Skip to content
This repository has been archived by the owner on Jan 13, 2025. It is now read-only.

Commit

Permalink
Fix _symm for SB_ENABLE_JOINT_MATRIX=1
Browse files Browse the repository at this point in the history
  • Loading branch information
pgorlani committed Oct 20, 2023
1 parent 7e85bf9 commit eae0f8b
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/interface/blas3/backend/nvidia_gpu.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@ typename sb_handle_t::event_t _gemm(

#ifdef SB_ENABLE_JOINT_MATRIX
const char* en_joint_matrix = std::getenv("SB_ENABLE_JOINT_MATRIX");
if (en_joint_matrix != NULL && *en_joint_matrix == '1') {
if (en_joint_matrix != NULL && *en_joint_matrix == '1' && !s_a && !s_b) {
if (_M > 1024 && _N > 1024) {
return blas::Gemm_Launcher<
container_0_t, container_1_t, container_2_t, 256, false, true, true,
Expand Down

0 comments on commit eae0f8b

Please sign in to comment.