fix(sgemm): destroy cublas handle to avoid alloc failed by lnxtree · Pull Request #415 · xlite-dev/LeetCUDA

lnxtree · 2026-04-05T03:19:20Z

What
Fix cuBLAS handle leak in kernels/sgemm/sgemm_cublas.cu by destroying handle after GEMM, and add status checks for cuBLAS API calls.
Why
Repeated benchmark calls can accumulate unreleased cuBLAS handles, then torch.matmul may fail with CUBLAS_STATUS_ALLOC_FAILED when creating a new handle.
Changes
Added CHECK_CUBLAS macro for cuBLAS status checking
cublas_sgemm: create -> setMathMode -> gemm -> destroy
cublas_sgemm_tf32: create -> setMathMode -> gemm -> destroy
Repro error
RuntimeError: CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling cublasCreate(handle)
Link issue
Fixes SGEMM benchmark may hit CUBLAS_STATUS_ALLOC_FAILED due to cublas handle leak in kernels/sgemm/sgemm_cublas.cu #414
Environment
GPU: NVIDIA H200 NVL (143771 MiB)
Driver: 580.126.09
CUDA Toolkit (nvcc): 12.8 (Build cuda_12.8.r12.8/compiler.35583870_0)
PyTorch: 2.5.0+cu124
PyTorch CUDA runtime: 12.4
CUDA available in torch: True
Compute capability: (9, 0)

lnxtree · 2026-04-05T03:23:37Z

Hi @DefTruth, could you please help review this PR when you have time?
It fixes a cuBLAS handle leak in SGEMM that may cause CUBLAS_STATUS_ALLOC_FAILED in [torch.matmul]. Thanks!

DefTruth

LGTM~ Thanks for this fix!

fix(sgemm): destroy cublas handle to avoid alloc failed

d16aac5

DefTruth approved these changes Apr 5, 2026

View reviewed changes

DefTruth merged commit 0b0a4f5 into xlite-dev:main Apr 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(sgemm): destroy cublas handle to avoid alloc failed#415

fix(sgemm): destroy cublas handle to avoid alloc failed#415
DefTruth merged 1 commit intoxlite-dev:mainfrom
lnxtree:fix/sgemm-cublas-handle-leak

lnxtree commented Apr 5, 2026

Uh oh!

lnxtree commented Apr 5, 2026 •

edited

Loading

Uh oh!

DefTruth left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

lnxtree commented Apr 5, 2026

Uh oh!

lnxtree commented Apr 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DefTruth left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lnxtree commented Apr 5, 2026 •

edited

Loading