Add compute kernels to Windows CUDA build #1062

CarlGao4 · 2025-12-07T13:27:54Z

Fixes #1061, Fixes #1060

Current status: running CI on my fork

Changes:

Updated Jimver/cuda-toolkit to 0.2.22
Updated CUDA compiler to 12.8.1
Fixes single CUDA architecture issue

Current supported GPUs:

6.1: Tesla P40 generation, GTX 10 Series, Quadro Pascal generation
7.0: V100
7.5: Tesla T4, GTX 16 Series, RTX 20 Series, Quadro Turing Series
8.0: A100
8.6: A40, Workstation Ampere Series, RTX 30 Series
8.9: L40, Workstation Ada Series, RTX 40 Series
9.0: H100, H200
10.0: B200
12.0: Workstation Blackwell Series, RTX 50 Series

leejet · 2025-12-07T14:12:29Z

LGTM. Thanks.

CarlGao4 · 2025-12-07T15:32:57Z

However, with this multiple-kernel CUDA support, build time would increase significantly, so I suggest adding a cache mechanism. I've used ccache on Linux C++ project and upload the cache to github, but I'm not sure whether it works with Windows and nvcc

LostRuins · 2025-12-08T11:46:10Z

tbh this seems a bit overkill considering GGML does not utilize any new features after Ada Lovelace.
I'd probably recommend stopping at Hopper and just build PTX for anything after.

CarlGao4 · 2025-12-08T12:03:53Z

But it's sure that there are users using RTX 50 series (I'm using 4060, but one of my friends is using 5070)

LostRuins · 2025-12-09T02:09:19Z

Yes the way PTX targets work (e.g. 80-virtual) is that they are forward compatible. It will work on 5070 and future GPUs as well

CarlGao4 added 4 commits December 7, 2025 18:53

Fix syntax for CUDA architecture definitions

9ab647d

Extend CUDA support to GTX 10 Series to RTX 50 Series

1f3378d

update cuda installer step version to install cuda 12.8.1

09da541

Remove unsupported compute capability

c47281b

leejet merged commit 0392273 into leejet:master Dec 7, 2025

leejet mentioned this pull request Dec 8, 2025

add z-image support #1020

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add compute kernels to Windows CUDA build #1062

Add compute kernels to Windows CUDA build #1062

Uh oh!

CarlGao4 commented Dec 7, 2025 •

edited

Loading

Uh oh!

leejet commented Dec 7, 2025

Uh oh!

CarlGao4 commented Dec 7, 2025

Uh oh!

LostRuins commented Dec 8, 2025

Uh oh!

CarlGao4 commented Dec 8, 2025

Uh oh!

LostRuins commented Dec 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add compute kernels to Windows CUDA build #1062

Add compute kernels to Windows CUDA build #1062

Uh oh!

Conversation

CarlGao4 commented Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leejet commented Dec 7, 2025

Uh oh!

CarlGao4 commented Dec 7, 2025

Uh oh!

LostRuins commented Dec 8, 2025

Uh oh!

CarlGao4 commented Dec 8, 2025

Uh oh!

LostRuins commented Dec 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CarlGao4 commented Dec 7, 2025 •

edited

Loading