Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

gguf : reject empty metadata key names during parsing ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#22410 opened Apr 26, 2026 by rxho Loading…
fix: re-throw raised_exception in statement::execute without wrapping jinja parser Issues related to the jinja parser
#22409 opened Apr 26, 2026 by rxho Loading…
ggml-webgpu: add layer norm ops ggml changes relating to the ggml tensor library for machine learning WebGPU
#22406 opened Apr 26, 2026 by Constannnnnt Contributor Loading…
Docs/SeepSeek V4 GGUF Stanardization documentation Improvements or additions to documentation
#22405 opened Apr 26, 2026 by Nottlespike Draft
llama: allow partial seq_rm for GDN models for speculative decoding examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs server testing Everything test related Vulkan Issues specific to the Vulkan backend
#22400 opened Apr 26, 2026 by am17an Contributor Draft
spec : refactor params examples server testing Everything test related
#22397 opened Apr 26, 2026 by ggerganov Member Draft
fix: rpc-server cache may not work in Windows environments examples ggml changes relating to the ggml tensor library for machine learning
#22394 opened Apr 26, 2026 by unraido Loading…
Windows: raise stdio limit for loading many GGUF shards
#22385 opened Apr 26, 2026 by Thireus Contributor Loading…
Wip/deepseek v4 support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes testing Everything test related
#22378 opened Apr 26, 2026 by nisparks Contributor Draft
ggml-webgpu: add Q1_0 support ggml changes relating to the ggml tensor library for machine learning WebGPU
#22374 opened Apr 25, 2026 by SharmaRithik Contributor Loading…
Add DeepSeek V4 GGUF conversion python python script changes
#22359 opened Apr 25, 2026 by nisparks Contributor Loading…
convert : remove input_scale for dequantized fp8 modelopt python python script changes
#22356 opened Apr 25, 2026 by CISC Member Loading…
ggml : revert to -lm linking instead of find_library ggml changes relating to the ggml tensor library for machine learning
#22355 opened Apr 25, 2026 by angt Member Loading…
rpc: add ipv6 support examples ggml changes relating to the ggml tensor library for machine learning
#22350 opened Apr 25, 2026 by alphaonex86 Loading…
hexagon: hmx flash attention ggml changes relating to the ggml tensor library for machine learning Hexagon testing Everything test related
#22347 opened Apr 25, 2026 by njsyw1997 Contributor Draft
ggml-webgpu: fast matrix-vector multiplication for i-quants ggml changes relating to the ggml tensor library for machine learning WebGPU
#22344 opened Apr 25, 2026 by SharmaRithik Contributor Loading…
chat: preserve media markers for typed-content templates
#22342 opened Apr 25, 2026 by AlexonOliveiraRH Loading…
2 tasks done
ggml: implement gguf_init_from_buffer ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#22341 opened Apr 24, 2026 by giladgd Contributor Loading…
common: fix missing exports in llama-common examples merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge.
#22340 opened Apr 24, 2026 by max-krasnyansky Member Loading…
opencl: refactor Adreno q4_0 ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#22335 opened Apr 24, 2026 by lhez Contributor Draft
ProTip! Add no:assignee to see everything that’s not assigned.