convert : qwen2/3moe : set yarn metadata if present by CISC · Pull Request #13331 · ggml-org/llama.cpp

CISC · 2025-05-06T06:58:55Z

Set YaRN metadata if present on Qwen2/3MoE just like Qwen2/3.

convert_hf_to_gguf.py

Co-authored-by: Xuan-Son Nguyen <son@huggingface.co>

* origin/master: (27 commits) llama : fix build_ffn without gate (ggml-org#13336) CUDA: fix bad asserts for partial offload (ggml-org#13337) convert : qwen2/3moe : set yarn metadata if present (ggml-org#13331) CUDA: fix --split-mode row for MMQ (ggml-org#13323) gguf-py : avoid requiring pyside6 for other scripts (ggml-org#13036) CUDA: fix logic for clearing padding with -ngl 0 (ggml-org#13320) sampling : Integrate Top-nσ into main sampling chain (and add it to the server) (ggml-org#13264) server : Webui - change setText command from parent window to also send the message. (ggml-org#13309) mtmd : rename llava directory to mtmd (ggml-org#13311) clip : fix confused naming ffn_up and ffn_down (ggml-org#13290) convert : bailingmoe : set yarn metadata if present (ggml-org#13312) SYCL: Disable mul_mat kernels for noncontiguous tensor b (ggml-org#13308) mtmd : add C public API (ggml-org#13184) rpc : use backend registry, support dl backends (ggml-org#13304) ggml : activate s390x simd for Q3_K (ggml-org#13301) llava/mtmd : fixes to fully support dl backends (ggml-org#13303) llama : build windows releases with dl backends (ggml-org#13220) CUDA: fix race condition in MMQ stream-k fixup (ggml-org#13299) CUDA: fix race condition in MMQ ids_dst (ggml-org#13294) vulkan: Additional type support for unary, binary, and copy (ggml-org#13266) ...

* set yarn metadata if present * add comment about enabling YaRN Co-authored-by: Xuan-Son Nguyen <son@huggingface.co> --------- Co-authored-by: Xuan-Son Nguyen <son@huggingface.co>

set yarn metadata if present

dd8086c

CISC linked an issue May 6, 2025 that may be closed by this pull request

Feature Request: Support YaRN RoPE Scaling on Qwen2MoeModel/Qwen3MoeModel models on convert_hf_to_gguf.py #13322

Closed

4 tasks

CISC requested a review from ngxson May 6, 2025 06:59

github-actions bot added the python python script changes label May 6, 2025

ngxson reviewed May 6, 2025

View reviewed changes

convert_hf_to_gguf.py Show resolved Hide resolved

ngxson approved these changes May 6, 2025

View reviewed changes

add comment about enabling YaRN

de1583b

Co-authored-by: Xuan-Son Nguyen <son@huggingface.co>

CISC merged commit 764b856 into master May 6, 2025
7 checks passed

CISC deleted the convert-qwen2-3moe-yarn branch May 6, 2025 09:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert : qwen2/3moe : set yarn metadata if present#13331

convert : qwen2/3moe : set yarn metadata if present#13331
CISC merged 2 commits intomasterfrom
convert-qwen2-3moe-yarn

CISC commented May 6, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

CISC commented May 6, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants