Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add SwinTransformer support for torch_onnx quantization workflow
#1235 opened Apr 10, 2026 by ajrasane Contributor Loading…
3 tasks
Fix test_collect_hidden_states: use synthetic short conversations
#1234 opened Apr 10, 2026 by yeyu-nvidia Contributor Loading…
2 tasks
vLLM fakequant: add recipe-based quantization support
#1233 opened Apr 10, 2026 by kinjalpatel27 Contributor Loading…
support Qwen3.5 quantization
#1230 opened Apr 10, 2026 by deepindeed2022 Loading…
[2/N] PTQ skill change for transformers 5.0
#1229 opened Apr 10, 2026 by mxinO Contributor Loading…
[2/3] Implicit Gemm NVFP4
#1227 opened Apr 9, 2026 by jingyu-ml Contributor Draft
Add LTX-2 third-party license notices for legal compliance cherry-pick After code freeze, cherry-pick into release branch for next rc. Only for bug fixes and doc updates
#1226 opened Apr 9, 2026 by kevalmorabia97 Collaborator Loading…
3 tasks
added gptq nvfp4 default recipe + docstring fix
#1224 opened Apr 9, 2026 by sugunav14 Contributor Loading…
1 task done
GPTQ vector
#1223 opened Apr 9, 2026 by sugunav14 Contributor Draft
Add Gemma4 MoE quantization support
#1219 opened Apr 9, 2026 by yueshen2016 Contributor Loading…
4 tasks done
Add WaterSIC for KV-cache quantization
#1217 opened Apr 9, 2026 by kaix-nv Contributor Draft
Add TriAttention For KV Cache Compression
#1216 opened Apr 9, 2026 by kaix-nv Contributor Draft
Add VLM base model support for auto_quantize in hf_ptq
#1214 opened Apr 9, 2026 by yueshen2016 Contributor Loading…
Add FP8 QKVO + NVFP4 MLP PTQ recipe
#1213 opened Apr 9, 2026 by yueshen2016 Contributor Loading…
add: DFlash block diffusion speculative decoding
#1211 opened Apr 8, 2026 by ChenhanYu Collaborator Loading…
fix: handle accelerate CPU-offloaded models in FakeQuant export
#1194 opened Apr 8, 2026 by sungsooha Contributor Loading…
Simplify KDTrainer and enhance ModelOptHFTrainer
#1191 opened Apr 7, 2026 by realAsma Contributor Loading…
4 of 6 tasks
GPTQ test
#1179 opened Apr 6, 2026 by sugunav14 Contributor Draft
ProTip! Follow long discussions with comments:>50.