-
Notifications
You must be signed in to change notification settings - Fork 342
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add SwinTransformer support for torch_onnx quantization workflow
#1235
opened Apr 10, 2026 by
ajrasane
Contributor
Loading…
3 tasks
Fix test_collect_hidden_states: use synthetic short conversations
#1234
opened Apr 10, 2026 by
yeyu-nvidia
Contributor
Loading…
2 tasks
vLLM fakequant: add recipe-based quantization support
#1233
opened Apr 10, 2026 by
kinjalpatel27
Contributor
Loading…
Pre-initialize torch._dynamo to prevent double-registration with
peft torch.compile() call
#1228
opened Apr 9, 2026 by
hychiang-git
Contributor
Loading…
Add LTX-2 third-party license notices for legal compliance
cherry-pick
After code freeze, cherry-pick into release branch for next rc. Only for bug fixes and doc updates
#1226
opened Apr 9, 2026 by
kevalmorabia97
Collaborator
Loading…
3 tasks
added gptq nvfp4 default recipe + docstring fix
#1224
opened Apr 9, 2026 by
sugunav14
Contributor
Loading…
1 task done
consolidate mbridge distillation: merge distill_hf.py into distill.py
#1220
opened Apr 9, 2026 by
j-rausch
Contributor
Loading…
Add Gemma4 MoE quantization support
#1219
opened Apr 9, 2026 by
yueshen2016
Contributor
Loading…
4 tasks done
Add VLM base model support for auto_quantize in hf_ptq
#1214
opened Apr 9, 2026 by
yueshen2016
Contributor
Loading…
add: DFlash block diffusion speculative decoding
#1211
opened Apr 8, 2026 by
ChenhanYu
Collaborator
Loading…
Add Z-Image (NextDiT/Lumina2) PTQ quantization support in diffusers example
#1205
opened Apr 8, 2026 by
andrea-pilzer
Loading…
Add support for postprocess exported model for block scale swizzling and support for different padding strategy
#1195
opened Apr 8, 2026 by
ynankani
Contributor
Loading…
fix: handle accelerate CPU-offloaded models in FakeQuant export
#1194
opened Apr 8, 2026 by
sungsooha
Contributor
Loading…
Simplify KDTrainer and enhance ModelOptHFTrainer
#1191
opened Apr 7, 2026 by
realAsma
Contributor
Loading…
4 of 6 tasks
Add ModelOpt Triton attention kernels for WAN2.2 diffusion (sparse, skip-softmax, NVFP4)
#1190
opened Apr 7, 2026 by
yeyu-nvidia
Contributor
Loading…
5 tasks
[chore]: weekly bump of uv.lock on main (2026-04-06)
#1180
opened Apr 6, 2026 by
github-actions
bot
Loading…
feat: parallelize fakequant export across GPUs via ThreadPoolExecutor
#1177
opened Apr 3, 2026 by
sungsooha
Contributor
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.