Fix _init_weights to safely skip int8 quantized weights by KaparthyReddy · Pull Request #41522 · huggingface/transformers

KaparthyReddy · 2025-10-11T13:28:13Z

This PR addresses a RuntimeError that occurs when loading W8A8 quantized Qwen2.5-VL models.

Changes:

Modified _init_weights in Qwen2_5_VLPreTrainedModel to safely skip non-floating tensors (e.g., int8 quantized weights) during initialization.
Ensures float tensors are initialized normally, while int8 or other non-float tensors are ignored, preventing errors from normal_() on integer dtypes.
Improves compatibility for loading quantized models without vLLM and allows safe custom hooks for research purposes.

Note: This PR does not include the tester file test_init_weights_safe.py; that file exists only in the fork for private testing.

…Reddy/hf-transformers-contributions into fix-llmcompressor-error the commit.

…logits_to_keep

github-actions · 2025-10-11T13:29:22Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: qwen2_5_vl

Rocketknight1 · 2025-10-13T13:57:22Z

Can you link the issue this is fixing..?

Applied from PR huggingface#41522 because direct merge conflicted with current model initialization.

Kaparthy Reddy and others added 8 commits October 9, 2025 22:41

Fix _init_weights to safely skip int8 tensors in Qwen2_5_VL model

0faf375

Fix _init_weights to safely skip int8 tensors

7bd3914

Delete test-fix.py

49ef9e2

Add tester file for _init_weights and logits_to_keep

7087663

Merge branch 'fix-llmcompressor-error' of https://github.com/Kaparthy…

41d3010

…Reddy/hf-transformers-contributions into fix-llmcompressor-error the commit.

Fix _init_weights to safely skip int8 tensors and update forward for …

1d3afa9

…logits_to_keep

Add tester for _init_weights safety with int8 tensors

de16a62

Fix _init_weights to safely skip int8 quantized weights

2358b8d

evalstate added a commit to evalstate/transformers that referenced this pull request Apr 29, 2026

Skip Qwen2.5-VL init for non-floating weights

9a81445

Applied from PR huggingface#41522 because direct merge conflicted with current model initialization.

evalstate mentioned this pull request Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix _init_weights to safely skip int8 quantized weights#41522

Fix _init_weights to safely skip int8 quantized weights#41522
KaparthyReddy wants to merge 8 commits into
huggingface:mainfrom
KaparthyReddy:add-init-weights-tester

KaparthyReddy commented Oct 11, 2025

Uh oh!

github-actions Bot commented Oct 11, 2025

Uh oh!

Rocketknight1 commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

KaparthyReddy commented Oct 11, 2025

Uh oh!

github-actions Bot commented Oct 11, 2025

Uh oh!

Rocketknight1 commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants