Document tensor parallelism configuration with Trainer by Likhita-17 · Pull Request #42876 · huggingface/transformers

Likhita-17 · 2025-12-15T14:40:55Z

This PR adds a concise documentation section explaining how to configure tensor parallelism (TP) when training with Trainer.

It clarifies that TP must be managed by Accelerate rather than during model loading, and explains why device_map="auto" should not be used with distributed training. A minimal, runnable TP-only Trainer example is included.

Fixes #41141

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).

Who can review?

@stevhliu

Added a section on configuring tensor parallelism with Trainer, including an example script for TP-only training.

stevhliu

thanks!

i would put this under the Trainer optimization section here
add a link to tensor parallelism docs
i think it's possible to just set tp_plan="auto" in from_pretrained without explicitly using ParallelismConfig. i believe Trainer creates this for you here right @SunMarc? 🙏

SunMarc · 2026-01-08T13:50:17Z

i think it's possible to just set tp_plan="auto" in from_pretrained without explicitly using ParallelismConfig. i believe Trainer creates this for you here right @SunMarc? 🙏

Yeah you can pass tp_plan="auto" which will shard the model on the gpus that you have and Trainer will create the correct ParallelismConfig. You can also pass a device_mesh created from ParallelismConfig when initializing the model. As for device_map="auto", if you run it with torchrun it will also enable tp when loading the model.

KrishnaKarunya-2K6 · 2026-01-08T16:28:33Z

Thanks very much for the review and constructive feedback!

I’m happy to make the suggested adjustments:

Move the TP configuration section under the existing Trainer optimization area as recommended.
Add a link to the tensor parallelism docs for additional context.
Clarify that in many cases, users can simply pass tp_plan="auto" in from_pretrained() and Trainer will create the correct parallelism configuration for TP-only training.

If it’s helpful, I can update the doc text to reflect these points — just let me know if there’s a preferred location/heading for the Trainer optimization section.

Thanks again for guiding this consolidated effort — I’m glad the content has come together nicely and addresses the original issue (#41141)!

stevhliu · 2026-01-09T17:46:21Z

sounds good, you can add a "tensor parallelism" section after this

KrishnaKarunya-2K6 · 2026-01-10T18:21:10Z

Thanks for the guidance! I’ve added the Tensor Parallelism section immediately after the torchcompile heading. Please let me know if you'd like any refinements or additional detail.

docs: document tensor parallelism configuration with Trainer

71c8a34

Added a section on configuring tensor parallelism with Trainer, including an example script for TP-only training.

stevhliu mentioned this pull request Jan 7, 2026

Docs: clarify TP-only training with Trainer KrishnaKarunya-2K6/transformers#1

Closed

5 tasks

stevhliu reviewed Jan 7, 2026

View reviewed changes

evalstate mentioned this pull request Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Document tensor parallelism configuration with Trainer#42876

Document tensor parallelism configuration with Trainer#42876
Likhita-17 wants to merge 1 commit into
huggingface:mainfrom
Likhita-17:patch-1

Likhita-17 commented Dec 15, 2025

Uh oh!

stevhliu left a comment

Uh oh!

SunMarc commented Jan 8, 2026

Uh oh!

KrishnaKarunya-2K6 commented Jan 8, 2026

Uh oh!

stevhliu commented Jan 9, 2026

Uh oh!

KrishnaKarunya-2K6 commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

Likhita-17 commented Dec 15, 2025

Before submitting

Who can review?

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

SunMarc commented Jan 8, 2026

Uh oh!

KrishnaKarunya-2K6 commented Jan 8, 2026

Uh oh!

stevhliu commented Jan 9, 2026

Uh oh!

KrishnaKarunya-2K6 commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants