Add chatterbox support by manmay-nakhashi · Pull Request #42413 · huggingface/transformers

manmay-nakhashi · 2025-11-26T04:08:16Z

No description provided.

…ansformers into add-s3gen-hifinet

Rocketknight1 · 2025-11-26T14:02:51Z

This PR is very overwhelming, like a lot of code agent PRs. It's not clear to me how novel some of these architectures are, or whether we really need three whole new architectures in the codebase! 😅

Ideally we could cut down the PR size a lot by using modular files and importing, and maybe treating some of the submodels as components or reusing existing architectures for those?

cc @ebezzam @eustlb since it's an audio model/pipeline

manmay-nakhashi · 2025-11-26T14:27:13Z

@Rocketknight1 t3 model is very similar to llama model but because this is a whole Text-to-Speech pipeline we have tokenizer(s3tokenizer different pr which compresses the audio to 25 tok/ sec we need this for conditioning), encoder(T3- mod-llama model (main changes are speech tokens and conditioning )), decoder(s3gen cfm based model from cozyvoice2) and hifinet(from cozyvoice2) which converts mel to wav.
i checked we don't have this models in the hf currently

manmay-nakhashi · 2025-12-04T12:47:24Z

@eustlb

ebezzam

@manmay-nakhashi I've done an initial review of some things stuck out to me, and to help you familiarize with different Transformers conventions.

Most notably, there are several modules that already exist within the Transformers library from other models, and those should be used via modular to create your modeling files.

Moreover, here are some PRs of other TTS models that may also help you see how to prepare the various files:

CSM: #36719
Dia: #38405
VibeVoice (ongoing but reflects more recent conventions): #40546

ebezzam · 2025-12-08T11:06:23Z

The modeling tests should contain integration tests to compare the outputs of Transformers version with original model. For example:

DAC:

transformers/tests/models/dac/test_modeling_dac.py

Line 305 in 5ee9ffe

Integration tests for DAC.

AudioFlamingo3 (more recent):

transformers/tests/models/audioflamingo3/test_modeling_audioflamingo3.py

Line 252 in 5ee9ffe

def test_fixture_single_matches(self):

github-actions · 2026-01-06T17:29:38Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, chatterbox, s3gen, s3tokenizer

manmay-nakhashi added 25 commits November 20, 2025 12:47

added s3tokenizer model support

da84c1c

add space

5538f6d

remove einops and fix tests

81de3db

ruff formatting

40c587e

added docs and fixed test

b20333e

fix tests

dc8def9

fix styles and unittests

cfc30cc

added model docstring fixed init weights

1ba22c0

fix formatting

140a113

fix tests

8ed8746

added to OBJECTS_TO_IGNORE

469bd25

Merge branch 'main' into manmay-add-s3tokenizer

a56e7d8

Merge branch 'main' into manmay-add-s3tokenizer

bc5307b

update s3gen code

35323b9

Merge branch 'add-s3gen-hifinet' of https://github.com/resemble-ai/tr…

8e53160

…ansformers into add-s3gen-hifinet

fix test imports

b6defe8

Merge branch 'add-s3gen-hifinet' of https://github.com/resemble-ai/tr…

578dd05

…ansformers into add-s3gen-hifinet

Merge branch 'main' into manmay-add-s3tokenizer

e9337b9

added working chatterbox support

194d53c

Merge branch 'main' into add-s3gen-hifinet

9f738c7

fix ruff formatting

2e99ea8

Merge branch 'add-s3gen-hifinet' of https://github.com/resemble-ai/tr…

f26be53

…ansformers into add-s3gen-hifinet

Merge branch 'main' into add-s3gen-hifinet

b2a5b03

fix style

0f367ec

Merge branch 'add-s3gen-hifinet' of https://github.com/resemble-ai/tr…

acc0dba

…ansformers into add-s3gen-hifinet

remove T3huggigfaceBackend

837b09c

manmay-nakhashi added 2 commits November 26, 2025 15:13

fix diffusers import and ruff fix

f9202f0

ruff fix

5ec1a6c

manmay-nakhashi added 2 commits December 1, 2025 21:38

Merge branch 'main' into add-s3gen-hifinet

1c885dc

Merge branch 'main' into add-s3gen-hifinet

b7a0ba0

ebezzam added New model Audio labels Dec 5, 2025

ebezzam reviewed Dec 8, 2025

View reviewed changes

manmay-nakhashi added 21 commits December 23, 2025 13:15

resolve comments

d74e6c0

fix test

ad53461

fix tests

d0868a2

fix ruff added conversion script

347a7ab

fix unit tests

f520f02

fix ruff

0b511b2

Merge branch 'main' into add-s3gen-hifinet

d8ccb98

added kwargs

401105f

add post init

ca9a968

Merge branch 'main' into add-s3gen-hifinet

2dba921

conditioning changes

14cbca7

fix tests

f4ac7c7

Merge branch 'main' into add-s3gen-hifinet

d416b19

fix trch import check

17d5e63

fix torch import

71f7f3c

fixes

8226783

fix

2532b5a

fix import

ef14acd

Merge branch 'main' into add-s3gen-hifinet

0904d33

fix unit test

f75b192

Merge branch 'main' into add-s3gen-hifinet

77cafeb

manmay-nakhashi requested a review from ebezzam January 8, 2026 06:08

evalstate mentioned this pull request Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add chatterbox support#42413

Add chatterbox support#42413
manmay-nakhashi wants to merge 66 commits into
huggingface:mainfrom
resemble-ai:add-s3gen-hifinet

manmay-nakhashi commented Nov 26, 2025

Uh oh!

Rocketknight1 commented Nov 26, 2025

Uh oh!

manmay-nakhashi commented Nov 26, 2025 •

edited

Loading

Uh oh!

manmay-nakhashi commented Dec 4, 2025

Uh oh!

ebezzam left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ebezzam Dec 8, 2025

Uh oh!

Uh oh!

github-actions Bot commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

manmay-nakhashi commented Nov 26, 2025

Uh oh!

Rocketknight1 commented Nov 26, 2025

Uh oh!

manmay-nakhashi commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

manmay-nakhashi commented Dec 4, 2025

Uh oh!

ebezzam left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ebezzam Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

manmay-nakhashi commented Nov 26, 2025 •

edited

Loading