diff --git a/docs/source/bn/_toctree.yml b/docs/source/bn/_toctree.yml new file mode 100644 index 000000000000..666844afed34 --- /dev/null +++ b/docs/source/bn/_toctree.yml @@ -0,0 +1,3 @@ +- sections: + - local: index + title: 🤗 Transformers \ No newline at end of file diff --git a/docs/source/bn/index.md b/docs/source/bn/index.md new file mode 100644 index 000000000000..90d44d877520 --- /dev/null +++ b/docs/source/bn/index.md @@ -0,0 +1,362 @@ + + +# 🤗 Transformers + +[PyTorch](https://pytorch.org/), [TensorFlow](https://www.tensorflow.org/), [JAX](https://jax.readthedocs.io/en/latest/) āĻāϰ āϜāĻ¨ā§āϝ āφāϧ⧁āύāĻŋāĻ• āĻŽā§‡āĻļāĻŋāύ āϞāĻžāĻ°ā§āύāĻŋāĻ‚ + +🤗 Transformers āφāĻĒāύāĻžāϕ⧇ āϏāĻšāĻœā§‡ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āφāϧ⧁āύāĻŋāĻ• āĻŽāĻĄā§‡āϞāϗ⧁āϞāĻŋ āĻĄāĻžāωāύāϞ⧋āĻĄ āĻ“ āĻĒā§āϰāĻļāĻŋāĻ•ā§āώāĻŖ āĻ•āϰāĻžāϰ API āĻāĻŦāĻ‚ āϟ⧁āϞ āĻĒā§āϰāĻĻāĻžāύ āĻ•āϰ⧇āĨ¤ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āĻŦā§āϝāĻŦāĻšāĻžāϰ⧇ āĻ•āĻŽ āĻ•āĻŽā§āĻĒāĻŋāωāϟāĻŋāĻ‚ āĻ–āϰāϚ āĻ“ āĻ•āĻŽ āĻ•āĻžāĻ°ā§āĻŦāύ āύāĻŋāσāϏāϰāĻŖ āĻšāϝāĻŧ, āĻāĻŦāĻ‚ āĻļ⧁āϰ⧁ āĻĨ⧇āϕ⧇ āĻŽāĻĄā§‡āϞ āĻŸā§āϰ⧇āύ āĻ•āϰāĻžāϰ āϏāĻŽāϝāĻŧ āĻ“ āϰāĻŋāϏ⧋āĻ°ā§āϏ āĻŦāĻžāρāĻšā§‡āĨ¤ āφāĻŽāĻžāĻĻ⧇āϰ āĻŽāĻĄā§‡āϞāϗ⧁āϞāĻŋ āύāĻžāύāĻž āĻ•ā§āώ⧇āĻ¤ā§āϰ⧇āϰ āĻ•āĻžāϜ āϏāĻŽāĻ°ā§āĻĨāύ āĻ•āϰ⧇āĨ¤ + +📝 **āĻĒā§āϰāĻžāĻ•ā§ƒāϤāĻŋāĻ• āĻ­āĻžāώāĻž āĻĒā§āϰāĻ•ā§āϰāĻŋāϝāĻŧāĻžāĻ•āϰāĻŖ**: āĻŸā§‡āĻ•ā§āϏāϟ āĻ•ā§āϞāĻžāϏāĻŋāĻĢāĻŋāϕ⧇āĻļāύ, āύāĻžāĻŽā§āĻŦāĻžāϰ āϏāύāĻžāĻ•ā§āϤāĻ•āϰāĻŖ, āĻĒā§āϰāĻļā§āύ⧋āĻ¤ā§āϤāϰ, āĻ­āĻžāώāĻž āĻŽāĻĄā§‡āϞāĻŋāĻ‚, āϏāĻžāϰāĻžāĻ‚āĻļ, āĻ…āύ⧁āĻŦāĻžāĻĻ, āĻŽāĻžāĻ˛ā§āϟāĻŋāĻĒāϞ āϚāϝāĻŧ⧇āϏ QA, āĻŸā§‡āĻ•ā§āϏāϟ āĻœā§‡āύāĻžāϰ⧇āĻļāύ +đŸ–ŧī¸ **āĻ•āĻŽā§āĻĒāĻŋāωāϟāĻžāϰ āĻ­āĻŋāĻļāύ**: āĻ›āĻŦāĻŋ āĻļā§āϰ⧇āĻŖāĻŋāĻŦāĻŋāĻ¨ā§āϝāĻžāϏ, āĻ…āĻŦāĻœā§‡āĻ•ā§āϟ āĻĄāĻŋāĻŸā§‡āĻ•āĻļāύ, āϏ⧇āĻ—āĻŽā§‡āĻ¨ā§āĻŸā§‡āĻļāύ +đŸ—Ŗī¸ **āĻ…āĻĄāĻŋāĻ“**: āĻ¸ā§āĻŦāϝāĻŧāĻ‚āĻ•ā§āϰāĻŋāϝāĻŧ āĻ¸ā§āĻĒāĻŋāϚ āϰāĻŋāĻ•āĻ—āύāĻŋāĻļāύ, āĻ…āĻĄāĻŋāĻ“ āĻļā§āϰ⧇āĻŖāĻŋāĻŦāĻŋāĻ¨ā§āϝāĻžāϏ +🐙 **āĻŽāĻžāĻ˛ā§āϟāĻŋāĻŽā§‹āĻĄāĻžāϞ**: āĻŸā§‡āĻŦāĻŋāϞ QA, OCR, āĻ¸ā§āĻ•ā§āϝāĻžāύ āĻ•āϰāĻž āĻĄāϕ⧁āĻŽā§‡āĻ¨ā§āϟ āĻĨ⧇āϕ⧇ āϤāĻĨā§āϝ āĻŦ⧇āϰ āĻ•āϰāĻž, āĻ­āĻŋāĻĄāĻŋāĻ“ āĻ•ā§āϞāĻžāϏāĻŋāĻĢāĻŋāϕ⧇āĻļāύ, āĻ­āĻŋāĻœā§āϝ⧁āϝāĻŧāĻžāϞ āĻĒā§āϰāĻļā§āύ⧋āĻ¤ā§āϤāϰ + +🤗 Transformers PyTorch, TensorFlow āĻāĻŦāĻ‚ JAX-āĻāϰ āĻŽāĻ§ā§āϝ⧇ āχāĻ¨ā§āϟāĻžāϰāĻ…āĻĒāĻžāϰ⧇āĻŦāĻŋāϞāĻŋāϟāĻŋ āϏāĻžāĻĒā§‹āĻ°ā§āϟ āĻ•āϰ⧇āĨ¤ āφāĻĒāύāĻŋ āϏāĻšāĻœā§‡āχ āĻŽāĻĄā§‡āϞ⧇āϰ āĻŦāĻŋāĻ­āĻŋāĻ¨ā§āύ āϧāĻžāĻĒ⧇ āĻ­āĻŋāĻ¨ā§āύ āĻĢā§āϰ⧇āĻŽāĻ“āϝāĻŧāĻžāĻ°ā§āĻ• āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāϤ⧇ āĻĒāĻžāϰ⧇āύāĨ¤ āϝ⧇āĻŽāύ, āϕ⧋āĻĄā§‡āϰ āĻŽāĻžāĻ¤ā§āϰ ā§ŠāϟāĻŋ āϞāĻžāχāύ āĻĻāĻŋāϝāĻŧ⧇ āĻŽāĻĄā§‡āϞ āĻĒā§āϰāĻļāĻŋāĻ•ā§āώāĻŖ āĻ•āϰ⧇ āϤāĻžāϰāĻĒāϰ āĻ…āĻ¨ā§āϝ āĻĢā§āϰ⧇āĻŽāĻ“āϝāĻŧāĻžāĻ°ā§āϕ⧇ āχāύāĻĢāĻžāϰ⧇āĻ¨ā§āϏ āĻ•āϰāϤ⧇ āĻĒāĻžāϰāĻŦ⧇āύāĨ¤ āĻĒā§āϰ⧋āĻĄāĻžāĻ•āĻļāύ-āĻ āĻŦā§āϝāĻŦāĻšāĻžāϰ⧇āϰ āϜāĻ¨ā§āϝ āĻŽāĻĄā§‡āϞ ONNX āĻŦāĻž TorchScript āĻĢāϰāĻŽā§āϝāĻžāĻŸā§‡āĻ“ āϰāĻĒā§āϤāĻžāύāĻŋ āĻ•āϰāĻž āϝāĻžāϝāĻŧāĨ¤ + +āĻ•āĻŽāĻŋāωāύāĻŋāϟāĻŋāϤ⧇ āϝ⧋āĻ— āĻĻāĻŋāϤ⧇ āϚāĻžāχāϞ⧇ [Hub](https://huggingface.co/models), [Forum](https://discuss.huggingface.co/), [Discord](https://discord.com/invite/JfAtkvEtRb) āĻĒāϰāĻŋāĻĻāĻ°ā§āĻļāύ āĻ•āϰ⧁āύ! + +## Hugging Face 팀ęŗŧ 링렑 ëŒ€í™”í•˜ęŗ  ė‹ļėœŧė‹ ę°€ėš”?[[hugging-face-team]] + + + HuggingFace Expert Acceleration Program + + +## āĻŦāĻŋāώāϝāĻŧāĻŦāĻ¸ā§āϤ⧁[[contents]] + +āφāĻŽāĻžāĻĻ⧇āϰ āĻĄāϕ⧁āĻŽā§‡āĻ¨ā§āĻŸā§‡āĻļāύ āĻĒā§āϰāϧāĻžāύāϤ ā§ĢāϟāĻŋ āĻ­āĻžāϗ⧇ āĻŦāĻŋāĻ­āĻ•ā§āϤ: + +- **āĻļ⧁āϰ⧁ āĻ•āϰ⧁āύ**-āĻ āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋ āϏāĻŽā§āĻĒāĻ°ā§āϕ⧇ āϏāĻ‚āĻ•ā§āώ⧇āĻĒ⧇ āϜāĻžāύāϤ⧇ āĻāĻŦāĻ‚ āχāύāĻ¸ā§āϟāϞ⧇āĻļāύ āĻĒāĻĻā§āϧāϤāĻŋ āĻĒāĻžāĻŦ⧇āύāĨ¤ +- **āϟāĻŋāωāĻŸā§‹āϰāĻŋāϝāĻŧāĻžāϞ**-āĻ āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻļ⧇āĻ–āĻžāϰ āϜāĻ¨ā§āϝ āĻŦāĻŋāĻ¸ā§āϤāĻžāϰāĻŋāϤ āĻ“ āϏāĻšāϜ āĻ—āĻžāχāĻĄ āĻĒāĻžāĻŦ⧇āύāĨ¤ +- **āĻšāĻžāω-āϟ⧁ āĻ—āĻžāχāĻĄ**-āĻ āĻ­āĻžāώāĻž āĻŽāĻĄā§‡āϞāĻŋāĻ‚āϝāĻŧ⧇āϰ āϜāĻ¨ā§āϝ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āĻĢāĻžāχāύ-āϟāĻŋāωāύāĻŋāĻ‚, āύāĻŋāϜāĻ¸ā§āĻŦ āĻŽāĻĄā§‡āϞ āϤ⧈āϰāĻŋ āĻ“ āĻļ⧇āϝāĻŧāĻžāϰ āχāĻ¤ā§āϝāĻžāĻĻāĻŋāϰ āύāĻŋāĻ°ā§āĻĻāĻŋāĻˇā§āϟ āĻ—āĻžāχāĻĄāϞāĻžāχāύ āĻĒāĻžāĻŦ⧇āύāĨ¤ +- **āĻ•āύāϏ⧇āĻĒā§āϟ āĻ—āĻžāχāĻĄ**-āĻ 🤗 Transformers āĻĄāĻŋāϜāĻžāχāύ āĻ…āϤ⧀āϤ⧇āϰ āĻĻāĻ°ā§āĻļāύ, āĻŦāĻŋāĻ­āĻŋāĻ¨ā§āύ āĻŽāĻĄā§‡āϞ āĻŦāĻž āϟāĻžāĻ¸ā§āĻ• āĻĒ⧇āĻ›āύ⧇āϰ āϧāĻžāϰāĻŖāĻž āĻ“ āĻŦā§āϝāĻžāĻ–ā§āϝāĻž āĻĒāĻžāĻŦ⧇āύāĨ¤ +- **API**-āϤ⧇ āϏāĻ•āϞ āĻ•ā§āϞāĻžāϏ āĻ“ āĻĢāĻžāĻ‚āĻļāύ āϏāĻŽā§āĻĒāĻ°ā§āϕ⧇ āĻŦā§āϝāĻžāĻ–ā§āϝāĻž āφāϛ⧇āĨ¤ + + - **āĻŽā§‚āϞ āĻ•ā§āϞāĻžāϏ**-āĻ configuration, model, tokenizer, pipeline āĻĒā§āϰāϭ⧃āϤāĻŋ āϗ⧁āϰ⧁āĻ¤ā§āĻŦāĻĒā§‚āĻ°ā§āĻŖ āĻ•ā§āϞāĻžāϏ⧇āϰ āĻŦāĻŋāĻ¸ā§āϤāĻžāϰāĻŋāϤ āĻŦāĻ°ā§āĻŖāύāĻžāĨ¤ + - **āĻŽāĻĄā§‡āϞ**-āĻ āĻĒā§āϰāϤāĻŋāϟāĻŋ āĻ…āĻ¨ā§āϤāĻ°ā§āϭ⧁āĻ•ā§āϤ āĻŽāĻĄā§‡āϞ⧇āϰ āϏāĻ™ā§āϗ⧇ āϏāĻ‚āĻļā§āϞāĻŋāĻˇā§āϟ āĻ•ā§āϞāĻžāϏ/āĻĢāĻžāĻ‚āĻļāύ āĻŦāĻŋāĻ¸ā§āϤāĻžāϰāĻŋāϤāĨ¤ + - **āχāĻ¨ā§āϟāĻžāĻ°ā§āύāĻžāϞ āχāωāϟāĻŋāϞāĻŋāϟāĻŋ**-āϤ⧇ āĻ—āĻ­ā§€āϰāĻ­āĻžāĻŦ⧇ āĻŦā§āϝāĻŦāĻšā§ƒāϤ āχāωāϟāĻŋāϞāĻŋāϟāĻŋ āĻ•ā§āϞāĻžāϏ āĻ“ āĻĢāĻžāĻ‚āĻļāύ⧇āϰ āĻŦā§āϝāĻžāĻ–ā§āϝāĻžāĨ¤ + +### 맀뛐 ëĒ¨ë¸[[supported-models]] + + + +1. **[ALBERT](model_doc/albert)** (from Google Research and the Toyota Technological Institute at Chicago) released with the paper [ALBERT: A Lite BERT for Self-supervised Learning of Language Representations](https://huggingface.co/papers/1909.11942), by Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, Radu Soricut. +1. **[BART](model_doc/bart)** (from Facebook) released with the paper [BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension](https://huggingface.co/papers/1910.13461) by Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Ves Stoyanov and Luke Zettlemoyer. +1. **[BARThez](model_doc/barthez)** (from École polytechnique) released with the paper [BARThez: a Skilled Pretrained French Sequence-to-Sequence Model](https://huggingface.co/papers/2010.12321) by Moussa Kamal Eddine, Antoine J.-P. Tixier, Michalis Vazirgiannis. +1. **[BARTpho](model_doc/bartpho)** (from VinAI Research) released with the paper [BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese](https://huggingface.co/papers/2109.09701) by Nguyen Luong Tran, Duong Minh Le and Dat Quoc Nguyen. +1. **[BEiT](model_doc/beit)** (from Microsoft) released with the paper [BEiT: BERT Pre-Training of Image Transformers](https://huggingface.co/papers/2106.08254) by Hangbo Bao, Li Dong, Furu Wei. +1. **[BERT](model_doc/bert)** (from Google) released with the paper [BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding](https://huggingface.co/papers/1810.04805) by Jacob Devlin, Ming-Wei Chang, Kenton Lee and Kristina Toutanova. +1. **[BERT For Sequence Generation](model_doc/bert-generation)** (from Google) released with the paper [Leveraging Pre-trained Checkpoints for Sequence Generation Tasks](https://huggingface.co/papers/1907.12461) by Sascha Rothe, Shashi Narayan, Aliaksei Severyn. +1. **[BERTweet](model_doc/bertweet)** (from VinAI Research) released with the paper [BERTweet: A pre-trained language model for English Tweets](https://aclanthology.org/2020.emnlp-demos.2/) by Dat Quoc Nguyen, Thanh Vu and Anh Tuan Nguyen. +1. **[BigBird-Pegasus](model_doc/bigbird_pegasus)** (from Google Research) released with the paper [Big Bird: Transformers for Longer Sequences](https://huggingface.co/papers/2007.14062) by Manzil Zaheer, Guru Guruganesh, Avinava Dubey, Joshua Ainslie, Chris Alberti, Santiago Ontanon, Philip Pham, Anirudh Ravula, Qifan Wang, Li Yang, Amr Ahmed. +1. **[BigBird-RoBERTa](model_doc/big_bird)** (from Google Research) released with the paper [Big Bird: Transformers for Longer Sequences](https://huggingface.co/papers/2007.14062) by Manzil Zaheer, Guru Guruganesh, Avinava Dubey, Joshua Ainslie, Chris Alberti, Santiago Ontanon, Philip Pham, Anirudh Ravula, Qifan Wang, Li Yang, Amr Ahmed. +1. **[Blenderbot](model_doc/blenderbot)** (from Facebook) released with the paper [Recipes for building an open-domain chatbot](https://huggingface.co/papers/2004.13637) by Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Kurt Shuster, Eric M. Smith, Y-Lan Boureau, Jason Weston. +1. **[BlenderbotSmall](model_doc/blenderbot-small)** (from Facebook) released with the paper [Recipes for building an open-domain chatbot](https://huggingface.co/papers/2004.13637) by Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Kurt Shuster, Eric M. Smith, Y-Lan Boureau, Jason Weston. +1. **[BLOOM](model_doc/bloom)** (from BigScience workshop) released by the [BigScience Workshop](https://bigscience.huggingface.co/). +1. **[BORT](model_doc/bort)** (from Alexa) released with the paper [Optimal Subarchitecture Extraction For BERT](https://huggingface.co/papers/2010.10499) by Adrian de Wynter and Daniel J. Perry. +1. **[ByT5](model_doc/byt5)** (from Google Research) released with the paper [ByT5: Towards a token-free future with pre-trained byte-to-byte models](https://huggingface.co/papers/2105.13626) by Linting Xue, Aditya Barua, Noah Constant, Rami Al-Rfou, Sharan Narang, Mihir Kale, Adam Roberts, Colin Raffel. +1. **[CamemBERT](model_doc/camembert)** (from Inria/Facebook/Sorbonne) released with the paper [CamemBERT: a Tasty French Language Model](https://huggingface.co/papers/1911.03894) by Louis Martin*, Benjamin Muller*, Pedro Javier Ortiz SuÃĄrez*, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, DjamÊ Seddah and BenoÃŽt Sagot. +1. **[CANINE](model_doc/canine)** (from Google Research) released with the paper [CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation](https://huggingface.co/papers/2103.06874) by Jonathan H. Clark, Dan Garrette, Iulia Turc, John Wieting. +1. **[CLIP](model_doc/clip)** (from OpenAI) released with the paper [Learning Transferable Visual Models From Natural Language Supervision](https://huggingface.co/papers/2103.00020) by Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever. +1. **[CLIPSeg](model_doc/clipseg)** (from University of GÃļttingen) released with the paper [Image Segmentation Using Text and Image Prompts](https://huggingface.co/papers/2112.10003) by Timo LÃŧddecke and Alexander Ecker. +1. **[CodeGen](model_doc/codegen)** (from Salesforce) released with the paper [A Conversational Paradigm for Program Synthesis](https://huggingface.co/papers/2203.13474) by Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong. +1. **[Conditional DETR](model_doc/conditional_detr)** (from Microsoft Research Asia) released with the paper [Conditional DETR for Fast Training Convergence](https://huggingface.co/papers/2108.06152) by Depu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang. +1. **[ConvBERT](model_doc/convbert)** (from YituTech) released with the paper [ConvBERT: Improving BERT with Span-based Dynamic Convolution](https://huggingface.co/papers/2008.02496) by Zihang Jiang, Weihao Yu, Daquan Zhou, Yunpeng Chen, Jiashi Feng, Shuicheng Yan. +1. **[ConvNeXT](model_doc/convnext)** (from Facebook AI) released with the paper [A ConvNet for the 2020s](https://huggingface.co/papers/2201.03545) by Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, Saining Xie. +1. **[ConvNeXTV2](model_doc/convnextv2)** (from Facebook AI) released with the paper [ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders](https://huggingface.co/papers/2301.00808) by Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, Saining Xie. +1. **[CPM](model_doc/cpm)** (from Tsinghua University) released with the paper [CPM: A Large-scale Generative Chinese Pre-trained Language Model](https://huggingface.co/papers/2012.00413) by Zhengyan Zhang, Xu Han, Hao Zhou, Pei Ke, Yuxian Gu, Deming Ye, Yujia Qin, Yusheng Su, Haozhe Ji, Jian Guan, Fanchao Qi, Xiaozhi Wang, Yanan Zheng, Guoyang Zeng, Huanqi Cao, Shengqi Chen, Daixuan Li, Zhenbo Sun, Zhiyuan Liu, Minlie Huang, Wentao Han, Jie Tang, Juanzi Li, Xiaoyan Zhu, Maosong Sun. +1. **[CTRL](model_doc/ctrl)** (from Salesforce) released with the paper [CTRL: A Conditional Transformer Language Model for Controllable Generation](https://huggingface.co/papers/1909.05858) by Nitish Shirish Keskar*, Bryan McCann*, Lav R. Varshney, Caiming Xiong and Richard Socher. +1. **[CvT](model_doc/cvt)** (from Microsoft) released with the paper [CvT: Introducing Convolutions to Vision Transformers](https://huggingface.co/papers/2103.15808) by Haiping Wu, Bin Xiao, Noel Codella, Mengchen Liu, Xiyang Dai, Lu Yuan, Lei Zhang. +1. **[Data2Vec](model_doc/data2vec)** (from Facebook) released with the paper [Data2Vec: A General Framework for Self-supervised Learning in Speech, Vision and Language](https://huggingface.co/papers/2202.03555) by Alexei Baevski, Wei-Ning Hsu, Qiantong Xu, Arun Babu, Jiatao Gu, Michael Auli. +1. **[DeBERTa](model_doc/deberta)** (from Microsoft) released with the paper [DeBERTa: Decoding-enhanced BERT with Disentangled Attention](https://huggingface.co/papers/2006.03654) by Pengcheng He, Xiaodong Liu, Jianfeng Gao, Weizhu Chen. +1. **[DeBERTa-v2](model_doc/deberta-v2)** (from Microsoft) released with the paper [DeBERTa: Decoding-enhanced BERT with Disentangled Attention](https://huggingface.co/papers/2006.03654) by Pengcheng He, Xiaodong Liu, Jianfeng Gao, Weizhu Chen. +1. **[Decision Transformer](model_doc/decision_transformer)** (from Berkeley/Facebook/Google) released with the paper [Decision Transformer: Reinforcement Learning via Sequence Modeling](https://huggingface.co/papers/2106.01345) by Lili Chen, Kevin Lu, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas, Igor Mordatch. +1. **[Deformable DETR](model_doc/deformable_detr)** (from SenseTime Research) released with the paper [Deformable DETR: Deformable Transformers for End-to-End Object Detection](https://huggingface.co/papers/2010.04159) by Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, Jifeng Dai. +1. **[DeiT](model_doc/deit)** (from Facebook) released with the paper [Training data-efficient image transformers & distillation through attention](https://huggingface.co/papers/2012.12877) by Hugo Touvron, Matthieu Cord, Matthijs Douze, Francisco Massa, Alexandre Sablayrolles, HervÊ JÊgou. +1. **[DETR](model_doc/detr)** (from Facebook) released with the paper [End-to-End Object Detection with Transformers](https://huggingface.co/papers/2005.12872) by Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, Sergey Zagoruyko. +1. **[DialoGPT](model_doc/dialogpt)** (from Microsoft Research) released with the paper [DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation](https://huggingface.co/papers/1911.00536) by Yizhe Zhang, Siqi Sun, Michel Galley, Yen-Chun Chen, Chris Brockett, Xiang Gao, Jianfeng Gao, Jingjing Liu, Bill Dolan. +1. **[DistilBERT](model_doc/distilbert)** (from HuggingFace), released together with the paper [DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter](https://huggingface.co/papers/1910.01108) by Victor Sanh, Lysandre Debut and Thomas Wolf. The same method has been applied to compress GPT2 into [DistilGPT2](https://github.com/huggingface/transformers-research-projects/tree/main/distillation), RoBERTa into [DistilRoBERTa](https://github.com/huggingface/transformers-research-projects/tree/main/distillation), Multilingual BERT into [DistilmBERT](https://github.com/huggingface/transformers-research-projects/tree/main/distillation) and a German version of DistilBERT. +1. **[DiT](model_doc/dit)** (from Microsoft Research) released with the paper [DiT: Self-supervised Pre-training for Document Image Transformer](https://huggingface.co/papers/2203.02378) by Junlong Li, Yiheng Xu, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei. +1. **[Donut](model_doc/donut)** (from NAVER), released together with the paper [OCR-free Document Understanding Transformer](https://huggingface.co/papers/2111.15664) by Geewook Kim, Teakgyu Hong, Moonbin Yim, Jeongyeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park. +1. **[DPR](model_doc/dpr)** (from Facebook) released with the paper [Dense Passage Retrieval for Open-Domain Question Answering](https://huggingface.co/papers/2004.04906) by Vladimir Karpukhin, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, and Wen-tau Yih. +1. **[DPT](master/model_doc/dpt)** (from Intel Labs) released with the paper [Vision Transformers for Dense Prediction](https://huggingface.co/papers/2103.13413) by RenÊ Ranftl, Alexey Bochkovskiy, Vladlen Koltun. +1. **[EfficientNet](model_doc/efficientnet)** (from Google Research) released with the paper [EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks](https://huggingface.co/papers/1905.11946) by Mingxing Tan and Quoc V. Le. +1. **[ELECTRA](model_doc/electra)** (from Google Research/Stanford University) released with the paper [ELECTRA: Pre-training text encoders as discriminators rather than generators](https://huggingface.co/papers/2003.10555) by Kevin Clark, Minh-Thang Luong, Quoc V. Le, Christopher D. Manning. +1. **[EncoderDecoder](model_doc/encoder-decoder)** (from Google Research) released with the paper [Leveraging Pre-trained Checkpoints for Sequence Generation Tasks](https://huggingface.co/papers/1907.12461) by Sascha Rothe, Shashi Narayan, Aliaksei Severyn. +1. **[ERNIE](model_doc/ernie)** (from Baidu) released with the paper [ERNIE: Enhanced Representation through Knowledge Integration](https://huggingface.co/papers/1904.09223) by Yu Sun, Shuohuan Wang, Yukun Li, Shikun Feng, Xuyi Chen, Han Zhang, Xin Tian, Danxiang Zhu, Hao Tian, Hua Wu. +1. **[ESM](model_doc/esm)** (from Meta AI) are transformer protein language models. **ESM-1b** was released with the paper [Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences](https://www.pnas.org/content/118/15/e2016239118) by Alexander Rives, Joshua Meier, Tom Sercu, Siddharth Goyal, Zeming Lin, Jason Liu, Demi Guo, Myle Ott, C. Lawrence Zitnick, Jerry Ma, and Rob Fergus. **ESM-1v** was released with the paper [Language models enable zero-shot prediction of the effects of mutations on protein function](https://doi.org/10.1101/2021.07.09.450648) by Joshua Meier, Roshan Rao, Robert Verkuil, Jason Liu, Tom Sercu and Alexander Rives. **ESM-2 and ESMFold** were released with the paper [Language models of protein sequences at the scale of evolution enable accurate structure prediction](https://doi.org/10.1101/2022.07.20.500902) by Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Allan dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Sal Candido, Alexander Rives. +1. **[FLAN-T5](model_doc/flan-t5)** (from Google AI) released in the repository [google-research/t5x](https://github.com/google-research/t5x/blob/main/docs/models.md#flan-t5-checkpoints) by Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Eric Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Zhao, Yanping Huang, Andrew Dai, Hongkun Yu, Slav Petrov, Ed H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, and Jason Wei +1. **[FlauBERT](model_doc/flaubert)** (from CNRS) released with the paper [FlauBERT: Unsupervised Language Model Pre-training for French](https://huggingface.co/papers/1912.05372) by Hang Le, Loïc Vial, Jibril Frej, Vincent Segonne, Maximin Coavoux, Benjamin Lecouteux, Alexandre Allauzen, BenoÃŽt CrabbÊ, Laurent Besacier, Didier Schwab. +1. **[FLAVA](model_doc/flava)** (from Facebook AI) released with the paper [FLAVA: A Foundational Language And Vision Alignment Model](https://huggingface.co/papers/2112.04482) by Amanpreet Singh, Ronghang Hu, Vedanuj Goswami, Guillaume Couairon, Wojciech Galuba, Marcus Rohrbach, and Douwe Kiela. +1. **[FNet](model_doc/fnet)** (from Google Research) released with the paper [FNet: Mixing Tokens with Fourier Transforms](https://huggingface.co/papers/2105.03824) by James Lee-Thorp, Joshua Ainslie, Ilya Eckstein, Santiago Ontanon. +1. **[Funnel Transformer](model_doc/funnel)** (from CMU/Google Brain) released with the paper [Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing](https://huggingface.co/papers/2006.03236) by Zihang Dai, Guokun Lai, Yiming Yang, Quoc V. Le. +1. **[GLPN](model_doc/glpn)** (from KAIST) released with the paper [Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth](https://huggingface.co/papers/2201.07436) by Doyeon Kim, Woonghyun Ga, Pyungwhan Ahn, Donggyu Joo, Sehwan Chun, Junmo Kim. +1. **[GPT](model_doc/openai-gpt)** (from OpenAI) released with the paper [Improving Language Understanding by Generative Pre-Training](https://openai.com/research/language-unsupervised/) by Alec Radford, Karthik Narasimhan, Tim Salimans and Ilya Sutskever. +1. **[GPT Neo](model_doc/gpt_neo)** (from EleutherAI) released in the repository [EleutherAI/gpt-neo](https://github.com/EleutherAI/gpt-neo) by Sid Black, Stella Biderman, Leo Gao, Phil Wang and Connor Leahy. +1. **[GPT NeoX](model_doc/gpt_neox)** (from EleutherAI) released with the paper [GPT-NeoX-20B: An Open-Source Autoregressive Language Model](https://huggingface.co/papers/2204.06745) by Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle McDonell, Jason Phang, Michael Pieler, USVSN Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach +1. **[GPT NeoX Japanese](model_doc/gpt_neox_japanese)** (from ABEJA) released by Shinya Otani, Takayoshi Makabe, Anuj Arora, and Kyo Hattori. +1. **[GPT-2](model_doc/gpt2)** (from OpenAI) released with the paper [Language Models are Unsupervised Multitask Learners](https://openai.com/research/better-language-models/) by Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei and Ilya Sutskever. +1. **[GPT-J](model_doc/gptj)** (from EleutherAI) released in the repository [kingoflolz/mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax/) by Ben Wang and Aran Komatsuzaki. +1. **[GPTSAN-japanese](model_doc/gptsan-japanese)** released in the repository [tanreinama/GPTSAN](https://github.com/tanreinama/GPTSAN/blob/main/report/model.md) by Toshiyuki Sakamoto(tanreinama). +1. **[GroupViT](model_doc/groupvit)** (from UCSD, NVIDIA) released with the paper [GroupViT: Semantic Segmentation Emerges from Text Supervision](https://huggingface.co/papers/2202.11094) by Jiarui Xu, Shalini De Mello, Sifei Liu, Wonmin Byeon, Thomas Breuel, Jan Kautz, Xiaolong Wang. +1. **[Hubert](model_doc/hubert)** (from Facebook) released with the paper [HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units](https://huggingface.co/papers/2106.07447) by Wei-Ning Hsu, Benjamin Bolte, Yao-Hung Hubert Tsai, Kushal Lakhotia, Ruslan Salakhutdinov, Abdelrahman Mohamed. +1. **[I-BERT](model_doc/ibert)** (from Berkeley) released with the paper [I-BERT: Integer-only BERT Quantization](https://huggingface.co/papers/2101.01321) by Sehoon Kim, Amir Gholami, Zhewei Yao, Michael W. Mahoney, Kurt Keutzer. +1. **[ImageGPT](model_doc/imagegpt)** (from OpenAI) released with the paper [Generative Pretraining from Pixels](https://openai.com/blog/image-gpt/) by Mark Chen, Alec Radford, Rewon Child, Jeffrey Wu, Heewoo Jun, David Luan, Ilya Sutskever. +1. **[Jukebox](model_doc/jukebox)** (from OpenAI) released with the paper [Jukebox: A Generative Model for Music](https://huggingface.co/papers/2005.00341) by Prafulla Dhariwal, Heewoo Jun, Christine Payne, Jong Wook Kim, Alec Radford, Ilya Sutskever. +1. **[LayoutLM](model_doc/layoutlm)** (from Microsoft Research Asia) released with the paper [LayoutLM: Pre-training of Text and Layout for Document Image Understanding](https://huggingface.co/papers/1912.13318) by Yiheng Xu, Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou. +1. **[LayoutLMv2](model_doc/layoutlmv2)** (from Microsoft Research Asia) released with the paper [LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding](https://huggingface.co/papers/2012.14740) by Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou. +1. **[LayoutLMv3](model_doc/layoutlmv3)** (from Microsoft Research Asia) released with the paper [LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking](https://huggingface.co/papers/2204.08387) by Yupan Huang, Tengchao Lv, Lei Cui, Yutong Lu, Furu Wei. +1. **[LayoutXLM](model_doc/layoutxlm)** (from Microsoft Research Asia) released with the paper [LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding](https://huggingface.co/papers/2104.08836) by Yiheng Xu, Tengchao Lv, Lei Cui, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Furu Wei. +1. **[LED](model_doc/led)** (from AllenAI) released with the paper [Longformer: The Long-Document Transformer](https://huggingface.co/papers/2004.05150) by Iz Beltagy, Matthew E. Peters, Arman Cohan. +1. **[LeViT](model_doc/levit)** (from Meta AI) released with the paper [LeViT: A Vision Transformer in ConvNet's Clothing for Faster Inference](https://huggingface.co/papers/2104.01136) by Ben Graham, Alaaeldin El-Nouby, Hugo Touvron, Pierre Stock, Armand Joulin, HervÊ JÊgou, Matthijs Douze. +1. **[LiLT](model_doc/lilt)** (from South China University of Technology) released with the paper [LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding](https://huggingface.co/papers/2202.13669) by Jiapeng Wang, Lianwen Jin, Kai Ding. +1. **[Longformer](model_doc/longformer)** (from AllenAI) released with the paper [Longformer: The Long-Document Transformer](https://huggingface.co/papers/2004.05150) by Iz Beltagy, Matthew E. Peters, Arman Cohan. +1. **[LongT5](model_doc/longt5)** (from Google AI) released with the paper [LongT5: Efficient Text-To-Text Transformer for Long Sequences](https://huggingface.co/papers/2112.07916) by Mandy Guo, Joshua Ainslie, David Uthus, Santiago Ontanon, Jianmo Ni, Yun-Hsuan Sung, Yinfei Yang. +1. **[LUKE](model_doc/luke)** (from Studio Ousia) released with the paper [LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention](https://huggingface.co/papers/2010.01057) by Ikuya Yamada, Akari Asai, Hiroyuki Shindo, Hideaki Takeda, Yuji Matsumoto. +1. **[LXMERT](model_doc/lxmert)** (from UNC Chapel Hill) released with the paper [LXMERT: Learning Cross-Modality Encoder Representations from Transformers for Open-Domain Question Answering](https://huggingface.co/papers/1908.07490) by Hao Tan and Mohit Bansal. +1. **[M-CTC-T](model_doc/mctct)** (from Facebook) released with the paper [Pseudo-Labeling For Massively Multilingual Speech Recognition](https://huggingface.co/papers/2111.00161) by Loren Lugosch, Tatiana Likhomanenko, Gabriel Synnaeve, and Ronan Collobert. +1. **[M2M100](model_doc/m2m_100)** (from Facebook) released with the paper [Beyond English-Centric Multilingual Machine Translation](https://huggingface.co/papers/2010.11125) by Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin. +1. **[MarianMT](model_doc/marian)** Machine translation models trained using [OPUS](http://opus.nlpl.eu/) data by JÃļrg Tiedemann. The [Marian Framework](https://marian-nmt.github.io/) is being developed by the Microsoft Translator Team. +1. **[MarkupLM](model_doc/markuplm)** (from Microsoft Research Asia) released with the paper [MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding](https://huggingface.co/papers/2110.08518) by Junlong Li, Yiheng Xu, Lei Cui, Furu Wei. +1. **[Mask2Former](model_doc/mask2former)** (from FAIR and UIUC) released with the paper [Masked-attention Mask Transformer for Universal Image Segmentation](https://huggingface.co/papers/2112.01527) by Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar. +1. **[MaskFormer](model_doc/maskformer)** (from Meta and UIUC) released with the paper [Per-Pixel Classification is Not All You Need for Semantic Segmentation](https://huggingface.co/papers/2107.06278) by Bowen Cheng, Alexander G. Schwing, Alexander Kirillov. +1. **[mBART](model_doc/mbart)** (from Facebook) released with the paper [Multilingual Denoising Pre-training for Neural Machine Translation](https://huggingface.co/papers/2001.08210) by Yinhan Liu, Jiatao Gu, Naman Goyal, Xian Li, Sergey Edunov, Marjan Ghazvininejad, Mike Lewis, Luke Zettlemoyer. +1. **[mBART-50](model_doc/mbart)** (from Facebook) released with the paper [Multilingual Translation with Extensible Multilingual Pretraining and Finetuning](https://huggingface.co/papers/2008.00401) by Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan. +1. **[Megatron-BERT](model_doc/megatron-bert)** (from NVIDIA) released with the paper [Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism](https://huggingface.co/papers/1909.08053) by Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper and Bryan Catanzaro. +1. **[Megatron-GPT2](model_doc/megatron_gpt2)** (from NVIDIA) released with the paper [Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism](https://huggingface.co/papers/1909.08053) by Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper and Bryan Catanzaro. +1. **[mLUKE](model_doc/mluke)** (from Studio Ousia) released with the paper [mLUKE: The Power of Entity Representations in Multilingual Pretrained Language Models](https://huggingface.co/papers/2110.08151) by Ryokan Ri, Ikuya Yamada, and Yoshimasa Tsuruoka. +1. **[MobileBERT](model_doc/mobilebert)** (from CMU/Google Brain) released with the paper [MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices](https://huggingface.co/papers/2004.02984) by Zhiqing Sun, Hongkun Yu, Xiaodan Song, Renjie Liu, Yiming Yang, and Denny Zhou. +1. **[MobileViT](model_doc/mobilevit)** (from Apple) released with the paper [MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer](https://huggingface.co/papers/2110.02178) by Sachin Mehta and Mohammad Rastegari. +1. **[MPNet](model_doc/mpnet)** (from Microsoft Research) released with the paper [MPNet: Masked and Permuted Pre-training for Language Understanding](https://huggingface.co/papers/2004.09297) by Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, Tie-Yan Liu. +1. **[MT5](model_doc/mt5)** (from Google AI) released with the paper [mT5: A massively multilingual pre-trained text-to-text transformer](https://huggingface.co/papers/2010.11934) by Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou, Aditya Siddhant, Aditya Barua, Colin Raffel. +1. **[MVP](model_doc/mvp)** (from RUC AI Box) released with the paper [MVP: Multi-task Supervised Pre-training for Natural Language Generation](https://huggingface.co/papers/2206.12131) by Tianyi Tang, Junyi Li, Wayne Xin Zhao and Ji-Rong Wen. +1. **[Nezha](model_doc/nezha)** (from Huawei Noah’s Ark Lab) released with the paper [NEZHA: Neural Contextualized Representation for Chinese Language Understanding](https://huggingface.co/papers/1909.00204) by Junqiu Wei, Xiaozhe Ren, Xiaoguang Li, Wenyong Huang, Yi Liao, Yasheng Wang, Jiashu Lin, Xin Jiang, Xiao Chen and Qun Liu. +1. **[NLLB](model_doc/nllb)** (from Meta) released with the paper [No Language Left Behind: Scaling Human-Centered Machine Translation](https://huggingface.co/papers/2207.04672) by the NLLB team. +1. **[NystrÃļmformer](model_doc/nystromformer)** (from the University of Wisconsin - Madison) released with the paper [NystrÃļmformer: A NystrÃļm-Based Algorithm for Approximating Self-Attention](https://huggingface.co/papers/2102.03902) by Yunyang Xiong, Zhanpeng Zeng, Rudrasis Chakraborty, Mingxing Tan, Glenn Fung, Yin Li, Vikas Singh. +1. **[OneFormer](model_doc/oneformer)** (from SHI Labs) released with the paper [OneFormer: One Transformer to Rule Universal Image Segmentation](https://huggingface.co/papers/2211.06220) by Jitesh Jain, Jiachen Li, MangTik Chiu, Ali Hassani, Nikita Orlov, Humphrey Shi. +1. **[OPT](master/model_doc/opt)** (from Meta AI) released with the paper [OPT: Open Pre-trained Transformer Language Models](https://huggingface.co/papers/2205.01068) by Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen et al. +1. **[OWL-ViT](model_doc/owlvit)** (from Google AI) released with the paper [Simple Open-Vocabulary Object Detection with Vision Transformers](https://huggingface.co/papers/2205.06230) by Matthias Minderer, Alexey Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, and Neil Houlsby. +1. **[Pegasus](model_doc/pegasus)** (from Google) released with the paper [PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization](https://huggingface.co/papers/1912.08777) by Jingqing Zhang, Yao Zhao, Mohammad Saleh and Peter J. Liu. +1. **[PEGASUS-X](model_doc/pegasus_x)** (from Google) released with the paper [Investigating Efficiently Extending Transformers for Long Input Summarization](https://huggingface.co/papers/2208.04347) by Jason Phang, Yao Zhao, and Peter J. Liu. +1. **[Perceiver IO](model_doc/perceiver)** (from Deepmind) released with the paper [Perceiver IO: A General Architecture for Structured Inputs & Outputs](https://huggingface.co/papers/2107.14795) by Andrew Jaegle, Sebastian Borgeaud, Jean-Baptiste Alayrac, Carl Doersch, Catalin Ionescu, David Ding, Skanda Koppula, Daniel Zoran, Andrew Brock, Evan Shelhamer, Olivier HÊnaff, Matthew M. Botvinick, Andrew Zisserman, Oriol Vinyals, JoÃŖo Carreira. +1. **[PhoBERT](model_doc/phobert)** (from VinAI Research) released with the paper [PhoBERT: Pre-trained language models for Vietnamese](https://www.aclweb.org/anthology/2020.findings-emnlp.92/) by Dat Quoc Nguyen and Anh Tuan Nguyen. +1. **[PLBart](model_doc/plbart)** (from UCLA NLP) released with the paper [Unified Pre-training for Program Understanding and Generation](https://huggingface.co/papers/2103.06333) by Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang. +1. **[PoolFormer](model_doc/poolformer)** (from Sea AI Labs) released with the paper [MetaFormer is Actually What You Need for Vision](https://huggingface.co/papers/2111.11418) by Yu, Weihao and Luo, Mi and Zhou, Pan and Si, Chenyang and Zhou, Yichen and Wang, Xinchao and Feng, Jiashi and Yan, Shuicheng. +1. **[ProphetNet](model_doc/prophetnet)** (from Microsoft Research) released with the paper [ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training](https://huggingface.co/papers/2001.04063) by Yu Yan, Weizhen Qi, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang and Ming Zhou. +1. **[QDQBert](model_doc/qdqbert)** (from NVIDIA) released with the paper [Integer Quantization for Deep Learning Inference: Principles and Empirical Evaluation](https://huggingface.co/papers/2004.09602) by Hao Wu, Patrick Judd, Xiaojie Zhang, Mikhail Isaev and Paulius Micikevicius. +1. **[RAG](model_doc/rag)** (from Facebook) released with the paper [Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks](https://huggingface.co/papers/2005.11401) by Patrick Lewis, Ethan Perez, Aleksandara Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich KÃŧttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela. +1. **[REALM](model_doc/realm.html)** (from Google Research) released with the paper [REALM: Retrieval-Augmented Language Model Pre-Training](https://huggingface.co/papers/2002.08909) by Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat and Ming-Wei Chang. +1. **[Reformer](model_doc/reformer)** (from Google Research) released with the paper [Reformer: The Efficient Transformer](https://huggingface.co/papers/2001.04451) by Nikita Kitaev, Łukasz Kaiser, Anselm Levskaya. +1. **[RegNet](model_doc/regnet)** (from META Platforms) released with the paper [Designing Network Design Space](https://huggingface.co/papers/2003.13678) by Ilija Radosavovic, Raj Prateek Kosaraju, Ross Girshick, Kaiming He, Piotr DollÃĄr. +1. **[RemBERT](model_doc/rembert)** (from Google Research) released with the paper [Rethinking embedding coupling in pre-trained language models](https://huggingface.co/papers/2010.12821) by Hyung Won Chung, Thibault FÊvry, Henry Tsai, M. Johnson, Sebastian Ruder. +1. **[ResNet](model_doc/resnet)** (from Microsoft Research) released with the paper [Deep Residual Learning for Image Recognition](https://huggingface.co/papers/1512.03385) by Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun. +1. **[RoBERTa](model_doc/roberta)** (from Facebook), released together with the paper [RoBERTa: A Robustly Optimized BERT Pretraining Approach](https://huggingface.co/papers/1907.11692) by Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, Veselin Stoyanov. +1. **[RoCBert](model_doc/roc_bert)** (from WeChatAI) released with the paper [RoCBert: Robust Chinese Bert with Multimodal Contrastive Pretraining](https://aclanthology.org/2022.acl-long.65.pdf) by HuiSu, WeiweiShi, XiaoyuShen, XiaoZhou, TuoJi, JiaruiFang, JieZhou. +1. **[RoFormer](model_doc/roformer)** (from ZhuiyiTechnology), released together with the paper [RoFormer: Enhanced Transformer with Rotary Position Embedding](https://huggingface.co/papers/2104.09864) by Jianlin Su and Yu Lu and Shengfeng Pan and Bo Wen and Yunfeng Liu. +1. **[SegFormer](model_doc/segformer)** (from NVIDIA) released with the paper [SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers](https://huggingface.co/papers/2105.15203) by Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, Ping Luo. +1. **[SEW](model_doc/sew)** (from ASAPP) released with the paper [Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition](https://huggingface.co/papers/2109.06870) by Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Han, Kilian Q. Weinberger, Yoav Artzi. +1. **[SEW-D](model_doc/sew_d)** (from ASAPP) released with the paper [Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition](https://huggingface.co/papers/2109.06870) by Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Han, Kilian Q. Weinberger, Yoav Artzi. +1. **[SpeechToTextTransformer](model_doc/speech_to_text)** (from Facebook), released together with the paper [fairseq S2T: Fast Speech-to-Text Modeling with fairseq](https://huggingface.co/papers/2010.05171) by Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, Juan Pino. +1. **[SpeechToTextTransformer2](model_doc/speech_to_text_2)** (from Facebook), released together with the paper [Large-Scale Self- and Semi-Supervised Learning for Speech Translation](https://huggingface.co/papers/2104.06678) by Changhan Wang, Anne Wu, Juan Pino, Alexei Baevski, Michael Auli, Alexis Conneau. +1. **[Splinter](model_doc/splinter)** (from Tel Aviv University), released together with the paper [Few-Shot Question Answering by Pretraining Span Selection](https://huggingface.co/papers/2101.00438) by Ori Ram, Yuval Kirstain, Jonathan Berant, Amir Globerson, Omer Levy. +1. **[SqueezeBERT](model_doc/squeezebert)** (from Berkeley) released with the paper [SqueezeBERT: What can computer vision teach NLP about efficient neural networks?](https://huggingface.co/papers/2006.11316) by Forrest N. Iandola, Albert E. Shaw, Ravi Krishna, and Kurt W. Keutzer. +1. **[Swin Transformer](model_doc/swin)** (from Microsoft) released with the paper [Swin Transformer: Hierarchical Vision Transformer using Shifted Windows](https://huggingface.co/papers/2103.14030) by Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, Baining Guo. +1. **[Swin Transformer V2](model_doc/swinv2)** (from Microsoft) released with the paper [Swin Transformer V2: Scaling Up Capacity and Resolution](https://huggingface.co/papers/2111.09883) by Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong, Furu Wei, Baining Guo. +1. **[T5](model_doc/t5)** (from Google AI) released with the paper [Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer](https://huggingface.co/papers/1910.10683) by Colin Raffel and Noam Shazeer and Adam Roberts and Katherine Lee and Sharan Narang and Michael Matena and Yanqi Zhou and Wei Li and Peter J. Liu. +1. **[T5v1.1](model_doc/t5v1.1)** (from Google AI) released in the repository [google-research/text-to-text-transfer-transformer](https://github.com/google-research/text-to-text-transfer-transformer/blob/main/released_checkpoints.md#t511) by Colin Raffel and Noam Shazeer and Adam Roberts and Katherine Lee and Sharan Narang and Michael Matena and Yanqi Zhou and Wei Li and Peter J. Liu. +1. **[Table Transformer](model_doc/table-transformer)** (from Microsoft Research) released with the paper [PubTables-1M: Towards Comprehensive Table Extraction From Unstructured Documents](https://huggingface.co/papers/2110.00061) by Brandon Smock, Rohith Pesala, Robin Abraham. +1. **[TAPAS](model_doc/tapas)** (from Google AI) released with the paper [TAPAS: Weakly Supervised Table Parsing via Pre-training](https://huggingface.co/papers/2004.02349) by Jonathan Herzig, Paweł Krzysztof Nowak, Thomas MÃŧller, Francesco Piccinno and Julian Martin Eisenschlos. +1. **[TAPEX](model_doc/tapex)** (from Microsoft Research) released with the paper [TAPEX: Table Pre-training via Learning a Neural SQL Executor](https://huggingface.co/papers/2107.07653) by Qian Liu, Bei Chen, Jiaqi Guo, Morteza Ziyadi, Zeqi Lin, Weizhu Chen, Jian-Guang Lou. +1. **[Time Series Transformer](model_doc/time_series_transformer)** (from HuggingFace). +1. **[Trajectory Transformer](model_doc/trajectory_transformers)** (from the University of California at Berkeley) released with the paper [Offline Reinforcement Learning as One Big Sequence Modeling Problem](https://huggingface.co/papers/2106.02039) by Michael Janner, Qiyang Li, Sergey Levine +1. **[Transformer-XL](model_doc/transfo-xl)** (from Google/CMU) released with the paper [Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context](https://huggingface.co/papers/1901.02860) by Zihang Dai*, Zhilin Yang*, Yiming Yang, Jaime Carbonell, Quoc V. Le, Ruslan Salakhutdinov. +1. **[TrOCR](model_doc/trocr)** (from Microsoft), released together with the paper [TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models](https://huggingface.co/papers/2109.10282) by Minghao Li, Tengchao Lv, Lei Cui, Yijuan Lu, Dinei Florencio, Cha Zhang, Zhoujun Li, Furu Wei. +1. **[UL2](model_doc/ul2)** (from Google Research) released with the paper [Unifying Language Learning Paradigms](https://huggingface.co/papers/2205.05131v1) by Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Neil Houlsby, Donald Metzler +1. **[UniSpeech](model_doc/unispeech)** (from Microsoft Research) released with the paper [UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data](https://huggingface.co/papers/2101.07597) by Chengyi Wang, Yu Wu, Yao Qian, Kenichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, Xuedong Huang. +1. **[UniSpeechSat](model_doc/unispeech-sat)** (from Microsoft Research) released with the paper [UNISPEECH-SAT: UNIVERSAL SPEECH REPRESENTATION LEARNING WITH SPEAKER AWARE PRE-TRAINING](https://huggingface.co/papers/2110.05752) by Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li, Xiangzhan Yu. +1. **[VAN](model_doc/van)** (from Tsinghua University and Nankai University) released with the paper [Visual Attention Network](https://huggingface.co/papers/2202.09741) by Meng-Hao Guo, Cheng-Ze Lu, Zheng-Ning Liu, Ming-Ming Cheng, Shi-Min Hu. +1. **[VideoMAE](model_doc/videomae)** (from Multimedia Computing Group, Nanjing University) released with the paper [VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training](https://huggingface.co/papers/2203.12602) by Zhan Tong, Yibing Song, Jue Wang, Limin Wang. +1. **[ViLT](model_doc/vilt)** (from NAVER AI Lab/Kakao Enterprise/Kakao Brain) released with the paper [ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision](https://huggingface.co/papers/2102.03334) by Wonjae Kim, Bokyung Son, Ildoo Kim. +1. **[Vision Transformer (ViT)](model_doc/vit)** (from Google AI) released with the paper [An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale](https://huggingface.co/papers/2010.11929) by Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby. +1. **[VisualBERT](model_doc/visual_bert)** (from UCLA NLP) released with the paper [VisualBERT: A Simple and Performant Baseline for Vision and Language](https://huggingface.co/papers/1908.03557) by Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, Kai-Wei Chang. +1. **[ViTMAE](model_doc/vit_mae)** (from Meta AI) released with the paper [Masked Autoencoders Are Scalable Vision Learners](https://huggingface.co/papers/2111.06377) by Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr DollÃĄr, Ross Girshick. +1. **[ViTMSN](model_doc/vit_msn)** (from Meta AI) released with the paper [Masked Siamese Networks for Label-Efficient Learning](https://huggingface.co/papers/2204.07141) by Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Florian Bordes, Pascal Vincent, Armand Joulin, Michael Rabbat, Nicolas Ballas. +1. **[Wav2Vec2](model_doc/wav2vec2)** (from Facebook AI) released with the paper [wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations](https://huggingface.co/papers/2006.11477) by Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, Michael Auli. +1. **[Wav2Vec2-Conformer](model_doc/wav2vec2-conformer)** (from Facebook AI) released with the paper [FAIRSEQ S2T: Fast Speech-to-Text Modeling with FAIRSEQ](https://huggingface.co/papers/2010.05171) by Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Sravya Popuri, Dmytro Okhonko, Juan Pino. +1. **[Wav2Vec2Phoneme](model_doc/wav2vec2_phoneme)** (from Facebook AI) released with the paper [Simple and Effective Zero-shot Cross-lingual Phoneme Recognition](https://huggingface.co/papers/2109.11680) by Qiantong Xu, Alexei Baevski, Michael Auli. +1. **[WavLM](model_doc/wavlm)** (from Microsoft Research) released with the paper [WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing](https://huggingface.co/papers/2110.13900) by Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei. +1. **[Whisper](model_doc/whisper)** (from OpenAI) released with the paper [Robust Speech Recognition via Large-Scale Weak Supervision](https://cdn.openai.com/papers/whisper.pdf) by Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, Ilya Sutskever. +1. **[X-CLIP](model_doc/xclip)** (from Microsoft Research) released with the paper [Expanding Language-Image Pretrained Models for General Video Recognition](https://huggingface.co/papers/2208.02816) by Bolin Ni, Houwen Peng, Minghao Chen, Songyang Zhang, Gaofeng Meng, Jianlong Fu, Shiming Xiang, Haibin Ling. +1. **[XGLM](model_doc/xglm)** (From Facebook AI) released with the paper [Few-shot Learning with Multilingual Language Models](https://huggingface.co/papers/2112.10668) by Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona Diab, Veselin Stoyanov, Xian Li. +1. **[XLM](model_doc/xlm)** (from Facebook) released together with the paper [Cross-lingual Language Model Pretraining](https://huggingface.co/papers/1901.07291) by Guillaume Lample and Alexis Conneau. +1. **[XLM-ProphetNet](model_doc/xlm-prophetnet)** (from Microsoft Research) released with the paper [ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training](https://huggingface.co/papers/2001.04063) by Yu Yan, Weizhen Qi, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang and Ming Zhou. +1. **[XLM-RoBERTa](model_doc/xlm-roberta)** (from Facebook AI), released together with the paper [Unsupervised Cross-lingual Representation Learning at Scale](https://huggingface.co/papers/1911.02116) by Alexis Conneau*, Kartikay Khandelwal*, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco GuzmÃĄn, Edouard Grave, Myle Ott, Luke Zettlemoyer and Veselin Stoyanov. +1. **[XLM-RoBERTa-XL](model_doc/xlm-roberta-xl)** (from Facebook AI), released together with the paper [Larger-Scale Transformers for Multilingual Masked Language Modeling](https://huggingface.co/papers/2105.00572) by Naman Goyal, Jingfei Du, Myle Ott, Giri Anantharaman, Alexis Conneau. +1. **[XLNet](model_doc/xlnet)** (from Google/CMU) released with the paper [​XLNet: Generalized Autoregressive Pretraining for Language Understanding](https://huggingface.co/papers/1906.08237) by Zhilin Yang*, Zihang Dai*, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, Quoc V. Le. +1. **[XLS-R](model_doc/xls_r)** (from Facebook AI) released with the paper [XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale](https://huggingface.co/papers/2111.09296) by Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli. +1. **[XLSR-Wav2Vec2](model_doc/xlsr_wav2vec2)** (from Facebook AI) released with the paper [Unsupervised Cross-Lingual Representation Learning For Speech Recognition](https://huggingface.co/papers/2006.13979) by Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed, Michael Auli. +1. **[YOLOS](model_doc/yolos)** (from Huazhong University of Science & Technology) released with the paper [You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection](https://huggingface.co/papers/2106.00666) by Yuxin Fang, Bencheng Liao, Xinggang Wang, Jiemin Fang, Jiyang Qi, Rui Wu, Jianwei Niu, Wenyu Liu. +1. **[YOSO](model_doc/yoso)** (from the University of Wisconsin - Madison) released with the paper [You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling](https://huggingface.co/papers/2111.09714) by Zhanpeng Zeng, Yunyang Xiong, Sathya N. Ravi, Shailesh Acharya, Glenn Fung, Vikas Singh. + + +### 맀뛐 í”„ë ˆėž„ė›ŒíŦ[[supported-framework]] + +āύ⧀āĻšā§‡āϰ āĻŸā§‡āĻŦāĻŋāϞāϟāĻŋ āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋāϤ⧇ āĻ…āĻ¨ā§āϤāĻ°ā§āϭ⧁āĻ•ā§āϤ āĻĒā§āϰāϤāĻŋāϟāĻŋ āĻŽāĻĄā§‡āϞ⧇āϰ āϏāĻŽāĻ°ā§āĻĨāύ-āϏāĻ‚āĻ•ā§āϰāĻžāĻ¨ā§āϤ āĻ…āĻŦāĻ¸ā§āĻĨāĻž āĻĻ⧇āĻ–āĻžāϝāĻŧāĨ¤ āĻŸā§‹āϕ⧇āύāĻžāχāĻœā§‡āĻļāύ Python (āϝāĻžāϰ āφāϰ⧇āĻ• āύāĻžāĻŽ "slow") āĻ…āĻĨāĻŦāĻž 🤗 Tokenizers (āϝāĻžāϰ āφāϰ⧇āĻ• āύāĻžāĻŽ "fast") āĻĻāĻŋāϝāĻŧ⧇ āĻ•āϰāĻž āĻšāĻšā§āϛ⧇ āĻ•āĻŋ āύāĻž; āĻāĻŦāĻ‚ (Flax-āĻāϰ āĻŽāĻžāĻ§ā§āϝāĻŽā§‡) Jax, PyTorch, TensorFlow āĻĢā§āϰ⧇āĻŽāĻ“āϝāĻŧā§āϝāĻžāϰāϗ⧁āϞ⧋āϰ āĻŽāĻ§ā§āϝ⧇ āϕ⧋āύāϗ⧁āϞ⧋ āϏāĻŽāĻ°ā§āĻĨāύ āĻ•āϰ⧇, āϤāĻž āĻāĻ–āĻžāύ⧇ āĻĻ⧇āĻ–āĻžāύ⧋ āĻšāϝāĻŧ⧇āϛ⧇āĨ¤ + + + +| Model | Tokenizer slow | Tokenizer fast | PyTorch support | TensorFlow support | Flax Support | +|:---------------------------:|:--------------:|:--------------:|:---------------:|:------------------:|:------------:| +| ALBERT | ✅ | ✅ | ✅ | ✅ | ✅ | +| BART | ✅ | ✅ | ✅ | ✅ | ✅ | +| BEiT | ❌ | ❌ | ✅ | ❌ | ✅ | +| BERT | ✅ | ✅ | ✅ | ✅ | ✅ | +| Bert Generation | ✅ | ❌ | ✅ | ❌ | ❌ | +| BigBird | ✅ | ✅ | ✅ | ❌ | ✅ | +| BigBird-Pegasus | ❌ | ❌ | ✅ | ❌ | ❌ | +| Blenderbot | ✅ | ✅ | ✅ | ✅ | ✅ | +| BlenderbotSmall | ✅ | ✅ | ✅ | ✅ | ✅ | +| BLOOM | ❌ | ✅ | ✅ | ❌ | ❌ | +| CamemBERT | ✅ | ✅ | ✅ | ✅ | ❌ | +| CANINE | ✅ | ❌ | ✅ | ❌ | ❌ | +| CLIP | ✅ | ✅ | ✅ | ✅ | ✅ | +| CLIPSeg | ❌ | ❌ | ✅ | ❌ | ❌ | +| CodeGen | ✅ | ✅ | ✅ | ❌ | ❌ | +| Conditional DETR | ❌ | ❌ | ✅ | ❌ | ❌ | +| ConvBERT | ✅ | ✅ | ✅ | ✅ | ❌ | +| ConvNeXT | ❌ | ❌ | ✅ | ✅ | ❌ | +| CTRL | ✅ | ❌ | ✅ | ✅ | ❌ | +| CvT | ❌ | ❌ | ✅ | ✅ | ❌ | +| Data2VecAudio | ❌ | ❌ | ✅ | ❌ | ❌ | +| Data2VecText | ❌ | ❌ | ✅ | ❌ | ❌ | +| Data2VecVision | ❌ | ❌ | ✅ | ✅ | ❌ | +| DeBERTa | ✅ | ✅ | ✅ | ✅ | ❌ | +| DeBERTa-v2 | ✅ | ✅ | ✅ | ✅ | ❌ | +| Decision Transformer | ❌ | ❌ | ✅ | ❌ | ❌ | +| Deformable DETR | ❌ | ❌ | ✅ | ❌ | ❌ | +| DeiT | ❌ | ❌ | ✅ | ✅ | ❌ | +| DETR | ❌ | ❌ | ✅ | ❌ | ❌ | +| DistilBERT | ✅ | ✅ | ✅ | ✅ | ✅ | +| DonutSwin | ❌ | ❌ | ✅ | ❌ | ❌ | +| DPR | ✅ | ✅ | ✅ | ✅ | ❌ | +| DPT | ❌ | ❌ | ✅ | ❌ | ❌ | +| ELECTRA | ✅ | ✅ | ✅ | ✅ | ✅ | +| Encoder decoder | ❌ | ❌ | ✅ | ✅ | ✅ | +| ERNIE | ❌ | ❌ | ✅ | ❌ | ❌ | +| ESM | ✅ | ❌ | ✅ | ✅ | ❌ | +| FairSeq Machine-Translation | ✅ | ❌ | ✅ | ❌ | ❌ | +| FlauBERT | ✅ | ❌ | ✅ | ✅ | ❌ | +| FLAVA | ❌ | ❌ | ✅ | ❌ | ❌ | +| FNet | ✅ | ✅ | ✅ | ❌ | ❌ | +| Funnel Transformer | ✅ | ✅ | ✅ | ✅ | ❌ | +| GLPN | ❌ | ❌ | ✅ | ❌ | ❌ | +| GPT Neo | ❌ | ❌ | ✅ | ❌ | ✅ | +| GPT NeoX | ❌ | ✅ | ✅ | ❌ | ❌ | +| GPT NeoX Japanese | ✅ | ❌ | ✅ | ❌ | ❌ | +| GPT-J | ❌ | ❌ | ✅ | ✅ | ✅ | +| GroupViT | ❌ | ❌ | ✅ | ✅ | ❌ | +| Hubert | ❌ | ❌ | ✅ | ✅ | ❌ | +| I-BERT | ❌ | ❌ | ✅ | ❌ | ❌ | +| ImageGPT | ❌ | ❌ | ✅ | ❌ | ❌ | +| Jukebox | ✅ | ❌ | ✅ | ❌ | ❌ | +| LayoutLM | ✅ | ✅ | ✅ | ✅ | ❌ | +| LayoutLMv2 | ✅ | ✅ | ✅ | ❌ | ❌ | +| LayoutLMv3 | ✅ | ✅ | ✅ | ✅ | ❌ | +| LED | ✅ | ✅ | ✅ | ✅ | ❌ | +| LeViT | ❌ | ❌ | ✅ | ❌ | ❌ | +| LiLT | ❌ | ❌ | ✅ | ❌ | ❌ | +| Longformer | ✅ | ✅ | ✅ | ✅ | ❌ | +| LongT5 | ❌ | ❌ | ✅ | ❌ | ✅ | +| LUKE | ✅ | ❌ | ✅ | ❌ | ❌ | +| LXMERT | ✅ | ✅ | ✅ | ✅ | ❌ | +| M-CTC-T | ❌ | ❌ | ✅ | ❌ | ❌ | +| M2M100 | ✅ | ❌ | ✅ | ❌ | ❌ | +| Marian | ✅ | ❌ | ✅ | ✅ | ✅ | +| MarkupLM | ✅ | ✅ | ✅ | ❌ | ❌ | +| MaskFormer | ❌ | ❌ | ✅ | ❌ | ❌ | +| mBART | ✅ | ✅ | ✅ | ✅ | ✅ | +| Megatron-BERT | ❌ | ❌ | ✅ | ❌ | ❌ | +| MobileBERT | ✅ | ✅ | ✅ | ✅ | ❌ | +| MobileViT | ❌ | ❌ | ✅ | ✅ | ❌ | +| MPNet | ✅ | ✅ | ✅ | ✅ | ❌ | +| MT5 | ✅ | ✅ | ✅ | ✅ | ✅ | +| MVP | ✅ | ✅ | ✅ | ❌ | ❌ | +| Nezha | ❌ | ❌ | ✅ | ❌ | ❌ | +| NystrÃļmformer | ❌ | ❌ | ✅ | ❌ | ❌ | +| OpenAI GPT | ✅ | ✅ | ✅ | ✅ | ❌ | +| OpenAI GPT-2 | ✅ | ✅ | ✅ | ✅ | ✅ | +| OPT | ❌ | ❌ | ✅ | ✅ | ✅ | +| OWL-ViT | ❌ | ❌ | ✅ | ❌ | ❌ | +| Pegasus | ✅ | ✅ | ✅ | ✅ | ✅ | +| PEGASUS-X | ❌ | ❌ | ✅ | ❌ | ❌ | +| Perceiver | ✅ | ❌ | ✅ | ❌ | ❌ | +| PLBart | ✅ | ❌ | ✅ | ❌ | ❌ | +| PoolFormer | ❌ | ❌ | ✅ | ❌ | ❌ | +| ProphetNet | ✅ | ❌ | ✅ | ❌ | ❌ | +| QDQBert | ❌ | ❌ | ✅ | ❌ | ❌ | +| RAG | ✅ | ❌ | ✅ | ✅ | ❌ | +| REALM | ✅ | ✅ | ✅ | ❌ | ❌ | +| Reformer | ✅ | ✅ | ✅ | ❌ | ❌ | +| RegNet | ❌ | ❌ | ✅ | ✅ | ✅ | +| RemBERT | ✅ | ✅ | ✅ | ✅ | ❌ | +| ResNet | ❌ | ❌ | ✅ | ✅ | ✅ | +| RetriBERT | ✅ | ✅ | ✅ | ❌ | ❌ | +| RoBERTa | ✅ | ✅ | ✅ | ✅ | ✅ | +| RoCBert | ✅ | ❌ | ✅ | ❌ | ❌ | +| RoFormer | ✅ | ✅ | ✅ | ✅ | ✅ | +| SegFormer | ❌ | ❌ | ✅ | ✅ | ❌ | +| SEW | ❌ | ❌ | ✅ | ❌ | ❌ | +| SEW-D | ❌ | ❌ | ✅ | ❌ | ❌ | +| Speech Encoder decoder | ❌ | ❌ | ✅ | ❌ | ✅ | +| Speech2Text | ✅ | ❌ | ✅ | ✅ | ❌ | +| Speech2Text2 | ✅ | ❌ | ❌ | ❌ | ❌ | +| Splinter | ✅ | ✅ | ✅ | ❌ | ❌ | +| SqueezeBERT | ✅ | ✅ | ✅ | ❌ | ❌ | +| Swin Transformer | ❌ | ❌ | ✅ | ✅ | ❌ | +| Swin Transformer V2 | ❌ | ❌ | ✅ | ❌ | ❌ | +| T5 | ✅ | ✅ | ✅ | ✅ | ✅ | +| Table Transformer | ❌ | ❌ | ✅ | ❌ | ❌ | +| TAPAS | ✅ | ❌ | ✅ | ✅ | ❌ | +| Time Series Transformer | ❌ | ❌ | ✅ | ❌ | ❌ | +| Trajectory Transformer | ❌ | ❌ | ✅ | ❌ | ❌ | +| Transformer-XL | ✅ | ❌ | ✅ | ✅ | ❌ | +| TrOCR | ❌ | ❌ | ✅ | ❌ | ❌ | +| UniSpeech | ❌ | ❌ | ✅ | ❌ | ❌ | +| UniSpeechSat | ❌ | ❌ | ✅ | ❌ | ❌ | +| VAN | ❌ | ❌ | ✅ | ❌ | ❌ | +| VideoMAE | ❌ | ❌ | ✅ | ❌ | ❌ | +| ViLT | ❌ | ❌ | ✅ | ❌ | ❌ | +| Vision Encoder decoder | ❌ | ❌ | ✅ | ✅ | ✅ | +| VisionTextDualEncoder | ❌ | ❌ | ✅ | ❌ | ✅ | +| VisualBERT | ❌ | ❌ | ✅ | ❌ | ❌ | +| ViT | ❌ | ❌ | ✅ | ✅ | ✅ | +| ViTMAE | ❌ | ❌ | ✅ | ✅ | ❌ | +| ViTMSN | ❌ | ❌ | ✅ | ❌ | ❌ | +| Wav2Vec2 | ✅ | ❌ | ✅ | ✅ | ✅ | +| Wav2Vec2-Conformer | ❌ | ❌ | ✅ | ❌ | ❌ | +| WavLM | ❌ | ❌ | ✅ | ❌ | ❌ | +| Whisper | ✅ | ❌ | ✅ | ✅ | ❌ | +| X-CLIP | ❌ | ❌ | ✅ | ❌ | ❌ | +| XGLM | ✅ | ✅ | ✅ | ✅ | ✅ | +| XLM | ✅ | ❌ | ✅ | ✅ | ❌ | +| XLM-ProphetNet | ✅ | ❌ | ✅ | ❌ | ❌ | +| XLM-RoBERTa | ✅ | ✅ | ✅ | ✅ | ✅ | +| XLM-RoBERTa-XL | ❌ | ❌ | ✅ | ❌ | ❌ | +| XLNet | ✅ | ✅ | ✅ | ✅ | ❌ | +| YOLOS | ❌ | ❌ | ✅ | ❌ | ❌ | +| YOSO | ❌ | ❌ | ✅ | ❌ | ❌ | + + diff --git a/i18n/README_ar.md b/i18n/README_ar.md index cdf813445d6f..c4cde926ed44 100644 --- a/i18n/README_ar.md +++ b/i18n/README_ar.md @@ -47,6 +47,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_bn.md b/i18n/README_bn.md new file mode 100644 index 000000000000..2212872b510e --- /dev/null +++ b/i18n/README_bn.md @@ -0,0 +1,336 @@ + + +

+ + + + Hugging Face Transformers Library + +
+
+

+ +

+ Build + GitHub + Documentation + GitHub release + Contributor Covenant + DOI +

+ +

+

+ English | + įŽ€äŊ“中文 | + įšéĢ”ä¸­æ–‡ | + 한ęĩ­ė–´ | + EspaÃąol | + æ—ĨæœŦčĒž | + ā¤šā¤ŋ⤍āĨā¤ĻāĨ€ | + Đ ŅƒŅŅĐēиК | + Đ ortuguÃĒs | + ā°¤āą†ā°˛āąā°—āą | + Français | + Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | + Tiáēŋng Viáģ‡t | + Ø§Ų„ØšØąØ¨ŲŠØŠ | + Ø§ØąØ¯Ųˆ | +

+

+ +

+   

JAX, PyTorch āĻāĻŦāĻ‚ TensorFlow-āĻāϰ āϜāĻ¨ā§āϝ āĻ…āĻ¤ā§āϝāĻžāϧ⧁āύāĻŋāĻ• āĻŽā§‡āĻļāĻŋāύ āϞāĻžāĻ°ā§āύāĻŋāĻ‚

+

+ +

+    +

+ +🤗 Transformers āĻšāĻžāϜāĻžāϰ⧋ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āĻĒā§āϰāĻĻāĻžāύ āĻ•āϰ⧇, āϝāĻž āĻŸā§‡āĻ•ā§āϏāϟ, āχāĻŽā§‡āϜ āĻāĻŦāĻ‚ āĻ…āĻĄāĻŋāĻ“āϰ āĻŽāϤ⧋ āĻŦāĻŋāĻ­āĻŋāĻ¨ā§āύ modality-āϤ⧇ āϟāĻžāĻ¸ā§āĻ• āϏāĻŽā§āĻĒāĻžāĻĻāύ⧇āϰ āϜāĻ¨ā§āϝ āĻŦā§āϝāĻŦāĻšā§ƒāϤ āĻšāϝāĻŧāĨ¤ + +āĻāχ āĻŽāĻĄā§‡āϞāϗ⧁āϞ⧋ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāĻž āϝāĻžāϝāĻŧ: + +* 📝 āĻŸā§‡āĻ•ā§āϏāϟ — āĻŸā§‡āĻ•ā§āϏāϟ āĻ•ā§āϞāĻžāϏāĻŋāĻĢāĻŋāϕ⧇āĻļāύ, āϤāĻĨā§āϝ āφāĻšāϰāĻŖ, āĻĒā§āϰāĻļā§āύ⧋āĻ¤ā§āϤāϰ, āĻ¸ā§āĻŦāϝāĻŧāĻ‚āĻ•ā§āϰāĻŋāϝāĻŧ āϏāĻžāϰāĻžāĻ‚āĻļ, āĻŽā§‡āĻļāĻŋāύ āĻŸā§āϰāĻžāĻ¨ā§āϏāϞ⧇āĻļāύ āĻāĻŦāĻ‚ ā§§ā§Ļā§ĻāϟāĻŋāϰāĻ“ āĻŦ⧇āĻļāĻŋ āĻ­āĻžāώāĻžāϝāĻŧ āĻŸā§‡āĻ•ā§āϏāϟ āĻœā§‡āύāĻžāϰ⧇āĻļāύ⧇āϰ āĻŽāϤ⧋ āϟāĻžāĻ¸ā§āϕ⧇āĨ¤ +* đŸ–ŧī¸ āĻ›āĻŦāĻŋ — āĻ›āĻŦāĻŋ āĻļā§āϰ⧇āĻŖāĻŋāĻŦāĻŋāĻ¨ā§āϝāĻžāϏ, āĻ…āĻŦāĻœā§‡āĻ•ā§āϟ āĻļāύāĻžāĻ•ā§āϤāĻ•āϰāĻŖ āĻāĻŦāĻ‚ āϏ⧇āĻ—āĻŽā§‡āĻ¨ā§āĻŸā§‡āĻļāύ⧇āϰ āĻŽāϤ⧋ āϟāĻžāĻ¸ā§āϕ⧇āĨ¤ +* đŸ—Ŗī¸ āĻ…āĻĄāĻŋāĻ“ — āĻ¸ā§āĻĒāĻŋāϚ āϰāĻŋāĻ•āĻ—āύāĻŋāĻļāύ āĻ“ āĻ…āĻĄāĻŋāĻ“ āĻ•ā§āϞāĻžāϏāĻŋāĻĢāĻŋāϕ⧇āĻļāύ⧇āϰ āĻŽāϤ⧋ āϟāĻžāĻ¸ā§āϕ⧇āĨ¤ + +Transformer āĻŽāĻĄā§‡āϞāϗ⧁āϞ⧋ **āĻŦāĻšā§ modality-āĻāϰ āϏāĻ‚āĻŽāĻŋāĻļā§āϰāϪ⧇āĻ“** āĻŦāĻŋāĻ­āĻŋāĻ¨ā§āύ āĻ•āĻžāϜ āĻ•āϰāϤ⧇ āĻĒāĻžāϰ⧇, āϝ⧇āĻŽāύ: āĻŸā§‡āĻŦāĻŋāϞāĻ­āĻŋāĻ¤ā§āϤāĻŋāĻ• āĻĒā§āϰāĻļā§āύ⧋āĻ¤ā§āϤāϰ, āĻ…āĻĒāϟāĻŋāĻ•ā§āϝāĻžāϞ āĻ•ā§āϝāĻžāϰ⧇āĻ•ā§āϟāĻžāϰ āϰāĻŋāĻ•āĻ—āύāĻŋāĻļāύ (OCR), āĻ¸ā§āĻ•ā§āϝāĻžāύāĻ•ā§ƒāϤ āĻĄāϕ⧁āĻŽā§‡āĻ¨ā§āϟ āĻĨ⧇āϕ⧇ āϤāĻĨā§āϝ āφāĻšāϰāĻŖ, āĻ­āĻŋāĻĄāĻŋāĻ“ āĻļā§āϰ⧇āĻŖāĻŋāĻŦāĻŋāĻ¨ā§āϝāĻžāϏ āĻ“ āĻ­āĻŋāĻœā§āϝ⧁āϝāĻŧāĻžāϞ āĻĒā§āϰāĻļā§āύ⧋āĻ¤ā§āϤāϰāĨ¤ + +🤗 Transformers āϖ⧁āĻŦ āĻĻā§āϰ⧁āϤ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āĻĄāĻžāωāύāϞ⧋āĻĄ āĻ“ āĻŸā§‡āĻ•ā§āϏāĻŸā§‡āϰ āϜāĻ¨ā§āϝ āĻŦā§āϝāĻŦāĻšāĻžāϰ⧇āϰ API āĻĻ⧇āϝāĻŧ, āύāĻŋāĻœā§‡āϰ āĻĄā§‡āϟāĻžāϏ⧇āϟ-āĻ āĻĢāĻžāχāύ-āϟāĻŋāωāύ āĻ•āϰāϤ⧇ āĻāĻŦāĻ‚ āφāĻŽāĻžāĻĻ⧇āϰ [Model Hub](https://huggingface.co/models)-āĻ āĻ•āĻŽāĻŋāωāύāĻŋāϟāĻŋāϰ āϏāĻžāĻĨ⧇ āĻļ⧇āϝāĻŧāĻžāϰ āĻ•āϰāϤ⧇ āϏāĻžāĻšāĻžāĻ¯ā§āϝ āĻ•āϰ⧇āĨ¤ āĻāĻ•āχ āϏāĻŽāϝāĻŧ⧇, āĻĒā§āϰāϤāĻŋāϟāĻŋ Python āĻŽāĻĄāĻŋāωāϞ āϝāĻžāϰ āĻŽāĻžāĻ§ā§āϝāĻŽā§‡ āφāĻ°ā§āĻ•āĻŋāĻŸā§‡āĻ•āϚāĻžāϰ āϏāĻ‚āĻœā§āĻžāĻžāϝāĻŧāĻŋāϤ, āϏāĻŽā§āĻĒā§‚āĻ°ā§āĻŖ āĻ¸ā§āĻŦāϤāĻ¨ā§āĻ¤ā§āϰ āĻ“ āϏāĻŽā§āĻĒāĻžāĻĻāύāϝ⧋āĻ—ā§āϝ, āϝāĻžāϤ⧇ āĻāϟāĻĒāϟ āĻ—āĻŦ⧇āώāĻŖāĻžāĻŽā§‚āϞāĻ• āĻĒāϰ⧀āĻ•ā§āώāĻž-āύāĻŋāϰ⧀āĻ•ā§āώāĻž āĻ•āϰāĻž āϝāĻžāϝāĻŧāĨ¤ + +🤗 Transformers āϤāĻŋāύāϟāĻŋ āϜāύāĻĒā§āϰāĻŋāϝāĻŧ āĻĄāĻŋāĻĒ āϞāĻžāĻ°ā§āύāĻŋāĻ‚ āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋ—[Jax](https://jax.readthedocs.io/en/latest/), [PyTorch](https://pytorch.org/), āĻāĻŦāĻ‚ [TensorFlow](https://www.tensorflow.org/)-āĻāϰ āϏāĻžāĻĨ⧇ āϏāĻšāϜ āχāĻ¨ā§āϟāĻŋāĻ—ā§āϰ⧇āĻļāύ āϏāĻžāĻĒā§‹āĻ°ā§āϟ āĻ•āϰ⧇āĨ¤ āĻāĻ• āĻĢā§āϰ⧇āĻŽāĻ“āϝāĻŧāĻžāĻ°ā§āϕ⧇ āĻŽāĻĄā§‡āϞ āĻŸā§āϰ⧇āύ āĻ•āϰ⧁āύ āĻāĻŦāĻ‚ āϏāĻšāĻœā§‡āχ āφāϰ⧇āĻ•āϟāĻŋāϤ⧇ inference āĻ•āϰ⧁āύāĨ¤ + +## āĻ…āύāϞāĻžāχāύ āĻĄā§‡āĻŽā§‹ + +āφāĻĒāύāĻŋ āφāĻŽāĻžāĻĻ⧇āϰ āĻŦ⧇āĻļāĻŋāϰāĻ­āĻžāĻ— āĻŽāĻĄā§‡āϞ āϏāϰāĻžāϏāϰāĻŋ [Model Hub](https://huggingface.co/models)-āĻ āϤāĻžāĻĻ⧇āϰ āύāĻŋāϜ āύāĻŋāϜ āĻĒ⧇āĻœā§‡ āĻĒāϰ⧀āĻ•ā§āώāĻž āĻ•āϰāϤ⧇ āĻĒāĻžāϰāĻŦ⧇āύāĨ¤ āφāĻŽāϰāĻž [āĻĒā§āϰāĻžāχāϭ⧇āϟ āĻŽāĻĄā§‡āϞ āĻšā§‹āĻ¸ā§āϟāĻŋāĻ‚, āĻ­āĻžāĻ°ā§āϏāύāĻŋāĻ‚, āĻāĻŦāĻ‚ āχāύāĻĢāĻžāϰ⧇āĻ¨ā§āϏ API](https://huggingface.co/pricing) āĻĒāĻžāĻŦāϞāĻŋāĻ• āĻ“ āĻĒā§āϰāĻžāχāϭ⧇āϟ āĻŽāĻĄā§‡āϞ⧇āϰ āϜāĻ¨ā§āϝ āĻĒā§āϰāĻĻāĻžāύ āĻ•āϰāĻŋāĨ¤ + +āĻāĻ–āĻžāύ⧇ āĻ•āĻŋāϛ⧁ āωāĻĻāĻžāĻšāϰāĻŖ: + +āĻ•āĻŽā§āĻĒāĻŋāωāϟāĻžāϰ āϞāĻŋāĻ™ā§āϗ⧁āχāĻ¸ā§āϟāĻŋāĻ•āϏ⧇: + +- [BERT āĻĻāĻŋāϝāĻŧ⧇ āĻŽāĻžāĻ¸ā§āĻ•āĻĄ āĻ“āϝāĻŧāĻžāĻ°ā§āĻĄ āĻ•āĻŽāĻĒā§āϞāĻŋāĻļāύ](https://huggingface.co/google-bert/bert-base-uncased?text=Paris+is+the+%5BMASK%5D+of+France) +- [Electra āĻĻāĻŋāϝāĻŧ⧇ āύāĻŋāϜ āύāĻžāĻŽ āϏāύāĻžāĻ•ā§āϤāĻ•āϰāĻŖ](https://huggingface.co/dbmdz/electra-large-discriminator-finetuned-conll03-english?text=My+name+is+Sarah+and+I+live+in+London+city) +- [GPT-2 āĻĻāĻŋāϝāĻŧ⧇ āĻŸā§‡āĻ•ā§āϏāϟ āĻœā§‡āύāĻžāϰ⧇āĻļāύ](https://huggingface.co/openai-community/gpt2?text=A+long+time+ago%2C+) +- [RoBERTa āĻĻā§āĻŦāĻžāϰāĻž āĻ¨ā§āϝāĻžāϚāĻžāϰāĻžāϞ āĻ˛ā§āϝāĻžāĻ‚āϗ⧁āϝāĻŧ⧇āϜ āχāύāĻĢāĻžāϰ⧇āĻ¨ā§āϏ](https://huggingface.co/FacebookAI/roberta-large-mnli?text=The+dog+was+lost.+Nobody+lost+any+animal) +- [BART āĻĻāĻŋāϝāĻŧ⧇ āĻ¸ā§āĻŦāϝāĻŧāĻ‚āĻ•ā§āϰāĻŋāϝāĻŧ āĻŸā§‡āĻ•ā§āϏāϟ āϏāĻžāϰāĻžāĻ‚āĻļ](https://huggingface.co/facebook/bart-large-cnn?text=The+tower+is+324+metres+%281%2C063+ft%29+tall%2C+about+the+same+height+as+an+81-storey+building%2C+and+the+tallest+structure+in+Paris.+Its+base+is+square%2C+measuring+125+metres+%28410+ft%29+on+each+side.+During+its+construction%2C+the+Eiffel+Tower+surpassed+the+Washington+Monument+to+become+the+tallest+man-made+structure+in+the+world%2C+a+title+it+held+for+41+years+until+the+Chrysler+Building+in+New+York+City+was+finished+in+1930.+It+was+the+first+structure+to+reach+a+height+of+300+metres.+Due+to+the+addition+of+a+broadcasting+aerial+at+the+top+of+the+tower+in+1957%2C+it+is+now+taller+than+the+Chrysler+Building+by+5.2+metres+%2817+ft%29.+Excluding+transmitters%2C+the+Eiffel+Tower+is+the+second+tallest+free-standing+structure+in+France+after+the+Millau+Viaduct) +- [DistilBERT āĻĻāĻŋāϝāĻŧ⧇ āĻĒā§āϰāĻļā§āύ⧋āĻ¤ā§āϤāϰ](https://huggingface.co/distilbert/distilbert-base-uncased-distilled-squad?text=Which+name+is+also+used+to+describe+the+Amazon+rainforest+in+English%3F&context=The+Amazon+rainforest+%28Portuguese%3A+Floresta+Amaz%C3%B4nica+or+Amaz%C3%B4nia%3B+Spanish%3A+Selva+Amaz%C3%B3nica%2C+Amazon%C3%ADa+or+usually+Amazonia%3B+French%3A+For%C3%AAt+amazonienne%3B+Dutch%3A+Amazoneregenwoud%29%2C+also+known+in+English+as+Amazonia+or+the+Amazon+Jungle%2C+is+a+moist+broadleaf+forest+that+covers+most+of+the+Amazon+basin+of+South+America.+This+basin+encompasses+7%2C000%2C000+square+kilometres+%282%2C700%2C000+sq+mi%29%2C+of+which+5%2C500%2C000+square+kilometres+%282%2C100%2C000+sq+mi%29+are+covered+by+the+rainforest.+This+region+includes+territory+belonging+to+nine+nations.+The+majority+of+the+forest+is+contained+within+Brazil%2C+with+60%25+of+the+rainforest%2C+followed+by+Peru+with+13%25%2C+Colombia+with+10%25%2C+and+with+minor+amounts+in+Venezuela%2C+Ecuador%2C+Bolivia%2C+Guyana%2C+Suriname+and+French+Guiana.+States+or+departments+in+four+nations+contain+%22Amazonas%22+in+their+names.+The+Amazon+represents+over+half+of+the+planet%27s+remaining+rainforests%2C+and+comprises+the+largest+and+most+biodiverse+tract+of+tropical+rainforest+in+the+world%2C+with+an+estimated+390+billion+individual+trees+divided+into+16%2C000+species) +- [T5 āĻĻāĻŋāϝāĻŧ⧇ āĻŽā§‡āĻļāĻŋāύ āĻ…āύ⧁āĻŦāĻžāĻĻ](https://huggingface.co/google-t5/t5-base?text=My+name+is+Wolfgang+and+I+live+in+Berlin) + +āĻ•āĻŽā§āĻĒāĻŋāωāϟāĻžāϰ āĻ­āĻŋāĻļāύ⧇: + +- [ViT āĻĻāĻŋāϝāĻŧ⧇ āχāĻŽā§‡āϜ āĻ•ā§āϞāĻžāϏāĻŋāĻĢāĻŋāϕ⧇āĻļāύ](https://huggingface.co/google/vit-base-patch16-224) +- [DETR āĻĻāĻŋāϝāĻŧ⧇ āĻ…āĻŦāĻœā§‡āĻ•ā§āϟ āĻļāύāĻžāĻ•ā§āϤāĻ•āϰāĻŖ](https://huggingface.co/facebook/detr-resnet-50) +- [SegFormer āĻĻāĻŋāϝāĻŧ⧇ āϏ⧇āĻŽāĻžāĻ¨ā§āϟāĻŋāĻ• āϏ⧇āĻ—āĻŽā§‡āĻ¨ā§āĻŸā§‡āĻļāύ](https://huggingface.co/nvidia/segformer-b0-finetuned-ade-512-512) +- [MaskFormer āĻĻāĻŋāϝāĻŧ⧇ āĻĒā§āϝāĻžāύ⧋āĻĒāϟāĻŋāĻ• āϏ⧇āĻ—āĻŽā§‡āĻ¨ā§āĻŸā§‡āĻļāύ](https://huggingface.co/facebook/maskformer-swin-small-coco) +- [DPT āĻĻāĻŋāϝāĻŧ⧇ āĻĄā§‡āĻĒā§āĻĨ āĻāĻ¸ā§āϟāĻŋāĻŽā§‡āĻļāύ](https://huggingface.co/docs/transformers/model_doc/dpt) +- [VideoMAE āĻĻāĻŋāϝāĻŧ⧇ āĻ­āĻŋāĻĄāĻŋāĻ“ āĻ•ā§āϞāĻžāϏāĻŋāĻĢāĻŋāϕ⧇āĻļāύ](https://huggingface.co/docs/transformers/model_doc/videomae) +- [OneFormer āĻĻāĻŋāϝāĻŧ⧇ āχāωāύāĻŋāĻ­āĻžāĻ°ā§āϏāĻžāϞ āϏ⧇āĻ—āĻŽā§‡āĻ¨ā§āĻŸā§‡āĻļāύ](https://huggingface.co/shi-labs/oneformer_ade20k_dinat_large) + +āĻ…āĻĄāĻŋāĻ“ āĻŦāĻŋāĻ­āĻžāϗ⧇: + +- [Wav2Vec2 āĻĻāĻŋāϝāĻŧ⧇ āĻ¸ā§āĻŦāϝāĻŧāĻ‚āĻ•ā§āϰāĻŋāϝāĻŧ āĻ¸ā§āĻĒāĻŋāϚ āϰāĻŋāĻ•āĻ—āύāĻŋāĻļāύ](https://huggingface.co/facebook/wav2vec2-base-960h) +- [Wav2Vec2 āĻĻāĻŋāϝāĻŧ⧇ āϕ⧀āĻ“āϝāĻŧāĻžāĻ°ā§āĻĄ āĻļāύāĻžāĻ•ā§āϤāĻ•āϰāĻŖ](https://huggingface.co/superb/wav2vec2-base-superb-ks) +- [Audio Spectrogram Transformer āĻĻāĻŋāϝāĻŧ⧇ āĻ…āĻĄāĻŋāĻ“ āĻ•ā§āϞāĻžāϏāĻŋāĻĢāĻŋāϕ⧇āĻļāύ](https://huggingface.co/MIT/ast-finetuned-audioset-10-10-0.4593) + +āĻŽāĻžāĻ˛ā§āϟāĻŋāĻŽā§‹āĻĄāĻžāϞ āϟāĻžāĻ¸ā§āϕ⧇: + +- [TAPAS āĻĻāĻŋāϝāĻŧ⧇ āĻŸā§‡āĻŦāĻŋāϞāĻ­āĻŋāĻ¤ā§āϤāĻŋāĻ• āĻĒā§āϰāĻļā§āύ⧋āĻ¤ā§āϤāϰ](https://huggingface.co/google/tapas-base-finetuned-wtq) +- [ViLT āĻĻāĻŋāϝāĻŧ⧇ āĻ­āĻŋāĻœā§āϝ⧁āϝāĻŧāĻžāϞ āĻĒā§āϰāĻļā§āύ⧋āĻ¤ā§āϤāϰ](https://huggingface.co/dandelin/vilt-b32-finetuned-vqa) +- [CLIP āĻĻāĻŋāϝāĻŧ⧇ āϜāĻŋāϰ⧋-āĻļāϟ āχāĻŽā§‡āϜ āĻ•ā§āϞāĻžāϏāĻŋāĻĢāĻŋāϕ⧇āĻļāύ](https://huggingface.co/openai/clip-vit-large-patch14) +- [LayoutLM āĻĻāĻŋāϝāĻŧ⧇ āĻĄāϕ⧁āĻŽā§‡āĻ¨ā§āϟ-āĻ­āĻŋāĻ¤ā§āϤāĻŋāĻ• āĻĒā§āϰāĻļā§āύ⧋āĻ¤ā§āϤāϰ](https://huggingface.co/impira/layoutlm-document-qa) +- [X-CLIP āĻĻāĻŋāϝāĻŧ⧇ āϜāĻŋāϰ⧋-āĻļāϟ āĻ­āĻŋāĻĄāĻŋāĻ“ āĻ•ā§āϞāĻžāϏāĻŋāĻĢāĻŋāϕ⧇āĻļāύ](https://huggingface.co/docs/transformers/model_doc/xclip) + +## 🤗 Transformers- āĻŦā§āϝāĻŦāĻšāĻžāϰāĻ•āĻžāϰ⧀ ā§§ā§Ļā§ĻāϟāĻŋ āĻĒā§āϰāĻ•āĻ˛ā§āĻĒ + +🤗 Transformers āĻļ⧁āϧ⧁āĻŽāĻžāĻ¤ā§āϰ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āĻŦā§āϝāĻŦāĻšāĻžāϰ⧇āϰ āϟ⧁āϞāĻ•āĻŋāϟ āύāϝāĻŧ: āĻāϟāĻŋ āĻĒā§āϰāĻœā§‡āĻ•ā§āϟāϗ⧁āϞ⧋āϰ āĻāĻ•āϟāĻŋ āĻ•āĻŽāĻŋāωāύāĻŋāϟāĻŋ, āϝ⧇āϗ⧁āϞ⧋ Hugging Face Hub āĻāϰ āϚāĻžāϰāĻĒāĻžāĻļ⧇ āĻ—āĻĄāĻŧ⧇ āωāϠ⧇āϛ⧇āĨ¤ āφāĻŽāϰāĻž āϚāĻžāχ, 🤗 Transformers āĻĄā§‡āϭ⧇āϞāĻĒāĻžāϰ, āĻ—āĻŦ⧇āώāĻ•, āĻ›āĻžāĻ¤ā§āϰ, āĻļāĻŋāĻ•ā§āώāĻ•, āχāĻžā§āϜāĻŋāύāĻŋāϝāĻŧāĻžāϰ āĻ“ āϏāĻŦāĻžāχāϕ⧇ āϤāĻžāĻĻ⧇āϰ āĻ¸ā§āĻŦāĻĒā§āύ⧇āϰ āĻĒā§āϰāĻ•āĻ˛ā§āĻĒ āĻŦāĻžāĻ¸ā§āϤāĻŦāĻžāϝāĻŧāύ⧇ āϏāĻšāĻžāϝāĻŧāϤāĻž āĻ•āϰ⧁āĻ•āĨ¤ + +🤗 Transformers-āĻāϰ ā§§ā§Ļā§Ļ,ā§Ļā§Ļā§Ļ āĻ¸ā§āϟāĻžāϰ āωāĻĻāϝāĻžāĻĒāύ⧇āϰ āϜāĻ¨ā§āϝ, āφāĻŽāϰāĻž āĻ•āĻŽāĻŋāωāύāĻŋāϟāĻŋāϕ⧇ āϏāĻžāĻŽāύ⧇ āĻāύ⧇ [awesome-transformers](./awesome-transformers.md) āĻĒ⧇āϜāϟāĻŋ āϤ⧈āϰāĻŋ āĻ•āϰ⧇āĻ›āĻŋ, āϝ⧇āĻ–āĻžāύ⧇ 🤗 Transformers āĻĻāĻŋāϝāĻŧ⧇ āĻ•āϰāĻž ā§§ā§Ļā§ĻāϟāĻŋ āĻ…āϏāĻžāϧāĻžāϰāĻŖ āĻĒā§āϰāĻ•āĻ˛ā§āĻĒ⧇āϰ āϤāĻžāϞāĻŋāĻ•āĻž āĻĻāĻŋāϝāĻŧ⧇āĻ›āĻŋāĨ¤ + +āϝāĻĻāĻŋ āφāĻĒāύāĻžāϰ āĻ•āĻžāϛ⧇ āĻāĻŽāύ āϕ⧋āύ⧋ āĻĒā§āϰāĻ•āĻ˛ā§āĻĒ āĻĨāĻžāϕ⧇ āĻŦāĻž āφāĻĒāύāĻŋ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰ⧇āύ, āϝ⧇āϟāĻŋ āĻāχ āϤāĻžāϞāĻŋāĻ•āĻžāϝāĻŧ āĻĨāĻžāĻ•āĻž āωāϚāĻŋāϤ āĻŦāϞ⧇ āĻŽāύ⧇ āĻ•āϰ⧇āύ, āĻĻāϝāĻŧāĻž āĻ•āϰ⧇ āϏ⧇āϟāĻŋ āϝ⧋āĻ— āĻ•āϰāϤ⧇ āĻāĻ•āϟāĻŋ PR (Pull Request) āϖ⧁āϞ⧁āύ! + + +## āφāĻĒāύāĻŋ āϝāĻĻāĻŋ Hugging Face āϟāĻŋāĻŽā§‡āϰ āĻ•āĻžāĻ› āĻĨ⧇āϕ⧇ āĻŦā§āϝāĻ•ā§āϤāĻŋāĻ—āϤ āϏāĻšāĻžāϝāĻŧāϤāĻž āϚāĻžāύ + + +    HuggingFace Expert Acceleration Program +
+ +## āĻĻā§āϰ⧁āϤ āĻļ⧁āϰ⧁ āĻ•āϰ⧁āύ + +āύāĻŋāĻ°ā§āĻĻāĻŋāĻˇā§āϟ āϕ⧋āύ⧋ āχāύāĻĒ⧁āϟ (āĻŸā§‡āĻ•ā§āϏāϟ, āĻ›āĻŦāĻŋ, āĻ…āĻĄāĻŋāĻ“ ...) āύāĻŋāϝāĻŧ⧇ āĻĻā§āϰ⧁āϤ āϕ⧋āύ⧋ āĻŽāĻĄā§‡āϞ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāϤ⧇ āϚāĻžāχāϞ⧇ āφāĻŽāϰāĻž `pipeline`-API āϏāϰāĻŦāϰāĻžāĻš āĻ•āϰāĻŋāĨ¤ āĻĒāĻžāχāĻĒāϞāĻžāχāύ āĻāĻ•āϟāĻŋ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āĻ“ āϤāĻžāϰ āϏāĻžāĻĨ⧇ āĻŦā§āϝāĻŦāĻšā§ƒāϤ āĻĒā§āϰāĻŋāĻĒā§āϰāϏ⧇āϏāĻŋāĻ‚āϕ⧇ āĻāĻ•āĻ¤ā§āϰāĻŋāϤ āĻ•āϰ⧇, āϝāĻž āĻŸā§āϰ⧇āύāĻŋāĻ‚ā§Ÿā§‡āϰ āϏāĻŽāϝāĻŧ āĻ•āĻžāĻœā§‡ āϞāĻžāĻ—āĻžāύ⧋ āĻšāϝāĻŧ⧇āĻ›āĻŋāϞāĨ¤ āύāĻŋāĻšā§‡ āĻĻ⧇āĻ–āĻžāύ⧋ āĻšāϝāĻŧ⧇āϛ⧇, āϕ⧀āĻ­āĻžāĻŦ⧇ āĻĻā§āϰ⧁āϤ āĻāĻ•āϟāĻŋ āĻĒāĻžāχāĻĒāϞāĻžāχāύ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰ⧇ āχāϤāĻŋāĻŦāĻžāϚāĻ• āĻāĻŦāĻ‚ āύ⧇āϤāĻŋāĻŦāĻžāϚāĻ• āĻŸā§‡āĻ•ā§āϏāϟ āĻļā§āϰ⧇āĻŖāĻŋāĻŦāĻŋāĻ¨ā§āϝāĻžāϏ āĻ•āϰāĻž āϝāĻžāϝāĻŧ: + + + +```python +>>> from transformers import pipeline + +# āϏ⧇āĻ¨ā§āϟāĻŋāĻŽā§‡āĻ¨ā§āϟ āĻŦāĻŋāĻļā§āϞ⧇āώāϪ⧇āϰ āϜāĻ¨ā§āϝ āĻāĻ•āϟāĻŋ āĻĒāĻžāχāĻĒāϞāĻžāχāύ āĻŦāϰāĻžāĻĻā§āĻĻ āĻ•āϰāĻŋ +>>> classifier = pipeline('sentiment-analysis') +>>> classifier('We are very happy to introduce pipeline to the transformers repository.') +[{'label': 'POSITIVE', 'score': 0.9996980428695679}] +``` + + +āĻĻā§āĻŦāĻŋāϤ⧀āϝāĻŧ āϕ⧋āĻĄāϞāĻžāχāύ⧇ āĻĒāĻžāχāĻĒāϞāĻžāχāύ āĻŦā§āϝāĻŦāĻšāĻžāϰ⧇āϰ āϜāĻ¨ā§āϝ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āϞ⧋āĻĄ āĻāĻŦāĻ‚ āĻ•ā§āϝāĻžāĻļ āĻ•āϰāĻž āĻšāϝāĻŧ, āĻāĻŦāĻ‚ āϤ⧃āĻ¤ā§€ā§Ÿ āϞāĻžāχāύ⧇ āĻĻ⧇āϝāĻŧāĻž āĻŸā§‡āĻ•ā§āϏāĻŸā§‡ āϏ⧇āϟāĻž āĻĒāϰ⧀āĻ•ā§āώāĻž āĻ•āϰāĻž āĻšāϝāĻŧāĨ¤ āĻāĻ–āĻžāύ⧇ āωāĻ¤ā§āϤāϰāϟāĻŋ "āχāϤāĻŋāĻŦāĻžāϚāĻ•" ⧝⧝.⧝⧭% āφāĻ¤ā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏāϏāĻšāĨ¤ + +āĻ•āĻŽā§āĻĒāĻŋāωāϟāĻžāϰ āĻ­āĻžāώāĻžāϤāĻ¤ā§āĻ¤ā§āĻŦ, āĻ•āĻŽā§āĻĒāĻŋāωāϟāĻžāϰ āĻ­āĻŋāĻļāύ āĻāĻŦāĻ‚ āĻ¸ā§āĻĒāĻŋāϚ āĻĒā§āϰāϏ⧇āϏāĻŋāĻ‚āϝāĻŧ⧇ āĻ…āύ⧇āĻ• āϟāĻžāĻ¸ā§āϕ⧇āϰ āϜāĻ¨ā§āϝāχ āĻĒā§āϰāĻ¸ā§āϤ⧁āϤ āĻ•āϰāĻž āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ `pipeline` āĻ°ā§Ÿā§‡āϛ⧇āĨ¤ āϝ⧇āĻŽāύ, āφāĻŽāϰāĻž āϏāĻšāĻœā§‡āχ āϕ⧋āύ⧋ āĻ›āĻŦāĻŋāϤ⧇ āĻļāύāĻžāĻ•ā§āϤ āĻšāĻ“āϝāĻŧāĻž āĻ…āĻŦāĻœā§‡āĻ•ā§āϟ āĻŦ⧇āϰ āĻ•āϰāϤ⧇ āĻĒāĻžāϰāĻŋ: + + + +``` python +>>> import requests +>>> from PIL import Image +>>> from transformers import pipeline + +# āϏ⧁āĻ¨ā§āĻĻāϰ āĻŦāĻŋāĻĄāĻŧāĻžāϞ⧇āϰ āĻ›āĻŦāĻŋ āĻĄāĻžāωāύāϞ⧋āĻĄ āĻ•āϰ⧁āύ +>>> url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/coco_sample.png" +>>> image_data = requests.get(url, stream=True).raw +>>> image = Image.open(image_data) + +# āĻ…āĻŦāĻœā§‡āĻ•ā§āϟ āĻļāύāĻžāĻ•ā§āϤāĻ•āϰāϪ⧇āϰ āϜāĻ¨ā§āϝ āĻĒāĻžāχāĻĒāϞāĻžāχāύ +>>> object_detector = pipeline('object-detection') +>>> object_detector(image) +[{'score': 0.9982201457023621, + 'label': 'remote', + 'box': {'xmin': 40, 'ymin': 70, 'xmax': 175, 'ymax': 117}}, + {'score': 0.9960021376609802, + 'label': 'remote', + 'box': {'xmin': 333, 'ymin': 72, 'xmax': 368, 'ymax': 187}}, + {'score': 0.9954745173454285, + 'label': 'couch', + 'box': {'xmin': 0, 'ymin': 1, 'xmax': 639, 'ymax': 473}}, + {'score': 0.9988006353378296, + 'label': 'cat', + 'box': {'xmin': 13, 'ymin': 52, 'xmax': 314, 'ymax': 470}}, + {'score': 0.9986783862113953, + 'label': 'cat', + 'box': {'xmin': 345, 'ymin': 23, 'xmax': 640, 'ymax': 368}}] +``` + + +āĻāĻ–āĻžāύ⧇ āφāĻŽāϰāĻž āĻ›āĻŦāĻŋāϤ⧇ āĻļāύāĻžāĻ•ā§āϤ āĻšāĻ“āϝāĻŧāĻž āĻ…āĻŦāĻœā§‡āĻ•ā§āĻŸā§‡āϰ āĻāĻ•āϟāĻŋ āϤāĻžāϞāĻŋāĻ•āĻž āĻĒāĻžāχ, āϝ⧇āϗ⧁āϞ⧋āϰ āϏāĻžāĻĨ⧇ āĻŦāĻžāωāĻ¨ā§āĻĄāĻŋāĻ‚ āĻŦāĻžāĻ•ā§āϏ āĻ“ āφāĻ¤ā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ⧇āϰ āĻŽāĻžāύāϏāĻš āωāĻĒāĻ¸ā§āĻĨāĻžāĻĒāĻŋāϤ āĻšāϝāĻŧāĨ¤ āύāĻŋāĻšā§‡ āĻŦāĻžāĻŽā§‡ āĻŽā§‚āϞ āĻ›āĻŦāĻŋ āĻāĻŦāĻ‚ āĻĄāĻžāύ⧇ āĻĒā§‚āĻ°ā§āĻŦāĻžāĻ­āĻžāϏ āĻĻ⧇āĻ–āĻžāύ⧋ āĻšāϝāĻŧ⧇āϛ⧇: + +

+    +    +

+ +`pipeline`-API āϕ⧋āύ āϕ⧋āύ āϟāĻžāĻ¸ā§āĻ• āϏāĻžāĻĒā§‹āĻ°ā§āϟ āĻ•āϰ⧇, āϤāĻž [āĻāχ āϟāĻŋāωāĻŸā§‹āϰāĻŋāϝāĻŧāĻžāϞ⧇](https://huggingface.co/docs/transformers/task_summary) āϜāĻžāύāϤ⧇ āĻĒāĻžāϰāĻŦ⧇āύāĨ¤ + +`pipeline` āĻ›āĻžāĻĄāĻŧāĻžāĻ“ āĻŽāĻžāĻ¤ā§āϰ āϤāĻŋāύāϟāĻŋ āϕ⧋āĻĄāϞāĻžāχāύ⧇āχ āϝ⧇āϕ⧋āύ⧋ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āύāĻžāĻŽāĻŋāϝāĻŧ⧇ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāĻž āϝāĻžāϝāĻŧāĨ¤ āύāĻŋāĻšā§‡ PyTorch-āĻāϰ āϜāĻ¨ā§āϝ āĻāĻ•āϟāĻŋ āωāĻĻāĻžāĻšāϰāĻŖ: + + + +```python +>>> from transformers import AutoTokenizer, AutoModel + +>>> tokenizer = AutoTokenizer.from_pretrained("google-bert/bert-base-uncased") +>>> model = AutoModel.from_pretrained("google-bert/bert-base-uncased") + +>>> inputs = tokenizer("Hello world!", return_tensors="pt") +>>> outputs = model(**inputs) +``` + +āĻāĻŦāĻ‚ āĻāϟāĻŋ TensorFlow-āĻāϰ āϜāĻ¨ā§āϝ āĻāĻ•āχ āωāĻĻāĻžāĻšāϰāĻŖ: + +```python +>>> from transformers import AutoTokenizer, TFAutoModel + +>>> tokenizer = AutoTokenizer.from_pretrained("google-bert/bert-base-uncased") +>>> model = TFAutoModel.from_pretrained("google-bert/bert-base-uncased") + +>>> inputs = tokenizer("Hello world!", return_tensors="tf") +>>> outputs = model(**inputs) +``` + + +Tokenizer-āĻāϰ āĻ•āĻžāϜ āĻšāĻšā§āϛ⧇ āĻĒāϰāĻŋāĻĒā§āϰ⧇āĻ•ā§āώāĻŋāϤ āĻ…āύ⧁āϏāĻžāϰ⧇ āĻĒā§āϰāĻŋ-āĻĒā§āϰāϏ⧇āϏāĻŋāĻ‚ āĻ•āϰāĻž, āϝāĻž āĻŽāĻĄā§‡āϞ⧇āϰ āϜāĻ¨ā§āϝ āĻĻāϰāĻ•āĻžāϰ āĻšāϝāĻŧ—āĻāϟāĻž āĻāĻ•āĻ• āĻ¸ā§āĻŸā§āϰāĻŋāĻ‚ āĻŦāĻž āĻāĻ•āϟāĻŋ āϞāĻŋāĻ¸ā§āĻŸā§‡āϰ āĻ“āĻĒāϰ āϏāϰāĻžāϏāϰāĻŋ āϚāϞāϤ⧇ āĻĒāĻžāϰ⧇āĨ¤ āĻāϟāĻŋ āĻāĻ•āϟāĻŋ āĻĄāĻŋāĻ•āĻļāύāĻžāϰāĻŋ āφāωāϟāĻĒ⧁āϟ āĻĻ⧇āϝāĻŧ, āϝ⧇āϟāĻŋ āĻĒāϰāĻŦāĻ°ā§āϤ⧀ āϕ⧋āĻĄā§‡ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāĻž āϝ⧇āϤ⧇ āĻĒāĻžāϰ⧇ āĻŦāĻž āϏāϰāĻžāϏāϰāĻŋ āĻŽāĻĄā§‡āϞ⧇ āĻĒāĻžāĻ āĻžāύ⧋ āϝ⧇āϤ⧇ āĻĒāĻžāϰ⧇ (Python-āĻāϰ ** āĻ…āĻĒāĻžāϰ⧇āϟāϰ āĻĻāĻŋāϝāĻŧ⧇)āĨ¤ + +āĻŽāĻĄā§‡āϞ āύāĻŋāĻœā§‡āχ PyTorch-āĻāϰ [nn.Module](https://pytorch.org/docs/stable/nn.html#torch.nn.Module) āĻ…āĻĨāĻŦāĻž TensorFlow-āĻāϰ [tf.keras.Model](https://www.tensorflow.org/api_docs/python/tf/keras/Model) (āĻŦā§āϝāĻžāĻ•āĻāĻ¨ā§āĻĄ āĻ…āύ⧁āϝāĻžāϝāĻŧā§€), āϝ⧇āϟāĻŋ āĻšā§‡āύāĻž āύāĻŋāϝāĻŧāĻŽ āĻŽā§‡āύ⧇ āϚāĻžāϞāĻžāύ⧋ āϝāĻžāϝāĻŧāĨ¤ [āĻāχ āϟāĻŋāωāĻŸā§‹āϰāĻŋ⧟āĻžāϞāϟāĻŋ](https://huggingface.co/docs/transformers/training) āĻĻ⧇āϖ⧁āύ, āĻ•āĻŋāĻ­āĻžāĻŦ⧇ āĻŸā§āϰ⧇āύāĻŋāĻ‚ āϞ⧁āĻĒ āĻŦāĻž āφāĻŽāĻžāĻĻ⧇āϰ `Trainer`-API āĻĻāĻŋāϝāĻŧ⧇ āĻ…āϏāĻ‚āĻ–ā§āϝ āĻĄā§‡āϟāĻžāϏ⧇āĻŸā§‡ āĻĻā§āϰ⧁āϤ āĻĢāĻžāχāύ-āϟāĻŋāωāύ āĻ•āϰāĻž āϝāĻžāϝāĻŧāĨ¤ + +## āϕ⧇āύ āφāĻĒāύāĻŋ 🤗 Transformers āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāĻŦ⧇āύ? + +1. āĻŦā§āϝāĻŦāĻšāĻžāϰ-āĻŦāĻžāĻ¨ā§āϧāĻŦ āφāϧ⧁āύāĻŋāĻ• āĻŽāĻĄā§‡āϞ: +    - āĻ¨ā§āϝāĻžāϚāĻžāϰāĻžāϞ āĻ˛ā§āϝāĻžāĻ™ā§āϗ⧁āϝāĻŧ⧇āϜ āφāĻ¨ā§āĻĄāĻžāϰāĻ¸ā§āĻŸā§āϝāĻžāĻ¨ā§āĻĄāĻŋāĻ‚ āĻ“ āĻœā§‡āύāĻžāϰ⧇āĻļāύ, āĻ•āĻŽā§āĻĒāĻŋāωāϟāĻžāϰ āĻ­āĻŋāĻļāύ āĻ“ āĻ…āĻĄāĻŋāĻ“ āϟāĻžāĻ¸ā§āĻ•āϗ⧁āϞ⧋āϰ āϜāĻ¨ā§āϝ āωāĻšā§āϚ āĻĻāĻ•ā§āώāϤāĻžāĨ¤ +    - āĻļāĻŋāĻ•ā§āώāĻžāĻ°ā§āĻĨā§€ āĻ“ āϚāĻ°ā§āϚāĻžāĻ•āĻžāϰ⧀āĻĻ⧇āϰ āϜāĻ¨ā§āϝ āϏāĻšāϜ āĻĒā§āϰāĻŦ⧇āĻļāϝ⧋āĻ—ā§āϝāϤāĻžāĨ¤ +    - āĻļ⧁āϧ⧁āĻŽāĻžāĻ¤ā§āϰ āϤāĻŋāύāϟāĻŋ āĻŽā§‚āϞ āĻ•ā§āϞāĻžāϏ āĻļāĻŋāĻ–āϞ⧇āχ āĻšāĻŦ⧇āĨ¤ +    - āφāĻŽāĻžāĻĻ⧇āϰ āϏāĻŦ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āĻŦā§āϝāĻŦāĻšāĻžāϰ⧇āϰ āϜāĻ¨ā§āϝ āĻāĻ•āĻ• APIāĨ¤ + +2. āĻ•āĻŽ āĻ•āĻŽā§āĻĒāĻŋāωāϟāĻŋāĻ‚ āĻ–āϰāϚ, āϛ⧋āϟ CO2 āĻĢā§āϞ⧁āϟāĻĒā§āϰāĻŋāĻ¨ā§āϟ: +    - āĻ—āĻŦ⧇āώāĻ•āϰāĻž āϤāĻžāĻĻ⧇āϰ āĻĒā§āϰāĻļāĻŋāĻ•ā§āώāĻŋāϤ āĻŽāĻĄā§‡āϞ āĻļ⧇āϝāĻŧāĻžāϰ āĻ•āϰāϤ⧇ āĻĒāĻžāϰ⧇, āĻŦāĻžāϰāĻŦāĻžāϰ āĻŸā§āϰ⧇āύ āĻ•āϰāϤ⧇ āĻšāϝāĻŧ āύāĻžāĨ¤ +    - āĻĒā§āĻ°â€ā§āϝāĻžāĻ•ā§āϟāĻŋāĻļāύāĻžāϰāϰāĻž āĻ•āĻŽ āϏāĻŽāϝāĻŧ⧇ āĻ“ āĻ•āĻŽ āĻ–āϰāĻšā§‡ āĻ•āĻžāϜ āĻļ⧇āώ āĻ•āϰāϤ⧇ āĻĒāĻžāϰ⧇āύāĨ¤ +    - āĻĄāϜāύ āĻ–āĻžāύ⧇āĻ• āφāĻ°ā§āĻ•āĻŋāĻŸā§‡āĻ•āϚāĻžāϰ āĻ“ ā§Ē āϞāĻžāϖ⧇āϰāĻ“ āĻŦ⧇āĻļāĻŋ āĻĒā§āϰāĻŋ-āĻŸā§āϰ⧇āχāĻ¨ā§āĻĄ āĻŽāĻĄā§‡āϞ āϏāĻŦ modality-āĻāϰ āϜāĻ¨ā§āϝāĨ¤ + +3. āĻŽāĻĄā§‡āϞ āύāĻŋāĻ°ā§āĻŽāĻžāϪ⧇āϰ āĻĒā§āϰāϤāĻŋāϟāĻŋ āϧāĻžāĻĒ⧇ āĻĒāĻ›āĻ¨ā§āĻĻ⧇āϰ āĻĢā§āϰ⧇āĻŽāĻ“ā§ŸāĻžāĻ°ā§āĻ• āĻŦ⧇āϛ⧇ āύāĻŋāύ: +    - āĻŽāĻžāĻ¤ā§āϰ ā§ŠāĻŸā§‡ āϕ⧋āĻĄāϞāĻžāχāύ⧇ āφāϧ⧁āύāĻŋāĻ• āĻŽāĻĄā§‡āϞ āĻŸā§āϰ⧇āύāĻŋāĻ‚āĨ¤ +    - TF2.0, PyTorch āĻŦāĻž JAX-āĻ āĻŽā§āĻ•ā§āϤāĻ­āĻžāĻŦ⧇ āĻāĻ•āχ āĻŽāĻĄā§‡āϞ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰ⧁āύāĨ¤ +    - āĻŸā§āϰ⧇āύāĻŋāĻ‚, āĻŽā§‚āĻ˛ā§āϝāĻžāϝāĻŧāύ āĻ“ āĻĒā§āϰ⧋āĻĄāĻžāĻ•āĻļāύ⧇āϰ āϜāĻ¨ā§āϝ āϏāĻšāĻœā§‡āχ āϏāĻ āĻŋāĻ• āĻĢā§āϰ⧇āĻŽāĻ“ā§ŸāĻžāĻ°ā§āĻ• āĻŦ⧇āϛ⧇ āύāĻŋāύāĨ¤ + +4. āϏāĻšāĻœā§‡āχ āĻ•āĻžāĻ¸ā§āϟāĻŽāĻžāχāϜ āĻ•āϰ⧁āύ: +    - āĻĒā§āϰāϤāĻŋāϟāĻŋ āφāĻ°ā§āĻ•āĻŋāĻŸā§‡āĻ•āϚāĻžāϰ⧇āϰ āϜāĻ¨ā§āϝ āφāĻŽāĻžāĻĻ⧇āϰ āĻ•āĻžāϛ⧇ āϰ⧇āĻĢāĻžāϰ⧇āĻ¨ā§āϏ āĻāĻ•ā§āϏāĻžāĻŽā§āĻĒāϞ āϰāϝāĻŧ⧇āϛ⧇, āĻŽā§‚āϞ āϞ⧇āĻ–āĻ•āĻĻ⧇āϰ āĻĢāϞāĻžāĻĢāϞ āĻĒ⧁āύāϰ⧁āĻ¤ā§āĻĒāĻžāĻĻāύ⧇āϰ āϜāĻ¨ā§āϝāĨ¤ +    - āĻŽāĻĄā§‡āϞ⧇āϰ āφāĻ­ā§āϝāĻ¨ā§āϤāϰ⧀āĻŖ āĻ—āĻ āύ āϝāϤāϟāĻž āϏāĻŽā§āĻ­āĻŦ āĻ…āĻ­āĻŋāĻ¨ā§āύ āϰāĻžāĻ–āĻž āĻšāϝāĻŧ⧇āϛ⧇āĨ¤ +    - āĻŽāĻĄā§‡āϞ āĻĢāĻžāχāϞ āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋ āĻ›āĻžāĻĄāĻŧāĻžāĻ“ āĻ¸ā§āĻŦāĻžāϧ⧀āύāĻ­āĻžāĻŦ⧇ āĻ—āĻŦ⧇āώāĻŖāĻžāϰ āϜāĻ¨ā§āϝ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāĻž āϝāĻžāĻŦ⧇āĨ¤ + +## āĻ•āĻ–āύ 🤗 Transformers āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāĻŦ⧇āύ āύāĻž? + +- āĻāχ āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋāϟāĻŋ āύāĻŋāϰ⧇āϟ āύāĻŋāωāϰāĻžāϞ āύ⧇āϟāĻ“āϝāĻŧāĻžāĻ°ā§āĻ• āĻŦāĻŋāĻ˛ā§āĻĄāĻŋāĻ‚ āĻŦā§āϞāĻ• āϏāϰāĻŦāϰāĻžāĻš āĻ•āϰ⧇ āύāĻž, āĻŦāϰāĻ‚ āύāĻŋāĻ°ā§āĻĻāĻŋāĻˇā§āϟ āĻŽāĻĄā§‡āϞ⧇āϰ āĻĻā§āϰ⧁āϤ āωāĻ¨ā§āύāϝāĻŧāύ āĻ“ āĻ—āĻŦ⧇āώāĻŖāĻžāϰ āϜāĻ¨ā§āϝ āύāĻ•āĻļāĻž āĻ•āϰāĻž āĻšāϝāĻŧ⧇āϛ⧇, āϝāĻžāϤ⧇ āφāĻĒāύāĻžāϕ⧇ āĻ…āĻĒā§āĻ°ā§Ÿā§‹āϜāĻ¨ā§€ā§Ÿ āĻ…ā§āϝāĻžāĻŦāĻ¸ā§āĻŸā§āϰāĻžāĻ•āĻļāύ⧇ āύāĻž āϝ⧇āϤ⧇ āĻšāϝāĻŧāĨ¤ +- Training API āϏāĻŦ āϧāϰāϪ⧇āϰ āĻŽāĻĄā§‡āϞ⧇āϰ āϜāĻ¨ā§āϝ āύāϝāĻŧ; āĻāϟāĻŋ āĻŦāĻŋāĻļ⧇āώāĻ­āĻžāĻŦ⧇ āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋāϰ āύāĻŋāϜāĻ¸ā§āĻŦ āĻŽāĻĄā§‡āϞāϗ⧁āϞ⧋āϰ āϜāĻ¨ā§āϝ āĻ…āĻĒā§āϟāĻŋāĻŽāĻžāχāϜāĻĄāĨ¤ āϏāĻžāϧāĻžāϰāĻŖ āĻŽā§‡āĻļāĻŋāύ āϞāĻžāĻ°ā§āύāĻŋāĻ‚ āĻŸā§āϰ⧇āύāĻŋāĻ‚ āϞ⧁āĻĒ⧇āϰ āϜāĻ¨ā§āϝ āĻ…āĻ¨ā§āϝ āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰ⧁āύ (āϝ⧇āĻŽāύ [Accelerate](https://huggingface.co/docs/accelerate))āĨ¤ +- āφāĻŽāĻžāĻĻ⧇āϰ [`examples`](./examples) āĻĢā§‹āĻ˛ā§āĻĄāĻžāϰ-āĻ āĻĨāĻžāĻ•āĻž āĻ¸ā§āĻ•ā§āϰāĻŋāĻĒā§āϟāϗ⧁āϞ⧋ āĻŽā§‚āϞāϤ āύāĻŽā§āύāĻž; āĻāϗ⧁āϞ⧋ āϏāϰāĻžāϏāϰāĻŋ āφāĻĒāύāĻžāϰ āĻĒā§āϰāĻœā§‡āĻ•ā§āĻŸā§‡ āϚāϞāĻŦ⧇ āύāĻžāĻ“āĨ¤ āφāĻĒāύāĻžāϕ⧇ āĻ•āĻŋāϛ⧁ āϕ⧋āĻĄ āĻĒāϰāĻŋāĻŦāĻ°ā§āϤāύ āĻ•āϰāϤ⧇ āĻšāϤ⧇ āĻĒāĻžāϰ⧇āĨ¤ + +## āχāύāĻ¸ā§āϟāϞ⧇āĻļāύ + +### pip āĻĻāĻŋāϝāĻŧ⧇ + +āĻāχ āϰ⧇āĻĒā§‹āϜāĻŋāϟāϰāĻŋāϟāĻŋāϤ⧇ Python 3.9+, Flax 0.4.1+, PyTorch 2.1+, āĻāĻŦāĻ‚ TensorFlow 2.6+ āĻĻāĻŋāϝāĻŧ⧇ āĻĒāϰ⧀āĻ•ā§āώāĻž āĻ•āϰāĻž āĻšāϝāĻŧ⧇āϛ⧇āĨ¤ + +āφāĻĒāύāĻŋ [āĻ­āĻžāĻ°ā§āϚ⧁āϝāĻŧāĻžāϞ āĻāύāĻ­āĻžāϝāĻŧāϰāύāĻŽā§‡āĻ¨ā§āĻŸā§‡](https://docs.python.org/3/library/venv.html) 🤗 Transformers āχāύāĻ¸ā§āϟāϞ āĻ•āϰāĻžāϰ āĻĒāϰāĻžāĻŽāĻ°ā§āĻļ āĻĻ⧇āĻ“āϝāĻŧāĻž āĻšāϝāĻŧāĨ¤ āĻ­āĻžāĻ°ā§āϚ⧁āϝāĻŧāĻžāϞ āĻāύāĻ­āĻžāϝāĻŧāϰāύāĻŽā§‡āĻ¨ā§āĻŸā§‡āϰ āϏāĻžāĻĨ⧇ āĻĒāϰāĻŋāϚāĻŋāϤ āύāĻž āĻšāϞ⧇ [āĻŦā§āϝāĻŦāĻšāĻžāϰāĻ•āĻžāϰ⧀ āĻ—āĻžāχāĻĄ](https://packaging.python.org/guides/installing-using-pip-and-virtual-environments/) āĻĻ⧇āϖ⧁āύāĨ¤ + +āĻĒā§āϰāĻĨāĻŽā§‡, āφāĻĒāύāĻŋ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāϤ⧇ āϚāĻžāĻ“āϝāĻŧāĻž Python āĻ­āĻžāĻ°ā§āϏāύ āĻĻāĻŋāϝāĻŧ⧇ āĻāύāĻ­āĻžāϝāĻŧāϰāύāĻŽā§‡āĻ¨ā§āϟ āϤ⧈āϰāĻŋ āĻ“ āĻ…ā§āϝāĻžāĻ•ā§āϟāĻŋāϭ⧇āϟ āĻ•āϰ⧁āύāĨ¤ + +āĻāϰāĻĒāϰ Flax, PyTorch āĻŦāĻž TensorFlow āϝ⧇āϟāĻŋāχ āφāĻĒāύāĻžāϰ āĻĻāϰāĻ•āĻžāϰ āϏ⧇āϟāĻŋ āχāύāĻ¸ā§āϟāϞ āĻ•āϰ⧁āύāĨ¤ [TensorFlow](https://www.tensorflow.org/install/), [PyTorch](https://pytorch.org/get-started/locally/#start-locally), [Flax](https://github.com/google/flax#quick-install) āĻāĻŦāĻ‚ [Jax](https://github.com/google/jax#installation) āĻāϰ āύāĻŋāĻ°ā§āĻĻāĻŋāĻˇā§āϟ āχāύāĻ¸ā§āϟāϞ⧇āĻļāύ āĻ—āĻžāχāĻĄ āϰ⧇āĻĢāĻžāϰ āĻ•āϰ⧁āύāĨ¤ + +āϝ⧇āϕ⧋āύ⧋ āĻāĻ•āϟāĻŋ Backend āχāύāĻ¸ā§āϟāϞ āĻĨāĻžāĻ•āĻžāϰ āĻĒāϰ, 🤗 Transformers āĻāχāĻ­āĻžāĻŦ⧇ pip āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰ⧇ āχāύāĻ¸ā§āϟāϞ āĻ•āϰ⧁āύ: + + + +```bash +pip install transformers +``` + + +āφāĻĒāύāĻŋ āϝāĻĻāĻŋ āωāĻĻāĻžāĻšāϰāĻŖ āĻ¸ā§āĻ•ā§āϰāĻŋāĻĒā§āϟ āĻŦāĻž āϕ⧋āĻĄā§‡āϰ āϏāĻ°ā§āĻŦāĻļ⧇āώ āϏāĻ‚āĻ¸ā§āĻ•āϰāĻŖ āϚāĻžāύ āĻāĻŦāĻ‚ āύāϤ⧁āύ āϰāĻŋāϞāĻŋāϜ āύāĻž āφāϏāĻž āĻĒāĻ°ā§āϝāĻ¨ā§āϤ āĻ…āĻĒ⧇āĻ•ā§āώāĻž āĻ•āϰāϤ⧇ āύāĻž āϚāĻžāύ, āϤāĻžāĻšāϞ⧇ [āϏ⧋āĻ°ā§āϏ āĻĨ⧇āϕ⧇ āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋ āχāύāĻ¸ā§āϟāϞ](https://huggingface.co/docs/transformers/installation#installing-from-source) āĻ•āϰ⧁āύāĨ¤ + +### conda āĻĻāĻŋāϝāĻŧ⧇ + +conda āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰ⧇āĻ“ 🤗 Transformers āχāύāĻ¸ā§āϟāϞ āĻ•āϰāĻž āϝāĻžāϝāĻŧ: + + + +```shell script +conda install conda-forge::transformers +``` + + +> **_āύ⧋āϟ:_** `huggingface`-āĻšā§āϝāĻžāύ⧇āϞ āĻĨ⧇āϕ⧇ `transformers` āχāύāĻ¸ā§āϟāϞ āĻ•āϰāĻž āĻĒ⧁āϰāύ⧋ āĻĒāĻĻā§āϧāϤāĻŋāĨ¤ + +Flax, PyTorch, āĻŦāĻž TensorFlow-āĻāϰ āχāύāĻ¸ā§āϟāϞ⧇āĻļāύ āϜāĻžāύāϤ⧇ āϤāĻžāĻĻ⧇āϰ āĻ…āĻĢāĻŋāϏāĻŋāϝāĻŧāĻžāϞ āĻ—āĻžāχāĻĄ āĻĻ⧇āϖ⧁āύāĨ¤ + +> **_āύ⧋āϟ:_** āωāχāĻ¨ā§āĻĄā§‹āĻœā§‡ āϕ⧇āĻļāĻŋāĻ‚ āϏ⧁āĻŦāĻŋāϧāĻž āύāĻŋāϤ⧇ āφāĻĒāύāĻžāϕ⧇ developers' mode āϚāĻžāϞ⧁ āĻ•āϰāϤ⧇ āĻŦāϞāĻž āĻšāϤ⧇ āĻĒāĻžāϰ⧇āĨ¤ āĻāϟāĻŋ āύāĻž āĻĒāĻžāϰāϞ⧇ [āĻāχ āχāĻ¸ā§āϝ⧁āϤ⧇](https://github.com/huggingface/huggingface_hub/issues/1062) āϜāĻžāύāĻžāύāĨ¤ + +## āĻŽāĻĄā§‡āϞ āφāĻ°ā§āĻ•āĻŋāĻŸā§‡āĻ•āϚāĻžāϰ + +**[āϏāĻŦ āĻŽāĻĄā§‡āϞ-āĻšā§‡āĻ•āĻĒāϝāĻŧ⧇āĻ¨ā§āϟ](https://huggingface.co/models)**, āϝāĻž 🤗 Transformers āϏāϰāĻŦāϰāĻžāĻš āĻ•āϰ⧇, huggingface.co [Model Hub](https://huggingface.co/models) āĻĨ⧇āϕ⧇ āϏāϰāĻžāϏāϰāĻŋ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāĻž āϝāĻžāϝāĻŧ, āĻŦā§āϝāĻŦāĻšāĻžāϰāĻ•āĻžāϰ⧀ āĻ“ āϏāĻ‚āĻ—āĻ āύ āωāĻ­āϝāĻŧāχ āϏ⧇āĻ–āĻžāύ⧇ āφāĻĒāϞ⧋āĻĄ āĻ•āϰāϤ⧇ āĻĒāĻžāϰ⧇āύāĨ¤ + +āĻŦāĻ°ā§āϤāĻŽāĻžāύ⧇ āĻšā§‡āĻ•āĻĒāϝāĻŧ⧇āĻ¨ā§āϟ āϏāĻ‚āĻ–ā§āϝāĻž: ![](https://img.shields.io/endpoint?url=https://huggingface.co/api/shields/models&color=brightgreen) + +🤗 Transformers āĻŦāĻ°ā§āϤāĻŽāĻžāύ⧇ āύāĻŋāĻŽā§āύ⧋āĻ•ā§āϤ āφāĻ°ā§āĻ•āĻŋāĻŸā§‡āĻ•āϚāĻžāϰ āϏāϰāĻŦāϰāĻžāĻš āĻ•āϰ⧇: āĻŦāĻŋāĻ¸ā§āϤāĻžāϰāĻŋāϤ āĻĻ⧇āĻ–āϤ⧇ [āĻāĻ–āĻžāύ⧇ āĻ•ā§āϞāĻŋāĻ• āĻ•āϰ⧁āύ](https://huggingface.co/docs/transformers/model_summary)āĨ¤ + +āĻĒā§āϰāĻ¤ā§āϝ⧇āĻ• āĻŽāĻĄā§‡āϞ⧇ Flax, PyTorch āĻŦāĻž TensorFlow āĻŦāĻžāĻ¸ā§āϤāĻŦāĻžāϝāĻŧāύ āφāϛ⧇ āĻ•āĻŋāύāĻž āĻāĻŦāĻ‚ 🤗 Tokenizers āĻĻā§āĻŦāĻžāϰāĻž āϏāĻŽāĻ°ā§āĻĨāĻŋāϤ āĻŸā§‹āϕ⧇āύāĻžāχāϜāĻžāϰ āϰāϝāĻŧ⧇āϛ⧇ āĻ•āĻŋāύāĻž āϜāĻžāύāϤ⧇, [āĻāχ āĻŸā§‡āĻŦāĻŋāϞ](https://huggingface.co/docs/transformers/index#supported-frameworks) āĻĻ⧇āϖ⧁āύāĨ¤ + +āĻāχāϏāĻŦ āĻŦāĻžāĻ¸ā§āϤāĻŦāĻžāϝāĻŧāύ āĻŦāĻŋāĻ­āĻŋāĻ¨ā§āύ āĻĄā§‡āϟāĻžāϏ⧇āĻŸā§‡ āĻĒāϰ⧀āĻ•ā§āώāĻž āĻ•āϰāĻž āĻšāϝāĻŧ⧇āϛ⧇ (āωāĻĻāĻžāĻšāϰāĻŖ āĻ¸ā§āĻ•ā§āϰāĻŋāĻĒā§āϟ āĻĻ⧇āϖ⧁āύ) āĻāĻŦāĻ‚ āĻŽā§‚āϞ āĻŦāĻžāĻ¸ā§āϤāĻŦāĻžāϝāĻŧāύ⧇āϰ āĻĢāϞāĻžāĻĢāϞ⧇āϰ āϏāĻžāĻĨ⧇ āĻŽāĻŋāϞ⧇ āϝāĻžāĻ“āϝāĻŧāĻžāϰ āĻ•āĻĨāĻžāĨ¤ āφāϰāĻ“ āĻŦāĻŋāĻ¸ā§āϤāĻžāϰāĻŋāϤ āϜāĻžāύāϤ⧇ āωāĻĻāĻžāĻšāϰāĻŖ āϏ⧇āĻ•āĻļāύ āĻ“ [āĻĄāϕ⧁āĻŽā§‡āĻ¨ā§āĻŸā§‡āĻļāύ](https://github.com/huggingface/transformers/tree/main/examples) āĻšā§‡āĻ• āĻ•āϰ⧁āύāĨ¤ + +## āφāϰāĻ“ āϜāĻžāύ⧁āύ + +| āĻŦāĻŋāĻ­āĻžāĻ— | āĻŦāĻ°ā§āĻŖāύāĻž | +|---|---| +| [āĻĄāϕ⧁āĻŽā§‡āĻ¨ā§āĻŸā§‡āĻļāύ](https://huggingface.co/docs/transformers/) | āϏāĻŽā§āĻĒā§‚āĻ°ā§āĻŖ API āĻĄāϕ⧁āĻŽā§‡āĻ¨ā§āĻŸā§‡āĻļāύ āĻ“ āϟāĻŋāωāĻŸā§‹āϰāĻŋāϝāĻŧāĻžāϞ | +| [āĻ•āĻžāĻœā§‡āϰ āϏāĻ‚āĻ•ā§āώāĻŋāĻĒā§āϤ āϤāĻžāϞāĻŋāĻ•āĻž](https://huggingface.co/docs/transformers/task_summary) | 🤗 Transformers āĻĻā§āĻŦāĻžāϰāĻž āϏāĻŽāĻ°ā§āĻĨāĻŋāϤ āϟāĻžāĻ¸ā§āĻ• | +| [āĻĒā§āϰāĻŋāĻĒā§āϰāϏ⧇āϏāĻŋāĻ‚ āϟāĻŋāωāĻŸā§‹āϰāĻŋāϝāĻŧāĻžāϞ](https://huggingface.co/docs/transformers/preprocessing) | āĻĄā§‡āϟāĻž āĻŽāĻĄā§‡āϞ⧇āϰ āϜāĻ¨ā§āϝ āĻĒā§āϰāĻ¸ā§āϤ⧁āϤ āĻ•āϰāϤ⧇ `Tokenizer`-āĻ•ā§āϞāĻžāϏ⧇āϰ āĻŦā§āϝāĻŦāĻšāĻžāϰ | +| [āĻŸā§āϰ⧇āύāĻŋāĻ‚ āĻ“ āĻĢāĻžāχāύ-āϟāĻŋāωāύāĻŋāĻ‚](https://huggingface.co/docs/transformers/training) | PyTorch/TensorFlow āϞ⧁āĻĒ āĻ“ `Trainer`-API-āϰ āϏāĻžāĻĨ⧇ āĻŽāĻĄā§‡āϞ āĻĢāĻžāχāύ-āϟāĻŋāωāύāĻŋāĻ‚ | +| [āĻĻā§āϰ⧁āϤ āĻļ⧁āϰ⧁: āĻĢāĻžāχāύāϟāĻŋāωāύāĻŋāĻ‚/āĻāĻĒā§āϞāĻŋāϕ⧇āĻļāύ āĻ¸ā§āĻ•ā§āϰāĻŋāĻĒā§āϟ](https://github.com/huggingface/transformers/tree/main/examples) | āĻŦāĻšā§āĻŦāĻŋāϧ āϟāĻžāĻ¸ā§āϕ⧇ āĻŽāĻĄā§‡āϞ āĻĢāĻžāχāύāϟāĻŋāωāύāĻŋāĻ‚āϝāĻŧ⧇āϰ āϜāĻ¨ā§āϝ āύāĻŽā§āύāĻž āĻ¸ā§āĻ•ā§āϰāĻŋāĻĒā§āϟ | +| [āĻŽāĻĄā§‡āϞ āφāĻĒāϞ⧋āĻĄ āĻ“ āĻļ⧇āϝāĻŧāĻžāϰ](https://huggingface.co/docs/transformers/model_sharing) | āφāĻĒāύāĻžāϰ āĻĢāĻžāχāύ-āϟāĻŋāωāύ āĻŽāĻĄā§‡āϞ āφāĻĒāϞ⧋āĻĄ āĻ•āϰ⧁āύ āĻ“ āĻ•āĻŽāĻŋāωāύāĻŋāϟāĻŋāϤ⧇ āĻļ⧇āϝāĻŧāĻžāϰ āĻ•āϰ⧁āύ | + +## āϰ⧇āĻĢāĻžāϰ⧇āĻ¨ā§āϏ + +āφāĻŽāĻžāĻĻ⧇āϰ [āĻāĻ•āϟāĻŋ āĻĒ⧇āĻĒāĻžāϰ](https://www.aclweb.org/anthology/2020.emnlp-demos.6/) āφāϛ⧇, āϝāĻž āφāĻĒāύāĻŋ 🤗 Transformers āϞāĻžāχāĻŦā§āϰ⧇āϰāĻŋ āϰ⧇āĻĢāĻžāϰ⧇āĻ¨ā§āϏ āĻ•āϰāϤ⧇ āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻ•āϰāϤ⧇ āĻĒāĻžāϰ⧇āύāĨ¤ + + +```bibtex +@inproceedings{wolf-etal-2020-transformers, + title = "Transformers: State-of-the-Art Natural Language Processing", + author = "Thomas Wolf and Lysandre Debut and Victor Sanh and Julien Chaumond and Clement Delangue and Anthony Moi and Pierric Cistac and Tim Rault and RÊmi Louf and Morgan Funtowicz and Joe Davison and Sam Shleifer and Patrick von Platen and Clara Ma and Yacine Jernite and Julien Plu and Canwen Xu and Teven Le Scao and Sylvain Gugger and Mariama Drame and Quentin Lhoest and Alexander M. Rush", + booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations", + month = oct, + year = "2020", + address = "Online", + publisher = "Association for Computational Linguistics", + url = "https://www.aclweb.org/anthology/2020.emnlp-demos.6", + pages = "38--45" +} +``` diff --git a/i18n/README_de.md b/i18n/README_de.md index b913df894dc1..09fb092aa772 100644 --- a/i18n/README_de.md +++ b/i18n/README_de.md @@ -47,6 +47,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_es.md b/i18n/README_es.md index d31b7f5f76c3..68b1e0e3a901 100644 --- a/i18n/README_es.md +++ b/i18n/README_es.md @@ -47,6 +47,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_fr.md b/i18n/README_fr.md index 6512b4af0700..d8526a138343 100644 --- a/i18n/README_fr.md +++ b/i18n/README_fr.md @@ -47,6 +47,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_hd.md b/i18n/README_hd.md index 1eb220efadc0..cbe7056ce11d 100644 --- a/i18n/README_hd.md +++ b/i18n/README_hd.md @@ -72,6 +72,7 @@ checkpoint: ā¤œā¤žā¤ā¤š ā¤Ŧā¤ŋ⤂ā¤ĻāĨ ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_ja.md b/i18n/README_ja.md index 5d5db4993239..530d9f8d248c 100644 --- a/i18n/README_ja.md +++ b/i18n/README_ja.md @@ -82,6 +82,7 @@ user: ãƒĻãƒŧã‚ļ ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_ko.md b/i18n/README_ko.md index fded56a37c9b..9d740abcc15d 100644 --- a/i18n/README_ko.md +++ b/i18n/README_ko.md @@ -47,6 +47,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_pt-br.md b/i18n/README_pt-br.md index e3c71c6a3f35..7f52aaa234d6 100644 --- a/i18n/README_pt-br.md +++ b/i18n/README_pt-br.md @@ -47,6 +47,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_ru.md b/i18n/README_ru.md index c30237fef885..5d02415dbbea 100644 --- a/i18n/README_ru.md +++ b/i18n/README_ru.md @@ -47,6 +47,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_te.md b/i18n/README_te.md index aee579b52abd..f0a7d50a2819 100644 --- a/i18n/README_te.md +++ b/i18n/README_te.md @@ -49,6 +49,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_ur.md b/i18n/README_ur.md index bba5988e7717..0a6044de9de1 100644 --- a/i18n/README_ur.md +++ b/i18n/README_ur.md @@ -47,6 +47,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_vi.md b/i18n/README_vi.md index f78e3b6d4e9b..1a387767c16d 100644 --- a/i18n/README_vi.md +++ b/i18n/README_vi.md @@ -47,6 +47,7 @@ limitations under the License. ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_zh-hans.md b/i18n/README_zh-hans.md index decbf5e21b1d..d04806174c9b 100644 --- a/i18n/README_zh-hans.md +++ b/i18n/README_zh-hans.md @@ -72,6 +72,7 @@ checkpoint: æŖ€æŸĨį‚š ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ | diff --git a/i18n/README_zh-hant.md b/i18n/README_zh-hant.md index da6ed40910ea..66527a404d03 100644 --- a/i18n/README_zh-hant.md +++ b/i18n/README_zh-hant.md @@ -84,6 +84,7 @@ user: äŊŋᔍ者 ā°¤āą†ā°˛āąā°—āą | Français | Deutsch | + āĻŦāĻžāĻ‚āϞāĻž | Tiáēŋng Viáģ‡t | Ø§Ų„ØšØąØ¨ŲŠØŠ | Ø§ØąØ¯Ųˆ |