mt5-base Open-source Multilingual Processing Model - Free Support for Text Processing Tasks in 101 Languages

Mt5 Base

Developed by google

mT5 is a multilingual variant of the T5 model, pretrained on the mC4 corpus covering 101 languages, suitable for multilingual text processing tasks.

Large Language Model Supports Multiple LanguagesOpen Source License:Apache-2.0 #Multilingual text generation #101 language support #Unsupervised pretraining

Downloads 118.49k

Release Time : 3/2/2022

Model Overview

mT5 is a large-scale multilingual pretrained model based on the T5 architecture, supporting text-to-text transformation tasks in 101 languages. Requires fine-tuning for downstream tasks.

Model Features

Multilingual support

Covers 101 languages, including low-resource languages such as Hmong and Hawaiian

Unified text framework

Adopts T5's text-to-text unified architecture, adaptable to various NLP tasks

Large-scale pretraining

Unsupervised pretraining based on the mC4 multilingual corpus

Model Capabilities

Multilingual text generation

Machine translation

Text summarization

Question answering systems

Text classification

Use Cases

Cross-language applications

Multilingual customer service system

Supports automated Q&A and dialogue in multiple languages

Can serve user groups in 101 languages

Content processing

News summarization generation

Generates summaries for news in different languages

🚀 Google's mT5

mT5 is a multilingual pre - trained text - to - text transformer that offers powerful language processing capabilities across 101 languages.

🚀 Quick Start

mT5 is pretrained on the mC4 corpus, which covers 101 languages: Afrikaans, Albanian, Amharic, Arabic, Armenian, Azerbaijani, Basque, Belarusian, Bengali, Bulgarian, Burmese, Catalan, Cebuano, Chichewa, Chinese, Corsican, Czech, Danish, Dutch, English, Esperanto, Estonian, Filipino, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hausa, Hawaiian, Hebrew, Hindi, Hmong, Hungarian, Icelandic, Igbo, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Kurdish, Kyrgyz, Lao, Latin, Latvian, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Maori, Marathi, Mongolian, Nepali, Norwegian, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Samoan, Scottish Gaelic, Serbian, Shona, Sindhi, Sinhala, Slovak, Slovenian, Somali, Sotho, Spanish, Sundanese, Swahili, Swedish, Tajik, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Uzbek, Vietnamese, Welsh, West Frisian, Xhosa, Yiddish, Yoruba, Zulu.

⚠️ Important Note

mT5 was only pre - trained on mC4 excluding any supervised training. Therefore, this model has to be fine - tuned before it is useable on a downstream task.

📚 Documentation

Pretraining Dataset

The model is pretrained on the mC4 dataset.

Other Community Checkpoints

You can find other community checkpoints here.

Paper

Authors

The authors of the paper are Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al - Rfou, Aditya Siddhant, Aditya Barua, Colin Raffel.

Abstract

The recent "Text - to - Text Transfer Transformer" (T5) leveraged a unified text - to - text format and scale to attain state - of - the - art results on a wide variety of English - language NLP tasks. In this paper, we introduce mT5, a multilingual variant of T5 that was pre - trained on a new Common Crawl - based dataset covering 101 languages. We describe the design and modified training of mT5 and demonstrate its state - of - the - art performance on many multilingual benchmarks. All of the code and model checkpoints used in this work are publicly available.

📄 License

This project is licensed under the Apache 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご