V

Vietnamese FlanT5 Large

Developed by Hatto
A multilingual sequence-to-sequence model based on Flan-T5-Large, supporting Vietnamese, English, and Chinese, suitable for tasks such as summarization, translation, and question answering.
Downloads 116
Release Time : 11/22/2023

Model Overview

This model is a multilingual model incrementally pretrained on the basis of Flan-T5-Large, enhancing multilingual processing capabilities by expanding the vocabulary and incorporating Vietnamese, English, and Chinese data.

Model Features

Multilingual Support
Supports Vietnamese, English, and Chinese processing by retraining the tokenizer and expanding the vocabulary.
Incremental Pretraining
Conducted single-round continuous pretraining on Flan-T5-Large, incorporating over 100GB of diverse data.
Vocabulary Expansion
Retrained the tokenizer using SentencePiece, resulting in a combined vocabulary of 106,611 tokens.

Model Capabilities

Text summarization
Machine translation
Question answering system
Mask filling

Use Cases

Natural Language Processing
News Summarization
Generate concise summaries of Vietnamese news content.
Multilingual Translation
Perform text translation between Vietnamese, English, and Chinese.
Legal Text Processing
Legal Document Analysis
Process Vietnamese legal documents and texts.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase