I

It5 Base Oscar

Developed by gsarti
The first large-scale sequence-to-sequence Transformer model pre-trained specifically for Italian, based on the OSCAR corpus
Downloads 19
Release Time : 3/2/2022

Model Overview

This model is the base version of the IT5 model family, specifically pre-trained for Italian using the T5 architecture, suitable for various sequence-to-sequence tasks.

Model Features

Italian-specific pre-training
The first large-scale pre-trained sequence-to-sequence Transformer model for Italian
Based on OSCAR corpus
Trained using the Italian portion of the OSCAR corpus
Improved tokenizer
Utilizes SentencePieceUnigramTokenizer trained on the Italian portion of mC4
TPU-optimized training
Training completed on Google Cloud's TPU3v8-VM machines, sponsored by Google TPU Research Cloud

Model Capabilities

Italian text understanding
Italian text generation
Sequence-to-sequence conversion

Use Cases

Natural Language Processing
Natural language inference
Can be used for natural language inference tasks, such as premise-hypothesis relationship judgment
See fine-tuned model gsarti/it5-base-nli
Text summarization
Can be used for automatic summarization of Italian text
Machine translation
Can be used for Italian-related translation tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase