Flan T5 Xxl Sharded Fp16
FLAN-T5 XXL is a variant of Google's T5 model, fine-tuned on over 1,000 additional tasks, supports multiple languages, and outperforms the original T5 model.
Downloads 531
Release Time : 1/27/2023
Model Overview
This is a forked version of google/flan-t5-xxl, implementing a custom handler.py as an example for using t5-11b on a single NVIDIA A10G via an inference endpoint.
Model Features
Multi-task fine-tuning
Fine-tuned on over 1,000 additional tasks, covering multiple languages and task types
Quantized version
Supports running on a single NVIDIA A10G GPU, reducing hardware requirements
Multilingual support
Supports processing and generation in over 60 languages
Superior performance
Outperforms the original T5 model with the same parameter scale
Model Capabilities
Text generation
Question answering systems
Multilingual translation
Instruction understanding
Text summarization
Use Cases
Natural language processing
Multilingual question answering system
Build an intelligent question answering system supporting multiple languages
Achieves 75.2% accuracy on five-shot MMLU
Text summarization
Automatically generate summaries of articles or documents
Machine translation
Supports mutual translation between multiple languages
Featured Recommended AI Models
ยฉ 2025AIbase