F

Flan T5 Xxl Sharded Fp16

Developed by philschmid
FLAN-T5 XXL is a variant of Google's T5 model, fine-tuned on over 1,000 additional tasks, supports multiple languages, and outperforms the original T5 model.
Downloads 531
Release Time : 1/27/2023

Model Overview

This is a forked version of google/flan-t5-xxl, implementing a custom handler.py as an example for using t5-11b on a single NVIDIA A10G via an inference endpoint.

Model Features

Multi-task fine-tuning
Fine-tuned on over 1,000 additional tasks, covering multiple languages and task types
Quantized version
Supports running on a single NVIDIA A10G GPU, reducing hardware requirements
Multilingual support
Supports processing and generation in over 60 languages
Superior performance
Outperforms the original T5 model with the same parameter scale

Model Capabilities

Text generation
Question answering systems
Multilingual translation
Instruction understanding
Text summarization

Use Cases

Natural language processing
Multilingual question answering system
Build an intelligent question answering system supporting multiple languages
Achieves 75.2% accuracy on five-shot MMLU
Text summarization
Automatically generate summaries of articles or documents
Machine translation
Supports mutual translation between multiple languages
Featured Recommended AI Models
ยฉ 2025AIbase