# Ultra-long context understanding
Llama 4 Maverick 17B 128E Instruct FP8
Other
A native multi-modal AI model in the Llama 4 series, supporting text and image understanding, adopting a mixture-of-experts architecture, suitable for commercial and research scenarios.
Multimodal Fusion
Transformers Supports Multiple Languages

L
RedHatAI
5,679
1
Llama 3.1 8B UltraLong 4M Instruct
A large language model specifically designed for processing ultra-long text sequences (supporting up to 1 million, 2 million, and 4 million tokens), maintaining excellent performance in standard benchmarks
Large Language Model
Transformers English

L
nvidia
264
27
Llama 3.1 Nemotron 8B UltraLong 4M Instruct
Nemotron-UltraLong-8B is a language model specifically designed for processing ultra-long text sequences, supporting a context window of up to 4 million tokens while maintaining outstanding performance on standard benchmarks.
Large Language Model
Transformers English

L
nvidia
4,363
103
Llama 3.1 8B UltraLong 1M Instruct
The Nemotron-UltraLong-8B series is a language model specifically designed for processing ultra-long text sequences, supporting a context window of up to 4 million tokens while maintaining exceptional performance.
Large Language Model
Transformers English

L
nvidia
1,387
26
Llama 3.1 Nemotron 8B UltraLong 1M Instruct
A large language model specifically designed for processing ultra-long text sequences (supporting up to 1 million, 2 million, and 4 million tokens) while maintaining outstanding performance in standard benchmarks.
Large Language Model
Transformers English

L
nvidia
4,025
40
Featured Recommended AI Models