M

Mistral Supra

Developed by TRI-ML
Mistral-SUPRA is a linear RNN model initialized based on Mistral-7B, combining the functions of Transformer and recurrent models.
Downloads 163
Release Time : 4/9/2024

Model Overview

This model transforms Mistral-7B into a linear RNN through a specific training process, supporting the selection of parallel or recurrent modes during inference, and is suitable for text generation tasks.

Model Features

Linear RNN Architecture
Transforms Mistral-7B into a linear RNN, combining the functions of Transformer and recurrent models
Dual-Mode Inference
Supports both parallel and recurrent inference modes, which can be selected according to requirements
Efficient Training
Completes training in only 1.5 days on a dataset of 100B tokens

Model Capabilities

Text Generation
Language Understanding

Use Cases

Natural Language Processing
Text Completion
Generates coherent subsequent content based on a given text fragment
Example output: 'Machine learning is a branch of artificial intelligence (AI) that enables computers to learn from experience...'
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase