G

Granite 3.1 1b A400m Base

Developed by ibm-granite
Granite-3.1-1B-A400M-Base is a language model developed by IBM. Through a progressive training strategy, the context length is extended from 4K to 128K, supporting multilingual and various text processing tasks.
Downloads 3,299
Release Time : 12/6/2024

Model Overview

This model is mainly used for various tasks such as text generation, summarization, classification, extraction, and question answering. It supports 12 languages and adopts a sparse mixture of experts (MoE) Transformer architecture.

Model Features

Long Context Support
Through a progressive training strategy, the context length is extended from 4K to 128K.
Multilingual Support
Supports 12 languages, including English, Chinese, Japanese, etc.
Sparse Mixture of Experts Architecture
Adopts the MoE architecture, including fine-grained experts, no-drop token routing, and load balancing loss.

Model Capabilities

Text Generation
Text Summarization
Text Classification
Information Extraction
Question Answering System

Use Cases

Text Processing
Question Answering System
Answer questions raised by users, such as 'Where is the Thomas J. Watson Research Center located?'
Generate accurate answers
Text Summarization
Summarize long texts and extract key information
Generate concise summaries
Featured Recommended AI Models
ยฉ 2025AIbase