# Trillion-token Training
Sarashina2 13b
MIT
A large language model trained by SB Intuitions, supporting Japanese and English, based on the Llama2 architecture
Large Language Model
Transformers Supports Multiple Languages

S
sbintuitions
1,167
17
Deepseek Coder 1.3b Base Ov Int8
MIT
A multi-head attention code generation model with 1.3 billion parameters, trained on 1 trillion tokens, supporting code completion tasks with a 16K window
Large Language Model
Transformers English

D
Intel
52
3
Featured Recommended AI Models