AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Cultural corpus training

# Cultural corpus training

Llama 2 7b Ukrainian
Llama-2-7b-Ukrainian Version is a bilingual pre-trained model supporting Ukrainian and English, based on continued pre-training of Llama-2-7b using 5 billion tokens of data from CulturaX.
Large Language Model Transformers Supports Multiple Languages
L
tartuNLP
141
2
Sambalingo Turkish Base
SambaLingo-Turkish-Base is a bilingual (Turkish and English) model based on Llama-2-7b pre-training, adapted for Turkish by training on 42 billion tokens from the Turkish portion of the Cultura-X dataset.
Large Language Model Transformers Supports Multiple Languages
S
sambanovasystems
29
38
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase