S

Sambalingo Turkish Base

Developed by sambanovasystems
SambaLingo-Turkish-Base is a bilingual (Turkish and English) model based on Llama-2-7b pre-training, adapted for Turkish by training on 42 billion tokens from the Turkish portion of the Cultura-X dataset.
Downloads 29
Release Time : 2/15/2024

Model Overview

This model is a pre-trained language model supporting Turkish and English, primarily used for text generation and language understanding tasks.

Model Features

Bilingual support
Supports both Turkish and English, suitable for bilingual tasks.
Large-scale pre-training
Trained on 42 billion tokens from the Turkish portion of the Cultura-X dataset, optimizing Turkish language performance.
Extended vocabulary
Expands the base Llama model's vocabulary by adding up to 25,000 non-overlapping tokens in the target language.

Model Capabilities

Text generation
Language understanding
Bilingual translation

Use Cases

Natural Language Processing
Turkish text generation
Generate Turkish text for content creation, automated responses, and similar scenarios.
Bilingual translation
Perform translation tasks between Turkish and English.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase