Turkish-Llama-8b-DPO-v0.1 Open-Source Large Model - Free Deployment to Boost Turkish Text Generation

Turkish Llama 8b DPO V0.1

Developed by ytu-ce-cosmos

CosmosLLaMa-Instruction DPO is a large language model specifically designed for Turkish text generation tasks, capable of continuing text fragments in a coherent and contextually appropriate manner.

Large Language Model

Safetensors

Other#Turkish text generation #DPO optimized dialogue #Multi-turn instruction understanding

Downloads 5,182

Release Time : 9/4/2024

Model Overview

This model is the latest iteration of CosmosLLaMa, developed by merging two independently trained CosmosLLaMa-Instruction DPO models, suitable for Turkish text generation tasks.

Model Features

Turkish Language Optimization

Specifically optimized for Turkish, capable of generating high-quality Turkish text.

Instruction Following

Capable of understanding and executing user instructions, completing tasks step-by-step and explaining each step.

DPO Training

Trained using Direct Preference Optimization (DPO) method, improving the quality and consistency of model outputs.

Model Capabilities

Turkish text generation

Instruction following

Task execution

Step-by-step reasoning

Use Cases

Education

Math Problem Solving

Solves math problems and explains the solution process step-by-step

As shown in the example, the model can correctly calculate car mileage and explain the calculation steps

Content Creation

Turkish Content Generation

Generates coherent Turkish text content

Capable of generating Turkish text that conforms to grammar and semantics based on context

🚀 Cosmos LLaMa Instruct-DPO

This is the newest and most advanced iteration of CosmosLLama, designed for text generation tasks.

This is the latest and most advanced version of CosmosLLama. The model is developed by merging two separately trained CosmosLLaMa - Instruct DPO models. It is tailored for text generation, capable of coherently continuing a given text snippet. However, due to the diverse training data from websites, books, and other text sources, the model may show biases. Users should be aware of these biases and use the model responsibly.

You can easily test the model through this demo link: https://cosmos.yildiz.edu.tr/cosmosllama

Cosmos LLaMa Image

🚀 Quick Start

Model Information

Property	Details
License	llama3
Language	Turkish
Pipeline Tag	text - generation
Base Model	ytu - ce - cosmos/Turkish - Llama - 8b - Instruct - v0.1
Tags	Turkish, turkish, Llama, Llama3

💻 Usage Examples

Basic Usage

import transformers
import torch

model_id = "ytu-ce-cosmos/Turkish-Llama-8b-DPO-v0.1"

pipeline = transformers.pipeline(
    "text-generation",
    model=model_id,
    model_kwargs={"torch_dtype": torch.bfloat16},
    device_map="auto",
)

messages = [
    {"role": "system", "content": "Sen bir yapay zeka asistanısın. Kullanıcı sana bir görev verecek. Amacın görevi olabildiğince sadık bir şekilde tamamlamak. Görevi yerine getirirken adım adım düşün ve adımlarını gerekçelendir."},
    {"role": "user", "content": "Soru: Bir arabanın deposu 60 litre benzin alabiliyor. Araba her 100 kilometrede 8 litre benzin tüketiyor. Depo tamamen doluyken araba kaç kilometre yol alabilir?"},
]

terminators = [
    pipeline.tokenizer.eos_token_id,
    pipeline.tokenizer.convert_tokens_to_ids("<|eot_id|>")
]

outputs = pipeline(
    messages,
    max_new_tokens=256,
    eos_token_id=terminators,
    do_sample=True,
    temperature=0.6,
    top_p=0.9,
)
print(outputs[0]["generated_text"][-1])

Advanced Usage

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

model_id = "ytu-ce-cosmos/Turkish-Llama-8b-DPO-v0.1"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)

messages = [
    {"role": "system", "content": "Sen bir yapay zeka asistanısın. Kullanıcı sana bir görev verecek. Amacın görevi olabildiğince sadık bir şekilde tamamlamak. Görevi yerine getirirken adım adım düşün ve adımlarını gerekçelendir."},
    {"role": "user", "content": "Soru: Bir arabanın deposu 60 litre benzin alabiliyor. Araba her 100 kilometrede 8 litre benzin tüketiyor. Depo tamamen doluyken araba kaç kilometre yol alabilir?"},
]

input_ids = tokenizer.apply_chat_template(
    messages,
    add_generation_prompt=True,
    return_tensors="pt"
).to(model.device)

terminators = [
    tokenizer.eos_token_id,
    tokenizer.convert_tokens_to_ids("<|eot_id|>")
]

outputs = model.generate(
    input_ids,
    max_new_tokens=256,
    eos_token_id=terminators,
    do_sample=True,
    temperature=0.6,
    top_p=0.9,
)
response = outputs[0][input_ids.shape[-1]:]
print(tokenizer.decode(response, skip_special_tokens=True))

🤝 Acknowledgments

Thanks to the generous support from the Hugging Face team, it is possible to download models from their S3 storage 🤗
Computing resources used in this work were provided by the National Center for High Performance Computing of Turkey (UHeM) under grant numbers 1016912023 and 1018512024
Research supported with Cloud TPUs from Google's TPU Research Cloud (TRC)

Contact

COSMOS AI Research Group, Yildiz Technical University Computer Engineering Department
https://cosmos.yildiz.edu.tr/
cosmos@yildiz.edu.tr

📄 Citation

@inproceedings{kesgin2024optimizing,
  title={Optimizing Large Language Models for Turkish: New Methodologies in Corpus Selection and Training},
  author={Kesgin, H Toprak and Yuce, M Kaan and Dogan, Eren and Uzun, M Egemen and Uz, Atahan and {\.I}nce, Elif and Erdem, Yusuf and Shbib, Osama and Zeer, Ahmed and Amasyali, M Fatih},
  booktitle={2024 Innovations in Intelligent Systems and Applications Conference (ASYU)},
  pages={1--6},
  year={2024},
  organization={IEEE}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご