Dictalm2.0-instruct Open-Source Large Language Model - Free Support for Hebrew and English Chat Conversations

Dictalm2.0 Instruct

Developed by dicta-il

An instruction fine-tuned large language model based on DictaLM-2.0, specifically optimized for Hebrew and English chat scenarios

Large Language Model

Transformers

Supports Multiple LanguagesOpen Source License:Apache-2.0 #Hebrew Optimization #Instruction Fine-tuning #Multilingual Dialogue

Downloads 9,977

Release Time : 4/14/2024

Model Overview

This is a full-precision instruction fine-tuned model designed for chat scenarios, fine-tuned through various dialogue datasets, with special optimization for Hebrew language capabilities

Model Features

Bilingual Support

Specifically optimized for bilingual processing capabilities in Hebrew and English

Instruction Fine-tuning

Fine-tuned through various dialogue datasets to optimize chat interaction experience

Enhanced Vocabulary

Expanded vocabulary and instruction datasets specifically for Hebrew

Model Capabilities

Text Generation

Multi-turn Dialogue

Bilingual Processing

Instruction Understanding

Use Cases

Chat Applications

Intelligent Dialogue Assistant

Can be used to build bilingual chatbots in Hebrew and English

Capable of natural and smooth multi-turn conversations

Recipe Generation

Generates cooking recipes based on user requests

Provides detailed ingredient lists and preparation steps

Language Learning

Hebrew Learning Assistant

Helps learners practice Hebrew conversations

Provides natural Hebrew interaction experience

🚀 Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities

The DictaLM-2.0-Instruct Large Language Model (LLM) is an instruct fine - tuned version of the [DictaLM - 2.0](https://huggingface.co/dicta - il/dictalm2.0) generative model. It uses a variety of conversation datasets. This model is designed to adapt large language models to Hebrew, with enhanced vocabulary and instruction capabilities.

For full details of this model, please read our [release blog post](https://dicta.org.il/dicta - lm) or the technical report.

This is the instruct - tuned full - precision model for chat. You can try the model out on a live demo [here](https://huggingface.co/spaces/dicta - il/dictalm2.0 - instruct - demo).

You can view and access the full collection of base/instruct unquantized/quantized versions of DictaLM - 2.0 [here](https://huggingface.co/collections/dicta - il/dicta - lm - 20 - collection - 661bbda397df671e4a430c27).

✨ Features

Instruction Format

In order to leverage instruction fine - tuning, your prompt should be surrounded by [INST] and [/INST] tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end - of - sentence token id.

E.g.

text = """<s>[INST] איזה רוטב אהוב עליך? [/INST]
טוב, אני די מחבב כמה טיפות מיץ לימון סחוט טרי. זה מוסיף בדיוק את הכמות הנכונה של טעם חמצמץ לכל מה שאני מבשל במטבח!</s>[INST] האם יש לך מתכונים למיונז? [/INST]"

This format is available as a chat template via the apply_chat_template() method.

💻 Usage Examples

Basic Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

device = "cuda" # the device to load the model onto

model = AutoModelForCausalLM.from_pretrained("dicta-il/dictalm2.0-instruct", torch_dtype=torch.bfloat16, device_map=device)
tokenizer = AutoTokenizer.from_pretrained("dicta-il/dictalm2.0-instruct")

messages = [
    {"role": "user", "content": "איזה רוטב אהוב עליך?"},
    {"role": "assistant", "content": "טוב, אני די מחבב כמה טיפות מיץ לימון סחוט טרי. זה מוסיף בדיוק את הכמות הנכונה של טעם חמצמץ לכל מה שאני מבשל במטבח!"},
    {"role": "user", "content": "האם יש לך מתכונים למיונז?"}
]

encoded = tokenizer.apply_chat_template(messages, return_tensors="pt").to(device)

generated_ids = model.generate(encoded, max_new_tokens=50, do_sample=True)
decoded = tokenizer.batch_decode(generated_ids)
print(decoded[0])
# <s> [INST] איזה רוטב אהוב עליך? [/INST]
# טוב, אני די מחבב כמה טיפות מיץ לימון סחוט טרי. זה מוסיף בדיוק את הכמות הנכונה של טעם חמצמץ לכל מה שאני מבשל במטבח!</s>  [INST] האם יש לך מתכונים למיונז? [/INST]
# בטח, הנה מתכון בסיסי וקל להכנת מיונז ביתי!
# 
# מרכיבים:
# - 2 חלמונים גדולים
# - 1 כף חומץ יין לבן
# (it stopped early because we set max_new_tokens=50)

🔧 Technical Details

Model Architecture

DictaLM - 2.0 - Instruct follows the [Zephyr - 7B - beta](https://huggingface.co/HuggingFaceH4/zephyr - 7b - beta) recipe for fine - tuning an instruct model, with an extended instruct dataset for Hebrew.

📚 Documentation

Limitations

The DictaLM 2.0 Instruct model is a demonstration that the base model can be fine - tuned to achieve compelling performance. It does not have any moderation mechanisms. We're looking forward to engaging with the community on ways to make the model finely respect guardrails, allowing for deployment in environments requiring moderated outputs.

📄 License

This model is licensed under the Apache - 2.0 license.

📚 Citation

If you use this model, please cite:

@misc{shmidman2024adaptingllmshebrewunveiling,
      title={Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities}, 
      author={Shaltiel Shmidman and Avi Shmidman and Amir DN Cohen and Moshe Koppel},
      year={2024},
      eprint={2407.07080},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2407.07080}, 
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご