🚀 英文到印地式英语语言翻译器项目
本项目旨在开发一个高性能的语言翻译模型,能够将标准英语翻译成印地式英语(一种在印度非正式交流中常用的印地语和英语的混合语言)。
🚀 快速开始
本项目包含一个由作者精心策划和制作的数据集。你可以通过以下链接购买该数据集:购买链接
✨ 主要特性
- 多语言支持:支持英语和印地式英语的翻译。
- 基于强大模型:基于
unsloth/llama-3-8b-Instruct-bnb-4bit
模型进行微调。
- 开源项目:采用
apache-2.0
许可证,可自由使用和修改。
📦 安装指南
!pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
!pip install --no-deps xformers trl peft accelerate bitsandbytes
💻 使用示例
基础用法
from unsloth import FastLanguageModel
import torch
max_seq_length = 2048
dtype = None
load_in_4bit = True
model, tokenizer = FastLanguageModel.from_pretrained(
model_name = "suyash2739/English_to_Hinglish_fintuned_lamma_3_8b_instruct",
max_seq_length = max_seq_length,
dtype = dtype,
load_in_4bit = load_in_4bit,
)
高级用法
def pipe(text):
prompt = """Translate the input from English to Hinglish to give the response.
### Input:
{}
### Response:
"""
inputs = tokenizer(
[
prompt.format(text),
], return_tensors = "pt").to("cuda")
outputs = model.generate(**inputs, max_new_tokens = 2048, use_cache = True)
raw_text = tokenizer.batch_decode(outputs)[0]
return raw_text.split("### Response:\n")[1].split("<|eot_id|>")[0]
text = "This is a fine-tuned Hinglish translation model using Llama 3."
print(pipe(text))
📚 详细文档
模型对比
English = """Finance Minister Nirmala Sitharaman said, "There used to be a poverty index...a human development index and all of them continue, but today what is keenly watched is VIX, the volatility index of the markets." Stability of the government is important for markets to be efficient, she stated. PM Narendra Modi's third term will make markets function with stability, she added."""
Gpt 4o = """ Finance Minister Nirmala Sitharaman ne kaha, "Pehle ek poverty index hota tha...ek human development index hota tha aur yeh sab ab bhi hain, lekin aaj jo sabse zyada dekha ja raha hai, woh hai VIX, jo markets ka volatility index hai." Unhone kaha ki sarkar ki stability markets ke efficient hone ke liye zaroori hai. PM Narendra Modi ka teesra term markets ko stability ke saath function karne mein madad karega, unhone joda."""
LLama model = Finance Minister Nirmala Sitharaman ne kaha, "Pehle ek poverty index hota tha... ek human development index hota tha aur sab kuch ab bhi chal raha hai, lekin aaj jo kaafi zyada dekha ja raha hai, woh VIX hai, jo markets ki volatility ka index hai." Unhone kaha ki markets ke liye sarkar ki stability zaroori hai. PM Narendra Modi ke teesre term se markets stability ke saath function karenge, unhone joda.
模型相关信息
属性 |
详情 |
模型类型 |
基于unsloth/llama-3-8b-Instruct-bnb-4bit 微调的印地式英语翻译模型 |
训练数据 |
suyash2739/News_Hinglish_English |
开发者 |
suyash2739 |
许可证 |
apache-2.0 |
模型训练加速
这个Llama模型使用Unsloth和Huggingface的TRL库进行训练,训练速度提升了2倍。
作者信息
支持作者

模型图片
