EEVE-Korean-Instruct-10.8B-v1.0开源大模型 - 专注韩语理解与生成任务

首页

EEVE Korean Instruct 10.8B V1.0 Gguf

由 teddylee777 开发

EEVE-Korean-Instruct-10.8B-v1.0 是一个韩语指令微调的大型语言模型，基于 yanolja/EEVE-Korean-10.8B-v1.0 基础模型开发，专注于韩语理解和生成任务。

大型语言模型开源协议:Apache-2.0 #韩语指令优化 #多轮对话生成 #高质量反馈训练

下载量 626

发布时间 : 4/25/2024

模型简介

该模型是一个10.8B参数规模的韩语指令微调模型，主要用于韩语对话和指令理解任务。它基于 llama.cpp 进行了量化优化，适合在本地环境中部署使用。

模型特点

韩语优化

专门针对韩语理解和生成任务进行了优化，在韩语处理方面表现优异

指令微调

经过指令微调，能够更好地理解和执行用户指令

量化支持

支持通过 llama.cpp 进行量化，便于在资源有限的环境中部署

模型能力

韩语文本生成

指令理解与执行

对话系统

知识问答

使用案例

对话系统

智能客服

可用于构建韩语智能客服系统，处理用户咨询

个人助手

作为个人数字助手，回答用户问题和执行简单任务

教育

语言学习

辅助韩语学习者进行语言练习和答疑

🚀 yanolja/EEVE-Korean-Instruct-10.8B-v1.0

该项目基于yanolja/EEVE-Korean-10.8B-v1.0模型，使用llama.cpp进行量化处理，适用于韩语对话场景，为用户提供智能问答服务。

🚀 快速开始

原模型为 yanolja/EEVE-Korean-Instruct-10.8B-v1.0，并使用 llama.cpp 进行量化。

Ollama 的 Modelfile 配置

FROM EEVE-Korean-Instruct-10.8B-v1.0-Q8_0.gguf

TEMPLATE """{{- if .System }}
<s>{{ .System }}</s>
{{- end }}
<s>Human:
{{ .Prompt }}</s>
<s>Assistant:
"""

SYSTEM """A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions."""

PARAMETER temperature 0
PARAMETER num_predict 3000
PARAMETER num_ctx 4096
PARAMETER stop <s>
PARAMETER stop </s>

📦 训练数据

属性	详情
训练数据	Open-Orca/SlimOrca-Dedup 的韩语翻译版本；argilla/ultrafeedback-binarized-preferences-cleaned 的韩语翻译版本；未使用其他数据集

📄 许可证

本项目采用 Apache-2.0 许可证。

📚 引用

@misc{kim2024efficient,
      title={Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models}, 
      author={Seungduk Kim and Seungtaek Choi and Myeongho Jeong},
      year={2024},
      eprint={2402.14714},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

@misc{cui2023ultrafeedback,
      title={UltraFeedback: Boosting Language Models with High-quality Feedback}, 
      author={Ganqu Cui and Lifan Yuan and Ning Ding and Guanming Yao and Wei Zhu and Yuan Ni and Guotong Xie and Zhiyuan Liu and Maosong Sun},
      year={2023},
      eprint={2310.01377},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

@misc{SlimOrcaDedup,
  title = {SlimOrca Dedup: A Deduplicated Subset of SlimOrca},
  author = {Wing Lian and Guan Wang and Bleys Goodson and Eugene Pentland and Austin Cook and Chanvichet Vong and "Teknium" and Nathan Hoos},
  year = {2023},
  publisher = {HuggingFace},
  url = {https://huggingface.co/datasets/Open-Orca/SlimOrca-Dedup/}
}

@misc{mukherjee2023orca,
      title={Orca: Progressive Learning from Complex Explanation Traces of GPT-4}, 
      author={Subhabrata Mukherjee and Arindam Mitra and Ganesh Jawahar and Sahaj Agarwal and Hamid Palangi and Ahmed Awadallah},
      year={2023},
      eprint={2306.02707},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}