Mixtral-8x22B-Instruct-v0.1-GGUF开源模型 - 免费部署实现多语言文本生成

首页

Mixtral 8x22B Instruct V0.1 GGUF

由 MaziyarPanahi 开发

基于mistralai/Mixtral-8x22B-Instruct-v0.1模型的GGUF量化版本，支持多语言文本生成任务

大型语言模型支持多种语言开源协议:Apache-2.0 #多专家混合模型 #多语言指令跟随 #函数调用支持

下载量 1,333

发布时间 : 4/17/2024

模型简介

这是一个支持多种语言（法语、英语、西班牙语、意大利语和德语）的大型语言模型，特别适用于指令跟随和文本生成任务。

模型特点

多语言支持

支持法语、英语、西班牙语、意大利语和德语等多种语言的文本生成

量化版本

提供多种量化级别（2-bit至16-bit）的GGUF格式模型，便于不同硬件环境部署

指令跟随

专门优化的指令跟随能力，适合对话和任务完成场景

函数调用支持

内置特殊令牌支持函数调用功能

模型能力

多语言文本生成

指令跟随

对话系统

函数调用

任务完成

使用案例

对话系统

多语言客服机器人

构建支持多种语言的智能客服系统

内容生成

多语言内容创作

自动生成多种语言的营销文案或文章

任务自动化

函数调用任务

通过函数调用实现复杂任务自动化

🚀 Mixtral-8x22B-Instruct-v0.1-GGUF

本项目提供基于 mistralai/Mixtral-8x22B-Instruct-v0.1 模型的 GGUF 量化模型。这些模型可用于文本生成任务，支持多种语言，包括法语、英语、西班牙语、意大利语和德语。

🚀 快速开始

下载模型

你可以仅下载所需的量化模型，而无需克隆整个仓库，示例命令如下：

huggingface-cli download MaziyarPanahi/Mixtral-8x22B-Instruct-v0.1-GGUF --local-dir . --include '*Q2_K*gguf'

加载分片模型

llama_load_model_from_file 函数会自动检测文件数量，并从其余文件中加载额外的张量。示例命令如下：

llama.cpp/main -m Mixtral-8x22B-Instruct-v0.1.Q2_K-00001-of-00005.gguf -p "Building a website can be done in 10 simple steps:\nStep 1:" -n 1024 -e

💻 使用示例

基础用法

from transformers import AutoModelForCausalLM
from mistral_common.protocol.instruct.messages import (
    AssistantMessage,
    UserMessage,
)
from mistral_common.protocol.instruct.tool_calls import (
    Tool,
    Function,
)
from mistral_common.tokens.tokenizers.mistral import MistralTokenizer
from mistral_common.tokens.instruct.normalize import ChatCompletionRequest

device = "cuda" # the device to load the model onto

tokenizer_v3 = MistralTokenizer.v3()

mistral_query = ChatCompletionRequest(
    tools=[
        Tool(
            function=Function(
                name="get_current_weather",
                description="Get the current weather",
                parameters={
                    "type": "object",
                    "properties": {
                        "location": {
                            "type": "string",
                            "description": "The city and state, e.g. San Francisco, CA",
                        },
                        "format": {
                            "type": "string",
                            "enum": ["celsius", "fahrenheit"],
                            "description": "The temperature unit to use. Infer this from the users location.",
                        },
                    },
                    "required": ["location", "format"],
                },
            )
        )
    ],
    messages=[
        UserMessage(content="What's the weather like today in Paris"),
    ],
    model="test",
)

encodeds = tokenizer_v3.encode_chat_completion(mistral_query).tokens
model = AutoModelForCausalLM.from_pretrained("mistralai/Mixtral-8x22B-Instruct-v0.1")
model_inputs = encodeds.to(device)
model.to(device)

generated_ids = model.generate(model_inputs, max_new_tokens=1000, do_sample=True)
sp_tokenizer = tokenizer_v3.instruct_tokenizer.tokenizer
decoded = sp_tokenizer.decode(generated_ids[0])
print(decoded)

高级用法

# 指令分词器相关高级使用示例
from mistral_common.protocol.instruct.messages import (
    AssistantMessage,
    UserMessage,
)
from mistral_common.tokens.tokenizers.mistral import MistralTokenizer
from mistral_common.tokens.instruct.normalize import ChatCompletionRequest

from transformers import AutoTokenizer

tokenizer_v3 = MistralTokenizer.v3()

mistral_query = ChatCompletionRequest(
    messages=[
        UserMessage(content="How many experts ?"),
        AssistantMessage(content="8"),
        UserMessage(content="How big ?"),
        AssistantMessage(content="22B"),
        UserMessage(content="Noice ðŸŽ‰ !"),
    ],
    model="test",
)
hf_messages = mistral_query.model_dump()['messages']

tokenized_mistral = tokenizer_v3.encode_chat_completion(mistral_query).tokens

tokenizer_hf = AutoTokenizer.from_pretrained('mistralai/Mixtral-8x22B-Instruct-v0.1')
tokenized_hf = tokenizer_hf.apply_chat_template(hf_messages, tokenize=True)

assert tokenized_hf == tokenized_mistral

📚 详细文档

指令分词器

本次发布中包含的 HuggingFace 分词器应与我们自己的分词器相匹配。你可以通过以下命令进行比较： pip install mistral-common

函数调用和特殊令牌

此分词器包含与函数调用相关的更多特殊令牌：

[TOOL_CALLS]
[AVAILABLE_TOOLS]
[/AVAILABLE_TOOLS]
[TOOL_RESULT]
[/TOOL_RESULTS]

如果你想在函数调用中使用此模型，请确保按照我们的 SentencePieceTokenizerV3 中的方式进行应用。

团队成员

Albert Jiang, Alexandre Sablayrolles, Alexis Tacnet, Antoine Roux, Arthur Mensch, Audrey Herblin-Stoop, Baptiste Bout, Baudouin de Monicault, Blanche Savary, Bam4d, Caroline Feldman, Devendra Singh Chaplot, Diego de las Casas, Eleonore Arcelin, Emma Bou Hanna, Etienne Metzger, Gianna Lengyel, Guillaume Bour, Guillaume Lample, Harizo Rajaona, Jean-Malo Delignon, Jia Li, Justus Murke, Louis Martin, Louis Ternon, Lucile Saulnier, LÃ©lio Renard Lavaud, Margaret Jennings, Marie Pellat, Marie Torelli, Marie-Anne Lachaux, Nicolas Schuhl, Patrick von Platen, Pierre Stock, Sandeep Subramanian, Sophia Yang, Szymon Antoniak, Teven Le Scao, Thibaut Lavril, TimothÃ©e Lacroix, ThÃ©ophile Gervet, Thomas Wang, Valera Nemychnikova, William El Sayed, William Marshall

📄 许可证

本项目采用 Apache-2.0 许可证。

📦 模型信息

属性	详情
模型类型	Mixtral-8x22B-Instruct-v0.1-GGUF
基础模型	mistralai/Mixtral-8x22B-Instruct-v0.1
模型创建者	MaziyarPanahi
量化者	MaziyarPanahi
支持语言	法语、英语、西班牙语、意大利语、德语
量化类型	2-bit、3-bit、4-bit、5-bit、6-bit、8-bit、16-bit、GGUF
任务类型	文本生成