Mixtral 8x22B V0.1 GGUF
Mixtral 8x22B是由MistralAI發佈的1760億參數混合專家模型,支持多語言文本生成任務。
下載量 170.27k
發布時間 : 4/10/2024
模型概述
這是一個基於混合專家架構的大規模語言模型,支持法語、英語、西班牙語、意大利語和德語等多種語言的文本生成。模型採用Apache 2.0許可證發佈,支持量化以降低資源需求。
模型特點
大規模混合專家架構
採用1760億參數的混合專家架構,其中約350億參數在推理時活躍,平衡了性能與效率
多語言支持
原生支持法語、英語、西班牙語、意大利語和德語等多種語言的文本生成
量化支持
提供從2比特到16比特的多種量化版本,顯著降低硬件需求
長上下文處理
支持長達65k tokens的上下文窗口,適合處理長文檔和複雜任務
模型能力
多語言文本生成
長文本處理
創意寫作
技術文檔生成
內容摘要
問答系統
使用案例
內容創作
網站內容生成
自動生成網站建設指南和內容
如示例所示,能生成詳細的步驟指南
技術文檔編寫
自動生成技術文檔和教程
商業應用
多語言客服
構建支持多種語言的自動客服系統
🚀 Mixtral-8x22B-v0.1-GGUF
Mixtral-8x22B-v0.1-GGUF是基於特定模型進行量化處理後的版本。它以Mixtral 8x22B模型為基礎,具備多方面的特性,如不同的量化比特位數,可在不同硬件條件下使用。該模型適用於文本生成任務,為相關應用提供了有力支持。
🚀 快速開始
下載模型
你可以僅下載所需的量化模型,而無需克隆整個倉庫,具體命令如下:
huggingface-cli download MaziyarPanahi/WizardLM-2-8x22B-GGUF --local-dir . --include '*Q2_K*gguf'
加載分片模型
llama_load_model_from_file
會自動檢測文件數量,並從其餘文件中加載額外的張量。
llama.cpp/main -m Mixtral-8x22B-v0.1.Q2_K-00001-of-00005.gguf -p "Building a website can be done in 10 simple steps:\nStep 1:" -n 1024 -e
✨ 主要特性
- 模型基礎信息:於4月10日,@MistralAI 發佈了名為 “Mixtral 8x22B” 的模型,這是一個176B的混合專家模型(MoE)。
- 參數規模:141B MoE,約35B活躍。
- 上下文長度:65k 標記。
- 微調能力:基礎模型可進行微調。
- 顯存要求:在fp16格式下需要約260GB VRAM,在int4格式下需要73GB。
- 許可協議:根據其Discord信息,該模型採用Apache 2.0許可協議。
- 可用平臺:可在 @huggingface(社區)獲取。
- 分詞器:使用與之前模型類似的分詞器。
- 量化模型:這裡的GGUF和量化模型基於 v2ray/Mixtral-8x22B-v0.1 模型。支持2-bit、3-bit、4-bit、5-bit、6-bit、8-bit、16-bit等多種量化比特位數。
💻 使用示例
基礎用法
huggingface-cli download MaziyarPanahi/WizardLM-2-8x22B-GGUF --local-dir . --include '*Q2_K*gguf'
此命令用於從Hugging Face下載指定的量化模型文件。
高級用法
llama.cpp/main -m Mixtral-8x22B-v0.1.Q2_K-00001-of-00005.gguf -p "Building a website can be done in 10 simple steps:\nStep 1:" -n 1024 -e
該命令用於加載分片模型並進行推理,輸入提示信息為構建網站的步驟,生成1024個標記的輸出。
輸出示例
system_info: n_threads = 64 / 128 | AVX = 1 | AVX_VNNI = 0 | AVX2 = 1 | AVX512 = 0 | AVX512_VBMI = 0 | AVX512_VNNI = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 0 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | MATMUL_INT8 = 0 |
sampling:
repeat_last_n = 64, repeat_penalty = 1.000, frequency_penalty = 0.000, presence_penalty = 0.000
top_k = 40, tfs_z = 1.000, top_p = 0.950, min_p = 0.050, typical_p = 1.000, temp = 0.800
mirostat = 0, mirostat_lr = 0.100, mirostat_ent = 5.000
sampling order:
CFG -> Penalties -> top_k -> tfs_z -> typical_p -> top_p -> min_p -> temperature
generate: n_ctx = 512, n_batch = 2048, n_predict = 1024, n_keep = 1
Building a website can be done in 10 simple steps:
Step 1: Pick a domain name
The domain name is your address on the Internet. It’s what people type into the browser to get to your website. It’s important to pick a domain name that is easy to remember and relates to your business. For example, if you were a plumber, you could register a domain like fixitplumbing.com. You can check the availability of a domain name with the WHOIS lookup tool. If your domain is available, you can register it at a domain registrar like GoDaddy.com or Domain.com.
Step 2: Sign up for a web hosting account
Web hosting is the service that stores your website files and makes them available to people on the Internet. It’s important to pick a web hosting provider that is reliable and has good customer service. Some popular web hosting providers include Bluehost, Hostgator, and Dreamhost.
Step 3: Create a website template
A website template is a pre-designed website that you can use as a starting point for your own website. There are many free website templates available online. Once you’ve found a template you like, you can download it and start customizing it to fit your needs.
Step 4: Add your content
Once you’ve chosen a template, you’ll need to add your own content to it. This includes things like your company logo, contact information, and text about your business. You can also add photos and videos to make your website more engaging.
Step 5: Test your website
Before you make your website live, it’s important to test it out. This includes checking for broken links, typos, and making sure that all of your content is correct. You can also ask friends or family to test your website and give you feedback.
Step 6: Launch your website
Once you’re happy with your website, you can make it live on the Internet. This process is called “launching” your website. You’ll need to upload your website files to your web hosting account and then point your domain name to your hosting account. Once you’ve done this, your website will be available to people on the Internet.
Step 7: Promote your website
Just because you’ve built a website doesn’t mean people will automatically find it. You need to promote your website to get people to visit it. This includes things like search engine optimization (SEO) and social media marketing.
Step 8: Track your website’s progress
Once you’ve built your website, you need to track its progress. This includes things like traffic, search engine rankings, and conversion rates. By tracking your website’s progress, you can make sure that it’s working properly and that people are finding it.
Step 9: Keep your website up-to-date
Just because you’ve built your website doesn’t mean you’re done. You need to keep your website up-to-date by adding new content and fixing any errors that occur. By keeping your website up-to-date, you can make sure that it’s always available to people on the Internet.
Step 10: Repeat steps 1-10
Once you’ve built your website, you need to promote it so that people can find it. You can do this by adding your website to search engines, directories, and social media sites. You can also promote your website by word-of-mouth and by giving people your business card.
Once you’ve promoted your website, you need to keep track of how it’s doing. You can do this by using website analytics tools. These tools will help you see how many people are visiting your website, where they’re coming from, and what they’re doing on your site.
If you want to keep your website up-to-date, you need to add new content on a regular basis. You can do this by writing blog posts, creating infographics, or recording videos. You can also add new content by updating your website’s design and by adding new features.
By following these steps, you can build a website that’s available to people on the Internet. You can also keep your website
由於這似乎是一個基礎模型,它會持續生成內容。
📄 許可證
該模型採用Apache 2.0許可協議。
🔗 致謝
- MistralAI 開放模型權重。
- v2ray 下載、轉換並與社區共享 Mixtral-8x22B-v0.1 模型。
- philschmid 在其Twitter上分享的圖片。
Phi 2 GGUF
其他
Phi-2是微軟開發的一個小型但強大的語言模型,具有27億參數,專注於高效推理和高質量文本生成。
大型語言模型 支持多種語言
P
TheBloke
41.5M
205
Roberta Large
MIT
基於掩碼語言建模目標預訓練的大型英語語言模型,採用改進的BERT訓練方法
大型語言模型 英語
R
FacebookAI
19.4M
212
Distilbert Base Uncased
Apache-2.0
DistilBERT是BERT基礎模型的蒸餾版本,在保持相近性能的同時更輕量高效,適用於序列分類、標記分類等自然語言處理任務。
大型語言模型 英語
D
distilbert
11.1M
669
Llama 3.1 8B Instruct GGUF
Meta Llama 3.1 8B Instruct 是一個多語言大語言模型,針對多語言對話用例進行了優化,在常見的行業基準測試中表現優異。
大型語言模型 英語
L
modularai
9.7M
4
Xlm Roberta Base
MIT
XLM-RoBERTa是基於100種語言的2.5TB過濾CommonCrawl數據預訓練的多語言模型,採用掩碼語言建模目標進行訓練。
大型語言模型 支持多種語言
X
FacebookAI
9.6M
664
Roberta Base
MIT
基於Transformer架構的英語預訓練模型,通過掩碼語言建模目標在海量文本上訓練,支持文本特徵提取和下游任務微調
大型語言模型 英語
R
FacebookAI
9.3M
488
Opt 125m
其他
OPT是由Meta AI發佈的開放預訓練Transformer語言模型套件,參數量從1.25億到1750億,旨在對標GPT-3系列性能,同時促進大規模語言模型的開放研究。
大型語言模型 英語
O
facebook
6.3M
198
1
基於transformers庫的預訓練模型,適用於多種NLP任務
大型語言模型
Transformers

1
unslothai
6.2M
1
Llama 3.1 8B Instruct
Llama 3.1是Meta推出的多語言大語言模型系列,包含8B、70B和405B參數規模,支持8種語言和代碼生成,優化了多語言對話場景。
大型語言模型
Transformers 支持多種語言

L
meta-llama
5.7M
3,898
T5 Base
Apache-2.0
T5基礎版是由Google開發的文本到文本轉換Transformer模型,參數規模2.2億,支持多語言NLP任務。
大型語言模型 支持多種語言
T
google-t5
5.4M
702
精選推薦AI模型
Llama 3 Typhoon V1.5x 8b Instruct
專為泰語設計的80億參數指令模型,性能媲美GPT-3.5-turbo,優化了應用場景、檢索增強生成、受限生成和推理任務
大型語言模型
Transformers 支持多種語言

L
scb10x
3,269
16
Cadet Tiny
Openrail
Cadet-Tiny是一個基於SODA數據集訓練的超小型對話模型,專為邊緣設備推理設計,體積僅為Cosmo-3B模型的2%左右。
對話系統
Transformers 英語

C
ToddGoldfarb
2,691
6
Roberta Base Chinese Extractive Qa
基於RoBERTa架構的中文抽取式問答模型,適用於從給定文本中提取答案的任務。
問答系統 中文
R
uer
2,694
98