34B - Beta Open - Source Large Language Model Beta Version - Free to experience, but be aware of precision limitations when using.

34b Beta

Developed by CausalLM

A 34B-parameter large language model beta version using ChatML prompt format. Note current version has precision limitations

Large Language Model

Transformers

Open Source License:Gpl-3.0 #High-precision dialogue generation #ChatML format support #34B parameter scale

Downloads 77

Release Time : 2/6/2024

Model Overview

A causal language model based on 34B parameters, supporting dialogue and text generation tasks (current beta version has precision issues)

Model Features

ChatML prompt format

Uses OpenAI's ChatML specification for dialogue interaction

High MT-Bench score

Achieved 8.5 on MT-Bench benchmark

Low contamination risk

Contamination tests show high model safety

Model Capabilities

Text generation

Dialogue interaction

Long-text processing

Use Cases

Dialogue systems

Intelligent assistant

Building ChatML-format dialogue assistants

Content generation

Creative writing

Assisting in story creation and content generation

🚀 CausalLM 34B β

CausalLM 34B β is a powerful language model. It provides a demo on Hugging Face and has specific prompt formats. However, there are some precision issues with the model weights that will be fixed in the next update.

🚀 Quick Start

The demo of CausalLM 34B β can be accessed via the following link:

✨ Features

Prompt Format

The model uses the chatml prompt format.

Model Issues and Precautions

Precision Issues: There are some issues with the model weights in terms of precision. In the next version update, we will roll back some progress and retrain to fix these issues as soon as possible.
Inference Framework: Please do not use "accelerated inference frameworks" like VLLM temporarily. Instead, use Transformers for inference. Otherwise, due to precision issues, the output quality will be significantly degraded. If you need faster inference, you can consider using the q8_0 quantization (faster and better than bf16 vllm for this model only) with llama.cpp temporarily or wait for the official version.
Repetition Penalty: no repetition_penalty!
Quantization Calibration: Please do not use wikitext for quantization calibration because all wikitext have been re - aligned on synthetic dataset, and its distribution differs significantly from the original wikitext.

MT - Bench Score

The model achieves a score of 8.5 on MT - Bench. ![mt - bench](https://cdn - uploads.huggingface.co/production/uploads/63468a143ea42ee2cb49ddd1/2vv2_nGbfWuOM8jwy40dn.png)

Contamination Detection

Models	MMLU (ref: llama7b)	TBA
microsoft/Orca - 2 - 7b	0.77
mistralai/Mistral - 7B - v0.1	0.46
CausalLM/34b - beta	0.38
01 - ai/Yi - 6B - 200K	0.3

The data is from https://huggingface.co/spaces/Yeyito/llm_contamination_detector. It should be safe. It was not trained on the benchmark, but the contamination of the training dataset is unavoidable due to cost constraints.