Mamba-2.8B-HF Open-Source Language Model - Free to Use, Compatible with HuggingFace for Diverse Conversations

Mamba 2.8b Hf

Developed by state-spaces

A 2.8 billion parameter language model based on the Mamba architecture, compatible with HuggingFace Transformers library

Large Language Model

Transformers

#Efficient Sequence Modeling #Lightweight Fine-tuning #Long Text Generation

Downloads 8,731

Release Time : 3/5/2024

Model Overview

An efficient sequence modeling model utilizing the Mamba architecture for high-performance causal language modeling tasks

Model Features

Efficient Architecture

Utilizes the Mamba architecture, offering higher computational efficiency compared to traditional Transformers

Optimization Support

Supports causal_conv_1d and mamba-ssm optimization components with CUDA acceleration capability

PEFT Compatibility

Supports parameter-efficient fine-tuning techniques like LoRA

Model Capabilities

Text Generation

Language Understanding

Dialogue Systems

Use Cases

Dialogue Systems

Chatbot

Building natural and fluent dialogue systems

Capable of generating coherent dialogue responses

Content Generation

Text Continuation

Generating coherent text content based on prompts

Can produce contextually appropriate natural language text

🚀 Mamba

This repository provides the transformers-compatible mamba-2.8b model. It includes the intact checkpoints, along with the full config.json and tokenizer.

🚀 Quick Start

This repository contains the transformers compatible mamba-2.8b. The checkpoints are untouched, but the full config.json and tokenizer are pushed to this repo.

📦 Installation

You need to install transformers from main until transformers=4.39.0 is released.

pip install git+https://github.com/huggingface/transformers@main

We also recommend you to install both causal_conv_1d and mamba-ssm using:

pip install causal-conv1d>=1.2.0
pip install mamba-ssm

If any of these two is not installed, the "eager" implementation will be used. Otherwise the more optimised cuda kernels will be used.

💻 Usage Examples

Basic Usage

You can use the classic generate API:

>>> from transformers import MambaConfig, MambaForCausalLM, AutoTokenizer
>>> import torch

>>> tokenizer = AutoTokenizer.from_pretrained("state-spaces/mamba-2.8b-hf")
>>> model = MambaForCausalLM.from_pretrained("state-spaces/mamba-2.8b-hf")
>>> input_ids = tokenizer("Hey how are you doing?", return_tensors="pt")["input_ids"]

>>> out = model.generate(input_ids, max_new_tokens=10)
>>> print(tokenizer.batch_decode(out))
["Hey how are you doing?\n\nI'm doing great.\n\nI"]

Advanced Usage

In order to finetune using the peft library, we recommend keeping the model in float32!

from datasets import load_dataset
from trl import SFTTrainer
from peft import LoraConfig
from transformers import AutoTokenizer, AutoModelForCausalLM, TrainingArguments
tokenizer = AutoTokenizer.from_pretrained("state-spaces/mamba-2.8b-hf")
model = AutoModelForCausalLM.from_pretrained("state-spaces/mamba-2.8b-hf")
dataset = load_dataset("Abirate/english_quotes", split="train")
training_args = TrainingArguments(
    output_dir="./results",
    num_train_epochs=3,
    per_device_train_batch_size=4,
    logging_dir='./logs',
    logging_steps=10,
    learning_rate=2e-3
)
lora_config =  LoraConfig(
        r=8,
        target_modules=["x_proj", "embeddings", "in_proj", "out_proj"],
        task_type="CAUSAL_LM",
        bias="none"
)
trainer = SFTTrainer(
    model=model,
    tokenizer=tokenizer,
    args=training_args,
    peft_config=lora_config,
    train_dataset=dataset,
    dataset_text_field="quote",
)
trainer.train()

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご