Falcon-H1-1.5B-Base Open-Source AI Model - Supports Multi-Language Tasks, Free Deployment for Easy Use!

Falcon H1 1.5B Base

Developed by tiiuae

Falcon-H1 is a decoder-only causal model with a hybrid Transformers + Mamba architecture developed by TII, supporting English and multilingual tasks.

Large Language Model

Transformers

Supports Multiple LanguagesOpen Source License:Other #Hybrid Architecture Transformer-Mamba #Multilingual Reasoning #Efficient Mathematical Reasoning

Downloads 454

Release Time : 5/1/2025

Model Overview

Falcon-H1 is an efficient hybrid architecture language model that combines the strengths of Transformers and Mamba architectures, suitable for various natural language processing tasks.

Model Features

Hybrid Architecture

Combines the strengths of Transformers and Mamba architectures to improve model efficiency and performance.

Multilingual Support

Supports English and multilingual tasks with strong language understanding capabilities.

Efficient Reasoning

Excels in various reasoning tasks, including mathematical and scientific tasks.

Model Capabilities

Text generation

Reasoning tasks

Multilingual support

Code generation

Use Cases

General Tasks

General Q&A

Used to answer various general questions, outperforming similar models.

Scores 46.57 on BBH tasks, outperforming models like Qwen3-1.7B and Gemma3-1B.

Mathematical Tasks

Mathematical Reasoning

Solves mathematical problems and reasoning tasks.

Scores 52.01 on GSM8k tasks and 20.39 on MATH lvl5 tasks.

Code Generation

Generates and completes code snippets.

Scores 50.0 on HumanEval tasks and 65.08 on MBPP tasks.

🚀 Falcon-H1

Falcon-H1 is a family of hybrid - head language models developed by TII, offering high - performance solutions across multiple languages and various NLP tasks.

🚀 Quick Start

Currently, you can use this model by relying on Hugging Face transformers, vLLM, or a custom fork of the llama.cpp library. Make sure to install the latest version of transformers or vLLM, and you can install these packages from source if needed.

✨ Features

Multilingual Support: Supports a wide range of languages including Arabic, Czech, German, English, Spanish, French, Hindi, Italian, Japanese, Korean, Dutch, Polish, Portuguese, Romanian, Russian, Swedish, Urdu, and Chinese.
Hybrid Architecture: Utilizes a hybrid Transformers + Mamba architecture.
Causal Decoder - Only: A causal decoder - only model type.

📦 Installation

Install `transformers` from source

pip install git+https://github.com/huggingface/transformers.git

Refer to the official vLLM documentation for more details on building vLLM from source.

💻 Usage Examples

Basic Usage

Using `transformers`

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "tiiuae/Falcon-H1-1B-Base"

model = AutoModelForCausalLM.from_pretrained(
  model_id,
  torch_dtype=torch.bfloat16,
  device_map="auto"
)

# Perform text generation

Using `vLLM`

# pip install vllm
vllm serve tiiuae/Falcon-H1-1B-Instruct --tensor-parallel-size 2 --data-parallel-size 1

Using `llama.cpp`

You can install the fork of the library and use it directly: https://github.com/tiiuae/llama.cpp - Falcon - H1. Use the same installing guidelines as llama.cpp.

📚 Documentation

Model Details

Property	Details
Developed by	https://www.tii.ae
Model Type	Causal decoder - only
Architecture	Hybrid Transformers + Mamba architecture
Language(s) (NLP)	English, Multilingual
License	Falcon - LLM License

Training Details

For more details about the training protocol of this model, please refer to the Falcon - H1 technical blogpost.

Evaluation

Falcon - H1 series perform very well on a variety of tasks, including reasoning tasks.

Tasks	Falcon - H1 - 1.5B	Qwen3 - 1.7B	Qwen2.5 - 1.5B	Gemma3 - 1B	Llama3.2 - 1B	Falcon3 - 1B
General
BBH	46.57	43.05	40.55	30.26	30.72	35.24
MMLU	61.81	62.46	61.13	26.33	32.39	45.14
ARC - C	53.24	55.72	54.27	39.33	39.42	47.87
HellaSwag	66.76	67.09	67.86	62.94	65.73	62.3
Winogrande	65.59	66.3	64.56	62.59	62.75	61.17
Math
GSM8k	52.01	70.74	63.0	2.2	7.05	34.95
MATH lvl5	20.39	16.39	8.84	1.21	0.98	3.4
Science
GPQA	29.11	29.45	28.36	24.66	23.57	27.85
MMLU - Pro	35.53	33.81	28.72	11.31	11.8	16.11
MMLU - stem	63.37	61.53	54.93	27.59	30.19	40.06
Code
HumanEval	50.0	67.68	35.37	6.71	18.9	10.37
HumanEval+	42.68	60.98	29.27	5.49	16.46	9.15
MBPP	65.08	67.72	60.05	12.7	35.98	12.43
MBPP+	55.03	58.99	49.47	9.52	29.89	9.52

You can check more in - detail on our release blogpost, which contains detailed benchmarks.

Useful Links

View our release blogpost.
Feel free to join our discord server if you have any questions or to interact with our researchers and developers.

📄 License

This model is under the Falcon - LLM License.

📚 Citation

If the Falcon - H1 family of models were helpful to your work, feel free to give us a cite.

@misc{tiifalconh1,
    title = {Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance},
    url = {https://falcon-lm.github.io/blog/falcon-h1},
    author = {Falcon-LLM Team},
    month = {May},
    year = {2025}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご