Leo-HessianAI-13B Open-Source German Language Model - Supports Open Commercial Use and German Dialogue Communication

Leo Hessianai 13b

Developed by LeoLM

LeoLM is the first open and commercially available German foundation language model based on Llama-2, extended to the German domain through extensive continuous pre-training with German texts.

Large Language Model

Transformers

Supports Multiple Languages#German-enhanced Llama-2 #8k long text support #English-German bilingual generation

Downloads 392

Release Time : 9/5/2023

Model Overview

LeoLM is a German large language model developed in collaboration between LAION and HessianAI, based on the Llama-2 architecture, focusing on German text comprehension and generation tasks.

Model Features

German optimization

Specifically optimized for German through continuous pre-training, enhancing German text processing capabilities

Long context support

Supports long-context processing of up to 8k tokens

Commercial-friendly

Complies with the Llama-2 Community License, suitable for commercial use

High-performance computing

Trained using HessianAI's supercomputer 42

Model Capabilities

German text generation

English text generation

Long text processing

Context understanding

Use Cases

Content creation

German article writing

Generates articles that conform to German grammar and expression conventions

Business applications

German customer service chatbot

Building intelligent customer service systems for German-speaking users

Education research

German language learning

Assisting German learners with language practice

🚀 LAION LeoLM: Linguistically Enhanced Open Language Model

Meet LeoLM, the first open and commercially available German Foundation Language Model built on Llama-2. It extends Llama-2's capabilities into German through continued pretraining on a large corpus of German - language and mostly locality - specific text.

🚀 Quick Start

LeoLM is the first open and commercially available German Foundation Language Model based on Llama - 2. By continuing pretraining on a large German - language corpus, it extends Llama - 2's capabilities to German. Thanks to a compute grant at HessianAI's supercomputer 42, we release two 8k context - length foundation models, [LeoLM/leo - hessianai - 7b](https://huggingface.co/LeoLM/leo - hessianai - 7b) and [LeoLM/leo - hessianai - 13b](https://huggingface.co/LeoLM/leo - hessianai - 13b), under the [Llama - 2 community license](https://huggingface.co/meta - llama/Llama - 2 - 70b/raw/main/LICENSE.txt) (70b is also coming soon!). We hope this release will bring new opportunities to German open - source and commercial LLM research and accelerate adoption. Read our blog post or paper (preprint coming soon) for more details.

A project by Björn Plüster and Christoph Schuhmann in collaboration with LAION and HessianAI.

✨ Features

Extends Llama - 2's capabilities to the German language through continued pretraining on a large German - language corpus.
Released two 8k context - length foundation models, facilitating German open - source and commercial LLM research.

📦 Installation

Install Direct Dependencies

pip install transformers torch sentencepiece

Install Dependencies for Faster Inference with Flash - Attention2

pip install packaging ninja
pip install flash-attn==v2.1.1 --no-build-isolation
pip install git+https://github.com/HazyResearch/flash-attention.git@v2.1.1#subdirectory=csrc/rotary

💻 Usage Examples

Basic Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model = AutoModelForCausalLM.from_pretrained(
    model="LeoLM/leo-hessianai-13b",
    device_map="auto",
    torch_dtype=torch.float16,
    trust_remote_code=True  # True for flash-attn2 else False
)

📚 Documentation

Model Details

Property	Details
Finetuned from	[meta - llama/Llama - 2 - 13b - hf](https://huggingface.co/meta - llama/Llama - 2 - 13b - hf)
Model Type	Causal decoder - only transformer language model
Language	English and German
License	[LLAMA 2 COMMUNITY LICENSE AGREEMENT](https://huggingface.co/meta - llama/Llama - 2 - 70b/raw/main/LICENSE.txt)
Contact	LAION Discord or Björn Plüster

Training parameters

training_parameters

Benchmarks

benchmarks

📄 License

This model is released under the [LLAMA 2 COMMUNITY LICENSE AGREEMENT](https://huggingface.co/meta - llama/Llama - 2 - 70b/raw/main/LICENSE.txt).

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご