C

Croissantllmchat V0.1

Developed by croissantllm
CroissantLLM is a 1.3B-parameter language model trained on 3T English-French bilingual tokens, designed for consumer hardware with fluent bilingual processing capabilities.
Downloads 3,812
Release Time : 1/24/2024

Model Overview

This model is part of the CroissantLLM initiative, trained for 190K steps (2.99T tokens) with a final chat fine-tuning phase, supporting text generation tasks in both French and English.

Model Features

Bilingual Support
Uses a 1:1 English-French pre-training data ratio, specifically optimized for French and English processing.
Efficient Operation
Designed to run smoothly on consumer hardware, suitable for research and industrial applications.
High-Quality French Corpus
Training data includes manually curated, high-quality, and diverse French corpora.
Transparent Open Source
Publicly released codebase, multiple checkpoints, fine-tuned chat models, and translation models, achieving an 81% transparency standard compliance rate.

Model Capabilities

Text Generation
Bilingual Translation
Chat Dialogue
Code Generation

Use Cases

Language Processing
French Q&A
Answer questions about French culture, history, or current events.
Performs well in writing tasks and internal knowledge retrieval.
English-French Translation
Perform translation tasks between English and French.
Excels particularly in translation tasks.
Code Assistance
Code Generation
Generate simple code snippets.
Limited coding capability, suitable for basic code generation.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase