🚀 PAGnol: An Extra-Large French Generative Model
PAGnol is a collection of large French language models for free - form text generation. With 1.5 billion parameters, it's based on the GPT architecture. It was the first language model trained by LightOn in cooperation with the ALMAnaCH team of Inria.
🚀 Quick Start
You can quickly start using PAGnol for text generation. Here are some useful links:
✨ Features
- Large - scale French models: PAGnol consists of large - scale French language models suitable for free - form text generation.
- Based on GPT architecture: It is built upon the well - known GPT architecture.
- Reproducibility: Trained following the scaling laws in early 2021 with the same training data as CamemBERT on CCNet.
💻 Usage Examples
Basic Usage
from transformers import pipeline
generator = pipeline('text-generation', model='lightonai/pagnol-xl', trust_remote_code=True)
output = generator(
"Salut PAGnol, comment ça va ?",
max_length=50,
do_sample=True,
temperature=0.7,
)[0]["generated_text"]
>>> "Très bien! Les jours d’été sont là ! Bientôt les premiers festivals..."
📦 Available Models
📄 License
PAGnol is made available under the MIT licence. By downloading the models available below, you agree with the terms of the MIT licence agreement. Under no circumstances will LightOn and/or Inria be held responsible or liable in any way for any claims, damages, losses, expenses, costs or liabilities whatsoever (including, without limitation, any direct or indirect damages for loss of profits, business interruption or loss of information) resulting or arising directly or indirectly from your use of or inability to use PAGnol.
📚 Citation
@inproceedings{launay-etal-2022-pagnol,
title = "{PAG}nol: An Extra-Large {F}rench Generative Model",
author = "Launay, Julien and
Tommasone, E.l. and
Pannier, Baptiste and
Boniface, Fran{\c{c}}ois and
Chatelain, Am{\'e}lie and
Cappelli, Alessandro and
Poli, Iacopo and
Seddah, Djam{\'e}",
editor = "Calzolari, Nicoletta and
B{\'e}chet, Fr{\'e}d{\'e}ric and
Blache, Philippe and
Choukri, Khalid and
Cieri, Christopher and
Declerck, Thierry and
Goggi, Sara and
Isahara, Hitoshi and
Maegaard, Bente and
Mariani, Joseph and
Mazo, H{\'e}l{\`e}ne and
Odijk, Jan and
Piperidis, Stelios",
booktitle = "Proceedings of the Thirteenth Language Resources and Evaluation Conference",
month = jun,
year = "2022",
address = "Marseille, France",
publisher = "European Language Resources Association",
url = "https://aclanthology.org/2022.lrec-1.455",
pages = "4275--4284",
}
📞 Contact
- Research enquiries: pagnol@lighton.ai
- Business enquiries: customer.relations@lighton.ai