L

Llama PLLuM 8B Chat

Developed by CYFRAGOVPL
PLLuM is a large - language - model family focused on Polish and other Slavic/Baltic languages, while incorporating English data to achieve broader generalization ability.
Downloads 2,618
Release Time : 2/7/2025

Model Overview

The PLLuM series of models aims to generate contextually coherent text, assist with various tasks (such as question - answering and summarization), and lay the foundation for specific - domain applications (such as intelligent assistants for specific domains).

Model Features

Extensive data collection
A large - scale, high - quality Polish text dataset (approximately 150 billion tokens after cleaning and deduplication) was collected, along with additional text in Slavic, Baltic, and English languages.
Organic instruction dataset
The largest manually created collection of Polish 'organic instructions' (approximately 40,000 prompt - response pairs) was carefully curated, covering a range of subtle aspects that automated methods in supervised fine - tuning might overlook.
Polish preference corpus
The first Polish preference corpus was created, containing prompts and multiple model responses manually evaluated by an annotation team with different demographic characteristics.
Evaluation benchmark
A custom benchmark was developed to evaluate the model's performance on tasks related to Polish public administration. PLLuM achieved the highest score among all tested models.

Model Capabilities

Text generation
Question - answering
Summarization
Retrieval - Augmented Generation (RAG)
Multilingual support

Use Cases

General language tasks
Text generation
Generate contextually coherent text, such as poems and articles.
Generate high - quality Polish text suitable for various scenarios.
Question - answering
Answer questions posed by users based on the provided documents or general knowledge.
Provide accurate and context - relevant answers.
Specific - domain assistants
Public administration
Provide professional support for Polish public administration, such as information retrieval and question - answering on legal or bureaucratic topics.
Perform excellently in complex information retrieval and question - answering.
Research and development
Downstream AI applications
Serve as a basic building block for downstream AI applications that require proficiency in Polish.
Provide strong language - model support for academic or industrial environments.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase