Gemma 2 9B It SPPO Iter3
An 8.9 billion parameter language model developed in the third iteration using self-play preference optimization, starting from google/gemma-2-9b-it and fine-tuned with the UltraFeedback dataset
Downloads 6,704
Release Time : 6/29/2024
Model Overview
This model employs self-play preference optimization for alignment and is primarily used for English text generation tasks
Model Features
Self-Play Preference Optimization
Utilizes SPPO method for three rounds of iterative optimization to enhance model performance
High-Quality Dataset
Trained using the UltraFeedback dataset and synthetic data
Iterative Improvement
Performance improved with each of the three iterations
Model Capabilities
English Text Generation
Dialogue Systems
Content Creation
Use Cases
Dialogue Systems
Intelligent Customer Service
Used for building English intelligent customer service dialogue systems
Content Generation
Article Writing
Assists in English article writing and content generation
Featured Recommended AI Models
Š 2025AIbase