AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Preference data augmentation

# Preference data augmentation

Gemma 2 9b It WPO HB
A large language model fine-tuned from the gemma-2-9b-it model using the Weighted Preference Optimization (WPO) method, enhancing the effectiveness of off-policy preference optimization.
Large Language Model Transformers
G
wzhouad
15
36
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase