Pal B Large Opt 350m
This model is a personalized reward model for diverse alignment, trained based on facebook/opt-350m for text summarization tasks.
Downloads 37
Release Time : 2/28/2025
Model Overview
PAL-B-Large-opt-350m is a personalized reward model for diverse alignment, focusing on handling the diversity of human preferences. It adopts a modular design, enabling efficient few-shot adaptation to new users' preferences, suitable for tasks like text summarization.
Model Features
Diverse Alignment
The model can handle diverse user preferences, not just assuming all users share homogeneous preferences.
Modular Design
Leverages commonalities among users while meeting individual personalized needs, enabling efficient few-shot adaptation to new users' preferences.
High Efficiency
In the Reddit TL;DR summarization task, it achieves 1.7% higher accuracy than the previous best method for known users and 36% higher for unknown users, with 100x fewer parameters.
Model Capabilities
Text Summarization
Personalized Reward Modeling
Few-shot Learning
Use Cases
Text Processing
Reddit TL;DR Summarization
Generates concise summaries for Reddit posts while considering different user preferences.
Achieves 1.7% higher accuracy than the previous best method for known users and 36% higher for unknown users.
Featured Recommended AI Models
Š 2025AIbase