pal-b-large-opt-350m Open Source Model - Provide Personalized Reward Support for Text Summarization Tasks

Pal B Large Opt 350m

Developed by daiweichen

This model is a personalized reward model for diverse alignment, trained based on facebook/opt-350m for text summarization tasks.

Text Generation

Transformers

EnglishOpen Source License:MIT #Diverse Preference Alignment #Personalized Reward Modeling #Few-shot User Adaptation

Downloads 37

Release Time : 2/28/2025

Model Overview

PAL-B-Large-opt-350m is a personalized reward model for diverse alignment, focusing on handling the diversity of human preferences. It adopts a modular design, enabling efficient few-shot adaptation to new users' preferences, suitable for tasks like text summarization.

Model Features

Diverse Alignment

The model can handle diverse user preferences, not just assuming all users share homogeneous preferences.

Modular Design

Leverages commonalities among users while meeting individual personalized needs, enabling efficient few-shot adaptation to new users' preferences.

High Efficiency

In the Reddit TL;DR summarization task, it achieves 1.7% higher accuracy than the previous best method for known users and 36% higher for unknown users, with 100x fewer parameters.

Model Capabilities

Text Summarization

Personalized Reward Modeling

Few-shot Learning

Use Cases

Text Processing

Reddit TL;DR Summarization

Generates concise summaries for Reddit posts while considering different user preferences.

Achieves 1.7% higher accuracy than the previous best method for known users and 36% higher for unknown users.

🚀 PAL-B-Large-opt-350m

This is a personalized reward model for pluralistic alignment, demonstrating the effectiveness of the proposed Pluralistic Alignment method in outperforming the standard homogeneous reward model.

🚀 Quick Start

This model is a personalized reward model for pluralistic alignment and serves as a demonstration for our paper.

Our approach outperforms the standard homogeneous reward model, demonstrating improved performance with our proposed Pluralistic Alignment method.

If you're interested in our PAL method (Pluralistic ALignment), we encourage you to explore our project page and repository

📚 Documentation

Intro

To quote the abstract of our official paper

Foundation models trained on internet-scale data benefit from extensive alignment to human preferences before deployment. However, existing methods typically assume a homogeneous preference shared by all individuals, overlooking the diversity inherent in human values. In this work, we propose a general reward modeling framework for pluralistic alignment (PAL), which incorporates diverse preferences from the ground up. PAL has a modular design that leverages commonalities across users while catering to individual personalization, enabling efficient few-shot localization of preferences for new users. Extensive empirical evaluation demonstrates that PAL matches or outperforms state-of-the-art methods on both text-to-text and text-to-image tasks: on Reddit TL;DR Summary, PAL is 1.7% more accurate for seen users and 36% more accurate for unseen users compared to the previous best method, with 100× less parameters. On Pick-a-Pic v2, PAL is 2.5% more accurate than the best method with 156× fewer learned parameters. Finally, we provide theoretical analysis for generalization of rewards learned via PAL framework showcasing the reduction in number of samples needed per user.

Model Details

We train the PAL-B-Large model (utilize facebook/opt350m as the base model) on a variant of Reddit TL;DR summary dataset, incorporating feedback from the 10 most active users.

Model Sources

Repository: RamyaLab/pluralistic-alignment

📄 License

The license for this project is MIT.

📦 Model Information

Property	Details
Base Model	facebook/opt-350m
Datasets	CarperAI/openai_summarize_tldr
Language	en
Library Name	transformers
Pipeline Tag	summarization

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご