S

Summllama3 8B

Developed by DISLab
SummLlama3-8B is a text summarization model initialized from Llama3-8B-Instruct, optimized through large-scale summarization feedback via DPO training, demonstrating excellent performance in faithfulness, completeness, and conciseness.
Downloads 15
Release Time : 10/11/2024

Model Overview

Specializes in generating cross-domain text summaries that align with human preferences, supporting seven scenarios including news, healthcare, and meetings, outperforming larger models like Llama3-70B and GPT-4o.

Model Features

Cross-domain Optimization
Covers 7 domains including news/healthcare/meetings, adaptable to both conversational and non-conversational texts.
Three Balanced Metrics
Leads comprehensively in faithfulness (0.98), completeness (0.697), and conciseness (0.959).
Efficient Inference
8B parameter scale achieves better performance than 70B models with faster inference speed.
LLM Feedback Training
Utilizes over 100,000 LLM-generated feedback summaries for DPO training, avoiding manual annotation costs.

Model Capabilities

Multi-domain text summarization
Conversation content condensation
Key information extraction
Long-text structured compression

Use Cases

Media Industry
News Brief Generation
Automatically extracts core facts from news
Reduces text volume by 70% while maintaining event context
Healthcare
Medical Record Summarization
Extracts key diagnostic information
Improves accuracy by 12% compared to baseline
Enterprise Office
Meeting Minutes Generation
Automatically summarizes discussion points and resolutions
Captures action items completely without redundant information
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase