Bart Large Cnn Samsum
模型简介
该模型用于生成英语对话的简洁摘要,基于BART-large-cnn架构在Samsung/samsum数据集上微调
模型特点
高质量对话摘要
能够从对话中提取关键信息生成简洁准确的摘要
特定领域优化
针对日常对话场景特别优化,在Samsung/samsum数据集上表现优异
预训练模型微调
基于强大的BART-large-cnn预训练模型进行微调
模型能力
英语对话理解
关键信息提取
文本摘要生成
使用案例
对话分析
客服对话摘要
自动生成客服对话的关键内容摘要
提高客服效率,便于后续分析
会议记录摘要
从会议对话中提取决策和行动项
节省会议记录整理时间
🚀 英文对话摘要模型
本模型用于英文对话的摘要提取,基于BART - large - cnn进行预训练。它能有效总结英文对话内容,为用户提供简洁的对话概要。
🚀 快速开始
安装与使用
你可以通过以下代码加载模型和分词器:
tokenizer = BartTokenizer.from_pretrained('facebook/bart-large-cnn')
model = BartForConditionalGeneration.from_pretrained('jngan/bart-large-cnn-samsum')
实现和微调脚本可在以下链接找到:https://github.com/joceyngan/conversation_summarization
✨ 主要特性
- 基于BART - large - cnn预训练,在英文对话摘要任务上表现良好。
- 提供了详细的评估结果,展示模型的实际表现。
📚 详细文档
评估结果
参考摘要为数据集中的真实摘要,生成摘要为该模型生成的摘要。
示例1
- 原文
- A: Hi Tom, are you busy tomorrow’s afternoon?
- B: I’m pretty sure I am. What’s up?
- A: Can you go with me to the animal shelter?.
- B: What do you want to do?
- A: I want to get a puppy for my son.
- B: That will make him so happy.
- A: Yeah, we’ve discussed it many times. I think he’s ready now.
- B: That’s good. Raising a dog is a tough issue. Like having a baby ;-)
- A: I'll get him one of those little dogs.
- B: One that won't grow up too big;-)
- A: And eat too much;-))
- B: Do you know which one he would like?
- A: Oh, yes, I took him there last Monday. He showed me one that he really liked.
- B: I bet you had to drag him away.
- A: He wanted to take it home right away ;-).
- B: I wonder what he'll name it.
- A: He said he’d name it after his dead hamster – Lemmy - he's a great Motorhead fan :-)))
- 参考摘要:A will go to the animal shelter tomorrow to get a puppy for her son. They already visited the shelter last Monday and the son chose the puppy.
- 生成摘要:A wants to get a puppy for his son. B will go with him to the animal shelter tomorrow. A will get him one of those little dogs. A took him there last Monday and he liked it a lot. A wants to name it Lemmy.
示例2
- 原文
- Emma: I’ve just fallen in love with this advent calendar! Awesome! I wanna one for my kids!
- Rob: I used to get one every year as a child! Loved them!
- Emma: Yeah, i remember! they were filled with chocolates!
- Lauren: they are different these days! much more sophisticated! Haha!
- Rob: yeah, they can be fabric/ wooden, shop bought/ homemade, filled with various stuff
- Emma: what do you fit inside?
- Lauren: small toys, Christmas decorations, creative stuff, hair bands & clips, stickers, pencils & rubbers, small puzzles, sweets
- Emma: WOW! That’s brill! X
- Lauren: i add one more very special thing as well- little notes asking my children to do something nice for someone else
- Rob: i like that! My sister adds notes asking her kids questions about christmas such as What did the 3 wise men bring? etc
- Lauren: i reckon it prepares them for Christmas
- Emma: and makes it more about traditions and being kind to other people
- Lauren: my children get very excited every time they get one!
- Emma: i can see why! :)
- 参考摘要:Emma and Rob love the advent calendar. Lauren fits inside calendar various items, for instance, small toys and Christmas decorations. Her children are excited whenever they get the calendar.
- 生成摘要:Emma loves the advent calendar. Rob used to get one every year as a child. Emma would like to buy one for her kids. Rob's sister asks her kids questions about Christmas. Lauren's children get very excited every time they get one.
示例3
- 原文
- Jackie: Madison is pregnant
- Jackie: but she doesn't wanna talk about it
- Iggy: why
- Jackie: I don't know why because she doesn't wanna talk about it
- Iggy: ok
- Jackie: I wanted to prepare you for it because people get super excited and ask lots of questions
- Jackie: and she looked way more anxious than excited
- Iggy: she's probably worrying about it
- Iggy: she's taking every commitment really seriously
- Jackie: it could be money problems or relationship problems
- Iggy: or maybe she wants an abortion
- Jackie: it could be all of the above
- Iggy: but you know what?
- Iggy: once my friend was pregnant and I couldn't bring myself to be happy about it
- Jackie: why?
- Iggy: I felt they were immature and I couldn't picture this couple as parents
- Jackie: I felt similar way on Patricia's wedding
- Iggy: Patricia Stevens?
- Jackie: yes
- Iggy: so we're talking about the same person
- Jackie: what a coincidence
- Jackie: so she's pregnant?
- Iggy: she thought she was
- Jackie: damn...
- 参考摘要:Madison is pregnant but she doesn't want to talk about it. Patricia Stevens got married and she thought she was pregnant.
- 生成摘要:Madison is pregnant. She doesn't want to talk about it. Iggy's friend Patricia Stevens was pregnant with Patricia Stevens and she didn't like it much. She felt they were immature and she couldn't picture them as parents.
示例4
- 原文
- Marla: <file_photo>
- Marla: look what I found under my bed
- Kiki: lol
- Tamara: is that someone's underwear?
- Marla: it certainly isn't mine, my ass is big but it isn't huge
- Kiki: it looks like male underwear
- Tamara: not necessarily, maybe some butch had fun in your room while you were gone
- Marla: ok but how can you leave your underwear after hooking up? wtf is wrong with people
- Kiki: she or he could be too wasted to notice
- Tamara: or maybe someone put their pants there to piss you off
- Marla: that makes no sense
- Marla: it's so fucking childish
- Kiki: if it's childish then it must have been your sister's idea
- Marla: she's 13, she doesn't have underwear that isn't pink
- Tamara: maybe it belonged to one of your exes?
- Kiki: she would have recognized it
- Marla: lol we're doing total CSI investigation on one pair of boxers :D
- Kiki: <file_gif>
- Tamara: lol
- Tamara: I think your sister convinced someone to put their underwear in your room as a dare
- Marla: sounds legit
- Kiki: Tamara, you just cracked the case!
- Tamara: <file_gif>
- Tamara: always happy to help
- 参考摘要:Marla found a pair of boxers under her bed.
- 生成摘要:Marla found a pair of boxers under her bed. Kiki, Tamara, Marla and Tamara are laughing at the fact that someone left their underwear under Marla's bed after hooking up. Marla is convinced that her sister convinced someone to put their underwear in her room.
示例5
- 原文
- Robert: Hey give me the address of this music shop you mentioned before
- Robert: I have to buy guitar cable
- Fred: <file_other>
- Fred: Catch it on google maps
- Robert: thx m8
- Fred: ur welcome
- 参考摘要:Robert wants Fred to send him the address of the music shop as he needs to buy guitar cable.
- 生成摘要:Fred gives Robert the address of the music shop where he needs to buy guitar cable. Robert can find it on google maps. Robert has to buy a guitar cable at the shop. Fred sends him a link to the address.
微调参数
{
output_dir=str(results_path),
evaluation_strategy="epoch",
save_strategy="epoch",
logging_steps=10,
learning_rate=2e-5,
per_device_train_batch_size=2,
per_device_eval_batch_size=2,
num_train_epochs=3,
weight_decay=0.01,
report_to="tensorboard",
save_total_limit=3,
load_best_model_at_end=True,
metric_for_best_model="eval_loss",
greater_is_better=False,
logging_dir=str(results_path),
}
📄 许可证
本项目采用MIT许可证。
Bart Large Cnn
MIT
基于英语语料预训练的BART模型,专门针对CNN每日邮报数据集进行微调,适用于文本摘要任务
文本生成 英语
B
facebook
3.8M
1,364
Parrot Paraphraser On T5
Parrot是一个基于T5的释义框架,专为加速训练自然语言理解(NLU)模型而设计,通过生成高质量释义实现数据增强。
文本生成
Transformers

P
prithivida
910.07k
152
Distilbart Cnn 12 6
Apache-2.0
DistilBART是BART模型的蒸馏版本,专门针对文本摘要任务进行了优化,在保持较高性能的同时显著提升了推理速度。
文本生成 英语
D
sshleifer
783.96k
278
T5 Base Summarization Claim Extractor
基于T5架构的模型,专门用于从摘要文本中提取原子声明,是摘要事实性评估流程的关键组件。
文本生成
Transformers 英语

T
Babelscape
666.36k
9
Unieval Sum
UniEval是一个统一的多维评估器,用于自然语言生成任务的自动评估,支持多个可解释维度的评估。
文本生成
Transformers

U
MingZhong
318.08k
3
Pegasus Paraphrase
Apache-2.0
基于PEGASUS架构微调的文本复述模型,能够生成语义相同但表达不同的句子。
文本生成
Transformers 英语

P
tuner007
209.03k
185
T5 Base Korean Summarization
这是一个基于T5架构的韩语文本摘要模型,专为韩语文本摘要任务设计,通过微调paust/pko-t5-base模型在多个韩语数据集上训练而成。
文本生成
Transformers 韩语

T
eenzeenee
148.32k
25
Pegasus Xsum
PEGASUS是一种基于Transformer的预训练模型,专门用于抽象文本摘要任务。
文本生成 英语
P
google
144.72k
198
Bart Large Cnn Samsum
MIT
基于BART-large架构的对话摘要模型,专为SAMSum语料库微调,适用于生成对话摘要。
文本生成
Transformers 英语

B
philschmid
141.28k
258
Kobart Summarization
MIT
基于KoBART架构的韩语文本摘要模型,能够生成韩语新闻文章的简洁摘要。
文本生成
Transformers 韩语

K
gogamza
119.18k
12
精选推荐AI模型
Llama 3 Typhoon V1.5x 8b Instruct
专为泰语设计的80亿参数指令模型,性能媲美GPT-3.5-turbo,优化了应用场景、检索增强生成、受限生成和推理任务
大型语言模型
Transformers 支持多种语言

L
scb10x
3,269
16
Cadet Tiny
Openrail
Cadet-Tiny是一个基于SODA数据集训练的超小型对话模型,专为边缘设备推理设计,体积仅为Cosmo-3B模型的2%左右。
对话系统
Transformers 英语

C
ToddGoldfarb
2,691
6
Roberta Base Chinese Extractive Qa
基于RoBERTa架构的中文抽取式问答模型,适用于从给定文本中提取答案的任务。
问答系统 中文
R
uer
2,694
98