Distilbert Punctuator En
D
Distilbert Punctuator En
Developed by Qishuai
A DistilBERT fine-tuned model for English text punctuation restoration, specifically designed to add punctuation to lowercase English text without punctuation.
Downloads 55
Release Time : 3/2/2022
Model Overview
This model can automatically add punctuation marks such as commas, periods, question marks, and exclamation marks to lowercase English text without punctuation, improving text readability.
Model Features
Efficient and Lightweight
Based on the DistilBERT architecture, it reduces model size and computational resource requirements while maintaining high performance.
Multi-source Training Data
Integrates text data from three different sources—BBC News, news articles, and TED Talks—to enhance model generalization.
Punctuation Type Coverage
Supports the restoration of four common English punctuation marks: commas, periods, question marks, and exclamation marks.
Model Capabilities
English text punctuation restoration
Unpunctuated text processing
Lowercase text normalization
Use Cases
Text Preprocessing
Post-processing for Speech-to-Text
Adds punctuation to unpunctuated text output from speech recognition systems.
Improves the readability of transcribed text and subsequent processing effects.
News Text Normalization
Processes unpunctuated news text scraped from the web.
Makes news content more compliant with publishing standards.
Writing Assistance
Quick Writing Assistance
Automatically adds punctuation to quickly input unpunctuated text.
Improves writing efficiency and reduces post-editing work.
Featured Recommended AI Models