Byt5 Small Historic English Span20
B
Byt5 Small Historic English Span20
Developed by hmbyt5
Historical multilingual and monolingual ByT5 base model, currently supporting English (British Library corpus - books).
Downloads 18
Release Time : 4/30/2023
Model Overview
hmByT5 is a base language model based on the ByT5 architecture, primarily used for text processing tasks, supporting English.
Model Features
Multilingual support
The model is designed to support multilingual processing, currently covering English.
Optimized noise span length
Pre-trained with mean_noise_span_length=20, making the pre-training task more challenging compared to the default value of 3.
TPU training
Pre-trained using Google's TPU Research Cloud (TRC) provided v3-8 TPU.
Model Capabilities
Text generation
Text processing
Use Cases
Text processing
English text processing
Suitable for processing English text, such as book content from the British Library corpus.
After fine-tuning on the English AjMC dataset, the average performance reached 85.82 ± 0.79.
Featured Recommended AI Models