roberta-med-small-1M-1 Open-source Text Understanding Model - Free Deployment for Text Understanding Tasks

Roberta Med Small 1M 1

Developed by nyu-mll

A RoBERTa model pretrained on a small-scale dataset of 1M tokens, using the MED-SMALL architecture, suitable for text understanding tasks.

Large Language Model #Small-scale pretraining #Efficient text understanding #Low-resource optimization

Downloads 23

Release Time : 3/2/2022

Model Overview

This model is a small-scale pretrained language model based on the RoBERTa architecture, focusing on language representation learning with limited data.

Model Features

Small-scale data pretraining

Specifically designed for effective pretraining on small-scale datasets ranging from 1M to 1B tokens.

Multiple scale options

Provides model versions with different training scales from 1M to 1B tokens.

Optimized architecture

MED-SMALL architecture (6 layers, 512 hidden dimensions) adjusted for small-scale data.

Model Capabilities

Text representation learning

Context understanding

Language modeling

Use Cases

Educational research

Small-scale data language model research

Used to study the performance of language models under limited data conditions.

Validation perplexity 134.18-153.38

Resource-constrained environments

Low-resource NLP applications

Suitable for environments with limited computational resources or training data.

Model Name	Training Size	Model Size	Max Steps	Batch Size	Validation Perplexity
roberta-base-1B-1	1B	BASE	100K	512	3.93
roberta-base-1B-2	1B	BASE	31K	1024	4.25
roberta-base-1B-3	1B	BASE	31K	4096	3.84
roberta-base-100M-1	100M	BASE	100K	512	4.99
roberta-base-100M-2	100M	BASE	31K	1024	4.61
roberta-base-100M-3	100M	BASE	31K	512	5.02
roberta-base-10M-1	10M	BASE	10K	1024	11.31
roberta-base-10M-2	10M	BASE	10K	512	10.78
roberta-base-10M-3	10M	BASE	31K	512	11.58
roberta-med-small-1M-1	1M	MED-SMALL	100K	512	153.38
roberta-med-small-1M-2	1M	MED-SMALL	10K	512	134.18
roberta-med-small-1M-3	1M	MED-SMALL	31K	512	139.39

Model Size	L	AH	HS	FFN	P
BASE	12	12	768	3072	125M
MED-SMALL	6	8	512	2048	45M

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Roberta Med Small 1M 1

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 RoBERTa Pretrained on Smaller Datasets

✨ Features

📚 Documentation

Hyperparameters and Validation Perplexity