B

Bert Base Romanian Uncased V1

Developed by dumitrescustefan
Romanian BERT base uncased model, trained on 15GB corpus, optimized for Romanian NLP tasks
Downloads 2,294
Release Time : 3/2/2022

Model Overview

This is a BERT base model optimized for Romanian language, case-insensitive, suitable for various natural language processing tasks.

Model Features

Romanian-specific
Specifically trained for Romanian language, outperforms multilingual BERT models
Character normalization
Requires input text to replace s and t letters with cedilla diacritics to comma-below variants for optimal performance
Comprehensive evaluation
Thoroughly evaluated on multiple NLP tasks including UPOS, XPOS, NER and LAS

Model Capabilities

Text encoding
Named Entity Recognition
Part-of-speech tagging
Dependency parsing

Use Cases

Natural Language Processing
Romanian text analysis
Used for processing and analyzing Romanian text
Outperforms multilingual BERT models on various NLP tasks
Named Entity Recognition
Identifying named entities in Romanian text
Achieves 85.26 F1 score on RONEC dataset
Featured Recommended AI Models
ยฉ 2025AIbase