M

Morphbert Large Morpheme Segmentation Ru

Developed by CrabInHoney
A large Russian morpheme segmentation model based on Transformer architecture, capable of classifying each character of Russian words into one of 25 morpheme categories
Downloads 16
Release Time : 4/7/2025

Model Overview

This model is specifically fine-tuned for Russian word morpheme segmentation tasks, capable of parsing the constituent morphemes of Russian words, suitable for character-level morphological analysis predictions.

Model Features

High-precision morpheme segmentation
Achieves approximately 0.99 character-level accuracy on evaluation datasets, capable of accurately identifying morpheme boundaries in Russian words.
Rich morpheme categories
Supports classification into 25 morpheme categories, including roots, prefixes, suffixes, connectors, and various morphological structures.
Large Transformer architecture
Adopts a complex architecture comparable to bert-base, offering higher parsing accuracy than smaller versions.

Model Capabilities

Russian word morpheme segmentation
Character-level token classification
Morphological analysis prediction

Use Cases

Linguistic analysis
Russian word morphological decomposition
Decompose Russian words into morpheme components such as roots, prefixes, and suffixes
For example, decomposing 'масляный' into 'масл:ROOT / ян:SUFF / ый:END'
Compound word analysis
Parse the structure of compound words containing hyphens
For example, decomposing 'сине-белый' into 'син:ROOT / е:LINK / -:HYPH / бел:ROOT1 / ый:END'
Natural language processing
Russian NLP preprocessing
Provide word morphological structure information for Russian NLP tasks
Can be used for downstream tasks such as lemmatization and stemming
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase