# SentencePiece tokenization

Bidi Eng Pol
Transformer-based bidirectional machine translation model supporting mutual translation among Slavic languages
Machine Translation Transformers Supports Multiple Languages
B
allegro
185
1
SBE
A sentence similarity model optimized for Russian e-commerce search queries, specifically designed for differentiating product search queries
Text Embedding Other
S
fkrasnov2
15
2
Translate Ar En V1.0 Hplt Opus
Arabic-English machine translation model trained on OPUS and HPLT data, available in both Marian and Hugging Face formats.
Machine Translation Transformers Supports Multiple Languages
T
HPLT
20
2
Opus Mt En Ar
Apache-2.0
This is a Transformer-based English to Arabic translation model that supports multiple Arabic dialect variants.
Machine Translation Supports Multiple Languages
O
Nextcloud-AI
19
2
Reazonspeech Nemo V2
Apache-2.0
Japanese automatic speech recognition model trained on the ReazonSpeech v2.0 corpus, supporting long audio inference
Speech Recognition Japanese
R
reazon-research
3,897
27
Stt Kr Conformer Transducer Large
This is a large-scale Korean automatic speech recognition model based on the Conformer-Transducer architecture, trained on the Ksponspeech dataset, suitable for Korean speech transcription tasks.
Speech Recognition Other
S
eesungkim
129
9
Opus Mt En He
Apache-2.0
This is an English-to-Hebrew machine translation model based on the Transformer architecture, developed by the Helsinki-NLP team as part of the Tatoeba Challenge project.
Machine Translation Supports Multiple Languages
O
tiedeman
19
1
Bert Base Ja
BERT base model trained on Japanese Wikipedia dataset, suitable for masked language modeling tasks in Japanese text
Large Language Model Transformers Japanese
B
colorfulscoop
16
1
Gpt2 Persian
Apache-2.0
A Persian language model based on the GPT2 architecture, specifically designed for Persian text generation with enhanced poetry processing capabilities.
Large Language Model Other
G
bolbolzaban
691
27
T5 Base Dutch
Apache-2.0
This is a Dutch pre-trained model based on the T5 architecture, with 222 million parameters, trained on the cleaned Dutch mC4 dataset.
Large Language Model Other
T
yhavinga
102
6
Opus Mt En De
An English-to-German translation model developed by the Language Technology Research Group at the University of Helsinki, built on the OPUS-MT framework, supporting high-quality machine translation tasks.
Machine Translation Supports Multiple Languages
O
Helsinki-NLP
232.60k
39
Opus Mt Mul En
Apache-2.0
This is a Transformer-based multilingual-to-English machine translation model supporting over 100 languages.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
173.61k
77
Opus Mt Nl Fr
Apache-2.0
This is a Dutch-to-French machine translation model based on the Transformer architecture, developed by the Helsinki-NLP team and trained using the OPUS dataset.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
6,776
0
Opus Mt Es De
Apache-2.0
opus-mt-es-de is a Transformer-based machine translation model for Spanish to German, developed by the University of Helsinki NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
1,719
0
Opus Mt Ru Uk
Apache-2.0
A Transformer-based machine translation model from Russian to Ukrainian, developed by the Helsinki-NLP team, supporting standardized preprocessing and SentencePiece tokenization.
Machine Translation Transformers Other
O
Helsinki-NLP
748
3
Opus Mt Pl En
Apache-2.0
A Transformer-based Polish-to-English machine translation model developed by the Helsinki-NLP team, trained using the OPUS dataset.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
99.67k
22
Opus Tatoeba Es Zh
Apache-2.0
This is a Transformer-based machine translation model from Spanish to Chinese, supporting multiple Chinese dialects and writing forms.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
399
5
Opus Mt En Zlw
Apache-2.0
This is a multilingual translation model based on the Transformer architecture, supporting translation tasks from English to West Slavic languages (including Czech, Polish, etc.).
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
391
0
Opus Mt Es Pl
Apache-2.0
A Transformer-based Spanish to Polish machine translation model developed by Helsinki-NLP team, trained on the OPUS dataset.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
208
0
Opus Mt Es Bg
Apache-2.0
This is a machine translation model based on Transformer architecture for translating from Spanish to Bulgarian, developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
75
0
Opus Mt Bg Tr
Apache-2.0
This is a Bulgarian-to-Turkish machine translation model based on the Transformer architecture, developed by the Helsinki-NLP team.
Machine Translation Transformers Other
O
Helsinki-NLP
25
0
Opus Mt Fi Mt
Apache-2.0
This is a Finnish-to-Maltese machine translation model based on the Transformer architecture, developed by the Helsinki-NLP team.
Machine Translation Transformers Other
O
Helsinki-NLP
20
0
Opus Mt Ru Et
Apache-2.0
This is a machine translation model from Russian to Estonian based on the Transformer architecture, developed by the Helsinki-NLP team as part of the Tatoeba Challenge project.
Machine Translation Transformers Other
O
Helsinki-NLP
17
0
Opus Mt Tl En
Apache-2.0
A Transformer-based machine translation model for Filipino to English, developed by the Helsinki-NLP team, supporting translation from Filipino written in Latin script to English.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
13.99k
0
Opus Mt En Vi
Apache-2.0
This is a Transformer-based machine translation model from English to Vietnamese (including Han character variants), developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
18.94k
9
Opus Mt Bn En
Apache-2.0
This is a Bengali-to-English machine translation model based on the Transformer architecture, developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
2,518
8
Opus Mt Vi Es
Apache-2.0
This is a Vietnamese-to-Spanish machine translation model based on the transformer-align architecture, developed by the Helsinki-NLP team and released under the Tatoeba-Challenge project.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
122
1
Opus Mt Ilo En
Apache-2.0
This is a machine translation model from Ilocano to English based on the transformer-align architecture, developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
52
1
Opus Mt Cus En
Apache-2.0
This is a machine translation model based on the Transformer architecture, specifically designed for translation tasks from Cushitic languages (particularly Somali) to English.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
46
1
Opus Mt En Cpp
Apache-2.0
This is a machine translation model based on the Transformer architecture, supporting translation tasks from English to various Portuguese-based Creoles and Pidgin languages.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
29
1
Opus Mt Nl Af
Apache-2.0
This is a Transformer-based machine translation model from Dutch to Afrikaans, released by the Tatoeba-Challenge project.
Machine Translation Transformers Other
O
Helsinki-NLP
14
0
Opus Mt Ja Pl
Apache-2.0
This is a Japanese-to-Polish machine translation model based on the transformer-align architecture, developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
44
0
Opus Mt En Sal
Apache-2.0
This is a machine translation model based on the Transformer architecture, supporting translation tasks from English to Salishan languages.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
40
0
Opus Mt En Ee
Apache-2.0
This is a Transformer-based machine translation model for English to Ewe, developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
38
0
Opus Mt De It
Apache-2.0
OPUS-MT German to Italian machine translation model based on Transformer architecture with SentencePiece tokenization
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
1,488
0
Opus Mt Lg En
Apache-2.0
This is a Transformer-based machine translation model for Luganda (lg) to English (en), developed by the Helsinki-NLP team.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
907
2
Opus Mt En Bg
Apache-2.0
This is a machine translation model based on Transformer architecture, specifically designed for translating English into Bulgarian and its Latinized version.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
671
0
Opus Mt Gmq En
Apache-2.0
This is a multilingual translation model based on Transformer architecture, supporting translation from various North Germanic languages (including Danish, Norwegian Bokmål, Swedish, etc.) to English.
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
1,625
1
Opus Mt Ine Ine
Apache-2.0
A Transformer model supporting translation between 136 Indo-European languages, trained by Helsinki-NLP team using OPUS multilingual data
Machine Translation Transformers Supports Multiple Languages
O
Helsinki-NLP
67
0
Opus Mt Pt Ca
Apache-2.0
This is a Transformer-based machine translation model from Portuguese to Catalan, developed by the Helsinki-NLP team.
Machine Translation Transformers Other
O
Helsinki-NLP
63
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase