Model Selection

Multi-domain Adaptation

# Multi-domain Adaptation

Deepseek R1T Chimera

DeepSeek-R1T-Chimera is an open-source weights model that combines the intelligence of DeepSeek-R1 with the token efficiency of DeepSeek-V3.

Large Language Model

Multi2convai Quality En Bert

This is a fine-tuned BERT-based model for English quality-related text classification tasks, part of the Multi2ConvAI project.

Text Classification

Transformers English

PEG is a model that achieves robust text retrieval through progressive learning, adjusting loss weights based on the difficulty levels of negative samples.

Transformers Chinese

Modernbert Base Tr Uncased

Turkish pre-trained model based on ModernBERT architecture, supporting 8192 context length with excellent performance across multiple domains

Large Language Model

Transformers Other

Beaverai MN 2407 DSK QwQify V0.1 12B GGUF

A large language model based on 12B parameters, supporting text generation tasks, released under the Apache-2.0 license.

Large Language Model

Deepseek Ai.deepseek R1 Distill Llama 8B GGUF

DeepSeek-R1-Distill-Llama-8B is an 8B-parameter large language model based on the Llama architecture, optimized through distillation training for text generation tasks.

Large Language Model

Wangchan Sentiment Thai Text Model

A Thai sentiment analysis model fine-tuned based on the WangchanBERTa model, used to analyze the emotional tendency of Thai text.

Text Classification

Transformers Other

Hugging Face Transformers is a library providing pre-trained deep learning models, supporting tasks in natural language processing, computer vision, and other domains.

Large Language Model

Exaone3 Instructrans V2 Enko 7.8b

An English-Korean translation model based on exaone-3-7.8B-it training, focused on translation tasks of instruction datasets

Machine Translation

Transformers Supports Multiple Languages

Translation-EnKo

Llama 3.1 PersianQA

A Persian Q&A specialized version fine-tuned based on Llama3, capable of accurately answering Persian questions in context

Question Answering System Supports Multiple Languages

Ltgbert 100m 2024

This project is an open-source project following the MIT license and has certain open-source value.

Large Language Model

Llama3 Instructrans Enko 8b

An English-Korean translation model trained on Llama-3-8B-it, specifically designed for translating English instruction datasets

Machine Translation

Transformers Supports Multiple Languages

Few Shot Learning Classification Bert Sm 500

A text classification model trained with AutoTrain, suitable for few-shot learning scenarios, capable of efficiently classifying news articles.

Text Classification

GNER T5 Large V2

GNER-T5-large is a generative named entity recognition model based on the Flan-T5-large architecture, focusing on improving zero-shot recognition capabilities in unseen entity domains.

Sequence Labeling

Transformers English

Intent Classifier

A Flan-T5-Base fine-tuned intent classification model for categorizing customer queries into predefined categories

Text Classification

Tinyllama V1.1 Math Code

TinyLlama is a compact language model with 1.1 billion parameters, adopting the same architecture and tokenizer as Llama 2, suitable for applications with limited computational and memory resources.

Large Language Model

Transformers English

TinyLlama is a small language model with 1.1 billion parameters, adopting the same architecture and tokenizer as Llama 2, suitable for resource-constrained application scenarios.

Large Language Model

Transformers English

GNER-T5-xl is a generative named entity recognition model based on Flan-T5-xl, significantly enhancing zero-shot recognition capabilities through negative instance training

Sequence Labeling

Transformers English

Parakeet Tdt 1.1b

Parakeet TDT 1.1B is an automatic speech recognition (ASR) model jointly developed by NVIDIA NeMo and Suno.ai, capable of transcribing speech into lowercase English letters.

Speech Recognition English

SFR Embedding Mistral

A text embedding model developed by Salesforce Research, trained on E5-mistral-7b-instruct and Mistral-7B-v0.1, primarily used for text retrieval tasks.

Transformers English

Parakeet Ctc 0.6b

Parakeet CTC 0.6B is an automatic speech recognition model jointly developed by NVIDIA NeMo and Suno.ai, based on the FastConformer architecture with approximately 600 million parameters, supporting English speech transcription.

Speech Recognition English

Parakeet Rnnt 0.6b

Parakeet RNNT 0.6B is an automatic speech recognition model jointly developed by NVIDIA NeMo and Suno.ai, based on the FastConformer architecture with approximately 600 million parameters, specifically designed for transcribing English speech into text.

Speech Recognition English

Belle Whisper Large V2 Zh

A Chinese speech recognition model fine-tuned based on whisper-large-v2, achieving a 30-70% relative performance improvement in multiple Chinese speech recognition benchmarks.

Speech Recognition

Robbert 2023 Dutch Large

RobBERT-2023 is a Dutch language model based on the RoBERTa architecture, developed by KU Leuven, Ghent University, and TU Berlin, and is one of the state-of-the-art language models for Dutch.

Large Language Model

Transformers Other

Sheared LLaMA 2.7B

Sheared-LLaMA-2.7B is a lightweight language model derived from Llama-2-7b through pruning and continued pretraining, consuming only a 50B token budget.

Large Language Model

Dragon Plus Query Encoder

This is a sentence encoder model based on sentence-transformers, capable of converting text into 768-dimensional vector representations, suitable for tasks such as semantic search and sentence similarity calculation.

GTE-small is a general text embedding model trained by Alibaba DAMO Academy, based on the BERT framework, suitable for tasks such as information retrieval and semantic text similarity.

Transformers English

A Thai text-to-speech model based on the Tacotron2 architecture, trained using a modified Common Voice Thai dataset

Speech Synthesis Other

A Russian text and dialogue summarization model fine-tuned based on ruT5-base, supporting multi-domain Russian text summarization tasks

Text Generation

Transformers Supports Multiple Languages

Whisper Small Ko

Korean speech recognition model based on the Whisper Small architecture, fine-tuned on multi-domain Korean datasets

Speech Recognition

Transformers Korean

Keyphrase Mpnet V1

A sentence transformer model optimized for phrases, mapping phrases into a 768-dimensional dense vector space, suitable for tasks like clustering or semantic search.

Pegasus X Sumstew

An English long-text summarization model fine-tuned based on Pegasus-x-large, supporting abstractive summarization of complex texts such as academic manuscripts and meeting minutes

Text Generation

Transformers English

EurekaQA is an AI Q&A model based on advanced machine learning algorithms, capable of automatically extracting information to answer questions by analyzing text data.

Question Answering System

Transformers English

Randeng Pegasus 523M Summary Chinese V1

A Chinese PEGASUS-large model specialized in text summarization tasks, fine-tuned on multiple Chinese summarization datasets

Text Generation

Transformers Chinese

JASMINE is a series of Arabic GPT models designed for few-shot learning, with parameters ranging from 300 million to 6.7 billion, pretrained on 235GB of text data.

Large Language Model

Randeng Pegasus 238M Summary Chinese

A specialized Chinese version of PAGASUS-base optimized for Chinese text summarization tasks, fine-tuned on multiple Chinese summarization datasets

Text Generation

Transformers Chinese

Randeng Pegasus 523M Summary Chinese

A Chinese PEGASUS-large model specialized in text summarization tasks, fine-tuned on multiple Chinese summarization datasets

Text Generation

Transformers Chinese

Opus Mt Tc Big Hu En

This is a neural machine translation model for translating from Hungarian to English, part of the OPUS-MT project.

Machine Translation

Transformers Supports Multiple Languages

Opus Mt Tc Big En Ro

This is a large-scale neural machine translation model based on the Transformer architecture, specifically designed for translating English to Romanian.

Machine Translation

Transformers Supports Multiple Languages

Opus Mt Tc Big En Lv

This is a neural machine translation model for English to Latvian translation, part of the OPUS-MT project, using transformer-big architecture.

Machine Translation

Transformers Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase