# Multi-Dataset Training

TIPO 500M Ft
Other
TIPO is a text-to-image prompt optimization system based on text pre-sampling, which enhances the quality and usability of generative models by optimizing user input prompts through large language models.
Text-to-Image English
T
KBlueLeaf
10.92k
29
Vitpose Base Coco Aic Mpii
Apache-2.0
ViTPose is a human pose estimation model based on Vision Transformer, achieving outstanding performance on benchmarks like MS COCO through simple architectural design.
Pose Estimation Transformers English
V
usyd-community
38
1
F5 Hindi 24KHz
A Hindi text-to-speech model trained from scratch based on the F5 architecture, developed by the SPRING Lab at the Indian Institute of Technology Madras.
Speech Synthesis Other
F
SPRINGLab
1,430
18
TIPO 500M
Other
TIPO is a 500-million-parameter model based on the LLaMA architecture, specifically designed for prompt optimization in text-to-image generation.
Text-to-Image English
T
KBlueLeaf
2,013
51
Test Push
Apache-2.0
distilvit is an image-to-text model based on a VIT image encoder and a distilled GPT-2 text decoder, capable of generating textual descriptions of images.
Image-to-Text Transformers
T
tarekziade
17
0
Pix2text Table Rec
MIT
A table structure recognition model developed based on Microsoft's Table Transformer for table detection and recognition tasks in documents
Text Recognition Transformers
P
breezedeus
1,124
2
Japanese Reranker Cross Encoder Large V1
MIT
A high-performance cross-encoder model optimized for Japanese text reranking tasks, featuring a 24-layer architecture with 1024 hidden units
Text Embedding Japanese
J
hotchpotch
2,959
15
Japanese Bge Reranker V2 M3 V1
MIT
This is a Japanese Reranker (Cross-Encoder) model for text ranking tasks, featuring 24 layers and a hidden layer size of 1024.
Text Embedding Japanese
J
hotchpotch
1,151
15
Japanese Reranker Cross Encoder Small V1
MIT
This is a Japanese-trained Reranker (Cross-Encoder) model for text ranking tasks.
Text Embedding Japanese
J
hotchpotch
209
3
Japanese Reranker Cross Encoder Xsmall V1
MIT
This is a Japanese-trained Reranker (Cross-Encoder) model for text ranking tasks.
Text Embedding Japanese
J
hotchpotch
7,041
7
Pairrm
MIT
PairRM is an efficient pairwise reward model for comparing and ranking output candidates from large language models, supporting various applications such as RLHF and Best-N sampling.
Large Language Model Transformers English
P
llm-blender
6,004
198
Ag Nli DeTS Sentence Similarity V1
Apache-2.0
This model is trained using the Cross-Encoder class from SentenceTransformers to predict the semantic similarity score between two sentences.
Text Embedding Transformers Supports Multiple Languages
A
abbasgolestani
982
0
All MiniLM L6 V2 Ct2 Int8
Apache-2.0
This is a sentence embedding model based on the MiniLM architecture, capable of mapping text to a 384-dimensional vector space, suitable for semantic search and text similarity tasks.
Text Embedding English
A
jncraton
40
0
Binarization Segformer B3
Openrail
A document image binarization model fine-tuned based on the SegFormer-B3 architecture, excelling in DIBCO evaluation metrics
Image Segmentation Transformers
B
DiTo97
85
1
Reward Model Deberta V3 Large V2
MIT
This reward model is trained to predict which generated answer humans would prefer for a given question. Suitable for QA evaluation, RLHF reward scoring, and toxic answer detection.
Large Language Model Transformers English
R
OpenAssistant
11.15k
219
Sbert All MiniLM L6 With Pooler
Apache-2.0
An ONNX model based on sentence-transformers that maps text to a 384-dimensional vector space, suitable for semantic search and clustering tasks.
Text Embedding English
S
optimum
3,867
6
Bert Base Cased Qa Evaluator
A BERT-base-cased based QA pair evaluation model for determining semantic relevance between questions and answers
Question Answering System
B
iarfmoose
122.54k
9
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase