Access Global AI Models - Power Next-Gen Apps
From General to Specialized AI - All Models in One Platform
Hot
Latest
High Likes
Filter

2673 models match the criteria

Indonesian Roberta Base Posp Tagger
MIT
This is a POS tagging model fine-tuned based on the Indonesian RoBERTa model, trained on the indonlu dataset for Indonesian text POS tagging tasks.
Sequence Labeling Transformers Other
I
w11wo
2.2M
7
Gender Classification
An image classification model built with PyTorch and HuggingPics for recognizing gender in images
Image Classification Transformers
G
rizvandwiki
1.8M
48
Wav2vec2 Base Finetuned Speech Commands V0.02
Apache-2.0
This model is a voice command recognition model fine-tuned on the speech_commands dataset based on facebook/wav2vec2-base, achieving an accuracy of 97.59%.
Audio Classification Transformers
W
0xb1
1.2M
0
Filipino Wav2vec2 L Xls R 300m Official
Apache-2.0
A speech recognition model fine-tuned on Filipino speech datasets based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
F
Khalsuu
1.2M
1
Gender Classification 2
This is an image classification model based on the PyTorch framework and generated using HuggingPics tools, specifically designed for gender classification tasks.
Image Classification Transformers
G
rizvandwiki
906.98k
32
Bert Base Arabertv02
AraBERT is an Arabic pre-trained language model based on the BERT architecture, specifically optimized for Arabic language understanding tasks.
Large Language Model Arabic
B
aubmindlab
666.17k
35
Bloomz 560m
Openrail
A medium-sized multilingual model from the BLOOMZ series, suitable for various natural language processing tasks.
Large Language Model Transformers Supports Multiple Languages
B
bigscience
593.72k
122
Whisper Medium Fleurs Lang Id
Apache-2.0
A speech language identification model fine-tuned on OpenAI Whisper-medium, achieving 88.05% accuracy on the FLEURS dataset
Audio Classification Transformers
W
sanchit-gandhi
590.30k
14
Distil Large V3
MIT
Distil-Whisper is a knowledge-distilled version of Whisper large-v3, focusing on English automatic speech recognition, offering faster inference speeds while maintaining accuracy close to the original model.
Speech Recognition English
D
distil-whisper
417.11k
311
Distilroberta Finetuned Financial News Sentiment Analysis
Apache-2.0
A financial news sentiment analysis model fine-tuned based on DistilRoBERTa, with an accuracy rate of 98.23%.
Text Classification Transformers
D
mrm8488
310.81k
386
Wikineural Multilingual Ner
A multilingual named entity recognition model combining neural networks and knowledge bases, supporting 9 languages
Sequence Labeling Transformers Supports Multiple Languages
W
Babelscape
258.08k
142
Whisper Small Ft Common Language Id
Apache-2.0
A general language identification model fine-tuned based on openai/whisper-small, achieving 88.6% accuracy on the evaluation dataset
Audio Classification Transformers
W
sanchit-gandhi
256.20k
2
Distil Medium.en
MIT
Distil-Whisper is a distilled version of the Whisper model, 6 times faster than the original, with a 49% reduction in size, while maintaining performance close to the original in English speech recognition tasks.
Speech Recognition English
D
distil-whisper
186.85k
120
Skin Type
An image classification model for categorizing human skin types, committed to fairness to ensure accurate performance across all skin tones.
Image Classification Transformers
S
driboune
182.21k
3
Ibert Roberta Base Abusive Or Threatening Speech
This model is a fine-tuned version based on ibert-roberta-base, specifically designed for detecting abusive or threatening speech.
Text Classification Transformers
I
DunnBC22
174.14k
3
Wavlm Libri Clean 100h Base Plus
An automatic speech recognition model fine-tuned on the LIBRISPEECH_ASR - CLEAN dataset based on microsoft/wavlm-base-plus
Speech Recognition Transformers
W
patrickvonplaten
126.17k
3
Classify News Category Iptc
This is a multilingual news classification model that can classify news content in Norwegian, Swedish, and English according to IPTC news codes, supporting 16 predefined categories.
Text Classification Transformers
C
ilsilfverskiold
125.81k
1
Bpmn Information Extraction V2
Apache-2.0
A BPMN process information extraction model fine-tuned based on bert-base-cased, used to extract key elements such as executors and tasks from textual process descriptions
Sequence Labeling Transformers
B
jtlicardo
112.15k
14
Nb Wav2vec2 1b Nynorsk
Apache-2.0
A Nynorsk automatic speech recognition model fine-tuned based on Facebook/Meta's XLS-R feature extractor, achieving a WER of 11.32% on the NPSC test set.
Speech Recognition Transformers Other
N
NbAiLab
96.58k
0
CLIP Convnext Large D 320.laion2B S29b B131k Ft Soup
MIT
CLIP model based on ConvNeXt-Large architecture, trained on LAION-2B dataset, supporting zero-shot image classification and image-text retrieval tasks
Text-to-Image TensorBoard
C
laion
83.56k
19
CLIP Convnext Large D.laion2b S26b B102k Augreg
MIT
Large-scale ConvNeXt-Large CLIP model trained on LAION-2B dataset, supporting zero-shot image classification and image-text retrieval tasks
Text-to-Image TensorBoard
C
laion
80.74k
5
CLIP ViT L 14 Laion2b S32b B82k
MIT
A vision-language model trained on the English subset of LAION-2B using the OpenCLIP framework, supporting zero-shot image classification and image-text retrieval
Text-to-Image TensorBoard
C
laion
79.01k
48
Nb Wav2vec2 300m Nynorsk
Apache-2.0
A 300M-parameter speech recognition model fine-tuned on the VoxRex feature extractor, optimized for Nynorsk (New Norwegian), achieving a WER of 12.22% on the NPSC test set
Speech Recognition Transformers Other
N
NbAiLab
73.53k
0
Yolov8m Table Extraction
An object detection model based on YOLOv8m, specifically designed for table extraction tasks, capable of detecting both bordered and borderless tables.
Object Detection TensorBoard
Y
keremberke
69.06k
40
Yolov5n License Plate
Lightweight license plate detection model based on YOLOv5n, optimized for license plate recognition tasks
Object Detection TensorBoard
Y
keremberke
68.64k
17
Table Detection And Extraction
A table detection model based on YOLOv8s, capable of accurately identifying bordered and borderless tables in images.
Object Detection TensorBoard English
T
foduucom
55.45k
88
Distilbert NER
Apache-2.0
A lightweight named entity recognition model fine-tuned based on DistilBERT, balancing performance and efficiency
Sequence Labeling Transformers English
D
dslim
48.95k
34
Distil Large V2
MIT
Distil-Whisper is a distilled version of the Whisper model, achieving 6x speedup and 49% size reduction with only a 1% WER difference on out-of-distribution evaluation sets.
Speech Recognition English
D
distil-whisper
42.65k
508
CLIP Convnext Base W Laion2b S13b B82k Augreg
MIT
CLIP model based on ConvNeXt-Base architecture, trained on a subset of LAION-5B using OpenCLIP, focusing on zero-shot image classification tasks
Text-to-Image TensorBoard
C
laion
40.86k
7
Wav2vec2 Lg Xlsr En Speech Emotion Recognition
Apache-2.0
A speech emotion recognition model fine-tuned on Wav2Vec 2.0, capable of identifying 8 English emotions with an accuracy of 82.23% on the RAVDESS dataset
Audio Classification Transformers
W
ehcalabres
39.83k
221
Gender Classification
Apache-2.0
A gender classification model fine-tuned on distilbert-base-uncased, achieving an accuracy of 1.0 on the evaluation set
Text Classification Transformers
G
padmajabfrl
39.68k
29
Distil Small.en
MIT
Distil-Whisper is a distilled version of the Whisper model, 6x faster with 49% smaller size, achieving near 1% WER on out-of-distribution evaluation sets.
Speech Recognition Transformers English
D
distil-whisper
33.51k
97
English Filipino Wav2vec2 L Xls R Test 09
Apache-2.0
English-Filipino speech recognition model fine-tuned from jonatasgrosman/wav2vec2-large-xlsr-53-english, achieving a WER of 0.5750 on the evaluation set
Speech Recognition Transformers
E
Khalsuu
29.03k
1
Yolov8s Signature Detector
A YOLOv8s fine-tuned model specialized for locating signatures in document images
Object Detection TensorBoard
Y
tech4humans
28.14k
15
Nb Whisper Tiny Verbatim
Apache-2.0
Norwegian automatic speech recognition model developed by the National Library of Norway based on OpenAI Whisper, specifically optimized for verbatim transcription scenarios, outputting all-lowercase, punctuation-free text
Speech Recognition Supports Multiple Languages
N
NbAiLabBeta
24.54k
2
Nb Wav2vec2 1b Bokmaal
Apache-2.0
Norwegian automatic speech recognition model fine-tuned based on Facebook/Meta's XLS-R feature extractor, achieving a 6.33% word error rate on the NPSC test set
Speech Recognition Transformers Other
N
NbAiLab
23.95k
3
Biomistral 7B
Apache-2.0
BioMistral is an open-source large language model optimized for the medical domain based on the Mistral architecture, further pre-trained on PubMed Central open-access text data, supporting multilingual medical question-answering tasks.
Large Language Model Transformers Supports Multiple Languages
B
BioMistral
22.59k
428
Aragpt2 Base
AraGPT2 is a Transformer-based Arabic text generation pre-trained model developed by AUB MIND Lab, supporting multiple model variants of different sizes.
Large Language Model Arabic
A
aubmindlab
21.26k
25
Cner Base
The CNER model is a named entity recognition model based on the DeBERTa-v3-base architecture, capable of jointly identifying and classifying concepts and named entities with fine-grained labels.
Sequence Labeling Transformers English
C
Babelscape
20.66k
6
Fullstop Punctuation Multilingual Base
MIT
FullStop is a Transformer-based multilingual punctuation prediction model that supports English, German, French, Italian, Dutch, and other languages.
Sequence Labeling Transformers Supports Multiple Languages
F
oliverguhr
19.41k
6
Spelling Correction English Base
MIT
This is an experimental model designed to correct spelling errors and punctuation in English text.
Text Generation Transformers English
S
oliverguhr
17.59k
76
Vit Base Patch16 224 In21k Finetuned Cifar10
Apache-2.0
A pre-trained model based on Google's Vision Transformer (ViT) architecture, fine-tuned on the CIFAR-10 dataset for image classification tasks.
Image Classification Transformers
V
aaraki
16.69k
10
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase