# Low Error Rate
Vntl Llama3 8b V2 Imatrix Gguf
QLoRA fine-tuned version based on LLaMA3 Youko, optimized for Japanese visual novel English translation with 8B parameters
Machine Translation Supports Multiple Languages
V
Casual-Autopsy
311
1
Reverb Diarization V2
Other
Reverb Speaker Diarization V2 is a speaker diarization model based on pyannote-audio, outperforming the baseline pyannote3.0 model on multiple test sets.
Audio Processing
R
Revai
4,073
45
Trocr Base Printed License Plates Ocr
A fine-tuned printed license plate OCR model based on microsoft/trocr-base-printed, with a character error rate of 0.037 on the evaluation set
Text Recognition
Transformers

T
artbreguez
163
1
Fine Tashkeel
MIT
An Arabic precise diacritization system based on byte-level fine-tuned models, automatically completing Arabic text diacritics by fine-tuning pre-trained models.
Large Language Model
Transformers Arabic

F
basharalrfooh
335
5
Wavlm Base 960h Asv19 Deepfake
A deepfake audio detection model fine-tuned based on Microsoft's WavLM-base, achieving excellent performance on the ASVspoof 2019 dataset with an accuracy of 99.79%
Audio Classification
Transformers

W
abhishtagatya
16
0
Hubert Base 960h Asv19 Deepfake
Apache-2.0
An audio classification model based on the HuBERT architecture, specifically designed for detecting deepfake audio and audio spoofing
Audio Classification
Transformers

H
abhishtagatya
15
2
Belle Whisper Large V3 Zh
Apache-2.0
A Chinese speech recognition model fine-tuned and optimized based on whisper-large-v3, showing significant performance improvements in multiple Chinese speech benchmarks
Speech Recognition
Transformers

B
BELLE-2
1,666
112
Trocr Large Spanish
MIT
Transformer-based OCR model for Spanish printed text, optimized for printed fonts and does not support handwriting recognition
Image-to-Text
Transformers Supports Multiple Languages

T
qantev
298
11
Trocr Base Printed License Plates Ocr
An OCR model fine-tuned based on microsoft/trocr-base-printed, specifically designed for recognizing printed license plate numbers.
Text Recognition
Transformers

T
mariovigliar
202
1
Trocr Base Printed License Plates Ocr Timestamp
An OCR model fine-tuned based on microsoft/trocr-base-printed, specifically designed for recognizing license plates and timestamp information
Text Recognition
Transformers

T
PQAshwin
132
1
Wespeaker Voxceleb Resnet293 LM
A speaker embedding model based on ResNet293 architecture, optimized with large margin fine-tuning, supporting tasks such as speaker recognition, similarity calculation, and speech segmentation
Speaker Analysis English
W
Wespeaker
108
3
Whisper Large V3 German
Apache-2.0
A fine-tuned German speech recognition model based on Whisper Large v3, optimized for German speech processing and recognition
Speech Recognition
Transformers German

W
primeline
8,745
70
Trocr Base Printed Captcha Ocr
A captcha recognition model fine-tuned based on Microsoft's trocr-base-printed model, specifically designed for OCR tasks involving printed text
Text Recognition
Transformers

T
chanelcolgate
33
1
Whisper Base Japanese
Apache-2.0
This model is fine-tuned on the Common Voice, JVS, and JSUT datasets for Japanese speech recognition tasks using openai/whisper-base.
Speech Recognition
Transformers Japanese

W
Ivydata
137
3
Trocr Handwritten Math
This model can convert images of handwritten mathematical expressions into corresponding LaTeX sequences, suitable for mathematical formula recognition and digital processing.
Text Recognition
Transformers

T
Azu
46
5
Featured Recommended AI Models