N

Navaistt V1 Medium

Developed by islomov
Uzbek speech recognition model fine-tuned based on Whisper medium, supports Tashkent dialect, trained on approximately 700 hours of data
Downloads 3,081
Release Time : 5/2/2025

Model Overview

An automatic speech recognition model optimized for Uzbek, specifically enhanced for the Tashkent dialect, suitable for audio transcription tasks

Model Features

Tashkent Dialect Optimization
Special focus on Tashkent dialect audio materials, ensuring excellent performance on this dialect
Diverse Training Data
Uses approximately 700 hours of diverse audio data, including podcasts, audiobooks, and Common Voice corpus
Mixed-Quality Data Training
Hybrid training strategy with 60% human-transcribed and 40% pseudo-transcribed materials (generated by Gemini 2.5 Pro)

Model Capabilities

Uzbek speech recognition
Tashkent dialect recognition
Audio transcription
Short audio processing within 30 seconds

Use Cases

Speech Transcription
Podcast Content Transcription
Automatically convert Uzbek podcast content into text
Word Error Rate ~13%
Audiobook Transcription
Convert Uzbek audiobooks into text format
Voice Assistants
Uzbek Voice Input
Add Uzbek voice input functionality to applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase