W

Whisper Base.kk

Developed by akuzdeuov
Whisper-base is an automatic speech recognition (ASR) model optimized for low-resource Kazakh language, fine-tuned on the Kazakh Speech Corpus 2 with over 1,000 hours of annotated data.
Downloads 43
Release Time : 8/14/2024

Model Overview

This is a speech recognition model supporting only the Kazakh language, based on the Whisper architecture, specifically optimized for Kazakh speech-to-text tasks.

Model Features

Low-resource language optimization
Specifically optimized for low-resource languages like Kazakh, achieving good performance with limited data.
Industrial-grade corpus training
Trained using over 1,000 hours of industrial-grade Kazakh speech corpus (KSC2).
Long audio processing
Supports processing of arbitrarily long audio inputs through chunking algorithms.

Model Capabilities

Kazakh speech recognition
Long audio transcription
Batch speech processing

Use Cases

Speech transcription
Kazakh meeting minutes
Automatically transcribe Kazakh meeting recordings into text records.
Test set WER 15.36%
Media content subtitle generation
Automatically generate subtitles for Kazakh video content.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase