M

Monsoon Whisper Medium Gigaspeech2

Developed by scb10x
Monsoon-Whisper-Medium-GigaSpeech2 is a Thai automatic speech recognition (ASR) model, based on Whisper-Medium and fine-tuned on the GigaSpeech2 dataset, suitable for speech recognition in real-world scenarios.
Downloads 546
Release Time : 7/12/2024

Model Overview

This model focuses on Thai automatic speech recognition tasks and performs excellently in YouTube audio and noisy environment speech recognition.

Model Features

Thai speech recognition
Focuses on Thai speech recognition tasks and performs excellently in real-world scenarios.
Fine-tuned based on Whisper-Medium
Based on the Whisper-Medium architecture and fine-tuned on the GigaSpeech2 dataset.
High performance
Outperforms similar models in WER and CER metrics.

Model Capabilities

Thai speech recognition
Speech recognition in noisy environments

Use Cases

Speech recognition
YouTube audio transcription
Suitable for transcribing Thai speech content in YouTube videos.
Speech recognition in noisy environments
Maintains high recognition accuracy even in noisy environments.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase