M

Malaysian Whisper Base

Developed by mesolitica
Whisper base model fine-tuned on Malaysian datasets, supporting Malay and English speech recognition
Downloads 143
Release Time : 1/1/2024

Model Overview

This model is a speech recognition model based on the Whisper architecture, specifically fine-tuned for Malay and English in the Malaysian context, suitable for speech-to-text tasks involving Malaysian accents and dialects.

Model Features

Malaysian Language Optimization
Specifically optimized for Malay and English accents in Malaysia, including standard Malay and dialects
Multi-source Training Data
Trained using various data sources including IMDA speech-to-text datasets and pseudo-labeled Malaysian YouTube video datasets
Bilingual Support
Supports both Malay and English speech recognition, including Manglish (Malaysian English)
Timestamp Support
Capable of generating transcriptions with timestamps

Model Capabilities

Malay speech recognition
English speech recognition
Timestamped transcription
Malaysian accent recognition

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe meeting recordings in Malaysia into text
Accurately recognizes Malay and English with Malaysian accents
Media Content Subtitling
Automatically generate subtitles for Malaysian YouTube videos
Supports recognition of dialects and local accents
Speech Analysis
Speech Data Analysis
Analyze speech data from Malaysia to gain insights
Capable of processing language variants unique to Malaysia
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase