M

Meralion AudioLLM Whisper SEA LION

Developed by MERaLiON
A speech-to-text large language model customized for Singapore's multilingual and multicultural environment, integrating Whisper-large-v2 speech encoder and SEA-LION V3 text decoder
Downloads 2,828
Release Time : 11/22/2024

Model Overview

Optimized for diverse linguistic nuances of Singaporean local accents and dialects, supporting multiple speech-to-text conversion tasks

Model Features

Localized Optimization
Specifically optimized for Singaporean local accents, dialects, and code-switching
Multitask Support
Supports 6 different speech-to-text conversion tasks
Efficient Inference
Supports vLLM framework for lightning-fast inference speed
Large-Scale Training
Trained on 260,000 hours of speech audio data

Model Capabilities

Speech Recognition
Speech Translation
Spoken Question Answering
Dialogue Summarization
Voice Command Understanding
Paralinguistic Analysis

Use Cases

Speech Transcription
Sentence-Level Speech Recognition
Convert single-sentence speech into text
Accurately transcribes Singaporean-accented English
Dialogue-Level Speech Recognition
Convert conversational speech into text with speaker labels
Supports multi-speaker identification and code-switching
Speech Understanding
Spoken Dialogue Summarization
Extract key information from conversational speech to generate summaries
Accurately captures the core content of dialogues
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase