L

Llama3.1 Typhoon2 Audio 8b Instruct

Developed by scb10x
Typhoon 2-Audio Edition is an end-to-end speech-to-speech model architecture capable of processing audio, speech, and text inputs while simultaneously generating both text and speech outputs. The model is specifically optimized for Thai language while also supporting English.
Downloads 664
Release Time : 12/13/2024

Model Overview

A speech-to-speech model based on the Typhoon 2 large language model, supporting Thai and English speech input and output with text generation and speech synthesis capabilities.

Model Features

Multimodal input/output
Supports audio, speech, and text inputs while simultaneously generating both text and speech outputs
Thai language optimization
Specifically optimized for Thai language, providing high-quality Thai speech processing capabilities
End-to-end architecture
Complete speech-to-speech processing pipeline without requiring additional intermediate steps
Multi-turn dialogue support
Supports complex multi-turn dialogue interactions while maintaining contextual consistency

Model Capabilities

Speech recognition
Speech synthesis
Text generation
Speech-to-speech
Multilingual processing
Dialogue system

Use Cases

Voice assistant
Thai voice assistant
Building Thai voice assistants supporting voice input and output
Achieved 7.19/10 in Thai speech quality evaluation
Speech transcription
Thai speech transcription
Transcribing Thai speech content into text
14.04% WER for Thai ASR
Speech translation
English-Thai speech translation
Translating English speech to Thai text or speech
27.15 BLEU score for English-to-Thai translation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase