P

Pathumma Llm Audio 1.0.0

Developed by nectec
Pathumma-llm-audio-1.0.0 is an 8-billion-parameter Thai large language model specifically designed for audio comprehension tasks, capable of processing various audio inputs including speech, general audio, and music.
Downloads 333
Release Time : 10/24/2024

Model Overview

This model combines the OpenThaiLLM-DoodNiLT-V1.0.0-Beta-7B language model with the Pathumma-whisper-th-large-v3 speech encoder to convert audio into meaningful text representations.

Model Features

Multi-type audio processing
Capable of processing various types of audio inputs including speech, general audio, and music.
Thai language optimization
Specially designed for Thai, with optimized capabilities for Thai speech and text conversion.
Efficient inference
Supports LoRA inference mode, suitable for operation with limited resources.

Model Capabilities

Audio transcription
Speech comprehension
Text generation

Use Cases

Speech transcription
Thai speech-to-text
Convert Thai speech into text output.
Audio comprehension
General audio analysis
Analyze general audio content and generate descriptive text.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase