S

Shuka 1

Developed by sarvamai
Shuka v1 is a language model natively supporting Indian language audio understanding, combining a self-developed audio encoder with the Llama3-8B-Instruct decoder, enabling zero-shot multilingual question-answering tasks.
Downloads 729
Release Time : 8/8/2024

Model Overview

Shuka v1 is an audio-to-text model specifically designed for Indian languages, supporting English and Hindi while excelling in other Indian languages.

Model Features

Multilingual Support
Natively supports English and Hindi while excelling in other Indian languages.
Efficient Training
Trained with less than 100 hours of audio data, fine-tuning only the projector weights.
Zero-shot Question Answering
Performs exceptionally well in zero-shot question-answering tasks for other Indian languages.

Model Capabilities

Audio-to-Text
Multilingual Audio Understanding
Zero-shot Question Answering

Use Cases

Speech Recognition
Hindi Speech-to-Text
Convert Hindi audio into text
Highly accurate text output
Multilingual Question Answering
Multilingual Zero-shot Question Answering
Perform question-answering tasks in languages not specifically trained on
Exceptional performance
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase