V

Vakyansh Wav2vec2 Tamil Tam 250

Developed by Harveenchadha
Tamil automatic speech recognition model based on Wav2Vec2 architecture, developed by Harveen Chadha, fine-tuned on 4200 hours of Hindi data
Downloads 1,843
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) system specifically designed for Tamil, based on Facebook's Wav2Vec2 architecture, fine-tuned from the multilingual pretrained model CLSRIL-23

Model Features

Multilingual Pretraining Foundation
Fine-tuned from CLSRIL-23 multilingual model with cross-language transfer learning capability
Large-scale Training Data
Trained using 4200 hours of annotated speech data
Language Model Independence
Directly outputs recognition results without requiring external language models
Open Source Availability
Complete training code and model weights are open-sourced

Model Capabilities

Tamil speech recognition
16kHz audio processing
End-to-end speech-to-text

Use Cases

Speech Transcription
Tamil Speech Transcription
Convert Tamil speech content into text
Word Error Rate 53.64% (Common Voice test set)
Voice Assistants
Tamil Voice Command Recognition
Provides basic recognition capability for Tamil voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase