wav2vec2-xlsr-Basaa Open-source Automatic Speech Recognition Model - Precise Recognition of Basaa Language Speech

Wav2vec2 Xlsr Basaa

Developed by sammy786

This model is an automatic speech recognition model fine-tuned on the Common Voice 8 Basaa dataset based on facebook/wav2vec2-xls-r-1b.

Speech Recognition

Transformers

OtherOpen Source License:Apache-2.0 #Basaa speech recognition #Multi-dialect support #Low-resource optimization

Downloads 20

Release Time : 3/2/2022

Model Overview

This is a model for automatic speech recognition in Basaa, fine-tuned on the Common Voice 8 dataset based on the wav2vec2-xls-r-1b architecture.

Model Features

High-performance Basaa recognition

Fine-tuned on the Common Voice 8 Basaa dataset, achieving a word error rate (WER) of 41.23 and a character error rate (CER) of 13.54.

Based on large-scale pre-trained model

Fine-tuned from the facebook/wav2vec2-xls-r-1b model, inheriting its powerful speech feature extraction capabilities.

Robust speech processing

Capable of handling conversational scenarios and inputs with varying speech quality.

Model Capabilities

Basaa speech recognition

Automatic speech-to-text

Processing conversational speech

Use Cases

Speech transcription

Basaa speech transcription

Convert Basaa speech content into text

Word error rate 41.23%, character error rate 13.54%

Voice assistant

Basaa voice interaction

Used for developing Basaa voice assistants and dialogue systems

🚀 sammy786/wav2vec2-xlsr-basaa

This model is a fine - tuned version of facebook/wav2vec2-xls-r-1b on the MOZILLA - FOUNDATION/COMMON_VOICE_8_0 - bas dataset. It's designed for automatic speech recognition, offering a solution for converting speech to text with specific performance metrics.

🚀 Quick Start

This model is a fine - tuned version of facebook/wav2vec2-xls-r-1b on the MOZILLA - FOUNDATION/COMMON_VOICE_8_0 - bas dataset. It achieves the following results on the evaluation set (which is 10 percent of the train data set merged with other and dev datasets):

Loss: 21.39
Wer: 30.99

✨ Features

Fine - Tuned Model: Based on "facebook/wav2vec2-xls-r-1b", fine - tuned on the specific bas dataset of MOZILLA - FOUNDATION/COMMON_VOICE_8_0.
Performance Metrics: Achieves certain loss and WER values on the evaluation set, indicating its effectiveness in speech recognition.

📚 Documentation

Model description

"facebook/wav2vec2-xls-r-1b" was finetuned.

Intended uses & limitations

More information needed

Training and evaluation data

Training data - Common voice Finnish train.tsv, dev.tsv and other.tsv

Training procedure

For creating the train dataset, all possible datasets were appended and a 90 - 10 split was used.

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.000045637994662983496
train_batch_size: 16
eval_batch_size: 16
seed: 13
gradient_accumulation_steps: 2
total_train_batch_size: 32
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e - 08
lr_scheduler_type: cosine_with_restarts
lr_scheduler_warmup_steps: 500
num_epochs: 70
mixed_precision_training: Native AMP

Training results

Step	Training Loss	Validation Loss	Wer
200	6.734100	1.605006	0.980456
400	1.011200	0.364686	0.442997
600	0.709300	0.300204	0.377850
800	0.469800	0.315612	0.405537
1000	0.464700	0.352494	0.372964
1200	0.421900	0.342533	0.368078
1400	0.401900	0.351398	0.343648
1600	0.429800	0.350570	0.348534
1800	0.352600	0.356601	0.358306
2000	0.387200	0.355814	0.356678
2200	0.362400	0.345573	0.355049

Framework versions

Transformers 4.16.0.dev0
Pytorch 1.10.0+cu102
Datasets 1.17.1.dev0
Tokenizers 0.10.3

Evaluation Commands

To evaluate on mozilla - foundation/common_voice_8_0 with split test

python eval.py --model_id sammy786/wav2vec2-xlsr-basaa --dataset mozilla - foundation/common_voice_8_0 --config bas --split test

📄 License

This project is under the Apache - 2.0 license.

Property	Details
Model Type	Fine - tuned version of "facebook/wav2vec2-xls-r-1b" on MOZILLA - FOUNDATION/COMMON_VOICE_8_0 - bas dataset
Training Data	Common voice Finnish train.tsv, dev.tsv and other.tsv

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご