wav2vec2_s-f-o_8batch_5sec_0.0001lr_unfrozen Open Source Speech Processing Model - Free Support for Speech Recognition Tasks

Home

Wav2vec2 S F O 8batch 5sec 0.0001lr Unfrozen

Developed by reralle

A speech processing model fine-tuned based on facebook/wav2vec2-large, supporting speech recognition tasks

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech Feature Extraction #Mini-batch Optimization #Short Audio Processing

Downloads 21

Release Time : 5/5/2023

Model Overview

This model is a fine-tuned version based on the facebook/wav2vec2-large architecture, primarily used for speech-related tasks, achieving 66.67% accuracy and 67.42% F1 score on the evaluation set.

Model Features

Efficient Fine-tuning

Fine-tuned based on the pre-trained wav2vec2-large model, fully leveraging the advantages of large-scale pre-training

Optimized Training

Trained with a batch size of 8 and a learning rate of 0.0001, ensuring training stability

Linear Learning Rate Scheduling

Uses a linear learning rate scheduler with a warm-up ratio of 0.003, optimizing the training process

Model Capabilities

Speech Recognition

Audio Feature Extraction

Use Cases

Speech Processing

Speech-to-Text

Convert speech signals into text content

Achieved 66.67% accuracy on the evaluation set

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1
2.1376	1.0	131	2.1461	0.15	0.0802
1.3836	2.0	262	1.8662	0.4	0.3888
1.2382	2.99	393	1.6891	0.45	0.4245
0.8998	4.0	525	1.4406	0.6	0.5890
0.5064	5.0	656	1.2466	0.7	0.6632
0.5248	6.0	787	1.1712	0.7	0.6705
0.5376	6.99	918	1.3778	0.6667	0.6620
0.4291	8.0	1050	2.0535	0.6167	0.5799
0.4947	9.0	1181	1.3218	0.7333	0.7250
0.5743	10.0	1312	1.7264	0.6667	0.6534
0.3847	10.99	1443	1.9041	0.6333	0.6319
0.6198	12.0	1575	1.3526	0.7167	0.6856

Property	Details
Model Type	Fine - tuned version of facebook/wav2vec2-large
Metrics	Accuracy, F1
License	Apache - 2.0

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 S F O 8batch 5sec 0.0001lr Unfrozen

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2_s-f-o_8batch_5sec_0.0001lr_unfrozen

📚 Documentation

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License