W

Wav2vec2 Large Xlsr 53 Toy Train Data Augment 0.1.csv

Developed by scasutt
This model is a speech recognition model fine-tuned from facebook/wav2vec2-base, trained using data augmentation techniques
Downloads 22
Release Time : 3/25/2022

Model Overview

A speech recognition model based on the wav2vec2 architecture, suitable for automatic speech-to-text tasks, supporting XLSR-53 multilingual features

Model Features

Data Augmentation Training
Trained using data augmentation techniques (augmentation ratio of 0.1), potentially improving model robustness
Multilingual Features
Based on XLSR-53 architecture, potentially capable of cross-language transfer learning

Model Capabilities

Speech recognition
Automatic speech-to-text conversion

Use Cases

Speech transcription
Automatic meeting minutes transcription
Automatically convert meeting recordings into text transcripts
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase