W

Wav2vec2 Large Xlsr Italian

Developed by joaoalvarenga
An Italian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving a word error rate of 13.91% on the Common Voice Italian test set
Downloads 27
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model specifically optimized for Italian, based on Facebook's wav2vec2-large-xlsr-53 architecture and fine-tuned using the Italian portion of the Common Voice dataset.

Model Features

High-precision Italian recognition
Achieves a word error rate of 13.91% on the Common Voice Italian test set
Based on XLSR architecture
Leverages the powerful capabilities of Cross-Lingual Speech Representation (XLSR) learning
No language model required
Can be used directly without additional language model support

Model Capabilities

Italian speech-to-text
Audio content transcription
Voice command recognition

Use Cases

Speech transcription
Automated meeting minutes
Automatically convert Italian meeting recordings into text transcripts
Approximately 86% accuracy
Voice assistant development
Build voice interaction applications supporting Italian
Educational technology
Language learning applications
Help learners practice Italian pronunciation and listening
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase