W

Wav2vec2 Large Xlsr 53 Italian

Developed by facebook
Large-scale Italian automatic speech recognition model based on the Wav2Vec2 architecture, fine-tuned on the Common Voice dataset, released by Facebook
Downloads 4,013
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) system based on the Wav2Vec2 architecture, specifically optimized for Italian, capable of converting Italian audio into text

Model Features

Large-scale Pretraining
Based on the XLSR-53 large-scale multilingual speech representation learning model
Italian Language Optimization
Specifically fine-tuned for Italian to improve recognition accuracy
Efficient Speech Processing
Supports 16kHz sample rate audio input, suitable for common speech application scenarios

Model Capabilities

Italian audio-to-text conversion
Speech recognition
Speech transcription

Use Cases

Speech Transcription
Italian Meeting Minutes
Automatically convert Italian meeting recordings into written transcripts
22.1% WER on the Common Voice test set
Voice Assistants
Provide speech recognition capabilities for Italian voice assistants
Accessibility Applications
Real-time Caption Generation
Generate real-time captions for Italian video content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase