W

Wav2vec2 Xls R 300m Italian Robust

Developed by dbdmg
An automatic speech recognition model fine-tuned on multiple Italian speech datasets based on facebook/wav2vec2-xls-r-300m
Downloads 28
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model for Italian, based on the XLS-R architecture, fine-tuned on public datasets such as Common Voice, and supports enhanced recognition with a language model.

Model Features

Multi-dataset Training
Fine-tuned on multilingual datasets such as Common Voice, LibriSpeech, and TED to improve model robustness
Language Model Enhancement
Supports recognition combined with a language model, reducing WER by approximately 30%
Cross-scenario Adaptation
Performs well on robust speech event datasets, adapting to different recording environments

Model Capabilities

Italian Speech-to-Text
Enhanced Recognition with Language Model
Multiple Accent Recognition

Use Cases

Speech Transcription
Meeting Minutes
Convert Italian meeting recordings into text transcripts
CER 3.52% (with language model)
Media Subtitle Generation
Automatically generate subtitles for Italian video content
Voice Interaction
Voice Assistant
Supports Italian voice command recognition
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase