W

Wav2vec2 Large Xlsr Greek 2

Developed by skylord
A speech recognition model fine-tuned on the Greek Common Voice dataset based on facebook/wav2vec2-large-xlsr-53, balancing the training set with synthesized female voice data
Downloads 15
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model for Greek, fine-tuned from Facebook's XLSR-53 large model, specifically addressing gender imbalance issues in Greek speech data

Model Features

Gender-balanced training data
Solved the male-dominated issue in the original dataset by synthesizing female voice data using Google TTS
Multi-stage fine-tuning
Adopted a phased fine-tuning strategy, first training on original data, then continuing training with synthesized data
Greek language optimization
Specifically optimized for Greek speech characteristics, handling unique Greek pronunciations and intonations

Model Capabilities

Greek speech recognition
16kHz audio processing
Direct inference without language model

Use Cases

Speech-to-text
Greek speech transcription
Convert Greek speech content into text
Achieved 45.05% WER on Common Voice test set
Voice assistants
Greek voice command recognition
Basic speech recognition component for Greek voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase