SKYLy Open-source Speech Recognition Model - Free Deployment for Accurate Speech Content Recognition

Skyly

Developed by Siyam

SKYLy is a speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-large-xlsr-53, achieving a word error rate (WER) of 0.4083 on the evaluation set.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech Recognition #Multilingual Support #Low Word Error Rate

Downloads 26

Release Time : 5/1/2022

Model Overview

This model is a speech recognition (ASR) model, primarily used to convert speech into text. It is fine-tuned based on the wav2vec2-large-xlsr-53 architecture and supports multilingual speech recognition.

Model Features

Low Word Error Rate

Achieved a word error rate (WER) of 0.4083 on the evaluation set, demonstrating excellent performance

Based on wav2vec2 Architecture

Uses facebook's wav2vec2-large-xlsr-53 as the base model, featuring robust speech feature extraction capabilities

Multilingual Support

Trained on the common_voice multilingual dataset, supporting speech recognition in multiple languages

Model Capabilities

Speech-to-Text

Multilingual Speech Recognition

Real-time Speech Processing

Use Cases

Speech Transcription

Automatic Meeting Transcription

Automatically converts meeting recordings into text transcripts

Approximately 60% accuracy (inferred based on WER 0.4)

Voice Assistant

Used as the speech recognition module for voice control systems

Accessibility Applications

Hearing Assistance Tool

Provides real-time speech-to-text services for the hearing impaired

Training Loss	Epoch	Step	Validation Loss	Wer
4.4215	4.26	400	1.6323	0.9857
0.5716	8.51	800	0.6679	0.5107
0.1721	12.77	1200	0.6935	0.4632
0.1063	17.02	1600	0.7533	0.4432
0.0785	21.28	2000	0.7208	0.4255
0.0608	25.53	2400	0.7481	0.4117
0.0493	29.79	2800	0.7645	0.4083

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Skyly

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 SKYLy

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License