fb-vindata-vi-large Open-source Vietnamese Speech Recognition Model - Free Deployment for Precise Transcription

Fb Vindata Vi Large

Developed by phongdtd

This model is a Vietnamese automatic speech recognition model fine-tuned on the PHONGDTD/VINDATAVLSP - NA dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Vietnamese speech recognition #XLSR-53 fine-tuning #Multi-GPU training

Downloads 29

Release Time : 3/2/2022

Model Overview

An optimized automatic speech recognition model for Vietnamese, fine-tuned based on the wav2vec2-large-xlsr-53 architecture

Model Features

Vietnamese optimization

Specially fine-tuned for Vietnamese speech recognition tasks

Based on wav2vec2 architecture

Uses facebook's wav2vec2-large-xlsr-53 as the base model

Multi-GPU training

Distributed training using 2 GPUs

Model Capabilities

Vietnamese speech recognition

Speech-to-text

Use Cases

Speech transcription

Vietnamese speech transcription

Convert Vietnamese speech content into text

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Fb Vindata Vi Large

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 fb-vindata-vi-large

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License