fb-youtube-vi-large Open-source Automatic Speech Recognition Model - Accurately Identify Vietnamese YouTube Audio Content

Fb Youtube Vi Large

Developed by phongdtd

This model is an automatic speech recognition model fine-tuned on Vietnamese YouTube informal audio datasets, based on facebook/wav2vec2-large-xlsr-53.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Vietnamese speech recognition #Multi-GPU fine-tuning #YouTube scenario optimization

Downloads 31

Release Time : 3/2/2022

Model Overview

An automatic speech recognition model optimized for informal Vietnamese speech, suitable for processing everyday conversational speech on platforms like YouTube.

Model Features

Vietnamese Optimization

Specially fine-tuned for informal Vietnamese speech scenarios

Multi-GPU Training

Uses multi-GPU distributed training to improve training efficiency

Efficient Training

Optimizes the training process with mixed-precision training (AMP)

Model Capabilities

Vietnamese speech recognition

Informal speech processing

YouTube audio transcription

Use Cases

Speech Transcription

YouTube Video Caption Generation

Automatically generates captions for Vietnamese YouTube videos

Daily Conversation Transcription

Transcribes Vietnamese daily conversation content

Property	Details
learning_rate	2e-05
train_batch_size	4
eval_batch_size	8
seed	42
distributed_type	multi - GPU
num_devices	2
total_train_batch_size	8
total_eval_batch_size	16
optimizer	Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type	linear
lr_scheduler_warmup_steps	200
num_epochs	25.0
mixed_precision_training	Native AMP

Property	Details
Transformers	4.17.0.dev0
Pytorch	1.10.0+cu111
Datasets	1.18.3
Tokenizers	0.10.3

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Fb Youtube Vi Large

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 fb-youtube-vi-large

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License