F

Fb Youtube Vi Large

Developed by phongdtd
This model is an automatic speech recognition model fine-tuned on Vietnamese YouTube informal audio datasets, based on facebook/wav2vec2-large-xlsr-53.
Downloads 31
Release Time : 3/2/2022

Model Overview

An automatic speech recognition model optimized for informal Vietnamese speech, suitable for processing everyday conversational speech on platforms like YouTube.

Model Features

Vietnamese Optimization
Specially fine-tuned for informal Vietnamese speech scenarios
Multi-GPU Training
Uses multi-GPU distributed training to improve training efficiency
Efficient Training
Optimizes the training process with mixed-precision training (AMP)

Model Capabilities

Vietnamese speech recognition
Informal speech processing
YouTube audio transcription

Use Cases

Speech Transcription
YouTube Video Caption Generation
Automatically generates captions for Vietnamese YouTube videos
Daily Conversation Transcription
Transcribes Vietnamese daily conversation content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase