Fb Youtube Vi Large
F
Fb Youtube Vi Large
Developed by phongdtd
This model is an automatic speech recognition model fine-tuned on Vietnamese YouTube informal audio datasets, based on facebook/wav2vec2-large-xlsr-53.
Downloads 31
Release Time : 3/2/2022
Model Overview
An automatic speech recognition model optimized for informal Vietnamese speech, suitable for processing everyday conversational speech on platforms like YouTube.
Model Features
Vietnamese Optimization
Specially fine-tuned for informal Vietnamese speech scenarios
Multi-GPU Training
Uses multi-GPU distributed training to improve training efficiency
Efficient Training
Optimizes the training process with mixed-precision training (AMP)
Model Capabilities
Vietnamese speech recognition
Informal speech processing
YouTube audio transcription
Use Cases
Speech Transcription
YouTube Video Caption Generation
Automatically generates captions for Vietnamese YouTube videos
Daily Conversation Transcription
Transcribes Vietnamese daily conversation content
Featured Recommended AI Models
Š 2025AIbase