F

Fsmn Vad

Developed by funasr
FunASR is a foundational toolkit dedicated to bridging academic research and industrial applications in speech recognition, supporting various functions such as speech recognition, voice activity detection, and punctuation restoration.
Downloads 107
Release Time : 2/1/2024

Model Overview

FunASR provides full-stack speech processing capabilities, including speech recognition (ASR), voice activity detection (VAD), punctuation restoration, language models, etc., supporting both inference and fine-tuning of pre-trained models.

Model Features

Industrial-Grade Model Support
Provides pre-trained models trained on industrial data, ready for direct deployment in production environments.
Full-Stack Speech Processing
Integrates complete speech processing workflows including ASR, VAD, punctuation restoration, and speaker verification.
Efficient Inference
The Paraformer model combines high accuracy with efficiency, making it suitable for real-time applications.

Model Capabilities

Speech Recognition
Voice Activity Detection
Punctuation Restoration
Speaker Verification
Multi-Speaker Recognition
Timestamp Prediction

Use Cases

Speech Transcription
Automatic Meeting Minutes Generation
Automatically transcribes meeting recordings into text with punctuation and speaker information.
Accuracy can exceed 90% (dependent on audio quality).
Real-Time Speech Processing
Real-Time Captioning
Provides real-time captions for live streams or video conferences.
Latency can be controlled within 600ms.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase