Whisper Ner V1
WhisperNER is a novel model capable of simultaneous speech transcription and entity recognition, supporting open-type named entity recognition (NER).
Speech Recognition
Safetensors Supports Multiple LanguagesOpen Source License:MIT#Speech Entity Recognition#Open Type NER#Multi-task ASR
Downloads 174
Release Time : 9/23/2024
Model Overview
WhisperNER is a powerful foundational model suitable for downstream tasks in automatic speech recognition (ASR) with NER, and its performance can be enhanced by fine-tuning on specific datasets.
Model Features
Joint Speech Transcription and Entity Recognition
Capable of simultaneous speech transcription and entity recognition, supporting open-type named entity recognition (NER).
Open-Type NER Support
Able to recognize diverse and evolving entities during inference.
Fine-Tunable Foundational Model
Suitable for downstream tasks in automatic speech recognition (ASR) with NER, and its performance can be enhanced by fine-tuning on specific datasets.
Model Capabilities
Speech Transcription
Named Entity Recognition
Open-Type Entity Recognition
Use Cases
Speech-to-Text and Entity Extraction
Meeting Minutes and Entity Extraction
Convert meeting recordings into text and extract key entities (e.g., names, companies, locations).
Enhances the efficiency and searchability of meeting records.
News Audio Analysis
Analyze news broadcast audio to extract key figures, organizations, and locations.
Quickly generates news summaries and entity indexes.
Featured Recommended AI Models
Š 2025AIbase