O

Owls 4B 180K

Developed by espnet
OWLS is a suite of Whisper-style models designed to help researchers understand the scaling properties of speech models, supporting multilingual speech recognition and translation.
Downloads 40
Release Time : 2/14/2025

Model Overview

The OWLS model is developed using ESPnet and supports multilingual speech recognition, speech translation, utterance-level alignment, long-form transcription, and language identification.

Model Features

Multilingual Support
Supports speech recognition and translation tasks in multiple languages.
Large-scale Training
Trained on up to 360K hours of publicly available speech data.
Diverse Task Support
Supports various tasks such as speech recognition, speech translation, utterance-level alignment, long-form transcription, and language identification.
Open-source Toolkit
Developed using ESPnet, fully open-source, facilitating use and extension by researchers.

Model Capabilities

Speech recognition
Speech translation
Utterance-level alignment
Long-form transcription
Language identification

Use Cases

Speech Processing
Multilingual Speech Recognition
Convert speech in multiple languages into text.
Cross-language Speech Translation
Translate speech from one language into text in another language.
Speech Analysis
Utterance-level Alignment
Analyze utterance boundaries and temporal alignment in speech.
Language Identification
Identify the language type in speech.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase