P

Psst Base Rep

Developed by birgermoell
A baseline speech recognition model trained on the PSST dataset based on the Wav2vec2-small architecture
Downloads 30
Release Time : 4/1/2022

Model Overview

This model is a reproduction of the Wav2vec2-small architecture on the PSST dataset, primarily used for speech recognition tasks, supporting phoneme and character-level recognition.

Model Features

Efficient Speech Recognition
Based on the Wav2vec2-small architecture, providing efficient speech recognition capabilities.
Phoneme and Character-Level Recognition
Supports evaluation of Phoneme Error Rate (PER) and Word Error Rate (WER).

Model Capabilities

Speech Recognition
Phoneme Recognition
Character-Level Recognition

Use Cases

Speech Transcription
Speech-to-Text
Convert speech content into text, suitable for meeting minutes, voice notes, and other scenarios.
Word Error Rate (WER): 10.4%
Speech Analysis
Phoneme Analysis
Analyze the phoneme composition in speech, suitable for linguistic research or speech training.
Phoneme Error Rate (PER): 23.1%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase