Soundwave is a groundbreaking speech-to-text model that bridges the gap between speech and text, demonstrating exceptional performance in speech translation and AIR-Bench speech tasks with just 10,000 hours of training data.
Speech Recognition English