J

Jets

Developed by imdanboy
A JETS text-to-speech model trained on the ESPnet framework, using the LJSpeech dataset, supporting English speech synthesis.
Downloads 15
Release Time : 5/28/2022

Model Overview

This is a text-to-speech model based on the JETS architecture, capable of converting English text into natural speech. The model employs adversarial training strategies, combining Transformer encoders and HiFiGAN discriminators to produce high-quality speech output.

Model Features

High-Quality Speech Synthesis
Utilizes the JETS architecture combined with HiFiGAN discriminators to generate natural and fluent speech
Adversarial Training Strategy
Employs Generative Adversarial Network (GAN) training methods to enhance speech quality
End-to-End Training
An end-to-end training pipeline from text directly to speech waveforms
Multi-Scale Discriminator
Uses Multi-Scale Multi-Period Discriminators to improve generation quality

Model Capabilities

English Text-to-Speech
High-Quality Speech Synthesis
Speech Feature Control (Pitch, Energy)

Use Cases

Speech Synthesis Applications
Audiobook Generation
Converts e-book text into natural speech
Produces speech close to human narration
Voice Assistants
Provides speech output capabilities for virtual assistants
Natural and fluent conversational speech
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase