J

Japanese Hubert Base Phoneme Ctc

Developed by prj-beatrice
This model is a fine-tuned model for Japanese phoneme recognition using CTC based on rinna/japanese-hubert-base, which can effectively improve the accuracy of Japanese speech recognition.
Downloads 144
Release Time : 6/21/2025

Model Overview

This model is fine-tuned using the ReazonSpeech v2 dataset, with the phoneme labels generated by pyopenjtalk-plus as the correct answers, focusing on improving the accuracy of Japanese speech recognition tasks.

Model Features

Efficient fine-tuning
Based on the rinna/japanese-hubert-base model, efficient fine-tuning is performed using CTC, focusing on Japanese phoneme recognition tasks.
High-quality dataset
Trained using the ReazonSpeech v2 dataset and phoneme labels generated by pyopenjtalk-plus to ensure data quality.
Optimal selection
After about 0.3 epochs of learning, the checkpoint with the best accuracy on the JSUT corpus is selected to ensure model performance.

Model Capabilities

Japanese phoneme recognition
Speech-to-text
Japanese speech processing

Use Cases

Speech recognition
Japanese speech-to-text
Convert Japanese speech into a phoneme sequence for subsequent processing and analysis.
The output is a phoneme sequence, such as'm i z u o m a r e e sh i a k a r a k a w a n a k U t e w a n a r a n a i n o d e s U'
Speech processing
Japanese speech analysis
Used to analyze the phoneme distribution and patterns in Japanese speech.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase