D

Donut Base Japanese Visual Novel

Developed by oshizo
This model was trained on a synthetic dataset of visual novel-style images based on naver-clova-ix/donut-base, specifically designed to recognize text and options in visual novels.
Downloads 14
Release Time : 5/3/2023

Model Overview

The Donut model is fine-tuned to recognize text content in visual novel-style images, including dialogues, options, and character names.

Model Features

Specialized for Visual Novels
Optimized specifically for visual novel-style images, accurately recognizing dialogues, options, and character names.
Layout Adaptation
Training includes various common visual novel layouts and their variants, capable of handling different formatting styles.
Furigana Filtering
Designed to ignore furigana (phonetic annotations) and focus on accurately reading the main text content.
UI Element Filtering
Capable of minimizing the reading of non-dialogue UI elements such as SAVE, LOAD buttons, and date displays.

Model Capabilities

Visual Novel Image Recognition
Japanese Text Extraction
Dialogue Option Parsing
Character Name Recognition

Use Cases

Game Development
Visual Novel Text Extraction
Automatically recognizes dialogue content and options in visual novel game screenshots
Outputs structured JSON format dialogue information
Game Testing Automation
Used for automated testing of text display in visual novel games
Verifies whether game text is displayed correctly
Localization Tools
Translation Assistance
Extracts visual novel text for translation work
Provides accurate extraction of text to be translated
Featured Recommended AI Models
ยฉ 2025AIbase