Luke Japanese Wordpiece Base
A LUKE model improved from Japanese BERT, specifically optimized for Japanese named entity recognition tasks
Downloads 16
Release Time : 8/10/2023
Model Overview
This model is a Japanese language model based on the LUKE architecture, enhanced for Japanese named entity recognition by switching the base model to Japanese BERT and updating the training data.
Model Features
Improved Base Model
Switched the base model from RoBERTa to Japanese BERT, and accordingly changed the tokenizer from Sentencepiece to WordPiece
Updated Training Data
Pretrained using Japanese Wikipedia data up to July 1, 2023
Enhanced Entity Handling
Added support for handling `[UNK]` (unknown) entities
Compatibility Optimization
Fixed compatibility issues with higher versions of transformers and adjusted tokenizer output to comply with BERT specifications
Model Capabilities
Japanese Text Understanding
Named Entity Recognition
Handling Unknown Entities
Use Cases
Natural Language Processing
Japanese Text Entity Recognition
Identify entities such as person names, place names, and organizations in Japanese text
Capable of accurately recognizing various named entities, including unknown entities
Featured Recommended AI Models