Ja Core News Lg
spaCy's CPU-optimized Japanese processing pipeline, including tokenization, part-of-speech tagging, dependency parsing, named entity recognition, and more
Downloads 53
Release Time : 3/2/2022
Model Overview
This is a Japanese natural language processing model trained on the Universal Dependencies Japanese corpus, supporting tasks such as part-of-speech tagging, dependency parsing, and named entity recognition. The model is optimized for CPU usage, making it suitable for Japanese text analysis tasks.
Model Features
CPU Optimization
The model is specifically optimized for CPU usage, making it suitable for running in environments without GPUs
Comprehensive NLP Features
Provides a complete set of natural language processing functions, from basic tokenization to advanced named entity recognition
High-Quality Word Vectors
Includes 480,443 300-dimensional word vectors based on the chiVe word embedding model
Model Capabilities
Japanese tokenization
Part-of-speech tagging
Named entity recognition
Dependency parsing
Lemmatization
Sentence segmentation
Use Cases
Text Analysis
Japanese News Analysis
Analyze Japanese news texts to extract entities, relations, and events
NER F-score reached 71.19%
Japanese Text Preprocessing
Prepare Japanese text data for machine learning tasks
Part-of-speech tagging accuracy reached 97.42%
Language Learning
Japanese Grammar Analysis
Help learners analyze Japanese sentence structures
Dependency parsing LAS score 90.90
Featured Recommended AI Models
ยฉ 2025AIbase