En Core Web Sm
A CPU-optimized small English language processing pipeline provided by spaCy, including core functionalities such as tokenization, part-of-speech tagging, dependency parsing, and named entity recognition
Downloads 2,707
Release Time : 3/2/2022
Model Overview
This is an English natural language processing model primarily used for basic NLP tasks such as text tokenization, part-of-speech tagging, dependency parsing, and named entity recognition. The model is optimized for CPU usage, making it suitable for lightweight application scenarios.
Model Features
CPU Optimization
Specially optimized for CPU usage scenarios, suitable for resource-limited environments
Multi-task Processing
Single pipeline simultaneously handles tokenization, part-of-speech tagging, dependency parsing, and named entity recognition
Lightweight
Small model size without word vectors, suitable for quick deployment
High Accuracy
Achieves high accuracy on standard datasets such as OntoNotes 5
Model Capabilities
Text Tokenization
Part-of-speech Tagging
Dependency Parsing
Named Entity Recognition
Sentence Segmentation
Lemmatization
Use Cases
Text Analysis
Information Extraction
Extract entity information such as person names, locations, and organizations from text
F1 score 84.56%
Syntax Analysis
Analyze sentence grammatical structure and word dependency relationships
Dependency Parsing UAS 91.75%
Content Processing
Text Preprocessing
Prepare text data for machine learning models, including tokenization and part-of-speech tagging
Tokenization accuracy 99.86%
Featured Recommended AI Models
Š 2025AIbase