E

En Core Web Sm

Developed by spacy
A CPU-optimized small English language processing pipeline provided by spaCy, including core functionalities such as tokenization, part-of-speech tagging, dependency parsing, and named entity recognition
Downloads 2,707
Release Time : 3/2/2022

Model Overview

This is an English natural language processing model primarily used for basic NLP tasks such as text tokenization, part-of-speech tagging, dependency parsing, and named entity recognition. The model is optimized for CPU usage, making it suitable for lightweight application scenarios.

Model Features

CPU Optimization
Specially optimized for CPU usage scenarios, suitable for resource-limited environments
Multi-task Processing
Single pipeline simultaneously handles tokenization, part-of-speech tagging, dependency parsing, and named entity recognition
Lightweight
Small model size without word vectors, suitable for quick deployment
High Accuracy
Achieves high accuracy on standard datasets such as OntoNotes 5

Model Capabilities

Text Tokenization
Part-of-speech Tagging
Dependency Parsing
Named Entity Recognition
Sentence Segmentation
Lemmatization

Use Cases

Text Analysis
Information Extraction
Extract entity information such as person names, locations, and organizations from text
F1 score 84.56%
Syntax Analysis
Analyze sentence grammatical structure and word dependency relationships
Dependency Parsing UAS 91.75%
Content Processing
Text Preprocessing
Prepare text data for machine learning models, including tokenization and part-of-speech tagging
Tokenization accuracy 99.86%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase