T

Text2vec Base Chinese

Developed by shibing624
A Chinese text embedding model based on the CoSENT (Cosine Sentence) model, which can map sentences to a 768-dimensional dense vector space and is suitable for tasks such as sentence embedding, text matching, or semantic search.
Downloads 605.98k
Release Time : 3/2/2022

Model Overview

This model is trained using the CoSENT method and is obtained by training on Chinese STS-B data based on hfl/chinese-macbert-base. It performs excellently in the evaluation of the Chinese STS-B test set.

Model Features

Efficient Chinese semantic matching
Performs excellently in Chinese text matching tasks and is suitable for general semantic matching scenarios.
Based on the CoSENT method
Trained using the Cosine Sentence (CoSENT) method to optimize the similarity calculation of sentence embeddings.
768-dimensional dense vector
Maps sentences to a 768-dimensional dense vector space, suitable for downstream task processing.

Model Capabilities

Sentence embedding
Text matching
Semantic search

Use Cases

Text similarity calculation
Question-answering system
Used to calculate the semantic similarity between questions and candidate answers
Improve the accuracy of question-answering matching
Information retrieval
Enhance the semantic understanding ability of search engines
Improve the relevance of search results
Natural language processing
Text clustering
Used for the automatic clustering of similar texts
Text classification
Used as input features for text classification tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase