B

Bertopic ArXiv

Developed by MaartenGr
A pre-trained topic modeling model based on the BERTopic framework, trained using approximately 30,000 abstracts of ArXiv papers, supporting multi-dimensional topic representation and classification
Downloads 231
Release Time : 5/30/2023

Model Overview

BERTopic is a flexible and modular topic modeling framework that can generate easily interpretable topic classifications from massive data. This model demonstrates the combined application of multiple topic representation methods in BERTopic.

Model Features

Multi-dimensional topic representation
Generate rich topic representations by combining multiple technologies such as part-of-speech tagging, KeyBERT heuristics, and MMR
ChatGPT enhancement
Use ChatGPT to generate topic labels and summaries to improve interpretability
Modular design
Support flexible combination of different topic representation and clustering algorithms

Model Capabilities

Text classification
Topic extraction
Keyword generation
Topic summary generation

Use Cases

Academic research
Paper topic analysis
Perform topic mining and classification on academic paper libraries such as ArXiv
Identify 107 different topics
Content analysis
Document clustering
Automatically perform topic clustering on large-scale document collections
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase