Bertopic Test
BERTopic is a flexible and modular topic modeling framework capable of generating easily interpretable topics from large datasets.
Downloads 33
Release Time : 6/25/2023
Model Overview
BERTopic is a BERT-based topic modeling tool for automatically discovering and classifying topics from text data.
Model Features
Modular design
Supports custom embedding models, dimensionality reduction and clustering algorithms
Interpretability
Generates easily interpretable topics with representative keywords
Automatic topic discovery
No preset topic count required, automatically identifies topic structures in data
Model Capabilities
Text topic classification
Topic keyword extraction
Topic visualization
Large-scale text analysis
Use Cases
Social media analysis
Cryptocurrency forum topic analysis
Analyze main topic distributions in cryptocurrency-related discussions
Identified 50 distinct topics (e.g. Litecoin, SEC regulations, etc.)
Market research
Consumer feedback classification
Automatically categorize main discussion themes in product reviews
tags:
- bertopic library_name: bertopic pipeline_tag: text-classification
bertopic-test
This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
Usage
To use this model, please install BERTopic:
pip install -U bertopic
You can use the model as follows:
from bertopic import BERTopic
topic_model = BERTopic.load("ahessamb/bertopic-test")
topic_model.get_topic_info()
Topic overview
- Number of topics: 50
- Number of training documents: 1570
Click here for an overview of all topics.
Topic ID | Topic Keywords | Topic Frequency | Label |
---|---|---|---|
0 | liquidations - forcefully - betting - liquidation - contracts | 8 | 0_liquidations_forcefully_betting_liquidation |
1 | litecoin - wsm - presale - 77 - near | 94 | 1_litecoin_wsm_presale_77 |
2 | sec - court - terraform - dismiss - lawyers | 49 | 2_sec_court_terraform_dismiss |
3 | huobi - hkvac - bsl - web3 - code | 12 | 3_huobi_hkvac_bsl_web3 |
4 | lucie - shiba - susbarium - puppynet - portals | 3 | 4_lucie_shiba_susbarium_puppynet |
5 | 000006819 - shiba - accuracy - finbold - estimates | 27 | 5_000006819_shiba_accuracy_finbold |
6 | tokens - sec - binance - securities - coinbase | 45 | 6_tokens_sec_binance_securities |
7 | mckinsey - ai - nanjing - productivity - diffusion | 43 | 7_mckinsey_ai_nanjing_productivity |
8 | resistance - swing - fib - zone - ltc | 32 | 8_resistance_swing_fib_zone |
9 | brinkman - tategpt - bitcoin - artists - wealth | 26 | 9_brinkman_tategpt_bitcoin_artists |
10 | stablecoin - stablecoins - decline - redemptions - tusd | 2 | 10_stablecoin_stablecoins_decline_redemptions |
11 | mutant - mayc - bayc - club - mcmullen | 64 | 11_mutant_mayc_bayc_club |
12 | xrp - ema - ripple - bullish - cryptocurrencies | 43 | 12_xrp_ema_ripple_bullish |
13 | tether - cbdcs - loans - federal - nafcu | 27 | 13_tether_cbdcs_loans_federal |
14 | rate - tradingview - bnb - breakout - coinmarketcap | 85 | 14_rate_tradingview_bnb_breakout |
15 | 26 - bulls - rsi - ceiling - 300 | 2 | 15_26_bulls_rsi_ceiling |
16 | lowest - jump - week - wallet - staggering | 3 | 16_lowest_jump_week_wallet |
17 | xrp - ripple - mekras - sbi - institutions | 56 | 17_xrp_ripple_mekras_sbi |
18 | debt - mortgages - trillion - government - suspends | 3 | 18_debt_mortgages_trillion_government |
19 | longitude - chronometer - bitcoin - ships - graffiti | 2 | 19_longitude_chronometer_bitcoin_ships |
20 | volumes - piggy - aud - xrp - usdt | 15 | 20_volumes_piggy_aud_xrp |
21 | root - ledger - stakers - sidechains - compatibility | 4 | 21_root_ledger_stakers_sidechains |
22 | astra - letter - concerns - investors - bitwise | 4 | 22_astra_letter_concerns_investors |
23 | gold - governments - manipulated - stocks - mined | 10 | 23_gold_governments_manipulated_stocks |
24 | tether - sygnum - documents - bank - coindesk | 9 | 24_tether_sygnum_documents_bank |
25 | rewards - governance - lido - proposal - june | 45 | 25_rewards_governance_lido_proposal |
26 | listings - coin - fairerc20 - bittrex - withdrawals | 68 | 26_listings_coin_fairerc20_bittrex |
27 | peaq - ordibots - cosmos - fetch - machine | 81 | 27_peaq_ordibots_cosmos_fetch |
28 | uniswap - v4 - orders - hooks - differing | 23 | 28_uniswap_v4_orders_hooks |
29 | price - neo - matic - rise - altcoin | 92 | 29_price_neo_matic_rise |
30 | emptydoc - staff - policy - binance - workspaces | 2 | 30_emptydoc_staff_policy_binance |
31 | lunc - synthetix - terra - perps - staking | 33 | 31_lunc_synthetix_terra_perps |
32 | tweet - dogecoin - chart - meme - negative | 3 | 32_tweet_dogecoin_chart_meme |
33 | binance - securities - exchange - cz - regulators | 63 | 33_binance_securities_exchange_cz |
34 | bitmart - sale - xrp - discount - event | 4 | 34_bitmart_sale_xrp_discount |
35 | yuan - event - olympics - canadians - organizers | 49 | 35_yuan_event_olympics_canadians |
36 | gusd - fidelity - bitcoin - proposal - blackrock | 52 | 36_gusd_fidelity_bitcoin_proposal |
37 | bills - mcglone - markets - stablecoins - liquidity | 56 | 37_bills_mcglone_markets_stablecoins |
38 | asset - gain - drop - trading - hours | 2 | 38_asset_gain_drop_trading |
39 | epstein - hamsterwheel - vulnerability - bounty - certick | 28 | 39_epstein_hamsterwheel_vulnerability_bounty |
40 | pyth - transparency - data - terra - oracle | 19 | 40_pyth_transparency_data_terra |
41 | shiba - inu - weighted - collapse - recovery | 2 | 41_shiba_inu_weighted_collapse |
42 | neo - opensea - carey - security - impersonators | 24 | 42_neo_opensea_carey_security |
43 | balancer - zkevm - liquidity - defi - 8020 | 3 | 43_balancer_zkevm_liquidity_defi |
44 | reed - battle - platform - argument - trading | 22 | 44_reed_battle_platform_argument |
45 | ada - cardano - whale - sell - investors | 4 | 45_ada_cardano_whale_sell |
46 | uk - coinbase - hong - crypto - regulatory | 65 | 46_uk_coinbase_hong_crypto |
47 | ethereum - tvl - defi - arbitrum - airdrop | 54 | 47_ethereum_tvl_defi_arbitrum |
48 | swyftx - shibarium - token - shibaswap - shiba | 54 | 48_swyftx_shibarium_token_shibaswap |
49 | bitcoin - mining - gain - miners - difficulty | 54 | 49_bitcoin_mining_gain_miners |
Training hyperparameters
- calculate_probabilities: False
- language: None
- low_memory: False
- min_topic_size: 10
- n_gram_range: (1, 1)
- nr_topics: None
- seed_topic_list: None
- top_n_words: 10
- verbose: False
Framework versions
- Numpy: 1.22.4
- HDBSCAN: 0.8.29
- UMAP: 0.5.3
- Pandas: 1.5.3
- Scikit-Learn: 1.2.2
- Sentence-transformers: 2.2.2
- Transformers: 4.30.2
- Numba: 0.56.4
- Plotly: 5.13.1
- Python: 3.10.12
Distilbert Base Uncased Finetuned Sst 2 English
Apache-2.0
Text classification model fine-tuned on the SST-2 sentiment analysis dataset based on DistilBERT-base-uncased, with 91.3% accuracy
Text Classification English
D
distilbert
5.2M
746
Xlm Roberta Base Language Detection
MIT
Multilingual detection model based on XLM-RoBERTa, supporting text classification in 20 languages
Text Classification
Transformers Supports Multiple Languages

X
papluca
2.7M
333
Roberta Hate Speech Dynabench R4 Target
This model improves online hate detection through dynamic dataset generation, focusing on learning from worst-case scenarios to enhance detection effectiveness.
Text Classification
Transformers English

R
facebook
2.0M
80
Bert Base Multilingual Uncased Sentiment
MIT
A multilingual sentiment analysis model fine-tuned based on bert-base-multilingual-uncased, supporting sentiment analysis of product reviews in 6 languages
Text Classification Supports Multiple Languages
B
nlptown
1.8M
371
Emotion English Distilroberta Base
A fine-tuned English text emotion classification model based on DistilRoBERTa-base, capable of predicting Ekman's six basic emotions and neutral category.
Text Classification
Transformers English

E
j-hartmann
1.1M
402
Robertuito Sentiment Analysis
Spanish tweet sentiment analysis model based on RoBERTuito, supporting POS(positive)/NEG(negative)/NEU(neutral) three-class sentiment classification
Text Classification Spanish
R
pysentimiento
1.0M
88
Finbert Tone
FinBERT is a BERT model pre-trained on financial communication texts, specializing in the field of financial natural language processing. finbert-tone is its fine-tuned version for financial sentiment analysis tasks.
Text Classification
Transformers English

F
yiyanghkust
998.46k
178
Roberta Base Go Emotions
MIT
A multi-label sentiment classification model based on RoBERTa-base, trained on the go_emotions dataset, supporting recognition of 28 emotion labels.
Text Classification
Transformers English

R
SamLowe
848.12k
565
Xlm Emo T
XLM-EMO is a multilingual sentiment analysis model fine-tuned based on the XLM-T model, supporting 19 languages and specifically designed for sentiment prediction in social media texts.
Text Classification
Transformers Other

X
MilaNLProc
692.30k
7
Deberta V3 Base Mnli Fever Anli
MIT
DeBERTa-v3 model trained on MultiNLI, Fever-NLI, and ANLI datasets, excelling in zero-shot classification and natural language inference tasks
Text Classification
Transformers English

D
MoritzLaurer
613.93k
204
Featured Recommended AI Models