Projects in Awesome Lists tagged with bertopic
A curated list of projects in awesome lists tagged with bertopic .
https://github.com/kstrassheim/active-learning-with-deep-learning-for-nlp
We present our concept of a new type of Active-Learning for Deep Learning with NLP text classification and experimentally prove its performance against Random Sampling as well as its runtime performance on the Security Threat dataset from CySecAlert. These new Active Learning algorithms are based on Sentence-BERT and BERTopic clustering algorithms with allow us to generate fixed length tokens for whole sentences to make them comparable to each other. Further the Tokens are Clustered using K-Means or HDBScan to get diverse clusters to pick the samples out of them.
active-learning bertopic deep-learning hdbscan k-means-clustering matplotlib natural-language-processing pandas python3 pytorch sentence-bert
Last synced: 03 Apr 2025
https://github.com/gcalcedo/clusview
Build interactive topic modeling pipelines.
bertopic clustering large-language-models topic-modeling
Last synced: 14 May 2025
https://github.com/nazir20/twitter-topic-modeling-with-lsa-lda-bertopic-top2vec-and-nmf
Text Mining Final Project about Twitter Topic Modeling with different models
bertopic lda-model lsa-model nmf python top2vec topic-modeling
Last synced: 27 Jan 2025
https://github.com/andrewdarnall/the-observer
A big data processing pipeline wich a topic modeling model (BERTopic) using Mastodon data
apache-kafka apache-spark bertopic dataengineering mastodon tapunict
Last synced: 12 Apr 2025
https://github.com/ruoheng-du/topic-modeling-sentiment-analysis
Analysis of Chinese Financial Discourse Based on Topic Clustering and Emotional Evolution | Fall 2023 - Spring 2024
bertopic finbert sentiment-analysis stock-return-predictions topic-modeling
Last synced: 15 Jan 2025
https://github.com/lugolbis/projet-in304
Inpoda est un programme python qui traite et analyse les données d'une base de données stocké au format JSON. L'enjeux principal d'Inpoda est d'extraire les 'Topics' et sentiments des tweets d'une base de données internationale.
bertopic gradio pandas pandas-dataframe python python3 textblob
Last synced: 25 Mar 2025
https://github.com/alisonmitchell/biomedical-knowledge-graph
Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.
arxiv bern2 bertopic biomedical coreference-resolution drug-repurposing europe-pmc fastcoref grobid groq kazu knowledge-graph langchain llamaindex melodi-presto named-entity-recognition pdfminersix relation-extraction spacy-scispacy unsloth
Last synced: 03 Apr 2025
https://github.com/jonfairbanks/bert-scraper
Scrape website content and extract topics
Last synced: 19 Feb 2025
https://github.com/chris-santiago/bookmarks-topics
Using unsupervised learning and language modeling to cluster and reorganize web bookmarks.
bert-embeddings bertopic bookmarks clustering generative-modeling hdbscan hydra llm openai taskfile umap unsupervised-learning
Last synced: 27 Mar 2025
https://github.com/huacenxu/pri-insights-chatbot
This project leverages machine learning and large language models to compute similarity scores, generate actionable insights, and enable AI-powered question-answering for enhanced data interaction.
ai-chatbot bertopic llm machine-learning natural-language-processing python sentence-similarity streamlit transformers visualization
Last synced: 01 Mar 2025
https://github.com/marionchaff/topic-modelling
NLP: topic modelling using BERTopic + LSA + pLSA + LDA
bertopic customer-reviews e-commerce lda lsa machine-learning natural-language-processing nlp plsa python text-analysis topic-modelling
Last synced: 12 Apr 2025
https://github.com/shakleen/news-outlet-freedom-detection
Identify the freedom of a local news outlet by comparing sentiment and stance of published news against international outlets.
bertopic clustering data-mining llama2 llm python3 sentiment-analysis stance-detection topic-modeling
Last synced: 26 Feb 2025
https://github.com/relostar-devil/cis-509-analytics-unstructured-data-yelp-data-analysis
In Florida and Pennsylvania, Yelp reviews paint a vivid picture of dining experiences across American, Chinese, and Italian cuisines. Using sentiment analysis and topic modeling, we uncover key themes that shape customer satisfaction. From flavor and service to ambiance and value, one factor stands above all—food quality.
bertopic matplotlib nltk pandas python seaborn sentiment-analysis tokenization yelp-dataset
Last synced: 13 Mar 2025
https://github.com/micheldpd24/cust_review_bertopic
Topic Modeling of Customer Reviews using BERTopic
bertopic coherence-score dash dashboard docker embeddings similarity-score topic-modeling transformers
Last synced: 12 Apr 2025
https://github.com/michaelkinfu/etd-topic-modeling
The electronic theses and dissertations topic modeling project was conducted by the Chinese University of Hong Kong Library.
bertopic clustering digital-scholarship topic-modeling
Last synced: 05 Mar 2025
https://github.com/rlvtick/topic-modeling-shopee-reviews
Topic modeling on Shopee's 1-star reviews to uncover insights and prevalent topics within the reviews.
bertopic lda natural-language-processing nmf-matrix-factorization topic-modeling
Last synced: 24 Feb 2025