An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with bertopic

A curated list of projects in awesome lists tagged with bertopic .

https://github.com/alisonmitchell/biomedical-knowledge-graph

Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.

arxiv bern2 bertopic biomedical coreference-resolution drug-repurposing europe-pmc fastcoref grobid groq kazu knowledge-graph langchain llamaindex melodi-presto named-entity-recognition pdfminersix relation-extraction spacy-scispacy unsloth

Last synced: 17 May 2026

https://github.com/kstrassheim/active-learning-with-deep-learning-for-nlp

We present our concept of a new type of Active-Learning for Deep Learning with NLP text classification and experimentally prove its performance against Random Sampling as well as its runtime performance on the Security Threat dataset from CySecAlert. These new Active Learning algorithms are based on Sentence-BERT and BERTopic clustering algorithms with allow us to generate fixed length tokens for whole sentences to make them comparable to each other. Further the Tokens are Clustered using K-Means or HDBScan to get diverse clusters to pick the samples out of them.

active-learning bertopic deep-learning hdbscan k-means-clustering matplotlib natural-language-processing pandas python3 pytorch sentence-bert

Last synced: 03 Apr 2025

https://github.com/gcalcedo/clusview

Build interactive topic modeling pipelines.

bertopic clustering large-language-models topic-modeling

Last synced: 14 May 2025

https://github.com/nazir20/twitter-topic-modeling-with-lsa-lda-bertopic-top2vec-and-nmf

Text Mining Final Project about Twitter Topic Modeling with different models

bertopic lda-model lsa-model nmf python top2vec topic-modeling

Last synced: 09 Oct 2025

https://github.com/ruoheng-du/topic-modeling-sentiment-analysis

Analysis of Chinese Financial Discourse Based on Topic Clustering and Emotional Evolution | Fall 2023 - Spring 2024

bertopic finbert sentiment-analysis stock-return-predictions topic-modeling

Last synced: 24 Sep 2025

https://github.com/andrewdarnall/the-observer

A Big Data processing pipeline wich a topic modeling model (BERTopic) using Mastodon data

apache-kafka apache-spark bertopic dataengineering mastodon tapunict

Last synced: 17 Feb 2026

https://github.com/lugolbis/projet-in304

Inpoda est un programme python qui traite et analyse les données d'une base de données stocké au format JSON. L'enjeux principal d'Inpoda est d'extraire les 'Topics' et sentiments des tweets d'une base de données internationale.

bertopic gradio pandas pandas-dataframe python python3 textblob

Last synced: 17 Apr 2026

https://github.com/chris-santiago/bookmarks-topics

Using unsupervised learning and language modeling to cluster and reorganize web bookmarks.

bert-embeddings bertopic bookmarks clustering generative-modeling hdbscan hydra llm openai taskfile umap unsupervised-learning

Last synced: 11 Apr 2026

https://github.com/jonfairbanks/bert-scraper

Scrape website content and extract topics

bertopic webscraper

Last synced: 29 Nov 2025

https://github.com/suryavamsi-p/conflict-nlp-topic-modeling-sentiment-analysis-using-llms

Extracts insights from 26K+ protest events using BERTopic, Top2Vec, and LLMs for real-world applications like crisis monitoring, policy research, and social unrest analysis.

all-mpnet-base-v2 bertopic conflict-data data data-science lda llama2 llms machine-learning mistral-7b nlp nltk protest-analysis pyldavis python3 top2vec topic-modeling transformers visualization

Last synced: 11 May 2026

https://github.com/shakleen/news-outlet-freedom-detection

Identify the freedom of a local news outlet by comparing sentiment and stance of published news against international outlets.

bertopic clustering data-mining llama2 llm python3 sentiment-analysis stance-detection topic-modeling

Last synced: 06 May 2026

https://github.com/pranavsp108/reddit-ai-workforce-analytics

NLP and GenAI-assisted analytics platform that analyzes Reddit discussions on AI-driven workforce disruption, career anxiety, hiring-market shifts, and adaptation strategies.

bertopic data-science machine-learning nlp python reddit-api sentiment-analysis streamlit topic-modeling workforce-analytics

Last synced: 16 Jun 2026

https://github.com/michaelkinfu/etd-topic-modeling

The electronic theses and dissertations topic modeling project was conducted by the Chinese University of Hong Kong Library.

bertopic clustering digital-scholarship topic-modeling

Last synced: 29 Oct 2025

https://github.com/terencicp/mastodon-topics

Topic modeling of Mastodon posts using BERTopic and LLM summarization. Results can be viewed in a Streamlit app.

bertopic mastodon streamlit summarization topic-modeling

Last synced: 21 Apr 2026

https://github.com/relostar-devil/cis-509-analytics-unstructured-data-yelp-data-analysis

In Florida and Pennsylvania, Yelp reviews paint a vivid picture of dining experiences across American, Chinese, and Italian cuisines. Using sentiment analysis and topic modeling, we uncover key themes that shape customer satisfaction. From flavor and service to ambiance and value, one factor stands above all—food quality.

bertopic matplotlib nltk pandas python seaborn sentiment-analysis tokenization yelp-dataset

Last synced: 14 Apr 2026

https://github.com/hassanzouhar/v0lur

Text Analysis Pipeline with interactive CLI UI. Features quote-aware processing, topic discovery, sentiment analysis, and data exports.

bert bertopic cli fault-tolerant memory-safe nlp textual ui

Last synced: 23 Jan 2026

https://github.com/vishrut-b/mod-lisation-nlp-de-la-presse-fran-aise

Pipeline NLP complet pour analyser 500+ articles de presse français. Collecte via NewsAPI, nettoyage et lemmatisation avec spaCy, embeddings CamemBERT, réduction UMAP, clustering BERTopic. Évaluation par cohérence sémantique, garantissant des thèmes précis et pertinents.

bertopic deep-learning french-language machine natural-language-processing topic-modeling

Last synced: 07 Feb 2026

https://github.com/huacenxu/pri-insights-chatbot

This project leverages machine learning and large language models to compute similarity scores, generate actionable insights, and enable AI-powered question-answering for enhanced data interaction.

ai-chatbot bertopic llm machine-learning natural-language-processing python sentence-similarity streamlit transformers visualization

Last synced: 19 May 2026

https://github.com/rlvtick/topic-modeling-shopee-reviews

Topic modeling on Shopee's 1-star reviews to uncover insights and prevalent topics within the reviews.

bertopic lda natural-language-processing nmf-matrix-factorization topic-modeling

Last synced: 15 Jun 2026

https://github.com/foteinipapadopoulou/covidvaccinesyoutubenlp

Unveiling Sentiments and Topics in COVID-19 Vaccine Comments on YouTube Over Time: from the First Vaccine Approval to the Post-Pandemic Era

bertopic sentiment-analysis text-mining topic-modeling

Last synced: 13 Sep 2025