An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with bertopic

A curated list of projects in awesome lists tagged with bertopic .

https://github.com/kstrassheim/active-learning-with-deep-learning-for-nlp

We present our concept of a new type of Active-Learning for Deep Learning with NLP text classification and experimentally prove its performance against Random Sampling as well as its runtime performance on the Security Threat dataset from CySecAlert. These new Active Learning algorithms are based on Sentence-BERT and BERTopic clustering algorithms with allow us to generate fixed length tokens for whole sentences to make them comparable to each other. Further the Tokens are Clustered using K-Means or HDBScan to get diverse clusters to pick the samples out of them.

active-learning bertopic deep-learning hdbscan k-means-clustering matplotlib natural-language-processing pandas python3 pytorch sentence-bert

Last synced: 03 Apr 2025

https://github.com/gcalcedo/clusview

Build interactive topic modeling pipelines.

bertopic clustering large-language-models topic-modeling

Last synced: 14 May 2025

https://github.com/nazir20/twitter-topic-modeling-with-lsa-lda-bertopic-top2vec-and-nmf

Text Mining Final Project about Twitter Topic Modeling with different models

bertopic lda-model lsa-model nmf python top2vec topic-modeling

Last synced: 27 Jan 2025

https://github.com/andrewdarnall/the-observer

A big data processing pipeline wich a topic modeling model (BERTopic) using Mastodon data

apache-kafka apache-spark bertopic dataengineering mastodon tapunict

Last synced: 12 Apr 2025

https://github.com/ruoheng-du/topic-modeling-sentiment-analysis

Analysis of Chinese Financial Discourse Based on Topic Clustering and Emotional Evolution | Fall 2023 - Spring 2024

bertopic finbert sentiment-analysis stock-return-predictions topic-modeling

Last synced: 15 Jan 2025

https://github.com/lugolbis/projet-in304

Inpoda est un programme python qui traite et analyse les données d'une base de données stocké au format JSON. L'enjeux principal d'Inpoda est d'extraire les 'Topics' et sentiments des tweets d'une base de données internationale.

bertopic gradio pandas pandas-dataframe python python3 textblob

Last synced: 25 Mar 2025

https://github.com/alisonmitchell/biomedical-knowledge-graph

Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.

arxiv bern2 bertopic biomedical coreference-resolution drug-repurposing europe-pmc fastcoref grobid groq kazu knowledge-graph langchain llamaindex melodi-presto named-entity-recognition pdfminersix relation-extraction spacy-scispacy unsloth

Last synced: 03 Apr 2025

https://github.com/jonfairbanks/bert-scraper

Scrape website content and extract topics

bertopic webscraper

Last synced: 19 Feb 2025

https://github.com/chris-santiago/bookmarks-topics

Using unsupervised learning and language modeling to cluster and reorganize web bookmarks.

bert-embeddings bertopic bookmarks clustering generative-modeling hdbscan hydra llm openai taskfile umap unsupervised-learning

Last synced: 27 Mar 2025

https://github.com/huacenxu/pri-insights-chatbot

This project leverages machine learning and large language models to compute similarity scores, generate actionable insights, and enable AI-powered question-answering for enhanced data interaction.

ai-chatbot bertopic llm machine-learning natural-language-processing python sentence-similarity streamlit transformers visualization

Last synced: 01 Mar 2025

https://github.com/shakleen/news-outlet-freedom-detection

Identify the freedom of a local news outlet by comparing sentiment and stance of published news against international outlets.

bertopic clustering data-mining llama2 llm python3 sentiment-analysis stance-detection topic-modeling

Last synced: 26 Feb 2025

https://github.com/relostar-devil/cis-509-analytics-unstructured-data-yelp-data-analysis

In Florida and Pennsylvania, Yelp reviews paint a vivid picture of dining experiences across American, Chinese, and Italian cuisines. Using sentiment analysis and topic modeling, we uncover key themes that shape customer satisfaction. From flavor and service to ambiance and value, one factor stands above all—food quality.

bertopic matplotlib nltk pandas python seaborn sentiment-analysis tokenization yelp-dataset

Last synced: 13 Mar 2025

https://github.com/michaelkinfu/etd-topic-modeling

The electronic theses and dissertations topic modeling project was conducted by the Chinese University of Hong Kong Library.

bertopic clustering digital-scholarship topic-modeling

Last synced: 05 Mar 2025

https://github.com/rlvtick/topic-modeling-shopee-reviews

Topic modeling on Shopee's 1-star reviews to uncover insights and prevalent topics within the reviews.

bertopic lda natural-language-processing nmf-matrix-factorization topic-modeling

Last synced: 24 Feb 2025