Projects in Awesome Lists tagged with topic-modeling
A curated list of projects in awesome lists tagged with topic-modeling .
https://github.com/ddbourgin/numpy-ml
Machine learning, in numpy
attention bayesian-inference gaussian-mixture-models gaussian-processes good-turing-smoothing gradient-boosting hidden-markov-models knn lstm machine-learning mfcc neural-networks reinforcement-learning resnet topic-modeling vae wavenet wgan-gp word2vec
Last synced: 12 May 2025
https://github.com/piskvorky/gensim
Topic Modelling for Humans
data-mining data-science document-similarity fasttext gensim information-retrieval machine-learning natural-language-processing neural-network nlp python topic-modeling word-embeddings word-similarity word2vec
Last synced: 11 Dec 2025
https://github.com/maartengr/bertopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
bert ldavis machine-learning nlp sentence-embeddings topic topic-modeling topic-modelling topic-models transformers
Last synced: 12 May 2025
https://github.com/MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
bert ldavis machine-learning nlp sentence-embeddings topic topic-modeling topic-modelling topic-models transformers
Last synced: 09 Apr 2025
https://github.com/ddangelov/top2vec
Top2Vec learns jointly embedded topic, document and word vectors.
bert document-embedding pre-trained-language-models semantic-search sentence-encoder sentence-transformers text-search text-semantic-similarity top2vec topic-modeling topic-modelling topic-search topic-vector word-embeddings
Last synced: 23 Apr 2025
https://github.com/ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
bert document-embedding pre-trained-language-models semantic-search sentence-encoder sentence-transformers text-search text-semantic-similarity top2vec topic-modeling topic-modelling topic-search topic-vector word-embeddings
Last synced: 30 Mar 2025
https://github.com/baidu/familia
A Toolkit for Industrial Topic Modeling
lda nlp sentence-lda topic-modeling topic-models twe
Last synced: 14 Apr 2025
https://github.com/baidu/Familia
A Toolkit for Industrial Topic Modeling
lda nlp sentence-lda topic-modeling topic-models twe
Last synced: 08 Apr 2025
https://github.com/jasonkessler/scattertext
Beautiful visualizations of how language differs among document types.
computational-social-science d3 eda exploratory-data-analysis japanese-language machine-learning natural-language-processing nlp scatter-plot semiotic-squares sentiment stylometric stylometry text-as-data text-mining text-visualization topic-modeling visualization word-embeddings word2vec
Last synced: 09 Apr 2025
https://github.com/JasonKessler/scattertext
Beautiful visualizations of how language differs among document types.
computational-social-science d3 eda exploratory-data-analysis japanese-language machine-learning natural-language-processing nlp scatter-plot semiotic-squares sentiment stylometric stylometry text-as-data text-mining text-visualization topic-modeling visualization word-embeddings word2vec
Last synced: 27 Mar 2025
https://github.com/contextlab/hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
data-visualization data-wrangling high-dimensional-data python text-vectorization time-series topic-modeling visualization
Last synced: 29 Jan 2026
https://github.com/ContextLab/hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
data-visualization data-wrangling high-dimensional-data python text-vectorization time-series topic-modeling visualization
Last synced: 07 Apr 2025
https://github.com/nomic-ai/nomic
Interact, analyze and structure massive text, image, embedding, audio and video datasets
clustering duplicate-detection embeddings python text topic-modeling unstructured-data
Last synced: 13 May 2025
https://github.com/owlbarn/owl
Owl - OCaml Scientific Computing @ https://ocaml.xyz
algorithmic-differentation autograd automatic-differentiation gsl linear-algebra machine-learning maths matrix mcmc ndarray neural-network numerical-calculations optimization plotting regression scientific-computing sparse-linear-systems statistical-functions statistics topic-modeling
Last synced: 14 May 2025
https://github.com/MilaNLProc/contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
bert embeddings multilingual-models multilingual-topic-models neural-topic-models nlp nlp-library nlp-machine-learning text-as-data topic-coherence topic-modeling transformer
Last synced: 03 Apr 2025
https://github.com/dselivanov/text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
glove latent-dirichlet-allocation natural-language-processing text-mining topic-modeling vectorization word-embeddings word2vec
Last synced: 16 May 2025
https://github.com/MIND-Lab/OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
bayesian-optimization evaluation-metrics hyperparameter-optimization hyperparameter-search hyperparameter-tuning latent-dirichlet-allocation latent-semantic-analysis natural-language-processing neural-topic-models nlp nlp-library nlproc non-negative-matrix-factorization topic-modeling topic-models
Last synced: 03 May 2025
https://github.com/bigartm/bigartm
Fast topic modeling platform
bigartm bigdata c-plus-plus machine-learning python python-api regularizer text-mining topic-modeling
Last synced: 08 Apr 2025
https://github.com/gregversteeg/corex_topic
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
information-theory machine-learning python topic-modeling unsupervised-learning
Last synced: 10 Feb 2026
https://github.com/bab2min/tomotopy
Python package of Tomoto, the Topic Modeling Tool
correlated-topic-model dirichlet-multinomial-regression hierarchical-dirichlet-processes latent-dirichlet-allocation nlp pachinko-allocation python-library supervised-lda topic-modeling topic-models
Last synced: 17 Jan 2026
https://github.com/cpsievert/ldavis
R package for web-based interactive topic model visualization.
javascript r text-mining topic-modeling visualization
Last synced: 08 Apr 2025
https://github.com/cpsievert/LDAvis
R package for web-based interactive topic model visualization.
javascript r text-mining topic-modeling visualization
Last synced: 15 Mar 2025
https://github.com/vi3k6i5/guidedlda
semi supervised guided topic model with custom guidedLDA
data-science guided-topic-modeling guidedlda machine-learning seededlda topic-modeling
Last synced: 12 Apr 2025
https://github.com/vi3k6i5/GuidedLDA
semi supervised guided topic model with custom guidedLDA
data-science guided-topic-modeling guidedlda machine-learning seededlda topic-modeling
Last synced: 03 May 2025
https://github.com/stephenhky/PyShortTextCategorization
Various Algorithms for Short Text Mining
algorithm machine-learning natural-language-processing neural-network package python python-library text-mining topic-modeling
Last synced: 03 May 2025
https://github.com/stephenhky/pyshorttextcategorization
Various Algorithms for Short Text Mining
algorithm machine-learning natural-language-processing neural-network package python python-library text-mining topic-modeling
Last synced: 14 May 2025
https://github.com/jmartinezheras/2018-MachineLearning-Lectures-ESA
Machine Learning Lectures at the European Space Agency (ESA) in 2018
anomaly-detection clustering decision-trees deep-learning lecture-material lecture-slides lecture-videos lectures linear-regression machine-learning machinelearning neural-network pca random-forest support-vector-machines text-mining tf-idf topic-modeling
Last synced: 21 Apr 2025
https://github.com/primaryobjects/lda
LDA topic modeling for node.js
ai artificial-intelligence javascript keywords language lda machine-learning natural-language-processing nlp node node-js nodejs topic-modeling topics
Last synced: 04 Apr 2025
https://github.com/chtmp223/topicGPT
TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL'24)
llm nlp openai python topic-modeling vllm
Last synced: 09 May 2025
https://github.com/yangliuy/LDAGibbsSampling
Open Source Package for Gibbs Sampling of LDA
gibbs-sampling java lda topic topic-modeling
Last synced: 03 May 2025
https://github.com/cohere-ai/sandbox-topically
Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.
machine-learning nlp python topic-modeling
Last synced: 13 Apr 2025
https://github.com/maartengr/concept
Concept Modeling: Topic Modeling on Images and Text
computer-vision image-processing nlp topic-modeling
Last synced: 16 May 2025
https://github.com/dice-group/palmetto
Palmetto is a quality measuring tool for topics
evaluation topic-coherence topic-modeling
Last synced: 24 Jun 2025
https://github.com/MaartenGr/Concept
Concept Modeling: Topic Modeling on Images and Text
computer-vision image-processing nlp topic-modeling
Last synced: 05 Apr 2025
https://github.com/WZBSocialScienceCenter/tmtoolkit
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
evaluation nlp parallel-processing python socialscience text-processing topic-modeling
Last synced: 03 May 2025
https://github.com/charlesdedampierre/BunkaTopics
🗺️ Data Cleaning and Textual Data Visualization 🗺️
cartography data-cleaning explainability fine-tuning llms machine-learning natural-language-processing nlp summarization topic-modeling
Last synced: 30 Aug 2025
https://github.com/maxent-ai/converse
Conversational text Analysis using various NLP techniques
callcenter-analysis conversational-ai emotion-recognition huggingface machine-learning nlp nlu pytorch scikit-learn sentiment-analysis spacy speech-to-text text text-mining topic-modeling transformers
Last synced: 15 Feb 2026
https://github.com/datquocnguyen/LFTM
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
gibbs-sampling short-text topic-modeling word-embeddings
Last synced: 03 May 2025
https://github.com/qiang2100/STTM
Short Text Topic Modeling, JAVA
btm gpudmm gpupdmm gsdmm lda ptm satm short-text short-text-clustering topic-modeling
Last synced: 03 May 2025
https://github.com/joewandy/hlda
Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model
gibbs-sampler hierarchical-topic-models lda topic-hierarchies topic-modeling
Last synced: 27 Mar 2026
https://github.com/yuewang-cuhk/takg
The official implementation of ACL 2019 paper "Topic-Aware Neural Keyphrase Generation for Social Media Language"
keyphrase-generation nlp social-media topic-modeling
Last synced: 06 Oct 2025
https://github.com/osainz59/ask2transformers
A Framework for Textual Entailment based Zero Shot text classification
deep-learning mnli natural-language-processing nlp nlp-tool nlp-tools pytorch relation-extraction text-classification topic-classification topic-modeling transformers zero-shot
Last synced: 31 Aug 2025
https://github.com/machine-intelligence-laboratory/TopicNet
Interface for easier topic modelling.
bigartm-library custom-score document-representation modalities multimodal-data multimodal-learning pypi topic-modeling topic-modelling
Last synced: 03 May 2025
https://github.com/dirkhovy/text_analysis_for_social_science
Code for the CUP Elements on text analysis in Python for social scientists
analysis classification-models clustering data-analysis embeddings neural-networks prediction predictive-modeling social-sciences text-analysis text-classification topic-modeling
Last synced: 15 Mar 2025
https://github.com/dondealban/learning-stm
Learning structural topic modeling using the stm R package.
automated-content-analysis machine-learning stm text-analysis topic-modeling
Last synced: 15 Mar 2025
https://github.com/JoeZJH/Labeled-LDA-Python
Implement of L-LDA Model(Labeled Latent Dirichlet Allocation Model) with python
gibbs-sampling incremental-update l-lda labeled-lda llda llda-model python python2 python27 python3 topic-model topic-modeling
Last synced: 03 May 2025
https://github.com/joezjh/labeled-lda-python
Implement of L-LDA Model(Labeled Latent Dirichlet Allocation Model) with python
gibbs-sampling incremental-update l-lda labeled-lda llda llda-model python python2 python27 python3 topic-model topic-modeling
Last synced: 11 Jul 2025
https://github.com/dipanjans/learning-social-media-analytics-with-r
This repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
analytics facebook flickr foursquare ggplot2 github guardian news r sentiment-analysis social-data social-media social-network-analysis stackexchange stackoverflow text-mining topic-modeling twitter
Last synced: 15 Apr 2025
https://github.com/cschwem2er/stminsights
A Shiny Application for Inspecting Structural Topic Models
natural-language-processing r shiny topic-modeling
Last synced: 09 Apr 2025
https://github.com/lmcinnes/enstop
Ensemble topic modelling with pLSA
dimensionality-reduction matrix-factorization plsa topic-modeling
Last synced: 30 Apr 2025
https://github.com/senderle/topic-modeling-tool
A point-and-click tool for creating and analyzing topic models produced by MALLET.
data-science digital-humanities mallet text-analytics topic-modeling
Last synced: 21 Jan 2026
https://github.com/lettier/lda-topic-modeling
A PureScript, browser-based implementation of LDA topic modeling.
bayesian bulma bulma-css clustering data-science functional-programming gibbs-sampling latent-dirichlet-allocation lda machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning purescript reactive reactive-programming text-mining thermite topic-modeling
Last synced: 07 Oct 2025
https://github.com/x-tabdeveloping/topicwizard
Powerful topic model visualization in Python
dash machine mantine plotly plotly-dash scikit-learn sklearn tailwindcss topic-modeling visualization
Last synced: 10 Apr 2025
https://github.com/bnosac/btm
Biterm Topic Modelling for Short Text with R
biterm-topic-modelling natural-language-processing r topic-modeling
Last synced: 12 Sep 2025
https://github.com/bnosac/BTM
Biterm Topic Modelling for Short Text with R
biterm-topic-modelling natural-language-processing r topic-modeling
Last synced: 03 May 2025
https://github.com/madrugado/attention-based-aspect-extraction
Code for unsupervised aspect extraction, using Keras and its Backends
aspect-extraction deep-learning keras topic-modeling unsupervised-learning
Last synced: 04 Oct 2025
https://github.com/ddangelov/restful-top2vec
Expose a Top2Vec model with a REST API.
document-embedding fastapi rest-api restful-api semantic-search semantic-search-engine text-search text-similarity top2vec topic-model topic-modeling word-embedding
Last synced: 01 May 2025
https://github.com/AdrienGuille/TOM
A library for topic modeling and browsing
Last synced: 15 Mar 2025
https://github.com/datquocnguyen/jLDADMM
A Java package for the LDA and DMM topic models
gibbs-sampling lda nlp short-text topic-modeling topic-models
Last synced: 03 May 2025
https://github.com/titipata/paper-reviewer-matcher
Linear programming solver for paper-reviewer matching and mind-matching
ccn-conference paper-reviewer-matcher papers python reviewers topic-modeling
Last synced: 26 Mar 2025
https://github.com/yao8839836/PTM
A Topic Modeling Approach for Traditional Chinese Medicine Prescriptions. TKDE 2018
herbs knowledge prescriptions symptoms topic-modeling traditional-chinese-medicine
Last synced: 03 May 2025
https://github.com/ahmedbesbes/how-to-mine-newsfeed-data-and-extract-interactive-insights-in-python
A practical guide to topic mining and interactive visualizations
bokeh crontab gensim kmeans latent-dirichlet-allocation natural-language-processing newsapi newsapi-python nlp nlp-keywords-extraction nlp-machine-learning plots sklearn text-mining tf-idf topic-modeling tsne-algorithm tsne-plot
Last synced: 01 Jul 2025
https://github.com/yao8839836/ptm
A Topic Modeling Approach for Traditional Chinese Medicine Prescriptions. TKDE 2018
herbs knowledge prescriptions symptoms topic-modeling traditional-chinese-medicine
Last synced: 24 Apr 2025
https://github.com/andrewtavis/kwx
BERT, LDA, and TFIDF based keyword extraction in Python
bert data-analysis data-science data-visualization keyword-extraction latent-dirichlet-allocation lda machine-learning multilingual natural-language-processing nlp open-source python python3 text-analysis text-classification text-mining tfidf topic-modeling unsupervised-learning
Last synced: 14 Apr 2025
https://github.com/ankane/tomoto-ruby
High performance topic modeling for Ruby
latent-dirichlet-allocation lda topic-modeling
Last synced: 17 Nov 2025
https://github.com/DARIAH-DE/Topics
A Python library for topic modeling and visualization
data-science digital-humanities lda machine-learning natural-language-processing python3 text-mining topic-modeling
Last synced: 03 May 2025
https://github.com/bunyaminergen/callytics
Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.
denoising diarization forced-alignment llama3 llm openai opensource sentiment-analysis speech-emotion-recognition speech-processing speech-recognition speech-to-text summary topic-modeling transcription voice-activity-detection voice-recognition
Last synced: 03 Apr 2025
https://github.com/alexeyev/abae-pytorch
PyTorch implementation of 'An Unsupervised Neural Attention Model for Aspect Extraction' by He et al. ACL2017'
aspect-extraction autoencoder nlp pytorch pytorch-implementation topic-modeling unsupervised-machine-learning
Last synced: 26 Apr 2025
https://github.com/DARIAH-DE/TopicsExplorer
Explore your own text collection with a topic model – without prior knowledge.
digital-humanities flask-application gui lda pyqt5 standalone-executables topic-modeling topics-explorer
Last synced: 03 May 2025
https://github.com/wesslen/Topic-Modeling-Workshop-with-R
A workshop on analyzing topic modeling (LDA, CTM, STM) using R
Last synced: 15 Mar 2025
https://github.com/wesslen/topic-modeling-workshop-with-r
A workshop on analyzing topic modeling (LDA, CTM, STM) using R
Last synced: 27 Oct 2025
https://github.com/bnosac/etm
Topic Modelling in Semantic Embedding Spaces
embeddings lda topic-modeling word-embeddings word2vec
Last synced: 29 Apr 2025
https://github.com/jarmoza/twic
Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models
d3 digital-humanities interactive-visualization topic-modeling visualization
Last synced: 18 Jan 2026
https://github.com/nzw0301/lightlda
fast sampling algorithm based on CGS
lda machine-learning nlp python topic-modeling
Last synced: 30 Apr 2025
https://github.com/nzw0301/lightLDA
fast sampling algorithm based on CGS
lda machine-learning nlp python topic-modeling
Last synced: 03 Apr 2025
https://github.com/yumeng5/CatE
[WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding
discriminative-topic-mining topic-mining topic-modeling word-embeddings
Last synced: 03 May 2025
https://github.com/x-tabdeveloping/turftopic
Robust and fast topic models with sentence-transformers.
contextual llm topic-modeling transformers
Last synced: 08 May 2025
https://github.com/benedekrozemberczki/nmfadmm
A sparsity aware implementation of "Alternating Direction Method of Multipliers for Non-Negative Matrix Factorization with the Beta-Divergence" (ICASSP 2014).
admm beta-divergence deepwalk embedding factorization feature-extraction lda matrix-factorization nmf node2vec optimization pca principal-component-analysis principal-components sparse-matrix topic-modeling unsupervised-learning unsupervised-machine-learning word-embedding word2vec
Last synced: 27 Jul 2025
https://github.com/js1010/cusim
Superfast CUDA implementation of Word2Vec and Latent Dirichlet Allocation (LDA)
cuda gensim gpu lda topic-modeling w2v word-embedding
Last synced: 30 Apr 2025
https://github.com/wesslen/topicapp
A simple Shiny App for Topic Modeling in R
r shiny structural-topic-modeling topic-modeling visualization
Last synced: 02 Mar 2026
https://github.com/silviatti/topic-model-diversity
A collection of topic diversity measures for topic modeling
evaluation-metrics gensim latent-dirichlet-allocation lda topic-diversity topic-diversity-measures topic-model topic-modeling topic-modeling-analysis topic-models
Last synced: 03 May 2025
https://github.com/wesslen/topicApp
A simple Shiny App for Topic Modeling in R
r shiny structural-topic-modeling topic-modeling visualization
Last synced: 03 May 2025
https://github.com/mast-group/tassal
Tree-based Autofolding Software Summarization Algorithm
machine-learning ml4code topic-modeling
Last synced: 15 Nov 2025
https://github.com/seinecle/nocodefunctions-web-app
The code base of the front-end of nocodefunctions.com
data-processing data-science jakarta-faces java network-analysis nlp nocode pdf-to-text pdf2text sentiment-analysis text-mining topic-modeling webapp
Last synced: 24 Jan 2026
https://github.com/yao8839836/kge-lda
Knowledge Graph Embedding LDA. AAAI 2017
Last synced: 23 Aug 2025
https://github.com/yao8839836/KGE-LDA
Knowledge Graph Embedding LDA. AAAI 2017
Last synced: 03 May 2025
https://github.com/vlukiyanov/pt-avitm
PyTorch implementation of AVITM (Autoencoding Variational Inference For Topic Models)
autoencoder avitm pytorch topic-modeling
Last synced: 23 Apr 2025
https://github.com/wang-h/werss
WeRSS - 微信公众号热度分析系统
keyword-extraction topic-modeling wechat
Last synced: 04 Apr 2026
https://github.com/centre-for-humanities-computing/tweetopic
Blazing fast topic modelling for short texts.
dirichlet-process-mixtures dmm gibbs-sampling gsdmm machine-learning mcmc nlp python scikit-learn topic-modeling tweet tweet-analysis visualization
Last synced: 10 Oct 2025
https://github.com/m-clark/sem
:white_medium_small_square: <- :white_circle: Structural Equation Modeling from a broader context.
bayesian-nonparametric-models graphical-models growth-curves irt latent-variable-models lavaan mixture-model r sem structural-equation-modeling topic-modeling
Last synced: 30 Apr 2025
https://github.com/ahoho/kd-topic-models
Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"
knowledge-distillation topic-modeling topic-models
Last synced: 14 Jul 2025
https://github.com/david-cortes/ctpfrec
Python implementation of "Content-based recommendations with poisson factorization", with some extensions
cold-start collaborative-topic-factorization poisson-factorization topic-modeling
Last synced: 14 Jan 2026
https://github.com/maxent-ai/lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
chainer deep-learning embeddings lda nlp python3 sklearn text text-mining topic-modeling word-embeddings word2vec
Last synced: 07 Oct 2025
https://github.com/kafkasl/contextuallstm
Contextual LSTM for NLP tasks like word prediction and word embedding creation for Deep Learning
lstm-neural-networks nlp-tasks topic-modeling word-embeddings word-prediction
Last synced: 09 Oct 2025
https://github.com/ecoronado92/towards_data_science
Repo containing code for Towards Data Science articles
bayesian non-parametric python3 r regression-models topic-modeling
Last synced: 10 Sep 2025
https://github.com/packtworkshops/the-unsupervised-learning-workshop
An Interactive Approach to Understanding Unsupervised Learning Algorithms
agglomerative-clustering dbscan-clustering hierarchical-clustering hotspot-analysis k-means market-basket-analysis principal-component-analysis topic-modeling
Last synced: 22 Jul 2025
https://github.com/m-clark/text-analysis-with-r
:book: :books: :newspaper: Workshop that demonstrates using and analyzing text in R.
barthelme carver character factor pos-tagging quanteda r regex sentiment-analysis shakespeare text text2vec tidytext topic-modeling wordembeddings
Last synced: 30 Apr 2025
https://github.com/dayyass/latent-semantic-analysis
Pipeline for training LSA models using Scikit-Learn.
data-science hacktoberfest latent-semantic-analysis lsa machine-learning natural-language-processing nlp pipeline python topic-modeling
Last synced: 13 Apr 2025