An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with lda

A curated list of projects in awesome lists tagged with lda .

https://github.com/baidu/familia

A Toolkit for Industrial Topic Modeling

lda nlp sentence-lda topic-modeling topic-models twe

Last synced: 14 Apr 2025

https://github.com/baidu/Familia

A Toolkit for Industrial Topic Modeling

lda nlp sentence-lda topic-modeling topic-models twe

Last synced: 08 Apr 2025

https://github.com/kimmeen/weibo-analyst

Social media (Weibo) comments analyzing toolbox in Chinese 微博评论分析工具, 实现功能: 1.微博评论数据爬取; 2.分词与关键词提取; 3.词云与词频统计; 4.情感分析; 5.主题聚类

crawler lda sentiment-analysis weibo word-clouds

Last synced: 04 Apr 2025

https://github.com/yongzhuo/nlg-yongzhuo

中文文本摘要(text summarization)工具包, 抽取式中文文本摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(graph,feature,topic model,summarize tool or tookit)

lda lead3 lsi nlg nmf text-summarization textrank textteaser tookit word-significance

Last synced: 10 Jun 2025

https://github.com/yangliuy/LDAGibbsSampling

Open Source Package for Gibbs Sampling of LDA

gibbs-sampling java lda topic topic-modeling

Last synced: 03 May 2025

https://github.com/joewandy/hlda

Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model

gibbs-sampler hierarchical-topic-models lda topic-hierarchies topic-modeling

Last synced: 30 Dec 2025

https://github.com/tatevkaren/data-science-popular-algorithms

Data Science algorithms and topics that you must know. (Newly Designed) Recommender Systems, Decision Trees, K-Means, LDA, RFM-Segmentation, XGBoost in Python, R, and Scala.

data-science-portfolio datascience decision-tree kmeans lda lda-model linear-discriminant-analysis machine-learning movie-recommender python3 recommendation-system scala xgboost

Last synced: 10 Apr 2025

https://github.com/ArtificiAI/Multilingual-Latent-Dirichlet-Allocation-LDA

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

clustering english french latent-dirichlet-allocation lda machine-learning multilingual natural-language-processing

Last synced: 27 Mar 2025

https://github.com/datquocnguyen/jLDADMM

A Java package for the LDA and DMM topic models

gibbs-sampling lda nlp short-text topic-modeling topic-models

Last synced: 03 May 2025

https://github.com/mattdeitke/cvpr2019

Displays all the 2019 CVPR Accepted Papers in a way that they are easy to parse.

computer-vision cvpr2019 imagemagick lda python web-crawler web-crawler-python

Last synced: 13 Apr 2025

https://github.com/ankane/tomoto-ruby

High performance topic modeling for Ruby

latent-dirichlet-allocation lda topic-modeling

Last synced: 17 Nov 2025

https://github.com/DARIAH-DE/TopicsExplorer

Explore your own text collection with a topic model – without prior knowledge.

digital-humanities flask-application gui lda pyqt5 standalone-executables topic-modeling topics-explorer

Last synced: 03 May 2025

https://github.com/sdq/deepvis

machine learning algorithms in Swift

hierarchical-clustering kmeans lda machine-learning pca unsupervised-learning

Last synced: 17 Aug 2025

https://github.com/wesslen/Topic-Modeling-Workshop-with-R

A workshop on analyzing topic modeling (LDA, CTM, STM) using R

lda r stm topic-modeling

Last synced: 15 Mar 2025

https://github.com/bnosac/etm

Topic Modelling in Semantic Embedding Spaces

embeddings lda topic-modeling word-embeddings word2vec

Last synced: 29 Apr 2025

https://github.com/wesslen/topic-modeling-workshop-with-r

A workshop on analyzing topic modeling (LDA, CTM, STM) using R

lda r stm topic-modeling

Last synced: 27 Oct 2025

https://github.com/nzw0301/lightLDA

fast sampling algorithm based on CGS

lda machine-learning nlp python topic-modeling

Last synced: 03 Apr 2025

https://github.com/nzw0301/lightlda

fast sampling algorithm based on CGS

lda machine-learning nlp python topic-modeling

Last synced: 30 Apr 2025

https://github.com/js1010/cusim

Superfast CUDA implementation of Word2Vec and Latent Dirichlet Allocation (LDA)

cuda gensim gpu lda topic-modeling w2v word-embedding

Last synced: 30 Apr 2025

https://github.com/yao8839836/kge-lda

Knowledge Graph Embedding LDA. AAAI 2017

embeddings lda topic-modeling

Last synced: 23 Aug 2025

https://github.com/yao8839836/KGE-LDA

Knowledge Graph Embedding LDA. AAAI 2017

embeddings lda topic-modeling

Last synced: 03 May 2025

https://github.com/wri-dssg-omdena/policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

active-learning bert data-science document-classification environmental huggingface incentives landscape-restoration lda machine-learning nlp policy sbert scraping scrapy sentence-transformers spyder text-classification topic transformers

Last synced: 27 Mar 2025

https://github.com/dataxujing/nlp-paper

:art: :art:NLP 自然语言处理教程 :art::art: https://dataxujing.github.io/NLP-paper/

albert attention-mechanism bert crf elmo fasttext glove gpt lad2vec lda lsa pagerank plsa seq2seq seq2seq-attention textcnn textrank transformer word2vec xlnet

Last synced: 26 Oct 2025

https://github.com/avivace/reviews-sentiment

Data analytics, exploration, sentiment analysis and topic analysis (LDA) on Amazon customer reviews. And cool interactive plots.

amazon-reviews analytics customer-data data-analytics data-visualization data-visualization-project lda plots sentiment-analysis topic-analysis user-review

Last synced: 30 Apr 2025

https://github.com/maxent-ai/lda2vec

Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019

chainer deep-learning embeddings lda nlp python3 sklearn text text-mining topic-modeling word-embeddings word2vec

Last synced: 07 Oct 2025

https://github.com/lejon/PartiallyCollapsedLDA

Implementations of various fast parallelized samplers for LDA, including Partially Collapsed LDA, Light LDA, Partially Collapsed Light LDA and a very efficient Polya-Urn LDA

lda machine-learning machine-learning-algorithms machinelearning

Last synced: 03 May 2025

https://github.com/weecology/ldats

Latent Dirichlet Allocation coupled with Bayesian Time Series analyses

changepoint lda parallel-tempering portal softmax

Last synced: 22 Oct 2025

https://github.com/lucastheis/trlda

Implementations of various online inference algorithms for LDA, with Python interface.

lda machine-learning python topic-modeling variational-inference

Last synced: 03 May 2025

https://github.com/abuchmueller/twitmo

Collect Twitter data and create topic models with R

ctm geospatial lda nlp r r-package rstats stm topic-modeling twitter twitter-api

Last synced: 22 Oct 2025

https://github.com/lucko515/dataset-dimensionality-reduction-python

Here I've demonstrated how and why should we use PCA, KernelPCA, LDA and t-SNE for dimensionality reduction when we work with higher dimensional datasets.

dimensionality-reduction kernel-pca lda machine-learning pca tsne

Last synced: 14 Apr 2025

https://github.com/lucastheis/logistic_lda

Basic tensorflow implementation of logistic latent Dirichlet allocation

classification lda machine-learning tensorflow topic-modeling

Last synced: 03 May 2025

https://github.com/contefranz/optop

Optimal topic identification from a pool of Latent Dirichlet Allocation models

latent-dirichlet-allocation lda model-selection natural-language-processing nlp text-mining topic-modeling

Last synced: 05 Apr 2025

https://github.com/parvvaresh/agricultural-products-classification

This pipeline is designed to classify agricultural products using satellite data from two satellites, SENTINEL-1 and SENTINEL-2

classification docker google-earth-engine lda ml pca satellite standarization

Last synced: 16 Oct 2025

https://github.com/contefranz/OpTop

Optimal topic identification from a pool of Latent Dirichlet Allocation models

latent-dirichlet-allocation lda model-selection natural-language-processing nlp text-mining topic-modeling

Last synced: 13 Jul 2025

https://github.com/worldbank/wb-nlp-apps

This repository contains the NLP modeling components and web application implementations of a project for knowledge and data discovery funded by the Knowledge for Change Program (KCP) and the Joint Data Center on Forced Displacement (JDC).

data-discovery lda machine-learning nlp python topic-modeling word2vec

Last synced: 02 Sep 2025

https://github.com/valentinarho/lda-rest

REST web service to compute and query Latent Dirichlet Allocation models

docker latent-dirichlet-allocation lda lda-model mongo-database

Last synced: 09 Apr 2025

https://github.com/byukan/chatbots-nlp

Chatbots and other NLP applications: Topic Modeling on text from Codechef and OkCupid

lda machine-learning nlp nmf tfidf topic-modeling

Last synced: 15 Jul 2025

https://github.com/thakur-nandan/topic-modeling

This repository contains as intuitive example on topic-modeling using regular LDA, and how GuidedLDA is better than regular LDA

gensim guidedlda latent latent-dirichlet-allocation lda nlp nlp-machine-learning nltk python seededlda text topic-modeling

Last synced: 14 Apr 2025

https://github.com/rena95/Loss-Distribution-Approach

The repo contains the main topics carried out in my master's thesis on operational risk. In particular, it is described how to implement the so called Loss Distribution Approach (LDA), which is considered the state-of-the-art method to compute capital charge among large banks.

copula copula-models extreme-value-statistics lda loss-distribution loss-distribution-approach operational-risk r risk-management value-at-risk

Last synced: 29 Jul 2025

https://github.com/thucdx/news-trending

Finding trends in news article with Spark (MLLIB, LDA), Spark-Solr, Solr

lda spark-solr topicmodelling

Last synced: 12 Apr 2025

https://github.com/andreped/adverse-events

IEEE BIBM 2021: Bayesian optimization-guided topic modeling for automatic detection of sepsis-related events from free text

adverse-events bayesian-optimization classification data-set detection ieee-bibm latent-dirichlet-allocation lda machine-learning natural-language-processing sepsis

Last synced: 13 Apr 2025

https://github.com/prrao87/topic-modelling

Comparing the scalability and quality of topic models in Gensim and PySpark

data-mining gensim lda natural-language-processing nlp pyspark python topic-modeling topic-models

Last synced: 20 Jun 2025

https://github.com/doaa-altarawy/lascad

LASCAD: Language-Agnostic Software Categorization and Similar Application Detection

hierarchical-clustering lda mining-software-repositories software-engineering topic-modeling

Last synced: 13 Apr 2025

https://github.com/andreaferretti/lda

Latent Dirichlet Allocation

lda nim topic-modeling

Last synced: 12 Oct 2025

https://github.com/adirthaborgohain/bert-text-analysis

Text Analysis done on a business text dataset using KeyBERT and BERTopic

bert eda keybert lda nlp transformers

Last synced: 07 May 2025

https://github.com/m-clark/topic-models-demo

Demonstration of a standard topic model approach

demo latent-dirichlet-allocation lda r structural-topic-modeling topic-modeling

Last synced: 30 Apr 2025

https://github.com/luc99hen/user-review-clustering

使用sklearn对用户评论数据进行聚类

cluster lda python3 sklearn tf-idf

Last synced: 27 Oct 2025

https://github.com/iaja/scalaLDAvis

Scala-Spark port of https://github.com/bmabey/pyLDAvis for Apache Spark LDA Topic Modelling Visualisation

apache lda machine-learning scala spark visulization

Last synced: 03 May 2025

https://github.com/dokato/bci-challange

Medicon 2019 BCI competition

bci erp lda machine-learning

Last synced: 15 Oct 2025

https://github.com/tariqulislam/nlp_research

This is a Natural language processing for semi supervised mechine learning technique to create Document classification

document-classification gensim lda natural-language-processing nlp-machine-learning nltk pymongo pypdf2 python

Last synced: 12 Apr 2025

https://github.com/miteshputhran/reuters_2012-17_news_headline_analysis

Topic-Modelling Reuters news headlines to find topic clusters for the past 7 years using LDA and t-SNE

analysis kaggle lda python text-mining topic-modeling

Last synced: 30 Jul 2025

https://github.com/kylase/topicdiff

TopicDiff visualises the difference in topic coverage between 2 content. This project is done for a technical demonstration of building a web application with Python.

d3 flask gensim lda python3

Last synced: 06 Apr 2025

https://github.com/koldim2001/emotion_classifier

Проведение бинарной и многоклассовой классификаций эмоций людей на фотографиях

binary-classification classic-machine-learning discriminant-analysis emotion-recognition face-detection gabor-feature-extraction lda multiclass-classification pca

Last synced: 24 Feb 2025

https://github.com/tanyokwok/topicmodel

分布式流数据大规模主题模型的实现

lda parameter-server spark-streaming

Last synced: 05 Dec 2025

https://github.com/titsuki/raku-algorithm-lda

A Raku Latent Dirichlet Allocation implementation

lda perl6 raku rakulang

Last synced: 09 Apr 2025

https://github.com/omarsar/math_comp_project

:gem: Matlab implementation and visualization of PCA, LDA, and K-means :gem:

clustering kmeans lda matlab pca

Last synced: 10 Jul 2025

https://github.com/mpuig/news-topics

News topic discovery using LDA (Latent Dirichlet Allocation)

lda news-topic personal-project topic-discovery

Last synced: 12 Apr 2025

https://github.com/christoph/robics

Automatic detection of robust parametrizations for LDA and NMF. Compatible with scikit-learn and gensim.

gensim lda natural-language-processing nmf robust-parametrizations scikit-learn topic-modeling topic-models

Last synced: 13 Apr 2025

https://github.com/diptodas8/lda-topic-modeling

CSC635 project on topic modeling with lda

gibbs-sampling lda topic-modeling

Last synced: 21 Feb 2025

https://github.com/mwoss/mors

Application of topic models for information retrieval and search engine optimization.

common-crawl crawler django doc2vec gensim hacktoberfest lda python scrapy search search-engine tfidf

Last synced: 13 Jun 2025

https://github.com/sylhare/simple-lda

:bookmark: simple lda - latent dirichlet allocation

language-processing latent-dirichlet-allocation lda python

Last synced: 28 Oct 2025

https://github.com/bean5/paper-thesis

My published paper on the application of LDA on documents. Base corpus: Thousands of LDS General Conference articles spanning decades.

ai docker general-conference latex lda paper recommendation-system recommender-system research topic-modelling topic-models whitepaper whitepapers

Last synced: 21 Feb 2025

https://github.com/maehr/simple-topic-modeling

A simple topic modeling tool that runs in your browser.

lda pyodide streamlit topic-modeling

Last synced: 29 Oct 2025

https://github.com/yash22222/terrorist-activity-forecasting-and-risk-assessment-system

In an era marked by global security challenges, the "TAFRAS" emerges as a cutting-edge solution to tackle the ever-evolving threat of terrorism. The project is grounded in the urgent need for predictive systems that can anticipate, assess, and mitigate potential terrorist activities.

corpora data-vizualisation folium-maps gensim global-terrorism-database lda machine-learning matplotlib networkx nltk nmf numpy pandas python random-forest-classifier seaborn sklearn spacy textblob vader-sentiment-analysis

Last synced: 24 Feb 2025

https://github.com/knightron0/topicgeneration

Get current topic from captions using Latent Dirichlet Allocation.

captions lda sentences topic

Last synced: 26 Feb 2025

https://github.com/subhadarship/text-clustering

Clustering text data (data mining fall 2019)

bert clustering glove-embeddings lda nlp roberta topic-modeling visualization

Last synced: 18 Mar 2025

https://github.com/adirthaborgohain/community-data-analysis

Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.

data-analysis lda python

Last synced: 30 Mar 2025

https://github.com/labrijisaad/dimreduce-healthanalytics

A project showcasing the application of various dimensionality reduction techniques for visualizing and analyzing simulated health diagnostics data in 2D and 3D.

2d-vis 3d-visualization dimensionality-reduction lda pca tsne umap

Last synced: 28 Apr 2025

https://github.com/bean5/nlp-topic-explorer

LDA is a machine learning algorithm. If you use the mallet toolkit, it results in some files which are great for NLP programs, but are not immediately human-friendly. This makes it human friendly. Take a look!

lda lda-algorithm machine-learning ml nlp nlp-machine-learning

Last synced: 21 Feb 2025