{"id":13696910,"url":"https://github.com/koheiw/seededlda","last_synced_at":"2025-07-23T21:38:50.670Z","repository":{"id":38011284,"uuid":"192859007","full_name":"koheiw/seededlda","owner":"koheiw","description":"LDA for semisupervised topic modeling","archived":false,"fork":false,"pushed_at":"2025-01-22T00:33:59.000Z","size":164194,"stargazers_count":76,"open_issues_count":11,"forks_count":15,"subscribers_count":6,"default_branch":"master","last_synced_at":"2025-06-11T08:48:21.955Z","etag":null,"topics":["semi-supervised-learning","text-classification"],"latest_commit_sha":null,"homepage":"https://koheiw.github.io/seededlda/","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/koheiw.png","metadata":{"files":{"readme":"README.RMD","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null}},"created_at":"2019-06-20T06:06:53.000Z","updated_at":"2025-03-29T08:12:34.000Z","dependencies_parsed_at":"2024-03-21T09:54:03.236Z","dependency_job_id":"bac51325-d4e0-4918-81d0-59d806ea3e7c","html_url":"https://github.com/koheiw/seededlda","commit_stats":{"total_commits":434,"total_committers":1,"mean_commits":434.0,"dds":0.0,"last_synced_commit":"338509e7bebae69690aee24fc21dac7fc9bf2711"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/koheiw/seededlda","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/koheiw%2Fseededlda","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/koheiw%2Fseededlda/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/koheiw%2Fseededlda/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/koheiw%2Fseededlda/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/koheiw","download_url":"https://codeload.github.com/koheiw/seededlda/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/koheiw%2Fseededlda/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":265419644,"owners_count":23761848,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["semi-supervised-learning","text-classification"],"created_at":"2024-08-02T18:00:49.739Z","updated_at":"2025-07-23T21:38:50.643Z","avatar_url":"https://github.com/koheiw.png","language":"R","funding_links":[],"categories":["Models"],"sub_categories":["Miscellaneous topic models"],"readme":"---\noutput: github_document\n---\n\n```{r, echo=FALSE}\nknitr::opts_chunk$set(\n  collapse = TRUE,\n  comment = \"##\",\n  fig.path = \"man/images/\"\n)\n```\n\n# seededlda: the package for semi-supervised topic modeling\n\n\u003c!-- badges: start --\u003e\n\n[![CRAN\nVersion](https://www.r-pkg.org/badges/version/seededlda)](https://CRAN.R-project.org/package=seededlda)\n[![Downloads](https://cranlogs.r-pkg.org/badges/seededlda)](https://CRAN.R-project.org/package=seededlda)\n[![Total\nDownloads](https://cranlogs.r-pkg.org/badges/grand-total/seededlda?color=orange)](https://CRAN.R-project.org/package=seededlda)\n[![R build\nstatus](https://github.com/koheiw/seededlda/workflows/R-CMD-check/badge.svg)](https://github.com/koheiw/seededlda/actions)\n[![codecov](https://codecov.io/gh/koheiw/seededlda/branch/master/graph/badge.svg)](https://codecov.io/gh/koheiw/seededlda)\n\u003c!-- badges: end --\u003e\n\n**seededlda** is an R package that implements Seeded LDA (Latent Dirichlet Allocation) for semi-supervised topic modeling based on **quanteda**. Initially, the package was a simple wrapper around the **topicmodels** package, but it was fully rewritten in C++ using the [GibbsLDA++ library](http://gibbslda.sourceforge.net/) and submitted to CRAN as version 0.5 in 2020. The package was further developed to add the sequential classification (Sequential LDA) and parallel computing (Distributed LDA) capabilities and released as version 1.0 in 2023.\n\n## Installation\n\nFrom CRAN:\n\n```{r, eval=FALSE}\ninstall.packages(\"seededlda\")\n```\n\nFrom Github:\n\n```{r, eval=FALSE}\ndevtools::install_github(\"koheiw/seededlda\")\n```\n\n## Examples\n\nPlease visit the package website for examples:\n\n- [Introduction](https://koheiw.github.io/seededlda/articles/pkgdown/basic.html): basic functions of the package\n- [Distributed LDA](https://koheiw.github.io/seededlda/articles/pkgdown/distributed.html): topic modeling with parallel computing\n- [Seeded LDA](https://koheiw.github.io/seededlda/articles/pkgdown/seeded.html): semi-supervised topic modeling\n- [Sequential LDA](https://koheiw.github.io/seededlda/articles/pkgdown/sequential.html): sentence-level topic modeling\n\nPlease read the following papers on the algorithms.\n\n- Watanabe, K., \u0026 Baturo, A. (2023). Seeded Sequential LDA: A Semi-Supervised Algorithm for Topic-Specific Analysis of Sentences. Social Science Computer Review. https://doi.org/10.1177/08944393231178605\n- Watanabe, K. (2023). Speed Up Topic Modeling: Distributed Computing and Convergence Detection for LDA, [working paper](https://blog.koheiw.net/wp-content/uploads/2023/05/Distributed-LDA-02.pdf).\n\n\n## Other publications\n\nPlease read the following papers for how to apply seeded-LDA in social science research:\n\n- Curini, L., \u0026 Vignoli, V. (2021). Committed Moderates and Uncommitted Extremists: Ideological Leaning and Parties’ Narratives on Military Interventions in Italy. Foreign Policy Analysis, 17(3), 1–20. https://doi.org/10.1093/fpa/orab016\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkoheiw%2Fseededlda","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkoheiw%2Fseededlda","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkoheiw%2Fseededlda/lists"}