{"id":13697281,"url":"https://github.com/senderle/topic-modeling-tool","last_synced_at":"2026-01-21T08:15:29.820Z","repository":{"id":88178478,"uuid":"47996186","full_name":"senderle/topic-modeling-tool","owner":"senderle","description":"A point-and-click tool for creating and analyzing topic models produced by MALLET.","archived":false,"fork":false,"pushed_at":"2021-03-01T20:42:05.000Z","size":168632,"stargazers_count":106,"open_issues_count":19,"forks_count":22,"subscribers_count":15,"default_branch":"main","last_synced_at":"2024-11-13T01:34:04.845Z","etag":null,"topics":["data-science","digital-humanities","mallet","text-analytics","topic-modeling"],"latest_commit_sha":null,"homepage":"https://senderle.github.io/topic-modeling-tool/documentation/2017/01/06/quickstart.html","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/senderle.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2015-12-14T19:22:12.000Z","updated_at":"2024-07-31T03:04:09.000Z","dependencies_parsed_at":null,"dependency_job_id":"2fd637b3-c243-4c40-bd0d-92f539a58c51","html_url":"https://github.com/senderle/topic-modeling-tool","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/senderle%2Ftopic-modeling-tool","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/senderle%2Ftopic-modeling-tool/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/senderle%2Ftopic-modeling-tool/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/senderle%2Ftopic-modeling-tool/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/senderle","download_url":"https://codeload.github.com/senderle/topic-modeling-tool/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252252473,"owners_count":21718748,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-science","digital-humanities","mallet","text-analytics","topic-modeling"],"created_at":"2024-08-02T18:00:55.024Z","updated_at":"2026-01-21T08:15:29.764Z","avatar_url":"https://github.com/senderle.png","language":"Java","funding_links":[],"categories":["Visualizations"],"sub_categories":["Embedding based Topic Models"],"readme":"\n[![DOI](https://zenodo.org/badge/47996186.svg)](https://zenodo.org/badge/latestdoi/47996186)\n\n# Topic Modeling Tool\n\n## An updated GUI for MALLET's implementation of LDA.[*](#acknowledgements)\n\n### New features:\n\n* **Metadata integration**\n* **Automatic file segmentation**\n* **Custom CSV delimiters**\n* **Alpha/Beta optimization**\n* **Custom regex tokenization**\n* **Multicore processor support**\n\n## Getting Started:\n\nTo start using some of these new features right away, consult the \n[quickstart guide](https://senderle.github.io/topic-modeling-tool/documentation/2017/01/06/quickstart.html).\nFor tinkerers, there's a guide to the tool's \n[optional settings](https://senderle.github.io/topic-modeling-tool/documentation/2018/09/27/optional-settings.html).\nYou may also find useful information in the discussion threads under \n[documentation](https://github.com/senderle/topic-modeling-tool/labels/documentation) \nissues.\n\n## Requirements:\n\nThe Topic Modeling Tool now has native Windows and Mac apps, and because of\nunicode issues, these are currently the best options for installation. \nJust follow the instructions for your operating system. **Do not try\nto install by clicking on [Clone or download] \u003e [Download ZIP].**\nIt [won't work](https://github.com/senderle/topic-modeling-tool/issues/63#issuecomment-360933035).\n\n**For Macs**: \n* Download [`TopicModelingTool.dmg`](https://github.com/senderle/topic-modeling-tool/raw/master/TopicModelingTool.dmg).\n* Open it by double-clicking.\n* Drag the app into your `Applications` folder -- or into any folder at all.\n* Run the app by double-clicking.\n\n**For Windows PCs**:\n* Download [`TopicModelingTool.zip`](https://github.com/senderle/topic-modeling-tool/raw/master/TopicModelingTool.zip).\n    * NOTE: The native PC build is out-of-date. [Help wanted](https://github.com/senderle/topic-modeling-tool/issues/70).\n* Extract the files into any folder and open it.\n* Double-click on the file called `TopicModelingTool.exe` to run it.\n\nIf you want to run the \n[plain `.jar` file](https://github.com/senderle/topic-modeling-tool/raw/master/TopicModelingTool.jar), \nyou'll need to have a fairly recent version of Java; the version that \ncame with your computer may not work, especially if your computer is \na Mac. Whatever your operating system, you can install an updated \nversion of Java by following the instructions for your operating \nsystem [here](https://java.com/en/download/help/download_options.xml).\n\n## Windows Unicode Support:\n\nWindows and Java don't play very well together when it comes to unicode \ntext. If you are using the `.jar` build, and non-ascii characters are \ngetting garbled on a Windows machine, there's a quick fix involving \n[environment variables](https://github.com/senderle/topic-modeling-tool/issues/48#issuecomment-274331463)\nthat may make things work.\n\nAgain, the best answer may just be to use the native app. It should\nnow work correctly at every stage with UTF-8-encoded text. (If it \ndoesn't, let us know and we will moan and gnash our teeth some more.)\n\n## Reporting and Replicating Bugs and Other Issues:\n\nIf you hadn't already guessed, most testing for this tool happens on a Mac. \nThere are bound to be errors happening on other platforms that have slipped\nthrough the cracks. We need you to report them so we can keep improving the\ntool! But we cannot fix a problem that we don't fully understand, so...\n\n**When posting a bug report, please include vast amounts of detail.**\n\n_Copy and paste everything from the tool's console output_ if you can, _tell us\nyour operating system and version_, and _let us know the other tools you're\nusing to create and view input and output_. It also helps if you verify that the\nbug still exists in the most recent build of the tool (i.e. the one contained in \nthe `.jar`, `.dmg`, or `.zip` files in the root directory).\n\nWe know that there are substantial problems with Windows support for\nunicode text; if you see problems, please post *detailed* information under\nthe [main issue](https://github.com/senderle/topic-modeling-tool/issues/48)\nso that we can start isolating and fixing these bugs. \n\nWe love getting new issues because it means the tool is improving! But\nagain, **when posting a bug report, please include vast amounts of detail**. \n\n## Building the Development Version:\n\nIf you feel adventurous, you might want to modify the code and compile your \nown version. To do so, you'll need to install [Apache Maven](https://maven.apache.org/) \nas well as the Java Development Kit. On Macs, [Homebrew](http://brew.sh/) \nis the best way to do so; simply install homebrew as described on the Homebrew \nsite, and then type `brew install maven` at the command line. On Windows PCs -- \nyou're on your own! But we did it and it wasn't terribly hard. You just need an\nup-to-date JDK and maven package, with their `bin` folders in your `PATH`. \n\nWith maven installed, simply use the terminal to navigate to the `TopicModelingTool` folder:\n\n    $ cd topic-modeling-tool/TopicModelingTool\n    \nThen use maven's `package` command:\n\n    $ mvn package\n\nWe now have experimental support for compiling the tool as a native app using\nthe [javafx plugin](https://github.com/javafx-maven-plugin/javafx-maven-plugin) \nfor maven. This will build a native package able to run on your operating system.\nThis has been tested on both Macs and Windows PCs.\n\n    $ mvn jfx:native\n    \n___\n\n#### Acknowledgements:\u003ca name=\"acknowledgements\"\u003e\u003c/a\u003e\n\nThis version of the tool was forked from the \n[original version](https://github.com/arunbg/Topic-Modeling-Tool) \nby [David Newman](https://scholar.google.com/citations?user=3z-mSpYAAAAJ\u0026hl=en) and \n[Arun Balagopalan](https://github.com/arunbg).\n\nPrevious work on the GUI for MALLET has been supported by a National Leadership \nGrant (LG-06-08-0057-08) from the Institute of Museum and Library Services to \nYale University, the University of Michigan, and the University of California, \nIrvine. The Institute of Museum and Library Services is the primary source of \nfederal support for the nation's 123,000 libraries and 17,500 museums. The \nInstitute's mission is to create strong libraries and museums that connect \npeople to information and ideas.\n\nWork on this version of the tool has benefited from the support of \n[Penn Libraries](http://www.library.upenn.edu/) and the the University of \nPennsylvania's [Price Lab for Digital Humanities](https://pricelab.sas.upenn.edu/).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsenderle%2Ftopic-modeling-tool","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsenderle%2Ftopic-modeling-tool","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsenderle%2Ftopic-modeling-tool/lists"}