https://github.com/sudar/Yahoo_LDA
Yahoo!'s topic modelling framework using Latent Dirichlet Allocation
https://github.com/sudar/Yahoo_LDA
Last synced: about 1 year ago
JSON representation
Yahoo!'s topic modelling framework using Latent Dirichlet Allocation
- Host: GitHub
- URL: https://github.com/sudar/Yahoo_LDA
- Owner: sudar
- License: apache-2.0
- Created: 2011-05-25T09:37:49.000Z (about 15 years ago)
- Default Branch: master
- Last Pushed: 2011-09-21T06:33:05.000Z (over 14 years ago)
- Last Synced: 2025-04-09T19:18:48.427Z (about 1 year ago)
- Language: C++
- Homepage:
- Size: 20.6 MB
- Stars: 337
- Watchers: 39
- Forks: 119
- Open Issues: 12
-
Metadata Files:
- Readme: README
- License: LICENSE
Awesome Lists containing this project
- awesome-topic-models - Yahoo-LDA - Yahoo!'s topic modelling framework [:page_facing_up:](https://dl.acm.org/doi/pdf/10.1145/2124295.2124312) (Models / Latent Dirichlet Allocation (LDA) [:page_facing_up:](https://www.jmlr.org/papers/volume3/blei03a/blei03a.pdf))
README
The Yahoo_LDA project uses several 3rd party open source libraries and tools.
This file summarizes the tools used, their purpose, and the licenses under which they're released.
Except as specifically stated below, the 3rd party software packages are not distributed as part of
this project, but instead are separately downloaded and built on the developer’s machine as a
pre-build step.
* Ice-3.4.1 (GNU GENERAL PUBLIC LICENSE)
* An efficient inter process communication framework which is used for the distributed storage of (topic, word) tables.
* http://www.zeroc.com/
* cppunit-1.12.1 (GNU LESSER GENERAL PUBLIC LICENSE)
* C++ unit testing framework. We use this for unit tests.
* http://cppunit.sourceforge.net
* glog-0.3.0 (BSD)
* Logfile generation (Google's log library).
* http://code.google.com/p/google-glog/
* mcpp-2.7.2 (BSD)
* C++ preprocessor
* http://mcpp.sourceforge.net/
* tbb22_20090809oss (GNU GENERAL PUBLIC LICENSE)
* Intel Threading Building Blocks. Multithreaded processing library. Much easier to use than pthreads. We use the pipeline class.
* http://threadingbuildingblocks.org
* bzip2-1.0.5 (BSD)
* Data compression
* http://www.bzip.org/
* gflags-1.2 (BSD)
* Google's flag processing library (used for commandline options)
* http://code.google.com/p/google-gflags/
* protobuf-2.2.0a (BSD)
* Protocol buffers (used for serializing data to disk and as internal key data structure). Google's serialization library
* http://code.google.com/p/protobuf/
* boost-1.46.0 (Boost Software License - Version 1.0 - August 17th, 2003)
* Boost Libraries (various datatypes)
* http://www.boost.org/
Please refer to the html or pdf documentation present at docs/html/index.html & docs/latex/refman.pdf respectively for more information.