{"id":22309505,"url":"https://github.com/polymorpher/bittiger","last_synced_at":"2025-07-24T09:32:27.041Z","repository":{"id":104148827,"uuid":"87985326","full_name":"polymorpher/bittiger","owner":"polymorpher","description":null,"archived":false,"fork":false,"pushed_at":"2017-05-03T03:44:38.000Z","size":12799,"stargazers_count":1,"open_issues_count":0,"forks_count":2,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-01-30T23:17:47.837Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Scala","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/polymorpher.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-04-11T22:34:45.000Z","updated_at":"2019-06-16T21:44:48.000Z","dependencies_parsed_at":null,"dependency_job_id":"7faf5295-e0a1-4c5b-95bb-b425b3ebcb88","html_url":"https://github.com/polymorpher/bittiger","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/polymorpher%2Fbittiger","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/polymorpher%2Fbittiger/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/polymorpher%2Fbittiger/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/polymorpher%2Fbittiger/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/polymorpher","download_url":"https://codeload.github.com/polymorpher/bittiger/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245573841,"owners_count":20637670,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-03T20:43:10.615Z","updated_at":"2025-03-26T01:43:58.886Z","avatar_url":"https://github.com/polymorpher.png","language":"Scala","funding_links":[],"categories":[],"sub_categories":[],"readme":"An example LDA implementation for Scala\n\nThis is a demo version made in 2 hours so execution speed is a bit slow. However you can check against [my C++ implementation](https://github.com/polymorpher/aliaslda) to figure out how to improve the speed.\n\nDesigned to supplement AI course on BitTiger [https://www.bittiger.io/livecourses/YQCMuXwL7fhHuQT5K](https://www.bittiger.io/livecourses/YQCMuXwL7fhHuQT5K)\n\nRequirements:\n- Scala \u003e2.11.8\n- SBT \n\n\nTo run, enter `sbt run` and select the demo you want to run\n\n- NLPCore: an example of using simple NLP processing pipeline on two sentences\n\n- SNAPReaderDemo: an example of reading data file line by line and parse each line as JSON.\n\n- TextDemo: Read `src/main/resources/text/SanDiskUltra64GB.txt`, perform NLP preprocessing, and LDA. Result are displayed for every iteration\n\n- BOWDemo: You need to specify dataset as parameter. Instead of running `sbt run`, run `sbt \"runMain BOWDemo \u003cdataset\u003e\"`, where `\u003cdataset\u003e` can be either `kos` or `nips`. Don't forget the double quotes!\n \n \nData files:\n```$xslt\nsrc/main/resources\n  /bow\n    /docword.kos.txt      // Daily Kos blog data, preprocessed, extracted from UCL bag of words dataset\n    /docword.nips.txt     // NIPS paper abstract data, extracted from UCL bag of words dataset\n    /vocab.kos.txt        // Vocabulary for Daily Kos\n    /vocab.nips.txt       // Vocabulary for NIPS\n  /text\n    /SanDiskUltra64GB.txt // Amazon product reviews for SanDisk Ultra 64GB until 2014\n```","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpolymorpher%2Fbittiger","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpolymorpher%2Fbittiger","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpolymorpher%2Fbittiger/lists"}