{"id":21978405,"url":"https://github.com/zhaipro/word2vec","last_synced_at":"2025-03-23T00:42:04.387Z","repository":{"id":90571835,"uuid":"47535673","full_name":"zhaipro/word2vec","owner":"zhaipro","description":"Automatically exported from code.google.com/p/word2vec","archived":false,"fork":false,"pushed_at":"2015-12-07T07:15:16.000Z","size":125,"stargazers_count":0,"open_issues_count":35,"forks_count":0,"subscribers_count":0,"default_branch":"master","last_synced_at":"2025-01-28T03:50:00.270Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"C","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zhaipro.png","metadata":{"files":{"readme":"README.txt","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2015-12-07T07:12:41.000Z","updated_at":"2015-12-07T07:13:30.000Z","dependencies_parsed_at":"2023-03-27T15:09:33.859Z","dependency_job_id":null,"html_url":"https://github.com/zhaipro/word2vec","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zhaipro%2Fword2vec","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zhaipro%2Fword2vec/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zhaipro%2Fword2vec/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zhaipro%2Fword2vec/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zhaipro","download_url":"https://codeload.github.com/zhaipro/word2vec/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245040188,"owners_count":20551299,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-29T16:23:43.769Z","updated_at":"2025-03-23T00:42:04.360Z","avatar_url":"https://github.com/zhaipro.png","language":"C","funding_links":[],"categories":[],"sub_categories":[],"readme":"Tools for computing distributed representtion of words\n------------------------------------------------------\n\nWe provide an implementation of the Continuous Bag-of-Words (CBOW) and the Skip-gram model (SG), as well as several demo scripts.\n\nGiven a text corpus, the word2vec tool learns a vector for every word in the vocabulary using the Continuous\nBag-of-Words or the Skip-Gram neural network architectures. The user should to specify the following:\n - desired vector dimensionality\n - the size of the context window for either the Skip-Gram or the Continuous Bag-of-Words model\n - training algorithm: hierarchical softmax and / or negative sampling\n - threshold for downsampling the frequent words \n - number of threads to use\n - the format of the output word vector file (text or binary)\n\nUsually, the other hyper-parameters such as the learning rate do not need to be tuned for different training sets. \n\nThe script demo-word.sh downloads a small (100MB) text corpus from the web, and trains a small word vector model. After the training\nis finished, the user can interactively explore the similarity of the words.\n\nMore information about the scripts is provided at https://code.google.com/p/word2vec/\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzhaipro%2Fword2vec","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzhaipro%2Fword2vec","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzhaipro%2Fword2vec/lists"}