{"id":13418745,"url":"https://github.com/percyliang/brown-cluster","last_synced_at":"2025-03-15T04:30:27.862Z","repository":{"id":4065393,"uuid":"5169420","full_name":"percyliang/brown-cluster","owner":"percyliang","description":"C++ implementation of the Brown word clustering algorithm.","archived":false,"fork":false,"pushed_at":"2023-09-10T05:05:39.000Z","size":59,"stargazers_count":424,"open_issues_count":15,"forks_count":136,"subscribers_count":32,"default_branch":"master","last_synced_at":"2024-07-31T22:43:58.816Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"C++","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":"iBotPeaches/Apktool","license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/percyliang.png","metadata":{"files":{"readme":"README","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2012-07-24T18:23:25.000Z","updated_at":"2024-07-05T20:18:53.000Z","dependencies_parsed_at":"2024-01-07T18:10:32.195Z","dependency_job_id":null,"html_url":"https://github.com/percyliang/brown-cluster","commit_stats":{"total_commits":16,"total_committers":8,"mean_commits":2.0,"dds":0.625,"last_synced_commit":"123bff38de44592d32a5eec588ea46a290a77ae6"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/percyliang%2Fbrown-cluster","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/percyliang%2Fbrown-cluster/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/percyliang%2Fbrown-cluster/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/percyliang%2Fbrown-cluster/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/percyliang","download_url":"https://codeload.github.com/percyliang/brown-cluster/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":221536580,"owners_count":16839535,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-07-30T22:01:06.493Z","updated_at":"2025-03-15T04:30:27.852Z","avatar_url":"https://github.com/percyliang.png","language":"C++","funding_links":[],"categories":["TODO scan for Android support in followings","Software"],"sub_categories":["Utilities"],"readme":"Implementation of the Brown hierarchical word clustering algorithm.\nPercy Liang\nRelease 1.3\n2012.07.24\n\nInput: a sequence of words separated by whitespace (see input.txt for an example).\nOutput: for each word type, its cluster (see output.txt for an example).\n        In particular, each line is:\n  \u003ccluster represented as a bit string\u003e \u003cword\u003e \u003cnumber of times word occurs in input\u003e\n\nRuns in $O(N C^2)$, where $N$ is the number of word types and $C$\nis the number of clusters.\n\nReferences:\n\n  Brown, et al.: Class-Based n-gram Models of Natural Language\n    http://acl.ldc.upenn.edu/J/J92/J92-4003.pdf\n\n  Liang: Semi-supervised learning for natural language processing\n    http://cs.stanford.edu/~pliang/papers/meng-thesis.pdf\n\nCompile:\n\n  make\n\nRun:\n\n  # Clusters input.txt into 50 clusters:\n  ./wcluster --text input.txt --c 50\n  # Output in input-c50-p1.out/paths\n\n============================================================\nChange Log\n\n1.3: compatibility updates for newer versions of g++ (courtesy of Chris Dyer).\n1.2: make compatible with MacOS (replaced timespec with timeval and changed order of linking).\n1.1: Removed deprecated operators so it works with GCC 4.3.\n\n============================================================\n(C) Copyright 2007-2012, Percy Liang\n\nhttp://cs.stanford.edu/~pliang\n\nPermission is granted for anyone to copy, use, or modify these programs and\naccompanying documents for purposes of research or education, provided this\ncopyright notice is retained, and note is made of any changes that have been\nmade.\n\nThese programs and documents are distributed without any warranty, express or\nimplied.  As the programs were written for research purposes only, they have\nnot been tested to the degree that would be advisable in any important\napplication.  All use of these programs is entirely at the user's own risk.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpercyliang%2Fbrown-cluster","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpercyliang%2Fbrown-cluster","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpercyliang%2Fbrown-cluster/lists"}