{"id":20637573,"url":"https://github.com/long-gong/datasets-e2h","last_synced_at":"2025-07-19T11:35:11.763Z","repository":{"id":176486997,"uuid":"229824521","full_name":"long-gong/datasets-E2H","owner":"long-gong","description":"Datasets Euclidean to Hamming Conversion","archived":false,"fork":false,"pushed_at":"2020-08-25T20:21:03.000Z","size":195140,"stargazers_count":0,"open_issues_count":0,"forks_count":1,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-03-09T08:41:00.855Z","etag":null,"topics":["cpp","datasets","eigen3","euclidean2hamming","hdf5","simhash"],"latest_commit_sha":null,"homepage":"https://github.com/long-gong/datasets","language":"C++","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/long-gong.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-12-23T21:20:56.000Z","updated_at":"2020-08-25T20:21:05.000Z","dependencies_parsed_at":null,"dependency_job_id":"e151bd98-bde9-4837-a0d8-5535906b1105","html_url":"https://github.com/long-gong/datasets-E2H","commit_stats":null,"previous_names":["long-gong/datasets-e2h"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/long-gong/datasets-E2H","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/long-gong%2Fdatasets-E2H","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/long-gong%2Fdatasets-E2H/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/long-gong%2Fdatasets-E2H/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/long-gong%2Fdatasets-E2H/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/long-gong","download_url":"https://codeload.github.com/long-gong/datasets-E2H/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/long-gong%2Fdatasets-E2H/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":265926967,"owners_count":23850886,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cpp","datasets","eigen3","euclidean2hamming","hdf5","simhash"],"created_at":"2024-11-16T15:15:03.368Z","updated_at":"2025-07-19T11:35:11.755Z","avatar_url":"https://github.com/long-gong.png","language":"C++","funding_links":[],"categories":[],"sub_categories":[],"readme":"# E2H: Euclidean Datasets to Hamming Datasets\n\n[![Build Status](https://travis-ci.org/long-gong/datasets-E2H.svg?branch=master)](https://travis-ci.org/long-gong/datasets-E2H)\n\nE2H implements the preprocessing tool used in our recent paper, \n\"Long Gong, Huayi Wang, Mitsunori Ogihara, and Jun Xu. 2020. IDEC: indexable distance estimating codes for approximate nearest neighbor search. \u003ci\u003eProc. VLDB Endow.\u003c/i\u003e 13, 9 (May 2020), 1483–1497. DOI:https://doi.org/10.14778/3397230.3397243.\" \nE2H is used to convert Euclidean datasets to Hamming datasets. \n\n## Install Dependecies\n\n```bash\n./install_deps.sh\n```\n\n## Usage\n\n```bash\nmake \u003cdataset\u003e\n./\u003cdataset\u003e m \n```\n\n`\u003cdataset\u003e`: audio|glove|mnist|enron|sift1m|gist1m|sift1b|gist80m \n`m`: dimension for Hamming data (suggested value: rounding original dim to multiples of 64)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flong-gong%2Fdatasets-e2h","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flong-gong%2Fdatasets-e2h","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flong-gong%2Fdatasets-e2h/lists"}