{"id":19288361,"url":"https://github.com/techascent/distinct-sort","last_synced_at":"2026-05-08T16:41:07.342Z","repository":{"id":202969501,"uuid":"708169118","full_name":"techascent/distinct-sort","owner":"techascent","description":"Testing various optimization pathways for a distinct-sort problem","archived":false,"fork":false,"pushed_at":"2023-11-02T18:18:18.000Z","size":17,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-02-23T23:28:57.591Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Clojure","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/techascent.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2023-10-21T18:20:02.000Z","updated_at":"2023-10-21T18:20:25.000Z","dependencies_parsed_at":"2023-10-29T16:22:29.418Z","dependency_job_id":"c2f17984-2225-4edd-9313-908bdff394e0","html_url":"https://github.com/techascent/distinct-sort","commit_stats":null,"previous_names":["techascent/distinct-sort"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/techascent/distinct-sort","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/techascent%2Fdistinct-sort","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/techascent%2Fdistinct-sort/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/techascent%2Fdistinct-sort/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/techascent%2Fdistinct-sort/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/techascent","download_url":"https://codeload.github.com/techascent/distinct-sort/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/techascent%2Fdistinct-sort/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32788897,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-08T08:22:46.396Z","status":"ssl_error","status_checked_at":"2026-05-08T08:22:45.650Z","response_time":54,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-09T22:08:48.308Z","updated_at":"2026-05-08T16:41:07.322Z","avatar_url":"https://github.com/techascent.png","language":"Clojure","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Distinct Sort\n\nCompare various algorithms to achieve a sorted set of multiple columns of data.  Currently the data stored as a vector of maps, a tech.ml.dataset, and a duckdb in-memory table.\n\n\n## Usage\n\n```console\n\nCHRISs-MBP:distinct-sort chrispnuernberger$ scripts/perftest\nBuilding uberjar\n\nopenjdk version \"19.0.2\" 2023-01-17\nOpenJDK Runtime Environment Homebrew (build 19.0.2)\nOpenJDK 64-Bit Server VM Homebrew (build 19.0.2, mixed mode, sharing)\nrunning full benchmark ...\nOct 29, 2023 3:51:46 PM clojure.tools.logging$eval136$fn__139 invoke\nINFO: Attempting to load duckdb from \"/Users/chrispnuernberger/dev/tech.all/distinct-sort/binaries/libduckdb.dylib\"\nbenchmarking :xforms\nbenchmarking :load-dataset-cwise\nbenchmarking :parallel-load-dataset-cwise\nbenchmarking :via-reduce-kv\nbenchmarking :ds-set-sort\nbenchmarking :hamf-lznc\nbenchmarking :via-treeset\nbenchmarking :via-hamf-sort\nbenchmarking :ds-java-hashset-sort\nbenchmarking :ds-parallel-hashset-union\nbenchmarking :ds-parallel-concurrent-hashset\nbenchmarking :map-singlepass-concurrent-hashset\nbenchmarking :map-singlepass-hashset-union\nbenchmarking :map-range-hashset-union\nbenchmarking :map-range-concurrent-hashset\nbenchmarking :ds-cols-custom-singlepass\nbenchmarking :load-duckdb-data\nbenchmarking :duckdb-prepared\nOct 29, 2023 3:56:02 PM clojure.tools.logging$eval136$fn__139 invoke\nINFO: Reference thread starting\n_unnamed [18 3]:\n\n```\n\n|                              :name |     :mean-ms | :variance-ms |\n|------------------------------------|-------------:|-------------:|\n|                   :duckdb-prepared |  10.13111076 |   0.00076707 |\n|         :ds-cols-custom-singlepass |  10.81612538 |   0.00005542 |\n|       :parallel-load-dataset-cwise |  12.96070300 |   0.00000046 |\n|    :ds-parallel-concurrent-hashset |  20.75493793 |   0.00006997 |\n|      :map-range-concurrent-hashset |  22.48023100 |   0.00000740 |\n| :map-singlepass-concurrent-hashset |  22.96825737 |   0.00001997 |\n|         :ds-parallel-hashset-union |  24.28339487 |   0.00006296 |\n|                :load-dataset-cwise |  39.92357317 |   0.00001253 |\n|           :map-range-hashset-union |  56.45934617 |   0.00097360 |\n|      :map-singlepass-hashset-union |  58.28758933 |   0.00007669 |\n|                         :hamf-lznc |  64.34495042 |   0.00027312 |\n|                       :ds-set-sort | 105.37044383 |   0.00017169 |\n|              :ds-java-hashset-sort | 127.69116567 |   0.00052420 |\n|                            :xforms | 175.08987450 |   0.00344701 |\n|                     :via-reduce-kv | 180.04342917 |   0.00160358 |\n|                     :via-hamf-sort | 200.37273533 |   0.00244236 |\n|                       :via-treeset | 328.26164483 |   0.00063673 |\n|                  :load-duckdb-data | 560.03121417 |   0.00342610 |\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftechascent%2Fdistinct-sort","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftechascent%2Fdistinct-sort","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftechascent%2Fdistinct-sort/lists"}