{"id":20301584,"url":"https://github.com/zero-one-group/geni-performance-benchmark","last_synced_at":"2025-04-11T13:36:35.352Z","repository":{"id":56516394,"uuid":"285627689","full_name":"zero-one-group/geni-performance-benchmark","owner":"zero-one-group","description":null,"archived":false,"fork":false,"pushed_at":"2020-11-03T01:43:22.000Z","size":53,"stargazers_count":28,"open_issues_count":0,"forks_count":1,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-04-02T21:52:14.212Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Clojure","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zero-one-group.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2020-08-06T17:11:02.000Z","updated_at":"2024-09-27T13:14:10.000Z","dependencies_parsed_at":"2022-08-15T20:10:45.024Z","dependency_job_id":null,"html_url":"https://github.com/zero-one-group/geni-performance-benchmark","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zero-one-group%2Fgeni-performance-benchmark","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zero-one-group%2Fgeni-performance-benchmark/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zero-one-group%2Fgeni-performance-benchmark/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zero-one-group%2Fgeni-performance-benchmark/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zero-one-group","download_url":"https://codeload.github.com/zero-one-group/geni-performance-benchmark/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248410200,"owners_count":21098772,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-14T16:26:37.406Z","updated_at":"2025-04-11T13:36:35.327Z","avatar_url":"https://github.com/zero-one-group.png","language":"Clojure","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Benchmark Results\n\n### Parquet\n\n| Language | Library                              | N=2,000,000 (secs) | N=24,000,000 (secs) |\n| --       | ---                                  | ---                | ---                 |\n| Python   | Pandas (with custom agg function)    | 587                | 1,132               |\n| R        | dplyr                                | 461                | 992                 |\n| Julia    | DataFrames (with Parquet)            | 87                 | 868                 |\n| Clojure  | tablecloth                           | 48                 | 151                 |\n| R        | data.table                           | 28                 | 143                 |\n| Julia    | DataFrames (with Feather)            | 16                 | 41                  |\n| Clojure  | tech.ml.dataset (optimised)          | 9                  | 61                  |\n| Clojure  | Geni                                 | 8                  | 39                  |\n| Python   | Pandas (with builtin agg functions)  | 3                  | 42                  |\n| Clojure  | tech.ml.dataset (optimised by Chris) | 3                  | 8                   |\n\n### Other Data Formats\n\n| Language | Library                              | Format  | N=2,000,000 (secs) | N=24,000,000 (secs) |\n| --       | ---                                  | ---     | ---                | ---                 |\n| Julia    | DataFrames                           | Feather | 16                 | 41                  |\n| Clojure  | tech.ml.dataset (optimised by Chris) | Arrow   | 1                  | 7                   |\n\n# Machine\n\n## CPU\n\n```\n$ lscpu\nArchitecture:                    x86_64\nCPU op-mode(s):                  32-bit, 64-bit\nByte Order:                      Little Endian\nAddress sizes:                   46 bits physical, 48 bits virtual\nCPU(s):                          12\nOn-line CPU(s) list:             0-11\nThread(s) per core:              2\nCore(s) per socket:              6\nSocket(s):                       1\nNUMA node(s):                    1\nVendor ID:                       GenuineIntel\nCPU family:                      6\nModel:                           63\nModel name:                      Intel(R) Core(TM) i7-5930K CPU @ 3.50GHz\nStepping:                        2\nCPU MHz:                         1617.393\nCPU max MHz:                     3700.0000\nCPU min MHz:                     1200.0000\nBogoMIPS:                        6996.02\nVirtualization:                  VT-x\nL1d cache:                       192 KiB\nL1i cache:                       192 KiB\nL2 cache:                        1.5 MiB\nL3 cache:                        15 MiB\nNUMA node0 CPU(s):               0-11\n...\n```\n\n## RAM\n\n3 x 8GB of Corsair's DDR4 RAM:\n\n```\n$ sudo dmidecode -t memory | grep -i speed\n        Size: 8192 MB\n        Size: 8192 MB\n        Size: 8192 MB\n        ...\n$ sudo dmidecode -t memory | grep -i speed\n        Speed: 2133 MT/s\n        Speed: 2133 MT/s\n        Speed: 2133 MT/s\n        ...\n```\n\n## SSD\n\n```\n$ sudo hwinfo --disk\n97: PCI 00.0: 10600 Disk\n  [Created at block.245]\n  ...\n  Hardware Class: disk\n  Model: \"Samsung Electronics NVMe SSD Controller SM981/PM981\"\n  Vendor: pci 0x144d \"Samsung Electronics Co Ltd\"\n  Device: pci 0xa808 \"NVMe SSD Controller SM981/PM981\"\n  SubVendor: pci 0x144d \"Samsung Electronics Co Ltd\"\n  SubDevice: pci 0xa801\n  Driver: \"nvme\"\n  Driver Modules: \"nvme\"\n  Device File: /dev/nvme0n1\n  Device Files: /dev/nvme0n1, /dev/disk/by-id/nvme-Samsung_SSD_970_EVO_Plus_500GB_S4EVNG0M801407P, /dev/disk/by-id/nvme-eui.002538589150082c, /dev/disk/by-path/pci-0000:05:00.0-nvme-1\n  Device Number: block 259:0\n  BIOS id: 0x80\n  Geometry (Logical): CHS 476940/64/32\n  Size: 976773168 sectors a 512 bytes\n  Capacity: 465 GB (500107862016 bytes)\n  Config Status: cfg=new, avail=yes, need=no, active=unknown\n  Attached to: #67 (Non-Volatile memory controller)\n```\n\nWrite speed:\n\n```\n$ sync; dd if=/dev/zero of=tempfile bs=1M count=1024; sync\n1024+0 records in\n1024+0 records out\n1073741824 bytes (1.1 GB, 1.0 GiB) copied, 0.503625 s, 2.1 GB/s\n```\n\nRead speed:\n\n```\n$ dd if=tempfile of=/dev/null bs=1M count=1024\n1024+0 records in\n1024+0 records out\n1073741824 bytes (1.1 GB, 1.0 GiB) copied, 0.132842 s, 8.1 GB/s\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzero-one-group%2Fgeni-performance-benchmark","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzero-one-group%2Fgeni-performance-benchmark","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzero-one-group%2Fgeni-performance-benchmark/lists"}