{"id":28491073,"url":"https://github.com/dssgabriel/arm-sve-benchmarks","last_synced_at":"2026-05-01T02:32:06.851Z","repository":{"id":107156006,"uuid":"495952941","full_name":"dssgabriel/arm-sve-benchmarks","owner":"dssgabriel","description":"Performance comparison between small hand-written SVE kernels and compiler-generated ones. ","archived":false,"fork":false,"pushed_at":"2022-06-17T18:19:51.000Z","size":54,"stargazers_count":10,"open_issues_count":0,"forks_count":3,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-07-07T18:52:43.734Z","etag":null,"topics":["arm64","assembly","benchmarks","compiler","simd","sve"],"latest_commit_sha":null,"homepage":"","language":"C","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dssgabriel.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2022-05-24T19:01:07.000Z","updated_at":"2024-11-07T04:29:25.000Z","dependencies_parsed_at":null,"dependency_job_id":"57eb1e4b-19fa-4130-8e8e-b222b70808df","html_url":"https://github.com/dssgabriel/arm-sve-benchmarks","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/dssgabriel/arm-sve-benchmarks","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dssgabriel%2Farm-sve-benchmarks","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dssgabriel%2Farm-sve-benchmarks/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dssgabriel%2Farm-sve-benchmarks/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dssgabriel%2Farm-sve-benchmarks/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dssgabriel","download_url":"https://codeload.github.com/dssgabriel/arm-sve-benchmarks/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dssgabriel%2Farm-sve-benchmarks/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32483406,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-30T13:12:12.517Z","status":"online","status_checked_at":"2026-05-01T02:00:05.856Z","response_time":64,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["arm64","assembly","benchmarks","compiler","simd","sve"],"created_at":"2025-06-08T07:31:20.012Z","updated_at":"2026-05-01T02:32:06.846Z","avatar_url":"https://github.com/dssgabriel.png","language":"C","funding_links":[],"categories":[],"sub_categories":[],"readme":"# ARM SVE Benchmarks\n\nThis repository gathers some small kernels to benchmark the performance of hand-written SVE code compared to compiler-generated one.\nImplemented kernels are:\n- Initialization (store);\n- Copy (load, store);\n- Reduction (load, add);\n- Dot product (load, load, mul, add);\n- DAXPY (load, load, load, mul, add, store);\n- Vector sum (load, load, load, add, store);\n- Vector scale (load, load, load, mul, store).\n\n## Usage\n*Note:* the provided Makefile uses the `armclang` compiler, however both `clang` and `gcc` have been tested and can be used as well.\nKeep in mind that the architecture specific flags (`AFLAGS`) might need to be changed depending on the chosen compiler.\nSee the [comparison between compiler flags across architectures](https://community.arm.com/arm-community-blogs/b/tools-software-ides-blog/posts/compiler-flags-across-architectures-march-mtune-and-mcpu) for more information.\n\nTo build the benchmarks:\n```\nmake build\n```\n\nYou can then execute one of the benchmarks presented above and specify the vectors' size (in bytes), number of iterations and error tolerance through the provided option flags.\n\nExample (reduction benchmark with 64KiB vectors, 100k iterations and an error tolerance of $10^{-14}$:\n```\ntarget/arm_bench -k reduc -s 8192 -i 100000 -e 1e-14\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdssgabriel%2Farm-sve-benchmarks","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdssgabriel%2Farm-sve-benchmarks","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdssgabriel%2Farm-sve-benchmarks/lists"}