{"id":19943221,"url":"https://github.com/root-project/opendata-benchmarks","last_synced_at":"2025-09-22T21:57:47.703Z","repository":{"id":43379151,"uuid":"182964833","full_name":"root-project/opendata-benchmarks","owner":"root-project","description":"Example repository showing benchmarks with open data","archived":false,"fork":false,"pushed_at":"2023-02-06T21:11:21.000Z","size":239,"stargazers_count":3,"open_issues_count":0,"forks_count":7,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-05-03T15:37:58.919Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"C++","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/root-project.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2019-04-23T08:00:58.000Z","updated_at":"2022-09-11T16:45:52.000Z","dependencies_parsed_at":"2025-05-03T15:43:47.380Z","dependency_job_id":null,"html_url":"https://github.com/root-project/opendata-benchmarks","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/root-project/opendata-benchmarks","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/root-project%2Fopendata-benchmarks","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/root-project%2Fopendata-benchmarks/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/root-project%2Fopendata-benchmarks/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/root-project%2Fopendata-benchmarks/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/root-project","download_url":"https://codeload.github.com/root-project/opendata-benchmarks/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/root-project%2Fopendata-benchmarks/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":276481067,"owners_count":25649988,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-09-22T02:00:08.972Z","response_time":79,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-13T00:15:52.629Z","updated_at":"2025-09-22T21:57:47.674Z","avatar_url":"https://github.com/root-project.png","language":"C++","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Sofware benchmarks with open data\n\n**NOTE:** This repo contains two versions of the RDataFrame implementation. One version includes just-in-time compilation and the other is fully compiled and therefore optimized for performance. Also, running the files as C++ macro, via the ROOT interpreter cling, will result in a reduced runtime performance due to missing optimizations. To compile any of the benchmarks named `*_compiled.cxx`, source ROOT and run following line:\n\n```\ng++ -O3 -o task task.cxx $(root-config --cflags --libs)\n```\n\nTo run the macros (named `*_jitted.C`) efficiently with optimizations, please use the following command:\n\n```\nroot -l -b -q macro.C+\n```\n\nThe implementations of the benchmarks tracked by the ROOT team can be found [in root-project/rootbench](https://github.com/root-project/rootbench/blob/master/root/tree/dataframe/RDataFrameOpenDataBenchmarks.cxx).\n\nJust an example what is possible with the existing open data.\n\nList of tasks:\n\n1. Plot the Missing ET in an event (loop over events)\n2. Plot the Jet pT of all jets in an event (loop over an array in an event)\n3. Plot the Jet pT with eta cuts on the jet (loop over an array that is filtered)\n4. Plot the missing ET of events with at least 2 jets above 40 GeV (loop over array in event, aggregate those results to filter at event level).\n5. Plot missing et for events that have an opposite-sign muon pair of mass 60-120 GeV [looping on combinations of objects in one collection, 4-vector algebra]\n6. Plot the pt of the tri-jet system with mass closest to 172.5 GeV, and the leading b-tag discriminator among the 3 jets in the triplet [requires looping on combination of objects in the same collection, and extracting properties of a combination other than the key used to sort them]\n7. Plot sum of the pt of all jets of pt \u003e 30 GeV that are not within DR 0.4 from a lepton of pt \u003e 10 GeV [requires looping on two separate collections]\n8. In events with \u003e=3 leptons and a same-flavour opposite-sign lepton pair, find the best same-flavour opposite-sign lepton pair (mass closest to 91.2 GeV), and plot the transverse mass of the missing energy and the leading other lepton [ a task whose formulation in an imperative language is easy, but whose translations to a functional query language may be less clear and/or possibly inefficient]\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Froot-project%2Fopendata-benchmarks","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Froot-project%2Fopendata-benchmarks","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Froot-project%2Fopendata-benchmarks/lists"}