{"id":15023949,"url":"https://github.com/sap/json-data-and-query-generator","last_synced_at":"2025-10-19T18:31:02.175Z","repository":{"id":182027092,"uuid":"665437684","full_name":"SAP/json-data-and-query-generator","owner":"SAP","description":"A generator of JSON data and queries, e.g., for benchmarking JSON document stores.","archived":false,"fork":false,"pushed_at":"2024-09-09T19:37:45.000Z","size":57,"stargazers_count":5,"open_issues_count":3,"forks_count":3,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-30T00:51:11.309Z","etag":null,"topics":["benchmark","database","json","sample-code"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/SAP.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-12T07:50:40.000Z","updated_at":"2025-01-09T21:42:43.000Z","dependencies_parsed_at":"2024-09-09T23:17:59.329Z","dependency_job_id":"cabba1e3-de59-44b7-99a9-0280c5fd5fa9","html_url":"https://github.com/SAP/json-data-and-query-generator","commit_stats":null,"previous_names":["sap/json-data-and-query-generator"],"tags_count":0,"template":false,"template_full_name":"SAP/repository-template","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SAP%2Fjson-data-and-query-generator","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SAP%2Fjson-data-and-query-generator/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SAP%2Fjson-data-and-query-generator/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SAP%2Fjson-data-and-query-generator/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/SAP","download_url":"https://codeload.github.com/SAP/json-data-and-query-generator/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":237191501,"owners_count":19269719,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["benchmark","database","json","sample-code"],"created_at":"2024-09-24T19:59:38.445Z","updated_at":"2025-10-19T18:30:56.857Z","avatar_url":"https://github.com/SAP.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"## JSON Data and Query Generator\n\nThe growing popularity of JSON as exchange and storage format in business and analytical applications led to its rapid dissemination, thus making a timely storage and processing of JSON documents crucial for organizations. Consequently, specialized JSON document stores are ubiquitously used for diverse domain-specific workloads, while a JSON-specific benchmark is missing.\n\nIn this repository, we provide an example implementation of **DeepBench**, an extensible, scalable benchmark that addresses nested JSON data, as well as queries over JSON documents. DeepBench features configurable domain-independent (e. g., varying document sizes, concurrent users) and JSON-specific scale levels (e. g., object, array nesting).\n\nThe package `json_data_and_query_generator` contains tools to generate random `json` data and corresponding `SQL` queries.\nEach of these tools needs as an input a configuration in form of a `json` document describing the fixed structure of the data and the characteristic of the generated queries.\n\n## Setup\n\nInstall prerequisites:\n\n```\n  pip install .\n```\n\nTo execute data and query generation based on the example scenario in `examples` (default):\n\n```\n  python -m json_data_and_query_generator --num-proc 5\n```\n\nwith five processes.\n\nIf other scenarios should be run, then specify paths to `schema.txt`, `data.txt`, and `config.json` as described in `pipeline.py --help`.\n\n## Support, Feedback, Contributing\n\nThis project is open to feature requests/suggestions, bug reports etc. via [GitHub issues](https://github.com/SAP/json-data-and-query-generator/issues). Contribution and feedback are encouraged and always welcome. For more information about how to contribute, the project structure, as well as additional contribution information, see our [Contribution Guidelines](CONTRIBUTING.md).\n\n## Code of Conduct\n\nWe as members, contributors, and leaders pledge to make participation in our community a harassment-free experience for everyone. By participating in this project, you agree to abide by its [Code of Conduct](https://github.com/SAP/.github/blob/main/CODE_OF_CONDUCT.md) at all times.\n\n## Licensing\n\nCopyright 2022-2023 SAP SE or an SAP affiliate company and \u003cyour-project\u003e contributors. Please see our [LICENSE](LICENSE) for copyright and license information. Detailed information including third-party components and their licensing/copyright information is available [via the REUSE tool](https://api.reuse.software/info/github.com/SAP/json-data-and-query-generator).\n\n## Citation\n\nFor more documentation read the following documents. If you find this work useful for your research, please cite:\n\n```\n@inproceedings{DBLP:conf/dbtest-ws/Belloni0SR22,\n  author       = {Stefano Belloni and\n                  Daniel Ritter and\n                  Marco Schr{\\\"{o}}der and\n                  Nils R{\\\"{o}}rup},\n  editor       = {Manuel Rigger and\n                  Pinar T{\\\"{o}}z{\\\"{u}}n},\n  title        = {DeepBench: Benchmarking {JSON} Document Stores},\n  booktitle    = {DBTest@SIGMOD '22: Proceedings of the 9th International Workshop of\n                  Testing Database Systems, Philadelphia, PA, USA, 17 June 2022},\n  pages        = {1--9},\n  publisher    = {{ACM}},\n  year         = {2022},\n  url          = {https://doi.org/10.1145/3531348.3532176},\n  doi          = {10.1145/3531348.3532176},\n  timestamp    = {Sun, 02 Oct 2022 15:58:56 +0200},\n  biburl       = {https://dblp.org/rec/conf/dbtest-ws/Belloni0SR22.bib},\n  bibsource    = {dblp computer science bibliography, https://dblp.org}\n}\n```\nand / or the usage in systems:\n\n```\n@article{DBLP:journals/dbsk/BelloniR22,\n  author       = {Stefano Belloni and\n                  Daniel Ritter},\n  title        = {Benchmarking {JSON} Document Stores in Practice},\n  journal      = {Datenbank-Spektrum},\n  volume       = {22},\n  number       = {3},\n  pages        = {217--226},\n  year         = {2022},\n  url          = {https://doi.org/10.1007/s13222-022-00425-y},\n  doi          = {10.1007/s13222-022-00425-y},\n  timestamp    = {Sat, 25 Feb 2023 21:35:08 +0100},\n  biburl       = {https://dblp.org/rec/journals/dbsk/BelloniR22.bib},\n  bibsource    = {dblp computer science bibliography, https://dblp.org}\n}\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsap%2Fjson-data-and-query-generator","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsap%2Fjson-data-and-query-generator","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsap%2Fjson-data-and-query-generator/lists"}