{"id":13709472,"url":"https://github.com/Messi-Q/Smart-Contract-Dataset","last_synced_at":"2025-05-06T16:31:45.864Z","repository":{"id":49254552,"uuid":"360530103","full_name":"Messi-Q/Smart-Contract-Dataset","owner":"Messi-Q","description":"Datasets for evaluating smart contract security analysis tools ( continuously updating... )","archived":false,"fork":false,"pushed_at":"2023-08-21T09:51:08.000Z","size":2362,"stargazers_count":138,"open_issues_count":9,"forks_count":26,"subscribers_count":4,"default_branch":"master","last_synced_at":"2024-11-13T19:40:26.435Z","etag":null,"topics":["dataset","ethereum","smart-contracts","vunerability"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Messi-Q.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-04-22T13:32:12.000Z","updated_at":"2024-11-13T05:59:08.000Z","dependencies_parsed_at":"2024-11-13T19:33:42.105Z","dependency_job_id":"012e92ee-2020-41f1-b20d-f0e00cf07f0e","html_url":"https://github.com/Messi-Q/Smart-Contract-Dataset","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Messi-Q%2FSmart-Contract-Dataset","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Messi-Q%2FSmart-Contract-Dataset/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Messi-Q%2FSmart-Contract-Dataset/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Messi-Q%2FSmart-Contract-Dataset/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Messi-Q","download_url":"https://codeload.github.com/Messi-Q/Smart-Contract-Dataset/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252721080,"owners_count":21793748,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dataset","ethereum","smart-contracts","vunerability"],"created_at":"2024-08-02T23:00:39.873Z","updated_at":"2025-05-06T16:31:45.858Z","avatar_url":"https://github.com/Messi-Q.png","language":null,"funding_links":[],"categories":["With Vulnerabilities or Bugs"],"sub_categories":[],"readme":"# Smart Contract Dataset\n\n**This repository aims at releasing smart contract datasets used in our works, to facilitate community research. Also, we present instructions on how to label a certain type of vulnerability and show the detailed pattern designs of investigated vulnerabilities.**\n\n\n## Resource 1\n\n- This dataset consists of over 40K real world Ethereum smart contracts. \n- Download this resource at [Ethereum_smart_contract](https://drive.google.com/file/d/1yFJSCiUuoiSx4uWYNcCESUvsEs5DOGM9/view?usp=sharing). \n\n- Please cite one of the papers if you want to use the dataset in your paper.\n```\n@inproceedings{zhuangsmart,\n  title={Smart Contract Vulnerability Detection using Graph Neural Network},\n  author={Zhuang, Yuan and Liu, Zhenguang and Qian, Peng and Liu, Qi and Wang, Xiang and He, Qinming},\n  booktitle={IJCAI},\n  pages={3283--3290},\n  year={2020}\n}\n\n@inproceedings{liu2021smart,\n  title={Smart Contract Vulnerability Detection: From Pure Neural Network to Interpretable Graph Feature and Expert Pattern Fusion},\n  author={Liu, Zhenguang and Qian, Peng and Wang, Xiang and Zhu, Lei and He, Qinming and Ji, Shouling},\n   booktitle={IJCAI},\n  pages={2751--2759},\n  year={2021}\n}\n\n@article{liu2021combining,\n  title={Combining Graph Neural Networks with Expert Knowledge for Smart Contract Vulnerability Detection},\n  author={Liu, Zhenguang and Qian, Peng and Wang, Xiaoyang and Zhuang, Yuan and Qiu, Lin and Wang, Xun},\n  journal={IEEE Transactions on Knowledge and Data Engineering},\n  year={2021},\n  publisher={IEEE}\n}\n``` \n\n## Resource 2\n- This dataset concerns four types of vulnerabilities (i.e., reentrancy, timestamp dependency, integer overflow, dangerous delegatecall), where we give the preprocessing method. \n- Check [instructions](https://github.com/Messi-Q/Smart-Contract-Dataset/tree/master/instructions) for how to label these vulnerabilities.\n- Download this resource at [Dataset_preprocessing](https://drive.google.com/file/d/1UhHHevE9iDmvSB_k_lhyI58KAj7hnB1o/view?usp=share_link). \n\nPlease cite our paper if you want to use the dataset in your paper.\n```\n@inproceedings{10.1145/3543507.3583367,\nauthor = {Qian, Peng and Liu, Zhenguang and Yin, Yifang and He, Qinming},\ntitle = {Cross-Modality Mutual Learning for Enhancing Smart Contract Vulnerability Detection on Bytecode},\nyear = {2023},\nisbn = {9781450394161},\npublisher = {Association for Computing Machinery},\naddress = {New York, NY, USA},\nbooktitle = {Proceedings of the ACM Web Conference 2023},\npages = {2220–2229},\nnumpages = {10},\nlocation = {Austin, TX, USA},\nseries = {WWW '23}\n}\n```\n\n\n## Resource 3\n- This dataset contains over 12K Ethereum smart contracts (where inherited contracts are also included) and concerns eight types of vulnerabilities. \n- Check the [pattern](https://github.com/Messi-Q/Smart-Contract-Dataset/tree/master/patterns) design for more details.\n- Download this resource at [Dataset](https://drive.google.com/file/d/1iU2J-BIstCa3ooVhXu-GljOBzWi9gVrG/view?usp=share_link). \n\n- Please cite our paper if you want to use the dataset in your paper.\n```\n@article{liu2023rethinking,\n  title={Rethinking Smart Contract Fuzzing: Fuzzing With Invocation Ordering and Important Branch Revisiting},\n  author={Liu, Zhenguang and Qian, Peng and Yang, Jiaxu and Liu, Lingfeng and Xu, Xiaojun and He, Qinming and Zhang, Xiaosong},\n  journal={arXiv preprint arXiv:2301.03943},\n  year={2023}\n}\n```\n\n\n## Resource 4\n- Here, we present three datasets to evaluate the performance of smart contract analyzers.\n\n- The first dataset D1 (released by [1]) is used to measure the branch coverage of fuzzers. \nThe second dataset D2 (released by [2, 3, 4]) aims to evaluate the performance of vulnerability detection tools, \nwhile the purpose of the third dataset D3 (released by [5]) is to validate the effectiveness of our system in handling real-world contracts that involve large-scale transactions.\n\n- Download this resource at [Dataset](https://drive.google.com/file/d/1XFp3tZSMkWSkeLSHe_vrQjGZYZ3LzB2s/view?usp=sharing).\n\n- Please cite our paper if you want to use the dataset in your paper.\n```\nComing soon.\n```\n\n\n\n## Reference \n[1] Christof Ferreira Torres, et al. CONFUZZIUS: A Data Dependency-Aware Hybrid Fuzzer for Smart Contracts. EuroSP 2021.  \n\n[2] SmartBug: https://github.com/smartbugs/smartbugs-wild\n\n[3] VeriSmart: https://github.com/kupl/VeriSmart-benchmarks\n\n[4] SWC registry:  https://swcregistry.io\n\n[5] Jaeseung Choi, et al. SMARTIAN: Enhancing Smart Contract Fuzzing with Static and Dynamic Data-Flow Analyses. ASE 2021.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FMessi-Q%2FSmart-Contract-Dataset","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FMessi-Q%2FSmart-Contract-Dataset","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FMessi-Q%2FSmart-Contract-Dataset/lists"}