{"id":22793475,"url":"https://github.com/ctuavastlab/datasets","last_synced_at":"2026-01-11T02:11:26.225Z","repository":{"id":130441828,"uuid":"419681577","full_name":"CTUAvastLab/datasets","owner":"CTUAvastLab","description":"Datasets, currently containing: Mutagenesis","archived":false,"fork":false,"pushed_at":"2021-10-21T11:22:56.000Z","size":51,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-02-05T19:12:28.583Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"cc0-1.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/CTUAvastLab.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-10-21T10:41:58.000Z","updated_at":"2021-10-21T11:22:58.000Z","dependencies_parsed_at":"2023-04-13T21:03:09.891Z","dependency_job_id":null,"html_url":"https://github.com/CTUAvastLab/datasets","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CTUAvastLab%2Fdatasets","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CTUAvastLab%2Fdatasets/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CTUAvastLab%2Fdatasets/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CTUAvastLab%2Fdatasets/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/CTUAvastLab","download_url":"https://codeload.github.com/CTUAvastLab/datasets/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246351399,"owners_count":20763293,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-12T03:20:04.312Z","updated_at":"2026-01-11T02:11:26.178Z","avatar_url":"https://github.com/CTUAvastLab.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# Datasets\n\nThis repository contains various datasets used in CTUAvastLab. \nCurrently it contains only mutagenesis dataset.\n\n## Mutagenesis dataset:\n\n### Summary\nThe dataset comprises of 230 molecules trialed for mutagenicity on Salmonella typhimurium. A subset of 188 molecules\n is learnable using linear regression. This subset was later termed the ”regression friendly” dataset. The remaining\n subset of 42 molecules is named the ”regression unfriendly” dataset. \n(taken from [relational.fit.cvut.cz/](https://relational.fit.cvut.cz/dataset/Mutagenesis)).\n\nCurrently, this repository contains only `Mutagenesis_188`.\n\n### Website\n[relational.fit.cvut.cz/](https://relational.fit.cvut.cz/dataset/Mutagenesis) where the original data is hosted as \nSQL database.\n[Original source](http://www.cs.ox.ac.uk/activities/machlearn/mutagenesis.html) \n\n### [License](LICENSE)\n\nsee separate file.\n\n### Data structure\n\n[mutagenesis/data.json](mutagenesis/data.json) contains data from dataset Mutagenesis_188, as list of 188 strucures, \neach representing one molecule, as a json.\n\n[mutagenesis/meta.json](mutagenesis/meta.json) contains metadata about the dataset, as a json.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fctuavastlab%2Fdatasets","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fctuavastlab%2Fdatasets","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fctuavastlab%2Fdatasets/lists"}