{"id":20696460,"url":"https://github.com/andreazoccatelli/tabular_data_augmentation_continuous","last_synced_at":"2026-04-21T00:02:55.173Z","repository":{"id":177824535,"uuid":"660957857","full_name":"AndreaZoccatelli/Tabular_data_augmentation_continuous","owner":"AndreaZoccatelli","description":"This repository contains the scripts used to write my master degree thesis project: \"Augmentation of tabular data with continuous features for binary imbalanced classification problems\"","archived":false,"fork":false,"pushed_at":"2023-07-09T16:24:17.000Z","size":685,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-01-17T18:25:52.295Z","etag":null,"topics":["cgan","copula","data-augmentation","imbalanced-classification","imbalanced-data","imbalanced-learning"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/AndreaZoccatelli.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-01T10:39:21.000Z","updated_at":"2023-07-09T16:30:53.000Z","dependencies_parsed_at":null,"dependency_job_id":"5c26e8c0-ff70-4080-951b-234446d10a5c","html_url":"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous","commit_stats":null,"previous_names":["andreazoccatelli/tabular_data_augmentation_continuous"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AndreaZoccatelli%2FTabular_data_augmentation_continuous","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AndreaZoccatelli%2FTabular_data_augmentation_continuous/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AndreaZoccatelli%2FTabular_data_augmentation_continuous/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AndreaZoccatelli%2FTabular_data_augmentation_continuous/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/AndreaZoccatelli","download_url":"https://codeload.github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":242961754,"owners_count":20213315,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cgan","copula","data-augmentation","imbalanced-classification","imbalanced-data","imbalanced-learning"],"created_at":"2024-11-17T00:14:03.758Z","updated_at":"2025-12-12T11:15:28.552Z","avatar_url":"https://github.com/AndreaZoccatelli.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Augmentation of tabular data with continuous features for binary imbalanced classification problems\n\nThe aim of this project is to augment the observations that belong to the minority class using copula sampling and conditional GANs in order to improve the performance of the classifiers for binary imbalanced classification problems.\n\n- For the augmentation based on copulas, my library, \u003ca href=\"https://github.com/AndreaZoccatelli/GenCopula\" target=\"_blank\"\u003eGenCopula\u003c/a\u003e has been used.\n``` r\nlibrary(devtools)\ninstall_github(\"AndreaZoccatelli/GenCopula\")\n```\n- The library used for the augmentation based on cGAN is \u003ca href=\"https://github.com/sdv-dev/CTGAN\" target=\"_blank\"\u003eCTGAN\u003c/a\u003e\n- To re-create the datasets used in the project run \u003ca href=\"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/blob/main/Create_data.ipynb\" target=\"_blank\"\u003eCreate_data.ipynb\u003c/a\u003e\n\n- These notebooks report the results obtained on the different dataset:\n    - \u003ca href=\"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/blob/main/BestCase.md\" target=\"_blank\"\u003eBest case\u003c/a\u003e\n    - \u003ca href=\"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/blob/main/20_30Safe.md\" target=\"_blank\"\u003e20-30% Safe\u003c/a\u003e\n    - \u003ca href=\"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/blob/main/less20Safe.md\" target=\"_blank\"\u003eLess 20% Safe\u003c/a\u003e\n    - \u003ca href=\"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/blob/main/10perc_minority.md\" target=\"_blank\"\u003e10% Minority\u003c/a\u003e\n    - \u003ca href=\"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/blob/main/5perc_minority.md\" target=\"_blank\"\u003e5% Minority\u003c/a\u003e\n    - \u003ca href=\"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/blob/main/4features.md\" target=\"_blank\"\u003e4 Features\u003c/a\u003e\n    - \u003ca href=\"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/blob/main/8features.md\" target=\"_blank\"\u003e8 Features\u003c/a\u003e\n    - \u003ca href=\"https://github.com/AndreaZoccatelli/Tabular_data_augmentation_continuous/blob/main/Default.md\" target=\"_blank\"\u003eDefault\u003c/a\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fandreazoccatelli%2Ftabular_data_augmentation_continuous","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fandreazoccatelli%2Ftabular_data_augmentation_continuous","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fandreazoccatelli%2Ftabular_data_augmentation_continuous/lists"}