Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/datasciencecampus/synthetic-data
Repo on generating synthetic data using GAN
https://github.com/datasciencecampus/synthetic-data
dsc-projects
Last synced: 2 months ago
JSON representation
Repo on generating synthetic data using GAN
- Host: GitHub
- URL: https://github.com/datasciencecampus/synthetic-data
- Owner: datasciencecampus
- Created: 2018-05-03T14:01:04.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2019-04-18T15:02:27.000Z (over 5 years ago)
- Last Synced: 2024-08-03T17:14:55.539Z (6 months ago)
- Topics: dsc-projects
- Language: Jupyter Notebook
- Homepage:
- Size: 26 MB
- Stars: 6
- Watchers: 2
- Forks: 3
- Open Issues: 34
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-data-synthesis - Repo on generating synthetic data using GAN - Repo on generating synthetic data using GAN (Data-driven methods / Tabular)
README
# Synthetic data
Repo on generating synthetic dataThe project involves the generation of synthetic data using machine learning in order to replace real data for the purpose of data processing. This is particularly useful in cases where the real data is sensitive (e.g. microdata, medical records, defence data). Additionally, the methods developed as part of the project can be used for imputation. Regarding data sources, publicly available data (open data) will be used initially. Once the developed methods have matured, they will be applied to ONS data such as Trade-ITIS, LFS and Census data.
The main machine learning methods investigated by our team for the generation of synthetic data are generative adversarial networks (GANs), variational autoencoders (VAEs) and auto-regressive models.
The Synthetic data project will result in a safer way to share data in cases where the real data is sensitive. Additionally, it will make sharing data between the research communities and ONS easier and faster. Furthermore, the project is linked to several current ONS Data Science projects (Trade, Housing, etc.).