{"id":28385516,"url":"https://github.com/kiprotect/data-privacy-for-data-scientists","last_synced_at":"2025-06-26T06:32:36.504Z","repository":{"id":53664562,"uuid":"141404339","full_name":"kiprotect/data-privacy-for-data-scientists","owner":"kiprotect","description":"A workshop on data privacy methods for data scientists. ","archived":false,"fork":false,"pushed_at":"2022-08-23T17:23:54.000Z","size":7448,"stargazers_count":70,"open_issues_count":3,"forks_count":19,"subscribers_count":7,"default_branch":"master","last_synced_at":"2025-05-30T12:20:10.717Z","etag":null,"topics":["anonymization","data-privacy","data-science","education","jupyter-notebooks","tutorial"],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/kiprotect.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-07-18T08:23:30.000Z","updated_at":"2025-03-22T16:31:10.000Z","dependencies_parsed_at":"2022-09-04T22:00:49.738Z","dependency_job_id":null,"html_url":"https://github.com/kiprotect/data-privacy-for-data-scientists","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/kiprotect/data-privacy-for-data-scientists","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kiprotect%2Fdata-privacy-for-data-scientists","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kiprotect%2Fdata-privacy-for-data-scientists/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kiprotect%2Fdata-privacy-for-data-scientists/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kiprotect%2Fdata-privacy-for-data-scientists/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/kiprotect","download_url":"https://codeload.github.com/kiprotect/data-privacy-for-data-scientists/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kiprotect%2Fdata-privacy-for-data-scientists/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":262014898,"owners_count":23245224,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["anonymization","data-privacy","data-science","education","jupyter-notebooks","tutorial"],"created_at":"2025-05-30T10:40:25.994Z","updated_at":"2025-06-26T06:32:36.487Z","avatar_url":"https://github.com/kiprotect.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Data Privacy for Data Scientists\n\nA workshop on data privacy methods for data scientists. \n\nThis workshop will be presented as part of [EuroPython 2018](https://ep2018.europython.eu/conference/talks/privacy-for-data-scientists). \n\n### Motivation\n\nAs data and information security become core components of managing user data, data scientists are keen to expand their knowledge and skills relating to data privacy and security basics. As of May 2018, the European General Data Protection Regulation affects how European residents can access and grant consent to use their data. As European data scientists, we now have an obligation as well as distinct motivation, to practice data science with attention to data privacy.\n\nIn this workshop, we will introduce some of the basics in terms of defining privacy within the realm of data collection, modeling and machine learning. A focus on practical knowledge and code, we will cover how one can implement some of these algorithms with Python. Students will be presented with these theories along with recent research on privacy-preserving models, so they can leave with a better understanding of how to apply privacy principles to data science in their work and study.\n\n### Installation\n\nPlease utilize the included `requirements.txt` to install your requirements using `pip` (you can also do so in `conda`. The notebooks have *only* been tested with Python 3. 🙌🏻\n\nWe recommend using [virtual environments](https://packaging.python.org/guides/installing-using-pip-and-virtualenv/) or [conda environments](https://conda.io/docs/user-guide/tasks/manage-environments.html). \n\n### Outline\n\nAgenda\n--------\n\n- Introduction and Motivation\n- Pseudonymization\n- K-Anonymity\n- Differential Privacy\n- Case Study\n- Wrap-Up and Q\u0026A\n\n### Recommended Reading\n\nEach notebook has its own section of recommended reading. We may update this README with additional reading of interest on this topic.\n\n### Questions?\n\nQuestions about getting set up or the content covered in the workshop? Feel free to reach out via email at: info /at/ kiprotect (d o t) com\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkiprotect%2Fdata-privacy-for-data-scientists","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkiprotect%2Fdata-privacy-for-data-scientists","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkiprotect%2Fdata-privacy-for-data-scientists/lists"}