Projects in Awesome Lists by sodascience
A curated list of projects in awesome lists by sodascience .
https://github.com/sodascience/metasyn
Transparent and privacy-friendly synthetic data generation
metadata open-data privacy synthetic-data
Last synced: 07 Apr 2025
https://github.com/sodascience/osmenrich
Enrich sf data with geographic features from OpenStreetMaps.
geospatial geospatial-data odissei osm sf utrecht-university
Last synced: 12 Apr 2025
https://github.com/sodascience/artscraper
Python package for downloading art and metadata of WikiArt and Google Arts & Culture
art download google-arts-and-culture odissei wikiart
Last synced: 12 Apr 2025
https://github.com/sodascience/social_science_inferences_with_llms
Addressing LLM-related measurement error in social science modeling research.
data-collection inference large-language-models llms
Last synced: 30 Jan 2026
https://github.com/sodascience/map-explorer
Map Explorer is a Vue.js web application for rendering GeoJSON maps with dynamic region coloring based on external data.
choropleth data-analysis data-visualization geojson
Last synced: 10 Feb 2026
https://github.com/sodascience/osmenrich_docker
Setup local OSRM and Overpass API endpoints using Docker
docker osrm overpass-api soda-science-team utrecht-university
Last synced: 13 Aug 2025
https://github.com/sodascience/regexmodel
Using regex to learn the structure of strings
graph-model python regex strings
Last synced: 12 Apr 2025
https://github.com/sodascience/workshop_ossc
This repository contains the code and presentation of the workshop on parallel computing & cluster computing using R in collaboration with SURF.
agent-based-modeling cluster-computing high-performance-computing odissei parallel-computing r simulation surf
Last synced: 30 Jul 2025
https://github.com/sodascience/port-whatsapp-datadonation
A project for extracting information from WhatsApp data download packages using an online platform(PORT)
data-donation data-download-package port privacy-protection whatsapp
Last synced: 18 May 2026
https://github.com/sodascience/kansenkaart_preprocessing
The processing pipeline for the Dutch Opportunity atlas
data-processing mapping opportunity
Last synced: 09 Sep 2025
https://github.com/sodascience/workshop_causal_impact_assessment
Repository containing materials for the ODISSEI workshop on causal impact assessment
causal-inference policy-evaluation workshop
Last synced: 04 Jan 2026
https://github.com/sodascience/fairprojects
Dashboard for FAIRness of organisation's projects
dashboard fair project-management
Last synced: 04 Mar 2026
https://github.com/sodascience/metasyn-disclosure-control
Plugin for metasyn that prevents data from leaking.
disclosure-control metasyn plugin privacy-protection synthetic-data
Last synced: 03 Oct 2025
https://github.com/sodascience/kansenkaart_analysis
Analysis code for the kansenkaart project
Last synced: 12 Apr 2025
https://github.com/sodascience/netcbs
Package to efficiently create network measures using CBS networks (POPNET) in the RA
Last synced: 11 Jun 2025
https://github.com/sodascience/generative_metadata_format
Generative metadata format for generating synthetic tabular datasets
standard statistical-metadata synthetic-data
Last synced: 20 Mar 2025
https://github.com/sodascience/ddi-synth
Synthesize dataverse data based on DDI metadata
dataverse metadata shiny-apps synthetic-data
Last synced: 28 Oct 2025
https://github.com/sodascience/synthpop.extract
Repository showing how to safely generate synthetic data based on a synthpop model
Last synced: 20 Mar 2025
https://github.com/sodascience/workshop_llm_data_collection
This repository contains the code and slides for our workshop on data collection and inference with Large Language Models
data-collection inference llm python r workshop workshop-materials
Last synced: 16 Apr 2026
https://github.com/sodascience/open_supply_hub
Processing supply chain data obtained from Open Supply Hub
data global-supply-chain open-supply-hub python
Last synced: 29 Apr 2026
https://github.com/sodascience/nprz_synthetic_controls
Code Archive for Synthetic Control Analysis of the NPRZ Education Intervention
causal-inference education policy-interventions synthetic-control-method
Last synced: 20 Mar 2025
https://github.com/sodascience/metasyn-distribution-template
Template for adding distribution plugins for the metasyn library
Last synced: 20 Mar 2025
https://github.com/sodascience/bestworst_analysis
Bayesian analysis of best-worst scaling data
bayesian-inference best-worst-scaling ranking-data word-associations
Last synced: 20 Mar 2025
https://github.com/sodascience/presentation-osmenrich-sig
Presentation for the UU special interest group open statistical software
data-science geospatial software
Last synced: 20 Mar 2025
https://github.com/sodascience/python-package-boilerplate
Boilerplate with the recommended structure for a python package. It can be used as a template for your own project
Last synced: 16 Jun 2026
https://github.com/sodascience/website
The SoDa website
communications online-presence webpage website
Last synced: 20 Mar 2025
https://github.com/sodascience/remode
Finding modes in discrete distributions
modality mode-detection ordinal-data
Last synced: 20 Mar 2025
https://github.com/sodascience/workshop_cbs_microdata_computing
Presentation and code for CBS microdata meeting
cbs microdata national-statistics odissei r supercomputing
Last synced: 20 Mar 2025
https://github.com/sodascience/metasyn-privacy-template
Template for creating privacy plugins for the metasyn library
Last synced: 16 May 2026
https://github.com/sodascience/cbs_python
An example python environment to use in CBS
cbs conda environment python statistics-netherlands yml
Last synced: 13 Jul 2025
https://github.com/sodascience/workshop_syntheticdata_osf2022
Files for the synthetic data presentation at the Open Science Festival 2022
open-science privacy-protection synthetic-data-generation
Last synced: 04 Jan 2026
https://github.com/sodascience/disease_database_viewer
Viewer for historical disease database
disease-spread history mapping shiny-apps
Last synced: 01 Jul 2025
https://github.com/sodascience/disease_database
Historical disease database (19th-20th century) for municipalities in the Netherlands
demography geospatial-data health history
Last synced: 14 Jul 2025
https://github.com/sodascience/schools_synth_test
Test code for disaggregated synthetic control with simulated data
causal-inference simulation statistics synthetic-control
Last synced: 20 Mar 2025
https://github.com/sodascience/synthetic_youth_pilot
Synthetic data pilot for YOUth study questionnaires, using metasyn
questionnaire-survey synthetic-data youth-data
Last synced: 14 Oct 2025
https://github.com/sodascience/empathy-viz
An application to be used in a clinical setting to score dynamics in empathy
data-visualization empathy r shiny-apps survey
Last synced: 27 Oct 2025
https://github.com/sodascience/workshop_efficient_microdata
Workshop for efficiently doing projects with CBS microdata.
computing microdata project-organisation statistics
Last synced: 19 Mar 2026
https://github.com/sodascience/diaper_project
Population-based study of poverty and perinatal maternal and child health in the Netherlands
mediation-analysis perinatal population-based-study
Last synced: 02 Mar 2026
https://github.com/sodascience/night_globe
Correcting inferences in volunteer data using geospatial covariates
geospatial-data inference statistics volunteer-data
Last synced: 28 Apr 2026
https://github.com/sodascience/lichess_db
Lichess game header data as parquet files
database lichess lichess-database
Last synced: 02 May 2026
https://github.com/sodascience/biodiversityasset
LLM-powered analysis of biodiversity-related investment activities in financial reports
biodiversity economics llm nlp
Last synced: 09 Jun 2026
https://github.com/sodascience/research-project-boilerplate
Boilerplate project with the recommended structure for a data processing research project. It can be used as a template for your own project
Last synced: 16 Jun 2026
https://github.com/sodascience/sicss_llm_workshop
Materials and notebooks for the SICSS LLM workshop.
Last synced: 16 Jun 2026
https://github.com/sodascience/word_norms_survey
Survey to derive word norms in Dutch for various associations
dutch-language survey word-associations word-embeddings
Last synced: 20 Mar 2025