{"id":21425294,"url":"https://github.com/D-score/childdevdata","last_synced_at":"2025-07-14T08:32:21.318Z","repository":{"id":56935061,"uuid":"353137452","full_name":"D-score/childdevdata","owner":"D-score","description":"Child Development Data","archived":false,"fork":false,"pushed_at":"2023-04-18T14:03:10.000Z","size":1683,"stargazers_count":3,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-07-13T10:10:56.263Z","etag":null,"topics":["child-development","d-score","dataset"],"latest_commit_sha":null,"homepage":"https://d-score.org/childdevdata","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"cc-by-4.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/D-score.png","metadata":{"files":{"readme":"README.Rmd","changelog":"NEWS.md","contributing":null,"funding":null,"license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-03-30T20:48:32.000Z","updated_at":"2024-10-30T08:03:26.000Z","dependencies_parsed_at":"2024-11-22T21:40:29.017Z","dependency_job_id":null,"html_url":"https://github.com/D-score/childdevdata","commit_stats":{"total_commits":47,"total_committers":1,"mean_commits":47.0,"dds":0.0,"last_synced_commit":"3f255d50c619d95cd1cc133fdea83b43fe2e0c29"},"previous_names":[],"tags_count":3,"template":false,"template_full_name":null,"purl":"pkg:github/D-score/childdevdata","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/D-score%2Fchilddevdata","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/D-score%2Fchilddevdata/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/D-score%2Fchilddevdata/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/D-score%2Fchilddevdata/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/D-score","download_url":"https://codeload.github.com/D-score/childdevdata/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/D-score%2Fchilddevdata/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":265123786,"owners_count":23714808,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["child-development","d-score","dataset"],"created_at":"2024-11-22T21:27:39.456Z","updated_at":"2025-07-14T08:32:16.471Z","avatar_url":"https://github.com/D-score.png","language":"R","funding_links":[],"categories":["Projects"],"sub_categories":["Research data"],"readme":"---\noutput: github_document\nalways_allow_html: yes\nbibliography: [references.bib]\nbiblio-style: apalike\nlink-citations: yes\n---\n\n\u003c!-- README.md is generated from README.Rmd. Please edit that file --\u003e\n\n```{r, include = FALSE}\nknitr::opts_chunk$set(\n  collapse = TRUE,\n  comment = \"#\u003e\",\n  fig.path = \"man/figures/README-\",\n  out.width = \"100%\"\n)\n```\n\n# childdevdata\n\n\u003c!-- badges: start --\u003e\n[![Lifecycle: stable](https://img.shields.io/badge/lifecycle-stable-brightgreen.svg)](https://lifecycle.r-lib.org/articles/stages.html#stable)\n[![CRAN status](https://www.r-pkg.org/badges/version/childdevdata)](https://CRAN.R-project.org/package=childdevdata)\n[![DOI](https://zenodo.org/badge/353137452.svg)](https://zenodo.org/badge/latestdoi/353137452)\n\u003c!-- badges: end --\u003e\n\nThe goal of `childdevdata` is to support innovation in child development. The package \n\n1. Makes anonymous microdata available to the research community;\n2. Adopts a simple naming schema for developmental milestones;\n3. Supports multiple measurement instruments;\n4. Eases joint analyses of the data.\n\nThe current version bundles milestone data from ten studies, containing 1,116,061 assessments \nmade on 10,831 unique children during 28,465 visits, covering 21 different instruments.\n\n## Installation\n\nYou can install the released version of childdevdata from [CRAN](https://CRAN.R-project.org) with\n\n```{r eval=FALSE}\ninstall.packages(\"childdevdata\")\n```\n\nYou can install the development version of `childdevdata` from [GitHub](https://github.com) with\n\n```{r eval=FALSE}\ninstall.packages(\"remotes\")\nremotes::install_github(\"d-score/childdevdata\")\n```\n\n## Example\n\nThe following example visualises how the proportion of toddlers that are able to walk increases with age.\n\n```{r example, fig.retina=2}\nlibrary(childdevdata)\nlibrary(ggplot2)\n\n# we use the Dutch SMOCC data\ndata \u003c- with(gcdg_nld_smocc, \n             data.frame(age = round(agedays/365.25, 4),\n                        walk = ddigmd068))\nggplot(na.omit(data), aes(age, walk)) +\n  geom_point(cex = 0.5) +\n  geom_smooth(method = \"gam\", formula = y ~ s(x, bs = \"cs\"), \n              se = FALSE, lwd = 0.5) +\n  theme_bw()\n```\n\n## Overview of available dataset and documentation\n\nThe package contains multiple datasets. Obtain the list of datasets by\n\n```{r datalist}\ndata(package = \"childdevdata\")$results[, \"Item\"]\n```\n\nThe documentation of the data can be found by typing into the console:\n\n```{r}\n?gcdg_col_lt42m\n```\n\nThe size of the data is \n\n```{r}\ndim(gcdg_col_lt42m)\n```\n\nThe first six rows and first nine columns are\n\n```{r}\nhead(gcdg_col_lt42m[, 1:9])\n```\n\nThe first seven columns are administrative and background variables. Column numbers eight and up hold the milestone scores. \n\n\n## Combining data\n\nConcatenating two or more data is straightforward using `dplyr`. The following code concatenates all publicly available GCDG datasets.\n\n```{r concatenate}\nlibrary(dplyr)\nalldata \u003c- bind_rows(gcdg_chl_1, gcdg_chn, gcdg_col_lt42m, gcdg_col_lt45m, gcdg_ecu, \n                     gcdg_jam_lbw, gcdg_jam_stunted, gcdg_mdg, gcdg_nld_smocc, gcdg_zaf)\ndim(alldata)\n```\n\nBoth the number of rows and the number of columns have increased. Milestones not appearing in a particular data obtain all missing (`NA`) scores.\n\nThe number of records per cohort by sex is\n\n```{r}\ntable(alldata$cohort, alldata$sex)\n```\n\n## Calculating D-score and DAZ\n\nThe [`dscore` package](https://d-score.org/dscore/) calculates the *D-score* [@vanbuuren2014] and the *D-score adjusted for age Z-score* (DAZ) for all cases:\n\n```{r}\nlibrary(dscore)\nalldata$age \u003c- round(alldata$agedays/365.25, 4)\nd \u003c- dscore(alldata)\nhead(d)\ndim(d)\n```\n\nWe visualise the D-score distribution by age per cohort as\n\n```{r fig.retina=2}\nalldata \u003c- bind_cols(alldata, d)\nggplot(alldata, aes(age, d, group = cohort)) +\n  geom_point(cex = 0.3) +\n  facet_wrap(~ cohort) +\n  ylab(\"D-score\") + xlab(\"Age (years)\") +\n  theme_bw()\n```\n\n## Why this package?\n\nWe all want our children to grow and prosper. While there is no shortage of apps and instruments to track child development, it is often unclear which data went into the construction of these tools. In order to improve measurement and norm setting of child development, we need child-level response data per milestone and age. However, no such public dataset seem to exist. The `childdevdata` package fills that void.\n\nThe package grew out of a project in which we collected milestone data from 16 cohorts. See @weber2019 and \u003chttp://d-score.org/dbook2/\u003e for results. Ten cohort owners graciously decided to make their data available for third parties. We are grateful to them.\n\n## How to use the data?\n\nTremendous effort has gone into the collection and harmonisation of the data. You can use the data in this package under the [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) license. Basically, this means that you may share and adapt the data, on the condition that you give appropriate credit and clearly indicate any changes you've made. See the license text for details.\n\nWe expect that you will properly cite the source data when you use the data in your own product or publication, as follows:\n\n- If you use one dataset, please cite the publication(s) given in the documentation of that dataset. \n- If you use two or more datasets, cite the publication(s) for each dataset *and* cite the `childdevdata` package.\n\nThe citation of the `childevdata` data package is\n\n```\n@software{stef_van_buuren_2021_4700229,\n  author       = {Stef van Buuren and\n                  Iris Eekhout and\n                  Marta Rubio Codina and\n                  Orazio Attanasio and\n                  Costas Meghir and\n                  Emla Fitzsimons and\n                  Sally Grantham-McGregor and\n                  Maria Caridad Araujo and\n                  Susan Walker and\n                  Susan Chang and\n                  Christine Powell and\n                  Ann Weber and\n                  Lia Fernald and\n                  Paul Verkerk and\n                  Linda Richter and\n                  Betsy Lozoff},\n  title        = {D-score/childdevdata: childdevdata 1.1.0},\n  month        = apr,\n  year         = 2021,\n  publisher    = {Zenodo},\n  version      = {v1.1.0},\n  doi          = {10.5281/zenodo.4700229},\n  url          = {https://doi.org/10.5281/zenodo.4700229}\n}\n```\n\n## Want to contribute?\n\nDo you have similar data and want to help others to advance the field? Please let us know. We hope that the `childdevdata` package may continue to grow into a valuable resource for developers and researchers worldwide.\n\n## Acknowledgement\n\nThis study was supported by the Bill \u0026 Melinda Gates Foundation. The contents are the sole responsibility of the authors and may not necessarily represent the official views of the Bill \u0026 Melinda Gates Foundation or other agencies that may have supported the primary data studies used in the present study. \n\n## References\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FD-score%2Fchilddevdata","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FD-score%2Fchilddevdata","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FD-score%2Fchilddevdata/lists"}