{"id":18365296,"url":"https://github.com/mit-lcp/sccm-datathon","last_synced_at":"2025-04-06T16:31:16.605Z","repository":{"id":40971054,"uuid":"238712384","full_name":"MIT-LCP/sccm-datathon","owner":"MIT-LCP","description":"Resources for the Society of Critical Care Medicine (SCCM) Datathon","archived":false,"fork":false,"pushed_at":"2024-09-24T13:21:58.000Z","size":258,"stargazers_count":23,"open_issues_count":2,"forks_count":9,"subscribers_count":8,"default_branch":"main","last_synced_at":"2025-03-22T03:23:32.938Z","etag":null,"topics":["critical-care","datathon","eicu","icu","mimic","sccm"],"latest_commit_sha":null,"homepage":"https://www.sccm.org/Education-Center/Annual-Congress/Program/Data-Science-Datathon","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/MIT-LCP.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2020-02-06T14:52:04.000Z","updated_at":"2025-01-28T19:02:57.000Z","dependencies_parsed_at":"2022-09-11T22:02:14.636Z","dependency_job_id":null,"html_url":"https://github.com/MIT-LCP/sccm-datathon","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MIT-LCP%2Fsccm-datathon","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MIT-LCP%2Fsccm-datathon/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MIT-LCP%2Fsccm-datathon/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MIT-LCP%2Fsccm-datathon/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/MIT-LCP","download_url":"https://codeload.github.com/MIT-LCP/sccm-datathon/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247512570,"owners_count":20950883,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["critical-care","datathon","eicu","icu","mimic","sccm"],"created_at":"2024-11-05T23:13:07.455Z","updated_at":"2025-04-06T16:31:16.300Z","avatar_url":"https://github.com/MIT-LCP.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Society of Critical Care Medicine (SCCM) Datathon (14-15 February)\n\nThis repository contains resources for the [Society of Critical Care Medicine (SCCM) Datathon 2020](https://www.sccm.org/Education-Center/Annual-Congress/Program/Data-Science-Datathon).\n\n## Contents\n\n1. Getting started\n2. Documentation\n3. Databases on BigQuery\n4. Analysing data with Google Colab\n5. Python notebooks that we prepared earlier\n6. An example in R\n7. Sample projects\n\n## 1. Getting started\n\nThe datasets are hosted on Google Cloud, which requires a Gmail account to manage permissions.\n\n1. Create a [Gmail account](https://www.google.com/gmail/about/), if you don't already have one. It will be used to manage your access to the resources.\n2. Give your gmail address to the session hosts.\n\n## 2. Documentation\n\nWe will be working with two critical care datasets during the event: [MIMIC-III](https://mimic.physionet.org/) and the [eICU Collaborative Research Database](https://eicu-crd.mit.edu/).\n\n- MIMIC-III Clinical Database: https://mimic.physionet.org/\n- eICU Collaborative Research Database: https://eicu-crd.mit.edu/\n\n## 3. Databases on BigQuery\n\nBigQuery is a database system that makes it easy to explore data with Structured Query Language (\"SQL\"). There are several datasets on BigQuery available for you to explore, including `eicu_crd` (the eICU Collaborative Research Database) and `mimiciii_clinical` (the MIMIC-III Clinical Database).\n\nYou will also find \"derived\" databases, which include tables derived from the original data using the code in the [eICU](https://github.com/MIT-LCP/eicu-code) and [MIMIC](https://github.com/MIT-LCP/mimic-code) code repositories. These are helpful if you are looking for something like a sepsis cohort or first day vital signs.\n\n1. [Open BigQuery](https://console.cloud.google.com/bigquery?project=sccm-datathon).\n2. At the top of the console, select `sccm-datathon` as the project. This indicates the account used for billing.\n3. \"Pin\" a project to the resources menu to view available datasets. In the Resources menu on the left, click \"Add data\", \"Pin a project\", then add the following project names: `physionet-data` and `sccm-datathon`.\n4. You should be able preview the data available on these projects using the graphical interface.\n5. Now try running a query. For example, try counting the number of rows in the demo eICU patient table:\n\n   ```SQL\n   SELECT count(*)\n   FROM `physionet-data.eicu_crd_demo.patient` \n   ```\n\n## 4. Analysing data with Google Colab\n\nPython is an increasingly popular programming language for analysing data. We will explore the data using Python notebooks, which allow code and text to be combined into executable documents. First, try opening a blank document using the link below:\n\n- [https://colab.research.google.com/](https://colab.research.google.com/)\n\n## 5. Python notebooks that we prepared earlier\n\nSeveral tutorials are provided below. Requirements for these notebooks are: (1) you have a Gmail account and (2) your Gmail address has been added to the appropriate Google Group by the workshop hosts.\n\nNotebook 1 (eICU): Exploring the patient table. \u003ca href=\"https://colab.research.google.com/github/MIT-LCP/sccm-datathon/blob/master/01_explore_patients.ipynb\" target=\"_parent\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e\n\nNotebook 2 (eICU): Severity of illness. \u003ca href=\"https://colab.research.google.com/github/MIT-LCP/sccm-datathon/blob/master/02_severity_of_illness.ipynb\" target=\"_parent\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e\n\nNotebook 3 (eICU): Summary statistics. \u003ca href=\"https://colab.research.google.com/github/MIT-LCP/sccm-datathon/blob/master/03_summary_statistics.ipynb\" target=\"_parent\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e\n\nNotebook 4 (eICU): Timeseries. \u003ca href=\"https://colab.research.google.com/github/MIT-LCP/sccm-datathon/blob/master/04_timeseries.ipynb\" target=\"_parent\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e\n\nNotebook 5 (eICU): Mortality prediction. \u003ca href=\"https://colab.research.google.com/github/MIT-LCP/sccm-datathon/blob/master/05_mortality_prediction.ipynb\" target=\"_parent\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e\n\nNotebook 6 (eICU): Acute kidney injury. \u003ca href=\"https://colab.research.google.com/github/MIT-LCP/sccm-datathon/blob/master/06_aki_project.ipynb\" target=\"_parent\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e\n\nNotebook 7 (eICU): Project work. \u003ca href=\"https://colab.research.google.com/github/MIT-LCP/sccm-datathon/blob/master/07_project_work.ipynb\" target=\"_parent\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e\n\nNotebook 8 (MIMIC): Weekend effect on mortality. \u003ca href=\"https://colab.research.google.com/github/MIT-LCP/sccm-datathon/blob/master/mimic-weekend-effect.ipynb\" target=\"_parent\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/\u003e\u003c/a\u003e\n\n## 6. An example in R\n\nIf you prefer working in R, then you can connect to Google Cloud from your code in a similar way:\n\n- https://github.com/MIT-LCP/sccm-datathon/blob/master/mimic-iii-los.rmd\n\n## 7. Sample projects\n\nThese papers and repositories may be helpful for reference. They are definitely **not** perfect! Code may be untidy, poorly documented, buggy, outdated etc. Think about how they can be improved, adapted, etc. For example, you could:\n\n- replicate the study on a different dataset (e.g. MIMIC vs eICU)\n- improve the methodology\n\n1. The association between mortality among patients admitted to the intensive care unit on a weekend compared to a weekday\n\n- Python Notebook: https://github.com/MIT-LCP/bhi-bsn-challenge/blob/master/challenge-demo.ipynb\n- R Markdown Notebook: https://github.com/MIT-LCP/bhi-bsn-challenge/blob/master/rmarkdown_example_notebook.Rmd\n- More reading: https://physionet.org/content/bhi-2018-challenge/1.0/\n\nTBC...","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmit-lcp%2Fsccm-datathon","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmit-lcp%2Fsccm-datathon","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmit-lcp%2Fsccm-datathon/lists"}