{"id":45253376,"url":"https://github.com/alext234/coronavirus-stats","last_synced_at":"2026-02-20T23:07:08.960Z","repository":{"id":41938034,"uuid":"237413504","full_name":"alext234/coronavirus-stats","owner":"alext234","description":"Automatically scrape data and statistics on Coronavirus to make them easily accessible in CSV format","archived":false,"fork":false,"pushed_at":"2023-07-23T04:14:49.000Z","size":811756,"stargazers_count":47,"open_issues_count":9,"forks_count":19,"subscribers_count":4,"default_branch":"master","last_synced_at":"2023-12-16T15:54:26.528Z","etag":null,"topics":["australia","cdc","china","coronavirus","covid-19","data-science","europe","health","italy","jupyter-notebook","pipeline","scraping-data","singapore","south-korea","stats","usa","wuhan-virus"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/alext234.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2020-01-31T11:07:03.000Z","updated_at":"2023-08-07T19:45:54.000Z","dependencies_parsed_at":"2022-09-21T08:23:47.871Z","dependency_job_id":"4c32ea85-374d-4656-9eb1-37eacd7a7d16","html_url":"https://github.com/alext234/coronavirus-stats","commit_stats":{"total_commits":7928,"total_committers":13,"mean_commits":609.8461538461538,"dds":"0.023335015136226045","last_synced_commit":"ccab13809cbdb1711cded89539e44f3ba2db76b9"},"previous_names":[],"tags_count":4,"template":null,"template_full_name":null,"purl":"pkg:github/alext234/coronavirus-stats","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alext234%2Fcoronavirus-stats","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alext234%2Fcoronavirus-stats/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alext234%2Fcoronavirus-stats/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alext234%2Fcoronavirus-stats/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/alext234","download_url":"https://codeload.github.com/alext234/coronavirus-stats/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alext234%2Fcoronavirus-stats/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29667139,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-20T19:49:36.704Z","status":"ssl_error","status_checked_at":"2026-02-20T19:44:05.372Z","response_time":59,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["australia","cdc","china","coronavirus","covid-19","data-science","europe","health","italy","jupyter-notebook","pipeline","scraping-data","singapore","south-korea","stats","usa","wuhan-virus"],"created_at":"2026-02-20T23:07:08.522Z","updated_at":"2026-02-20T23:07:08.954Z","avatar_url":"https://github.com/alext234.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n\n![Top 10](https://img.shields.io/endpoint?color=blue\u0026style=flat-square\u0026url=https%3A%2F%2Fraw.githubusercontent.com%2Falext234%2Fcoronavirus-stats%2Fmaster%2Fdata%2Ftop10.json)\n\n\n### CSV Data on Coronavirus (COVID-19)\nThis repository contains data in (CSV format) which are scraped from reliable sources (e.g. World Health Organisation).\n\n* Data are scraped a few times daily and pushed back to this repository together with generated charts (.PNG files).\n\n* Look for those **CSV direct link** below to get the scraped historical data.\n\n### Aggregate sites\n\n\n#### [BNO News](https://bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/)\n\n\nNOTE: paused on 1 Jan 2021\n\nBelow are international stats, excluding China.\n\n[CSV direct link](data/bnonews-international.csv?raw=true)\n\nBar chart of the latest snapshot.\n\n![](images/bnonews-international.png?raw=true)\n\n\n\n### WHO \u0026 Government sites \n\n#### From WHO (World Health Organisation) Situation reports\n\nNOTE: paused on 16 Jun 2020 due to format changes from WHO.\n\nData are scraped from [these reports](https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/) which are in PDF formats. New reports are released daily.\n\n\n#### Globally confirmed cases\n\n[CSV direct link](data/who-global-cases.csv?raw=true)\n\n![](images/who-global-cases.png?raw=true)\n\n\n### Stats from Australia\nNOTE: paused on 10 July 2020\n\nData is pulled from Department of Health [website](https://www.health.gov.au/news/coronavirus-update-at-a-glance).\n\n#### Cases in Australia\n\n[CSV direct link](data/australia-cases.csv?raw=true)\n\n![](images/australia-cases.png?raw=true)\n\n\n### Stats from Singapore\n\nData are scraped from the MOH (Ministry of Health) local situation [web page](https://www.moh.gov.sg/2019-ncov-wuhan).\n\n#### Cases in Singapore\n\nNOTE: paused on 27 Jun 2020.\n\n[CSV direct link](data/singapore-cases.csv?raw=true)\n\n\n\n\n### From US CDC (Centers for Disease Control and Prevention)\n\nNOTE: paused on 27 Jun 2020 due to format changes from CDC.\n\n#### Cases in the US (data are scraped from [here](https://www.cdc.gov/coronavirus/2019-ncov/cases-in-us.html))\n\n* Till 18 Apr 2020:\n\n[CSV direct link](data/cdc-us-cases.csv?raw=true) \n\n\n* From 18 Apr 2020, the data format CDC website has been changed to include races and age groups. \n\n[CSV direct link](data/cdc-us-cases-by-races-and-age-group.csv?raw=true) \n\n*  From 7 May 2020, \n\n[CSV direct link](data/us-cdc-total-cases-deaths.csv?raw=true) \n\n\n### Stats from China\n[This page](https://ncov.dxy.cn/ncovh5/view/pneumonia) has the realtime stats from China. Data are pulled several times a day by the pipeline.\n\n#### All cases in China\n\n[CSV direct link](data/china-summary-cases.csv?raw=true)\n\n![](images/china-summary-cases.png?raw=true)\n\n\n\n### How it works\n\n* Jupyter notebooks are used for scraping data and output to CSV files\n* These notebooks are executed on a schedule by Github Actions pipeline to scrape new data\n* This pipeline also commits back new data to this repository\n\n\n### Development \n\n\n* Tools: Python3, Jupyter, Pandas, BeautifulSoup and related stuff (e.g. Selenium for web-scraping). \nIt is recommended to start the development environment with this docker image, which is also used for the Github Actions build pipeline.\n\n```\ndocker run  -p 8888:8888 -it -v $PWD:/stats -w /stats alext234/datascience:latest  bash \n```\n\n\n* [requirements.txt](requirements.txt) contains Python dependencies\n\n```\npip install -r requirements.txt\n```\n\n* Start Jupyter notebook from inside the container and then visit the browser at `http://localhost:8888`\n\n```\njupyter notebook --allow-root --ip=0.0.0.0\n\n```\n\n### Contributions\n\n* Feel free to create new issues for any potential data source worth scraping.\n* Pull requests are welcomed!\n\n\n### Repo status and stats\n* Stargazers\n\n\n![GitHub stars](https://img.shields.io/github/stars/alext234/coronavirus-stats?style=social)\n\n* Last update from pipeline\n\n![Last update](https://img.shields.io/endpoint?color=blue\u0026style=flat-square\u0026url=https%3A%2F%2Fraw.githubusercontent.com%2Falext234%2Fcoronavirus-stats%2Fmaster%2Fdata%2Flast_update.json)\n\n* Pipeline status\n\n\n![Run notebooks and commit back data/charts](https://github.com/alext234/coronavirus-stats/workflows/Run%20notebooks%20and%20commit%20back%20data/charts/badge.svg?branch=master) \n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Falext234%2Fcoronavirus-stats","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Falext234%2Fcoronavirus-stats","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Falext234%2Fcoronavirus-stats/lists"}