{"id":26898412,"url":"https://github.com/seif-elkateb/dataset-analysis-r","last_synced_at":"2025-04-01T05:47:01.814Z","repository":{"id":281136404,"uuid":"944322259","full_name":"Seif-Elkateb/Dataset-Analysis-R","owner":"Seif-Elkateb","description":null,"archived":false,"fork":false,"pushed_at":"2025-03-07T07:34:22.000Z","size":8810,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-04-01T05:46:56.977Z","etag":null,"topics":["cu-boulder","data","data-analysis","datamodeling","datascience","ms-ds","msds434","r"],"latest_commit_sha":null,"homepage":"","language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Seif-Elkateb.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-03-07T06:32:48.000Z","updated_at":"2025-03-13T00:35:38.000Z","dependencies_parsed_at":null,"dependency_job_id":"8df77dd2-bd73-4f9b-a1c6-d6d27972b1aa","html_url":"https://github.com/Seif-Elkateb/Dataset-Analysis-R","commit_stats":null,"previous_names":["elkateb-seif/covid19-dataset-analysis-r","seif-elkateb/dataset-analysis-r"],"tags_count":null,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Seif-Elkateb%2FDataset-Analysis-R","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Seif-Elkateb%2FDataset-Analysis-R/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Seif-Elkateb%2FDataset-Analysis-R/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Seif-Elkateb%2FDataset-Analysis-R/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Seif-Elkateb","download_url":"https://codeload.github.com/Seif-Elkateb/Dataset-Analysis-R/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246591806,"owners_count":20801985,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cu-boulder","data","data-analysis","datamodeling","datascience","ms-ds","msds434","r"],"created_at":"2025-04-01T05:46:59.182Z","updated_at":"2025-04-01T05:47:01.797Z","avatar_url":"https://github.com/Seif-Elkateb.png","language":"HTML","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n\n# Data Analysis Projects\n\nThis repository contains two data analysis projects: one focusing on NYPD data and the other on COVID-19 data. Both projects utilize R for data manipulation, analysis, and visualization.\n\n## Projects Overview\n\n### 1. NYPD Data Analysis\n\nThis project analyzes NYPD data to uncover trends and insights related to crime in New York City.\n\n#### Libraries Used\n\n- Tidyverse\n- Lubridate\n\n#### Data Import\n\nThe following CSV files are imported for analysis:\n\n- `NYPD_Shooting_Incident_Data__Historic_csv`\n#### Data Tidying and Transformation\n\n- Removed unnecessary columns.\n- Created New Columns such as Day/Night\n- New columns for Day / Month / Year\n- fixed the object type of some variables such as occur_time and occur_data by using the lubridate mutate function\n#### Data Summary\n\n- Filtered out irrelevant observations.\n- Summarized the data to include only relevant observations.\n\n#### Data Visualization\n\n- Visualized crime trends over time.\n- Analyzed crime distribution by borough and precinct.\n\n### 2. COVID-19 Data Analysis\n\nThis project analyzes COVID-19 data to understand the spread and impact of the pandemic globally and in the US.\n\n#### Libraries Used\n\n- Tidyverse\n- Lubridate\n\n#### Data Import\n\nThe following CSV files are imported for analysis:\n\n- `time_series_covid19_confirmed_US.csv`\n- `time_series_covid19_confirmed_global.csv`\n- `time_series_covid19_deaths_US.csv`\n- `time_series_covid19_deaths_global.csv`\n- `UID_ISO_FIPS_LookUp_Table.csv`\n\n#### Data Tidying and Transformation\n\n- Removed unnecessary columns.\n- Transformed date columns into a single `date` column.\n- Created new columns for analysis, such as `cases` and `deaths`.\n\n#### Data Summary\n\n- Filtered out observations with zero cases.\n- Summarized the data to include only relevant observations.\n\n#### Data Visualization\n\n- Visualized cases and deaths over time.\n- Summarized total cases and deaths by state and country.\n\n## How to Run\n\n1. Clone the repository.\n2. Install the required libraries.\n3. Run the R scripts to reproduce the analysis and visualizations for both projects.\n\n## Summary Statistics\n\n### NYPD Data\n\n- Key insights and trends related to crime in NYC.\n\n### COVID-19 Data\n\n- Last date in the dataset: `2023-03-09`\n- Maximum number of cases: `103,802,702`\n- Maximum number of deaths: `1,123,836`\n\n---\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fseif-elkateb%2Fdataset-analysis-r","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fseif-elkateb%2Fdataset-analysis-r","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fseif-elkateb%2Fdataset-analysis-r/lists"}