{"id":19328853,"url":"https://github.com/bishopce16/pyber_analysis","last_synced_at":"2025-07-08T09:41:09.464Z","repository":{"id":44475518,"uuid":"511686531","full_name":"bishopce16/PyBer_Analysis","owner":"bishopce16","description":"The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.","archived":false,"fork":false,"pushed_at":"2022-07-14T23:26:25.000Z","size":2031,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-06T07:47:38.167Z","etag":null,"topics":["data-analysis","data-visualization","jupyter-notebook","matplotlib","pandas","python"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/bishopce16.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2022-07-07T22:04:58.000Z","updated_at":"2022-09-09T00:44:28.000Z","dependencies_parsed_at":"2022-09-17T06:50:36.686Z","dependency_job_id":null,"html_url":"https://github.com/bishopce16/PyBer_Analysis","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/bishopce16%2FPyBer_Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/bishopce16%2FPyBer_Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/bishopce16%2FPyBer_Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/bishopce16%2FPyBer_Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/bishopce16","download_url":"https://codeload.github.com/bishopce16/PyBer_Analysis/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":240434193,"owners_count":19800548,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-analysis","data-visualization","jupyter-notebook","matplotlib","pandas","python"],"created_at":"2024-11-10T02:24:59.688Z","updated_at":"2025-02-24T06:42:00.683Z","avatar_url":"https://github.com/bishopce16.png","language":"Jupyter Notebook","readme":"# PyBer_Analysis\n## Overview of the analysis:\n\nThe purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer. Using the provided CSV files located in the resources folder ride_date.csv and city_data.csv, to help the company increase access to ride sharing services and determine affordability for underserved areas. The two datasets were merged using left join based on the “city” column to create one DataFrame named pyber_data_df. The city types being compared are urban, suburban, and rural. \n\n---\n## Resources:\nData Sources: ride_data.csv, city_data.csv\n\nSoftware: Anaconda 4.13.0, Jupyter Notebook 6.4.11, Python 3.8.9\n\nLibraries: Pandas 1.4.2, Matplotlib-inline 0.1.3, Matplotlib 3.5.2\n\n---\n## Results:\n\nUsing the merged dataset to create a summary DataFrame by city type, show that rural cities have fewer drivers and riders with higher average fares prices than urban cities. The total rides for rural cities is thirteen times less than in urban cities. The average fare per ride 1.4 times more in rural cities compared to urban cities. Urban cities total fares were two times higher than suburban cities and nine times higher than in rural cities. \n\n![PyBer Summary DataFrame](analysis/pyber_summary_df.png)\n\n---\n\nThe Percentage of Total Rides by City Type: \n\nUrban cities made up more than 2/3 of the percentage of total rides in 2019. The suburban cities made up over 26% of the of total rides and rural cities had the smallest percentage with just over 5% of total rides in 2019.\n\n![The Percentage of Total Rides by City Type pie chart](analysis/Fig6.png)\n\n---\n\nThe Percentage of Total Drivers by City Type: \n\nDrivers in the urban cities are in the majority with just over 80% of total drivers in the 2019 data. With suburban cities making up only 16.5% and rural cities 2.6%. \n\n![The Percentage of Total Drivers by City Type pie chart](analysis/Fig7.png)\n\n---\n\nThe Percentage of Total Fares by City Type: \n\nThis pie chart shows that over 60% of total fares by city type, were in urban cities. Suburban cities counted for 30.5% of total fares and rural cities 6.8% in 2019.\n\n![The Percentage of Total Fares by City Type pie chart](analysis/Fig5.png)\n\n---\n\nAverage Fare per Ride and Driver by City Type:\n\nThe bubble chart “PyBer Ride-Sharing Data (2019)” shows the relationship between the total number of drivers and the average fare price. As the total number of rides per city increases, the average fare tends to decrease. This points to a negative relationship. The circle size correlates with the driver count per city. Rural cities have less drivers and rides per city than urban and suburban cities however rural city fare range from middle to high prices.\n\n![Average Fare per Ride and Driver by City Type bubble chart](analysis/Fig1.png)\n\n---\n\nTotal Fare by City Type:\n\nThe multiple-line chart “Total Fares by City Type” shows the total weekly fares for each city type from January to April 2019. The chart shows that total fares in urban cities is nine times higher than in rural cities and two times higher than suburban cities. \n\n![Total Fare by City Type line chart](analysis/Fig8.png)\n\n---\n## Summary:\n\nThe PyBer summary DataFrame indicates there is room for further analysis. Urban cities total fares were two times higher than suburban cities and nine times higher than in rural cities. Because of the general trend is higher number of drivers and rides correlates with low to medium fares, the fares in urban cities should increase if the number of drivers in urban cities is reduced. As well as increasing the number of drivers in rural cities to ensure there are the necessary drivers needed to meet the ride demand. To determine additional contributing factors (on all three city types), analysis such as geographic size, travel distance, and population vs the average number of drivers should be conducted.\n\n---\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbishopce16%2Fpyber_analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbishopce16%2Fpyber_analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbishopce16%2Fpyber_analysis/lists"}