Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/LisaATaylor/Rideshare

Jupyter notebooks for Capstone 1
https://github.com/LisaATaylor/Rideshare

Last synced: about 1 month ago
JSON representation

Jupyter notebooks for Capstone 1

Awesome Lists containing this project

README

        

# Machine Learning with the Chicago Rideshare "Trips" Dataset
## Lisa Taylor
### Springboard Data Science Bootcamp - Capstone Project 1

Since November 2018, rideshare providers (Lyft, Uber, etc) operating in Chicago have been required by ordinance to submit periodic data reports reports with basic rideshare information to the City. The main Trips dataset includes information about individual rides, including starting and ending times and locations, fares and tips, and whether the rides were pooled. The Trips data are anonymized, so trip starting and ending locations are generalized to the nearest census tract, and drivers cannot be linked to particular rides they provided. The portion of the dataset used for this exercise covers November 2018 through March 2019.

To prepare for the analyses and visualizations in this notebook, the Trips data was cleaned, aggregated, and enriched with climate and census data. Inferential statistical techniques were applied to test hypotheses regarding the data. Supervised learning with random forest regression was applied to develop a simple rideshare demand prediction model. Finally, a clustering analysis was used to perform customer segmentation and gain insight into rideshare usage patterns in Chicago.

### Data preparation, enrichment and visualization
[Data Wrangling and Visualization](Chicago%20Rideshare%20-%20Wrangling+Visualization.ipynb)

[Chicago Rideshare Data Story](Chicago%20Rideshare%20Data%20Story.ipynb)

[Python Data Prep Functions](RideshareDataPrep.py)

### Statistical analysis

[Application of Inferential Statistics](Chicago%20Rideshare%20Dataset-%20Statistical%20Inference.ipynb)

### Machine Learning
[Rideshare Usage Prediction with Random Forest Regression](https://nbviewer.jupyter.org/github/LisaATaylor/Rideshare/blob/master/Chicago%20Rideshare%20Dataset%20-%20Usage%20Prediction.ipynb)

[Rideshare Customer Segmentation with KMeans Clustering](https://nbviewer.jupyter.org/github/LisaATaylor/Rideshare/blob/master/Chicago%20Rideshare%20Dataset%20-%20Clustering%20Analysis.ipynb)

### Reporting
[Final Report](Capstone%201%20Reports/Capstone%201%20-%20Final%20Report.pdf)

[Final Presentation](Capstone%201%20Reports/Capstone%201%20-%20Chicago%20Rideshare%20Walkthrough.pdf)