https://github.com/mrsaeeddev/nyc-taxi-dataset-analysis
Analysis of NYC Taxi Dataset
https://github.com/mrsaeeddev/nyc-taxi-dataset-analysis
Last synced: 5 months ago
JSON representation
Analysis of NYC Taxi Dataset
- Host: GitHub
- URL: https://github.com/mrsaeeddev/nyc-taxi-dataset-analysis
- Owner: mrsaeeddev
- Created: 2019-03-03T17:27:07.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2019-05-18T04:05:20.000Z (about 7 years ago)
- Last Synced: 2025-05-30T06:06:31.147Z (about 1 year ago)
- Language: HTML
- Size: 1.23 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# NYC Taxi Dataset Analysis
## Analysis of NYC Taxi data to understand different patterns in data through heat maps and to predict trip duration for rides
## Dataset :
The data set is released by New York City Taxi and Limousine Commission (NYCTLC) and it contains data of Taxi Cabs of two types of taxi services operating in New York City area i.e Green
and Yellow taxi cabs. It contains data of both types of taxis ranging from 2016-01-01 to 2016-06-30.
## Approach :
In this project, I used general Exploratory Data Analysis to visualize heatmaps, graphs and bar graphs ofdifferent parameters of data.
I also used folium package of Python to identify the areas with higher concentration of pick ups and drop offs.
## Results :
This technique can be used by ride sharing companies to identify the areas with high number of rides and target the customers their accordingly. It can also be used to predict the duration and time of rides. It can also be used to predict fare prices.