https://github.com/zuzann18/uber-demand-analysis
Uber Demand Analysis in NYC | Time Series Analysis, Data Visualization, Correlation Analysis, Feature Engineering | Exploring the impact of weather, holidays, and location using Python (Pandas, NumPy, Matplotlib, Seaborn), Jupyter Notebook.
https://github.com/zuzann18/uber-demand-analysis
demand-forecasting time-series-analysis
Last synced: about 1 year ago
JSON representation
Uber Demand Analysis in NYC | Time Series Analysis, Data Visualization, Correlation Analysis, Feature Engineering | Exploring the impact of weather, holidays, and location using Python (Pandas, NumPy, Matplotlib, Seaborn), Jupyter Notebook.
- Host: GitHub
- URL: https://github.com/zuzann18/uber-demand-analysis
- Owner: zuzann18
- Created: 2025-02-02T09:43:02.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-02-02T13:43:18.000Z (over 1 year ago)
- Last Synced: 2025-02-10T16:17:11.640Z (over 1 year ago)
- Topics: demand-forecasting, time-series-analysis
- Language: Jupyter Notebook
- Homepage:
- Size: 2.47 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Uber Demand Analysis
## Project Overview
This project analyzes Uber ride demand in New York City, identifying key factors influencing ride pickups. The goal is to extract actionable insights that can help Uber optimize its operations based on weather conditions, holidays, and location-based trends.
## Objectives
- Determine the most significant factors affecting demand.
- Provide recommendations for Uber management to capitalize on demand fluctuations.
## Dataset Description
The main dataset initially contained over 10 million observations aggregated per hour and borough. After merging with additional datasets, it resulted in **29,101 observations across 13 variables**, including:
- **pickup_dt**: Date and time of the pick-up.
- **borough**: NYC borough.
- **pickups**: Number of pickups in a given period.
- **spd**: Wind speed (mph).
- **vsb**: Visibility (miles).
- **temp**: Temperature (°F).
- **dewp**: Dew point (°F).
- **slp**: Sea level pressure.
- **pcp01**: 1-hour liquid precipitation.
- **pcp06**: 6-hour liquid precipitation.
- **pcp24**: 24-hour liquid precipitation.
- **sd**: Snow depth (inches).
- **hday**: Holiday indicator (Y/N).
---
## 📊 Key Findings
### 1️⃣ Influence of Weather
- **Temperature** has a significant positive correlation with ride demand. **Higher temperatures lead to more Uber rides.**
- **Precipitation (rain & snow)** negatively impacts pickups. **Bad weather reduces demand due to reduced mobility.**
- **Visibility** is also a factor—low visibility conditions reduce demand.

### 2️⃣ Effect of Holidays & Time of Day
- **Holidays show a clear increase in Uber usage**, likely due to increased travel and reduced availability of public transport.
- **Peak demand is observed in the evening hours**, especially between 5 PM - 10 PM.
- **Weekends** see a significant **increase in demand** compared to weekdays.

### 3️⃣ Borough-Wise Demand
- **Manhattan has the highest Uber pickup volume.** This is expected due to its high population density and business activity.
- **Brooklyn and Queens show moderate demand**, but it fluctuates more with weather conditions.
- **The Bronx and Staten Island have lower overall demand.**

In the context of the data used in the Uber analysis, **borough** refers to the five main districts of New York City:
- **Manhattan**
- **Brooklyn**
- **Queens**
- **The Bronx**
- **Staten Island**
- **EWR (Newark Liberty International Airport)**
These boroughs and the Newark airport are crucial for analyzing Uber ride demand, as each has distinct travel patterns, traffic conditions, and transportation needs.
For example:
- **Manhattan** has the highest number of rides due to high business and tourist activity.
- **Brooklyn and Queens** are significant for commuters traveling to work.
- **The Bronx and Staten Island** have lower ride volumes but may experience demand spikes during specific hours or weather conditions.
- **EWR (Newark Liberty International Airport)** is a major travel hub, with high demand for rides to and from the airport, especially during peak flight hours.
When analyzing the data, it is essential to consider how ride volumes fluctuate across different boroughs and EWR based on weather factors, peak hours, and weekdays.

### Seasonal Decomposition

### Monthly Uber Pickups by Day Type
- **Working Days:**
- There's a general upward trend in total pickups from January to June.
- This indicates increasing demand throughout the period, possibly due to seasonal factors, increased Uber adoption, or other external influences.
- **Non-Working Days:**
- The trend is similar to working days but with generally lower total pickups.
- This is expected as there are fewer non-working days in a month.
### Key Insights from Time Series Analysis
- **Weekly Seasonality**: Uber demand fluctuates based on weekly cycles, with higher demand on weekends.
- **Trend Growth**: There is a clear upward trend in ride demand over time.
- **Weather Effects**: Rain and snow events cause temporary dips in demand.
- **Holiday Spikes**: Demand significantly increases during holidays.
---
## 📌 Recommendations
Based on these findings, we propose the following business strategies:
### 🚀 Dynamic Pricing & Driver Allocation
- Implement **surge pricing based on real-time weather and holiday data** to optimize revenue.
- Increase **driver availability in high-demand areas during peak times** (evenings, weekends, and holidays).
- Reduce driver allocation in **low-visibility conditions or heavy rainfall**, as demand tends to drop.
### 📍 Location-Based Adjustments
- Focus **marketing efforts on outer boroughs (Brooklyn, Queens)** to improve demand in those regions.
- Increase **Manhattan coverage** during peak evening hours to maximize trip efficiency.
### 📈 Service Enhancements
- Offer **incentives to drivers** for working in high-demand periods.
- **Introduce promotions for rides booked in bad weather**, encouraging more usage even during low-mobility conditions.
---
## 📜 Conclusion
This analysis reveals that **weather, holidays, and borough location significantly impact Uber ride demand.** By leveraging these insights, Uber can implement strategic pricing, optimize driver availability, and enhance user experience.
Applying **data-driven strategies** can lead to increased efficiency, better customer satisfaction, and higher revenue.
---
## 🛠️ Technologies Used
- **Python** (Pandas, NumPy, Matplotlib, Seaborn)
- **Jupyter Notebook** for analysis and visualization
---
## Data Source
This dataset originates from **Kaggle**: [NYC Uber Pickups with Weather and Holidays](https://www.kaggle.com/datasets/yannisp/uber-pickups-enriched?utm_source=chatgpt.com).
## How to Use the Project
### Clone the Repository
```sh
git clone https://github.com/your-username/uber-demand-analysis.git
cd uber-demand-analysis
```
### 🛠 Install Dependencies
```sh
pip install pandas numpy matplotlib seaborn jupyter
```
### ▶️ Run the Notebook
```sh
jupyter notebook Uber_Case_Study-1.ipynb
```
---