An open API service indexing awesome lists of open source software.

https://github.com/ericdataplus/kaggle-airbnb-nyc

NYC Airbnb Market Analysis: Multi-source from 2 Kaggle datasets (151K listings)
https://github.com/ericdataplus/kaggle-airbnb-nyc

airbnb data-analysis kaggle nyc python visualization

Last synced: about 2 months ago
JSON representation

NYC Airbnb Market Analysis: Multi-source from 2 Kaggle datasets (151K listings)

Awesome Lists containing this project

README

          

# πŸ™οΈ NYC Airbnb Market Analysis

> πŸ“Š **Multi-Source Analysis** combining data from:
> - [New York City Airbnb Open Data](https://www.kaggle.com/datasets/dgomonov/new-york-city-airbnb-open-data) (48,895 listings)
> - [Airbnb Open Data](https://www.kaggle.com/datasets/arianazmoudeh/airbnbopendata) (102,599 listings)
>
> Comprehensive geographic and pricing analysis of NYC's short-term rental market.

πŸ”— **[View Live Dashboard](https://ericdataplus.github.io/kaggle-airbnb-nyc/)**

![Summary Dashboard](graphs/07_summary_dashboard.png)

## πŸ“Š Key Statistics

| Metric | Value |
|--------|-------|
| Total Listings | **151,494** (combined) |
| Boroughs Covered | 5 |
| Neighborhoods | 221 |
| Unique Hosts | 37,457 |
| Avg Price (NYC) | $153/night |
| Most Expensive | Manhattan ($197/night) |

## πŸ—ΊοΈ Visualizations

### Geographic Analysis
- **NYC Listing Map** β€” All listings color-coded by borough
- **Price Heatmap** β€” Geographic distribution of prices

### Price Analysis
- **Price by Borough** β€” Manhattan leads at $197/night avg
- **Top 15 Most Expensive Neighborhoods**
- **Room Type Pricing** β€” Entire home vs Private room vs Shared

### Market Insights
- **Host Analysis** β€” Multi-listing hosts dominate supply
- **Neighborhood Rankings** β€” Most listed vs most expensive
- **Multi-Dataset Comparison** β€” Side-by-side source analysis

## πŸ“ Project Structure

```
kaggle-airbnb-nyc/
β”œβ”€β”€ index.html # Interactive Dashboard
β”œβ”€β”€ graphs/ # Visualizations
β”‚ β”œβ”€β”€ 01_price_by_borough.png
β”‚ β”œβ”€β”€ 02_room_type_analysis.png
β”‚ β”œβ”€β”€ 03_nyc_map.png
β”‚ β”œβ”€β”€ 04_top_neighborhoods.png
β”‚ β”œβ”€β”€ 05_dataset_comparison.png
β”‚ β”œβ”€β”€ 06_host_analysis.png
β”‚ └── 07_summary_dashboard.png
β”œβ”€β”€ scripts/ # Analysis scripts
β”‚ β”œβ”€β”€ 01_price_by_borough.py
β”‚ β”œβ”€β”€ 02_room_type_analysis.py
β”‚ β”œβ”€β”€ 03_nyc_map.py
β”‚ β”œβ”€β”€ 04_top_neighborhoods.py
β”‚ β”œβ”€β”€ 05_dataset_comparison.py
β”‚ β”œβ”€β”€ 06_host_analysis.py
β”‚ β”œβ”€β”€ 07_summary_dashboard.py
β”‚ └── run_all.py
└── README.md
```

## πŸ› οΈ Tech Stack

- **Python** β€” Core language
- **Pandas** β€” Data manipulation
- **Matplotlib** β€” Visualizations
- **NumPy** β€” Numerical computing

## πŸ“¦ Data Sources

| Dataset | Source | Records |
|---------|--------|---------|
| NYC Airbnb 2019 | [dgomonov/new-york-city-airbnb-open-data](https://www.kaggle.com/datasets/dgomonov/new-york-city-airbnb-open-data) | 48,895 |
| Airbnb Open Data | [arianazmoudeh/airbnbopendata](https://www.kaggle.com/datasets/arianazmoudeh/airbnbopendata) | 102,599 |
| **Combined** | Multi-source analysis | **151,494** |

## πŸ” Key Findings

1. **Manhattan dominates pricing** β€” $197/night average, 72% premium over outer boroughs
2. **Entire homes are 2x private rooms** β€” $212 vs $89 average
3. **Williamsburg most listed** β€” 3,920 listings in one neighborhood
4. **Professional hosts** β€” Top 10 hosts control 1,000+ listings
5. **Tribeca most expensive** β€” $490/night average

## πŸš€ Quick Start

```bash
# Clone repo
git clone https://github.com/Ericdataplus/kaggle-airbnb-nyc.git
cd kaggle-airbnb-nyc

# Download data from Kaggle (links above)
# Place CSV files in project root

# Run analysis
python scripts/run_all.py
```

---

Made with πŸ™οΈ by [Ericdataplus](https://github.com/Ericdataplus) | December 2024