Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/dharmendradiwaker/exploratory-data-analysis-of-los-angeles-crime-data

This project involves performing exploratory data analysis (EDA) on a large dataset containing over 802,000+ crime records from Los Angeles. The dataset includes 28 columns, and the analysis focuses on understanding crime patterns, victim demographics, crime types, and more.
https://github.com/dharmendradiwaker/exploratory-data-analysis-of-los-angeles-crime-data

eda exploratory-data-analysis matplotlib plotly seaborn

Last synced: about 1 month ago
JSON representation

This project involves performing exploratory data analysis (EDA) on a large dataset containing over 802,000+ crime records from Los Angeles. The dataset includes 28 columns, and the analysis focuses on understanding crime patterns, victim demographics, crime types, and more.

Awesome Lists containing this project

README

        

# Exploratory Data Analysis of Los Angeles Crime Data πŸš”πŸ“Š

## Overview
This project involves performing exploratory data analysis (EDA) on a large dataset containing over 802,000+ crime records from Los Angeles. The dataset includes 28 columns, and the analysis focuses on understanding crime patterns, victim demographics, crime types, and more. The insights are visualized using various tools, including **Pandas**, **Seaborn**, **Plotly**, and **Geo Heatmap**.

## Key Findings πŸ”
- Over **50% of crime victims** are aged between **21-40 years**. πŸ‘ΆπŸ‘¨β€πŸ¦±
- More than **50% of victims** are **male**. 🚹
- Over **20% of crimes** were committed using **deadly weapons**. πŸ”ͺ🧨

## Dataset πŸ“
The dataset contains more than **802,000** records, each representing a unique crime. It includes 28 columns such as:
- **Crime Type** πŸ”
- **Victim Demographics** πŸ§‘β€πŸ¦³
- **Crime Location** πŸ“
- **Time of Crime** ⏰
- **Weapon Used** πŸ”«

## Requirements πŸ“‹
- Python (version 3.6 or higher recommended)
- Required Python libraries:
- Pandas (for data manipulation) πŸ“Š
- Seaborn (for data visualization) 🎨
- Plotly (for interactive visualizations) πŸ“‰
- GeoPandas (for geographical data visualizations) 🌍
- Matplotlib (for additional plotting support) πŸ–ΌοΈ

You can install these dependencies using pip:
```bash
pip install pandas seaborn plotly geopandas matplotlib
```

## Analysis and Visualizations πŸ“Š
- **Data Cleaning**: Missing values handled, and data was preprocessed for analysis.
- **Visualization Techniques**:
- **Scatter Plots**: To understand correlations and trends.
- **Pie Charts**: For visualizing categorical distributions, like the gender of victims.
- **Geo Heatmap**: To visualize crime hotspots across Los Angeles on a map.
- **Histograms**: To understand the distribution of victims' ages and crime types.

## Setup βš™οΈ
1. Clone this repository to your local machine:
```bash
git clone https://github.com/dharmendradiwaker/Exploratory-Data-Analysis-of-Los-Angeles-Crime-Data-.git
```

2. Navigate to the project folder:
```bash
cd exploratory-data-analysis-los-angeles-crime
```

3. Install the required libraries:
```bash
pip install -r requirements.txt
```

4. Run the Jupyter Notebook or Python script:
```bash
jupyter notebook Crime_EDA.ipynb
```

## How to Use πŸ§‘β€πŸ’»
1. Load the dataset from the provided CSV file in the `data/` folder.
2. Follow the notebook to perform EDA and visualize the results.
3. Explore the plots and heatmaps to gain insights into crime trends and victim demographics.

## Conclusion πŸ“Œ
The analysis provides valuable insights into crime patterns in Los Angeles, especially concerning the age and gender of victims, weapon usage, and the geographic distribution of crimes. This project can help inform policy and improve crime prevention strategies.

## Contributors πŸ™‹β€β™‚οΈ
- @Dharmendradiwaker12

---