An open API service indexing awesome lists of open source software.

https://github.com/devoloper-1/los-angeles-crimes-analysis

EDA for crimes trends in LOS ANGELES from 2019 to 2024
https://github.com/devoloper-1/los-angeles-crimes-analysis

jupyter-notebook kaggle-dataset streamlit-dashboard

Last synced: 9 days ago
JSON representation

EDA for crimes trends in LOS ANGELES from 2019 to 2024

Awesome Lists containing this project

README

          

# Crimes in Los Angeles - Exploratory Data Analysis (EDA) πŸ•΅οΈβ€β™‚οΈ

## Overview πŸ“Š
Welcome to the repository for the exploratory data analysis (EDA) of the **Crimes in Los Angeles** dataset, available on [Kaggle](https://www.kaggle.com/datasets/sudhanvahg/crimes-in-las-angeles/code). This project involves data scraping, cleaning, and visualization to uncover insights about crime patterns in Los Angeles.

## Project Steps πŸš€
### 1. Data Scraping πŸ“₯
The dataset was retrieved using the Kaggle API token, ensuring an automated and up-to-date data extraction process.

### 2. Data Cleaning 🧼
- **Pandas**: Used for initial data manipulation and cleaning.
- **PyJanitor**: Assisted in further cleaning tasks, such as removing null values, renaming columns, and filtering data.

### 3. Initial EDA with LangChain API 🧠
Attempted to use the LangChain API to perform initial EDA, but it did not yield satisfactory results.

### 4. Advanced Analysis with Pandas Sketch πŸ”
Leveraged Pandas Sketch to analyze the dataframe and receive recommendations based on specific queries, leading to more insightful EDA.

### 5. Data Refinement ✨
The dataset was large and required additional cleaning. New dataframes were generated with specific re-cleansed columns, applying custom conditionals to ensure data accuracy and relevance.

### 6. Visualization with Streamlit πŸ“ˆ
The refined data was visualized using a Streamlit dashboard, providing an interactive and user-friendly interface to explore the analysis results.

## Results πŸ†
The EDA revealed significant insights into crime patterns in Los Angeles, such as:
- Trends over time ⏳
- Geographic hotspots πŸ—ΊοΈ
- Crime types and frequencies πŸ“

## Technologies Used πŸ’»
- **Kaggle API**: For data scraping.
- **Pandas**: Data manipulation and cleaning.
- **PyJanitor**: Advanced data cleaning.
- **LangChain API**: Initial EDA attempts.
- **Pandas Sketch**: Dataframe analysis and recommendations.
- **Streamlit**: Dashboard creation and data visualization.

## How to Run πŸƒβ€β™€οΈ
1. **Clone the repository**:
```bash
git clone https://github.com/DEVOLOPER-1/Los-Angeles-Crimes-Analysis.git
```
2. **Install the required packages (Easy copy and paste into ur cmd)**:
```bash
pip install -r requirements.txt
```
3. **Run the Streamlit dashboard**:
```bash
streamlit run main.py
```
4. **A Quick Overview ON the DashBoard**:

https://github.com/DEVOLOPER-1/Los-Angeles-Crimes-Analysis/assets/153318445/8e87321e-9829-4108-87cc-e27deed4bc02

## Conclusion 🎯
This project showcases a comprehensive approach to EDA, from data scraping to visualization. The use of various tools and libraries ensures thorough data cleaning and insightful analysis, making it a valuable resource for understanding crime dynamics in Los Angeles.

## License πŸ“œ
This project is licensed under the MIT License.

---
## Stay in Touch πŸ“¬
Thank you for using NeuroImg2PNG! If you have any questions or need any more help, please feel free to reach out.

https://shorturl.at/nQqEd