https://github.com/londheshubham153/disaster-response-pipelines

A machine learning pipeline to categorize emergency messages based on the needs communicated by the sender.
https://github.com/londheshubham153/disaster-response-pipelines

Last synced: 9 days ago
JSON representation

A machine learning pipeline to categorize emergency messages based on the needs communicated by the sender.

Host: GitHub
URL: https://github.com/londheshubham153/disaster-response-pipelines
Owner: LondheShubham153
Created: 2020-05-13T03:49:22.000Z (almost 5 years ago)
Default Branch: master
Last Pushed: 2020-05-14T08:05:38.000Z (almost 5 years ago)
Last Synced: 2025-04-01T19:17:42.002Z (29 days ago)
Language: Jupyter Notebook
Size: 2.39 MB
Stars: 3
Watchers: 2
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Disaster-Response-Pipelines

## Required libraries
- nltk 3.3.0
- numpy 1.15.2
- pandas 0.23.4
- scikit-learn 0.20.0
- sqlalchemy 1.2.12

## Motivation

In this project, It will provide disaster responses to analyze data from [Figure Eight](https://www.figure-eight.com/) to build a model for an API that classifies disaster messages.

This project will include a web app where an emergency worker can input a new message and get classification results in several categories. The web app will also display visualizations of the data.

Below are a few screenshots of the web app.

disaster response project web app

## Files

- ETL Pipeline Preparation.ipynb: Description for workspace/data/process_data.py
- ML Pipeline Preparation.ipynb: Description for workspace/model/train_classifier.py
- workspace/data/process_data.py: A data cleaning pipeline that:
- Loads the messages and categories datasets
- Merges the two datasets
- Cleans the data
- Stores it in a SQLite database
- workspace/model/train_classifier.py: A machine learning pipeline that:
- Loads data from the SQLite database
- Splits the dataset into training and test sets
- Builds a text processing and machine learning pipeline
- Trains and tunes a model using GridSearchCV
- Outputs results on the test set
- Exports the final model as a pickle file

## Acknowledgements

I wish to thank [Figure Eight](https://www.figure-eight.com/) for dataset, and thank [Udacity](https://www.udacity.com/) for advice and review.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/londheshubham153/disaster-response-pipelines

Awesome Lists containing this project

README