An open API service indexing awesome lists of open source software.

https://github.com/jonad/quora_data_analysis

Exploratory data analysis and feature engineering on Quora dataset.
https://github.com/jonad/quora_data_analysis

exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset

Last synced: 3 months ago
JSON representation

Exploratory data analysis and feature engineering on Quora dataset.

Awesome Lists containing this project

README

          

## Project: Exploratory Data Analysis (EDA) and feature engineering on quora dataset.
## Project Overview
In this project, we will do some exploratory data analysis and feature engineering on [quora dataset](https://www.kaggle.com/c/quora-question-pairs/data)

## Software and Libraries
This project requires **Python 3.6** and the following Python libraries installed:
- [NumPy](http://www.numpy.org/)
- [Pandas](http://pandas.pydata.org)
- [matplotlib](http://matplotlib.org/)

## Run
In a terminal or command window, navigate to the top-level project directory `quora_data_analysis/` (that contains this README) and run one of the following commands:

```bash
ipython notebook quora_data_analysis.ipynb
```
or
```bash
jupyter notebook quora_data_analysis.ipynb
```

This will open the Jupyter Notebook software and project file in your browser.