An open API service indexing awesome lists of open source software.

https://github.com/lmizner/codecademy_auto_evaluation

Summarized categorical variables in Python using numerical summary statistics.
https://github.com/lmizner/codecademy_auto_evaluation

categorical-data frequency jupyter-notebook median numpy pandas proportion python

Last synced: 4 months ago
JSON representation

Summarized categorical variables in Python using numerical summary statistics.

Awesome Lists containing this project

README

          

# codecademy_auto_evaluation

### Summarizing Automobile Evaluation Data

In the following project you’ll use what you’ve learned about summarizing categorical data to analyze a sample from a popular open source dataset. This dataset contains information on the cost and physical attributes of several thousand cars. Originally, this dataset was used for to train a classification model that assigned an acceptability score/category to cars based on these attributes.

The car evaluation dataset has been sourced from the UCI Machine Learning Repository and has been slightly modified for this project. Specifically, one additional field manufacturer_country has been simulated for illustrative purposes. You can read more about the details, features, and original uses of this dataset in research on the UCI data description page.