Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/prakharchoudhary/tale-o-regression

Understanding regression.
https://github.com/prakharchoudhary/tale-o-regression

boston-housing-dataset machine-learning python regression regression-analysis

Last synced: 2 days ago
JSON representation

Understanding regression.

Awesome Lists containing this project

README

        

# Tale-O-Regression
Understanding regression analysis.

## Introduction
Regression models are used to predict target variables on a continuous scale, which makes them attractive for addressing many questions in science as well as
applications in industry, such as understanding relationships between variables, valuating trends, or making forecasts.

## Our Approach
• Exploring and visualizing datasets

• Looking at different approaches to implement linear regression models

• Training regression models that are robust to outliers

• Evaluating regression models and diagnosing common problems

• Fitting regression models to nonlinear data

## Dataset Used
We will use the __Housing Dataset__, which contains information about houses in the suburbs of Boston collected by D. Harrison and D.L. Rubinfeld in 1978. The Housing Dataset has been made freely available and can be downloaded from the UCI machine
learning repository at https://archive.ics.uci.edu/ml/machine-learning-databases/housing/housing.data.

### Features
• __CRIM__: This is the per capita crime rate by town

• __ZN__: This is the proportion of residential land zoned for lots larger than
25,000 sq.ft.

• __INDUS__: This is the proportion of non-retail business acres per town

• __CHAS__: This is the Charles River dummy variable (this is equal to 1 if tract
bounds river; 0 otherwise)

• __NOX__: This is the nitric oxides concentration (parts per 10 million)

• __RM__: This is the average number of rooms per dwelling

• __AGE__: This is the proportion of owner-occupied units built prior to 1940

• __DIS__: This is the weighted distances to five Boston employment centers

• __RAD__: This is the index of accessibility to radial highways

• __TAX__: This is the full-value property-tax rate per $10,000

• __PTRATIO__: This is the pupil-teacher ratio by town

• __B__: This is calculated as 1000(Bk - 0.63)^2, where Bk is the proportion of
people of African American descent by town

• __LSTAT__: This is the percentage lower status of the population

• __MEDV__: This is the median value of owner-occupied homes in $1000s