https://github.com/uragirii/machine-learning-practice

This repository consists of all different algorithms I applied on the various Datasets. This repository consists of simple python code for working on common datasets.
https://github.com/uragirii/machine-learning-practice

breast-cancer-wisconsin datasets housing-prices iris knn logistic-regression machine-learning titanic-kaggle uci-machine-learning

Last synced: 13 days ago
JSON representation

This repository consists of all different algorithms I applied on the various Datasets. This repository consists of simple python code for working on common datasets.

Host: GitHub
URL: https://github.com/uragirii/machine-learning-practice
Owner: uragirii
Created: 2019-07-13T09:58:45.000Z (almost 7 years ago)
Default Branch: master
Last Pushed: 2019-07-13T10:11:48.000Z (almost 7 years ago)
Last Synced: 2025-02-24T12:16:02.448Z (over 1 year ago)
Topics: breast-cancer-wisconsin, datasets, housing-prices, iris, knn, logistic-regression, machine-learning, titanic-kaggle, uci-machine-learning
Language: Python
Size: 12.7 KB
Stars: 2
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Machine-Learning-Practice
This repository consists of all different algorithms I applied on the various Datasets. This repository consists of simple python code for working on common datasets.

## Datasets:

I've downloaded datasets from [UCI Machine Learning Repository](https://archive.ics.uci.edu/ml/index.php) and [Kaggle](https://www.kaggle.com). Datasets belong to respective owners. Further Information about each can be found on their website.
I've worked on :
- [Adult Salary Dataset](https://archive.ics.uci.edu/ml/datasets/Adult)
- [Air Quality Index](https://archive.ics.uci.edu/ml/datasets/Air+quality)
- [Bank Marketing Dataset](https://archive.ics.uci.edu/ml/datasets/Bank+Marketing)
- [Breast Cancer Dataset](https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29)
- [Boston Housing Dataset](https://www.kaggle.com/c/boston-housing)
- [Titanic Surivor Dataset](https://www.kaggle.com/c/titanic)

## Additional Information:
The dataset must be downloaded and saved in the directory `Datasets`. Each Dataset has a different file and sometimes different algorithms are applied on same datasets for checking the various accuracy percentage. Please Note that these are simple and very popular datasets and preprocessing is either done by the code or they are loaded preprocessed from `scikit-learn` library.

## Requirements:
Depending on the files, the requirements may change but common requirements are:
- `Pandas` (for DataFrame and data storage)
- `NumPy` (for calculations)
- `Matplotlib` (for visualizing the output of algorithms)

_I recommend opening files in Spyder IDE as many files may not work as Scripts_

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/uragirii/machine-learning-practice

Awesome Lists containing this project

README