https://github.com/uragirii/machine-learning-practice
This repository consists of all different algorithms I applied on the various Datasets. This repository consists of simple python code for working on common datasets.
https://github.com/uragirii/machine-learning-practice
breast-cancer-wisconsin datasets housing-prices iris knn logistic-regression machine-learning titanic-kaggle uci-machine-learning
Last synced: 13 days ago
JSON representation
This repository consists of all different algorithms I applied on the various Datasets. This repository consists of simple python code for working on common datasets.
- Host: GitHub
- URL: https://github.com/uragirii/machine-learning-practice
- Owner: uragirii
- Created: 2019-07-13T09:58:45.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2019-07-13T10:11:48.000Z (almost 7 years ago)
- Last Synced: 2025-02-24T12:16:02.448Z (over 1 year ago)
- Topics: breast-cancer-wisconsin, datasets, housing-prices, iris, knn, logistic-regression, machine-learning, titanic-kaggle, uci-machine-learning
- Language: Python
- Size: 12.7 KB
- Stars: 2
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Machine-Learning-Practice
This repository consists of all different algorithms I applied on the various Datasets. This repository consists of simple python code for working on common datasets.
## Datasets:
I've downloaded datasets from [UCI Machine Learning Repository](https://archive.ics.uci.edu/ml/index.php) and [Kaggle](https://www.kaggle.com). Datasets belong to respective owners. Further Information about each can be found on their website.
I've worked on :
- [Adult Salary Dataset](https://archive.ics.uci.edu/ml/datasets/Adult)
- [Air Quality Index](https://archive.ics.uci.edu/ml/datasets/Air+quality)
- [Bank Marketing Dataset](https://archive.ics.uci.edu/ml/datasets/Bank+Marketing)
- [Breast Cancer Dataset](https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29)
- [Boston Housing Dataset](https://www.kaggle.com/c/boston-housing)
- [Titanic Surivor Dataset](https://www.kaggle.com/c/titanic)
## Additional Information:
The dataset must be downloaded and saved in the directory `Datasets`. Each Dataset has a different file and sometimes different algorithms are applied on same datasets for checking the various accuracy percentage. Please Note that these are simple and very popular datasets and preprocessing is either done by the code or they are loaded preprocessed from `scikit-learn` library.
## Requirements:
Depending on the files, the requirements may change but common requirements are:
- `Pandas` (for DataFrame and data storage)
- `NumPy` (for calculations)
- `Matplotlib` (for visualizing the output of algorithms)
_I recommend opening files in Spyder IDE as many files may not work as Scripts_