https://github.com/codlocker/cse-572-data-mining
Homework Solution for CSE 572
https://github.com/codlocker/cse-572-data-mining
Last synced: 7 days ago
JSON representation
Homework Solution for CSE 572
- Host: GitHub
- URL: https://github.com/codlocker/cse-572-data-mining
- Owner: codlocker
- Created: 2023-09-11T12:15:28.000Z (almost 2 years ago)
- Default Branch: master
- Last Pushed: 2023-11-20T18:01:04.000Z (over 1 year ago)
- Last Synced: 2024-10-08T02:50:44.874Z (9 months ago)
- Language: Jupyter Notebook
- Size: 4.86 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: Readme.md
Awesome Lists containing this project
README
## HW - 2
For Task 1, the solutions for problems (1), (2), (3) and (4) are shared as follows:
- (1) Here is the [Code](./Titanic_Dataset_HW2.ipynb)
- (2) The decision tree looks as follows

- (3) Here is the [Code](./Titanic_Dataset_HW2.ipynb)
- (4) Here is the [Code](./Titanic_Dataset_HW2.ipynb)## HW - 3
#### TASK 1
Code + Answers are in the following notebook : [Code](https://github.com/codlocker/CSE-572-Data-Mining/blob/master/HW3-Task1.ipynb)
I have attached the Notebook as a PDF below, but Jupyter does not seem to do proper word wrapping for the PDF so kindly refer to the Github link for clarity.
For executing the code, here are the instructions.
- Install Python 3 and create a virtual environment [venv](https://docs.python.org/3/library/venv.html)
- Install Jupyter notebook [Project Jupyter | Installing Jupyter](https://jupyter.org/install)
- Ensure the following packages are installed (or you can install jupyter, numpy, scikit, surprise and matplotlib directly using conda or pip)
- scikit-image==0.19.2
- scikit-learn==1.0.2
- scikit-learn-intelex ==2021.6.0
- pandas ==1.4.4
- numpy==1.21.5
- numpy-base==1.21.5
- numpy==1.4.0
- matplotlib==3.5.2
- matplotlib-base==3.5.2
- matplotlib-inline==0.1.6
- The Dataset should the located in this relative path : __datasets/kmeans_data/data.csv__ and __datasets/kmeans_data/label.csv__#### TASK 2
Code + Answers are in the following notebook : [Code](https://github.com/codlocker/CSE-572-Data-Mining/blob/master/HW3-Task2.ipynb)
I have attached the Notebook as a PDF below, but Jupyter does not seem to do proper word wrapping for the PDF so kindly refer to the Github link for clarity.
For executing the code, here are the instructions.
- Install Python 3 and create a virtual environment - [venv](https://docs.python.org/3/library/venv.html)
- Install Jupyter notebook [Project Jupyter | Installing Jupyter](https://jupyter.org/install)
- Ensure the following packages are installed (or you can install jupyter, numpy, scikit, surprise and matplotlib directly using conda or pip)
- scikit-image==0.19.2
- scikit-learn==1.0.2
- scikit-learn-intelex ==2021.6.0
- scikit-surprise==1.1.3
- surprise==0.1
- pandas ==1.4.4
- numpy==1.21.5
- numpy-base==1.21.5
- numpy==1.4.0
- matplotlib==3.5.2
- matplotlib-base==3.5.2
- matplotlib-inline==0.1.6
- The Dataset should the located in this relative path : __datasets/MovieRating__.