https://github.com/codlocker/cse-572-data-mining

Homework Solution for CSE 572
https://github.com/codlocker/cse-572-data-mining

Last synced: 7 days ago
JSON representation

Homework Solution for CSE 572

Host: GitHub
URL: https://github.com/codlocker/cse-572-data-mining
Owner: codlocker
Created: 2023-09-11T12:15:28.000Z (almost 2 years ago)
Default Branch: master
Last Pushed: 2023-11-20T18:01:04.000Z (over 1 year ago)
Last Synced: 2024-10-08T02:50:44.874Z (9 months ago)
Language: Jupyter Notebook
Size: 4.86 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: Readme.md

Awesome Lists containing this project

README

        ## HW - 2

For Task 1, the solutions for problems (1), (2), (3) and (4) are shared as follows:

- (1) Here is the [Code](./Titanic_Dataset_HW2.ipynb)

- (2) The decision tree looks as follows

      

  ![Tree image](./HW2-Q1b.png)

- (3) Here is the [Code](./Titanic_Dataset_HW2.ipynb)

- (4) Here is the [Code](./Titanic_Dataset_HW2.ipynb)

## HW - 3

#### TASK 1

Code + Answers are in the following notebook : [Code](https://github.com/codlocker/CSE-572-Data-Mining/blob/master/HW3-Task1.ipynb)

I have attached the Notebook as a PDF below, but Jupyter does not seem to do proper word wrapping for the PDF so kindly refer to the Github link for clarity.

For executing the code, here are the instructions.

- Install Python 3 and create a virtual environment [venv](https://docs.python.org/3/library/venv.html)

- Install Jupyter notebook [Project Jupyter | Installing Jupyter](https://jupyter.org/install)

- Ensure the following packages are installed (or you can install jupyter, numpy, scikit, surprise and matplotlib directly using conda or pip)

  - scikit-image==0.19.2

  - scikit-learn==1.0.2

  - scikit-learn-intelex ==2021.6.0

  - pandas ==1.4.4

  - numpy==1.21.5

  - numpy-base==1.21.5

  - numpy==1.4.0

  - matplotlib==3.5.2

  - matplotlib-base==3.5.2

  - matplotlib-inline==0.1.6

- The Dataset should the located in this relative path : __datasets/kmeans_data/data.csv__ and __datasets/kmeans_data/label.csv__

#### TASK 2

Code + Answers are in the following notebook : [Code](https://github.com/codlocker/CSE-572-Data-Mining/blob/master/HW3-Task2.ipynb)

I have attached the Notebook as a PDF below, but Jupyter does not seem to do proper word wrapping for the PDF so kindly refer to the Github link for clarity.

For executing the code, here are the instructions.

- Install Python 3 and create a virtual environment - [venv](https://docs.python.org/3/library/venv.html)

- Install Jupyter notebook [Project Jupyter | Installing Jupyter](https://jupyter.org/install)

- Ensure the following packages are installed (or you can install jupyter, numpy, scikit, surprise and matplotlib directly using conda or pip)

  - scikit-image==0.19.2

  - scikit-learn==1.0.2

  - scikit-learn-intelex ==2021.6.0

  - scikit-surprise==1.1.3

  - surprise==0.1

  - pandas ==1.4.4

  - numpy==1.21.5

  - numpy-base==1.21.5

  - numpy==1.4.0

  - matplotlib==3.5.2

  - matplotlib-base==3.5.2

  - matplotlib-inline==0.1.6

- The Dataset should the located in this relative path : __datasets/MovieRating__.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/codlocker/cse-572-data-mining

Awesome Lists containing this project

README