https://github.com/danielatomas/fcup-dm1
Project developed for Data Mining I class @FCUP
https://github.com/danielatomas/fcup-dm1
data-mining fcup
Last synced: about 2 months ago
JSON representation
Project developed for Data Mining I class @FCUP
- Host: GitHub
- URL: https://github.com/danielatomas/fcup-dm1
- Owner: DanielaTomas
- Created: 2023-11-02T16:17:22.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-12-16T10:39:38.000Z (over 1 year ago)
- Last Synced: 2025-03-02T16:36:35.100Z (3 months ago)
- Topics: data-mining, fcup
- Language: Jupyter Notebook
- Homepage:
- Size: 23.9 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# DM1 - FCUP
A bank is aiming to enhance its customer service quality. One particular challenge it faces is the ambiguity surrounding the identification of good clients (those eligible for additional services) and bad clients (those requiring close monitoring to mitigate potential losses). The bank stores data of its clients, including information on their accounts (comprising transactions spanning several months), previously granted loans, and issued credit cards.
This project aims to use data mining techniques on this data set to help bank managers better understand their clients and identify which type of client makes a good candidate for a credit card.
## Task 1: Data Understanding and Preparation
This task involves summarizing and visualizing the data to provide valuable insights. Consider questions that could be interesting to check with the available data and provide answers using textual summaries or data visualization. Based on this analysis, you should also check if it is necessary to carry out any data clean-up and pre-processing steps.## Task 2: Descriptive Modelling
This task aims to apply a clustering algorithm on a set of variables that you find helpful to provide some description of the type of clients.## Task 3: Predictive Modelling
From the available data, you should define the data set used for the classification task at hand. Different models should be considered, and the choice of the final model should be justified.## Task 4: Kaggle Competition
Additionally, you should submit your solution for the data set available in the Kaggle Competition - open on November 7th. Your private rank will be accounted for in the final grade.### Dataset
