https://github.com/naliferopoulos/datamining
Bring your own pickaxe.
https://github.com/naliferopoulos/datamining
aueb aueb-students data data-mining machine-learning machine-learning-algorithms mining random-forest
Last synced: 5 months ago
JSON representation
Bring your own pickaxe.
- Host: GitHub
- URL: https://github.com/naliferopoulos/datamining
- Owner: naliferopoulos
- License: gpl-2.0
- Created: 2018-12-27T18:26:25.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2019-01-20T11:58:22.000Z (over 7 years ago)
- Last Synced: 2024-12-30T07:24:52.833Z (over 1 year ago)
- Topics: aueb, aueb-students, data, data-mining, machine-learning, machine-learning-algorithms, mining, random-forest
- Language: Python
- Size: 18.6 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# DataMining
### Outline
This repository contains a data mining solution to a problem assigned to students of an Athens University of Economics & Bussiness. The project assignment unfortunatelly is restricted to students-only, but an outline can be found on the [Kaggle Competition](https://www.kaggle.com/c/11851/) used for tracking progress of student teams.
The team members were the following:
* [Nick Aliferopoulos](https://github.com/naliferopoulos)
* [Athanassios Thalassinos](https://github.com/ch3ckm8)
* [Georgina Karakasilioti](https://github.com/baby-pink)
The project was fullfilled using Random Forests for classification, which were programmatically optimized and tweaked to achieve sweetspot results on the classification. More meaningful features were also extracted from the dataset. Finally, using the Kaggle API, an auto-submitter was implemented.
### License
Any code you find in this repository is under the [GPL](https://en.wikipedia.org/wiki/GNU_General_Public_License) License. Please do not use our hard work without attribution. Also, if you are a class student, we advise you to implement your own solution to the problem. We also feel the need to highlight that this project was research-oriented, meaning we did not pick the best/most accurate Machine Learning algorithm, we picked the one we thought our knowledge would benefit the most from.
### Footnote
Special thanks to the [Thinking Cup](https://www.thinkingcup.gr) which provided us with a steady coffee flow throughout the project implementation.