https://github.com/michaeldorner/decisiontrees
Seminar work "Decision Trees - An Introduction" with presentation, seminar paper, and Python implementation
https://github.com/michaeldorner/decisiontrees
decision-trees paper tex theory
Last synced: 19 days ago
JSON representation
Seminar work "Decision Trees - An Introduction" with presentation, seminar paper, and Python implementation
- Host: GitHub
- URL: https://github.com/michaeldorner/decisiontrees
- Owner: michaeldorner
- License: gpl-3.0
- Created: 2014-03-30T09:55:09.000Z (about 11 years ago)
- Default Branch: master
- Last Pushed: 2016-11-23T12:18:00.000Z (over 8 years ago)
- Last Synced: 2025-03-24T15:13:08.247Z (about 1 month ago)
- Topics: decision-trees, paper, tex, theory
- Language: TeX
- Size: 24.7 MB
- Stars: 129
- Watchers: 3
- Forks: 56
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
Decision Trees - An Introduction
=============
Abstract
--------------------
This project work emerges in the context of the course *Artificial Intelligence* in the winter semester 2013/2014 at [Friedrich-Alexander-University, Erlangen](http://www.fau.eu). Beside this seminar paper, an introductory presentation was conducted and an implementation for decision tree was developed. The presentation is available only in German.In the scope of this seminar paper, a small introduction to theory and application of decision trees shall be given.
After this short introduction a theoretical consideration shall guide to a practical part, which shall clarify the theoretical part by examples. The last part shall summarize and compare the introduced algorithm and shall give a small outlook to not tackled research fields of decision trees.
On the contrary to the presentation during the seminar, this seminar paper expects a basic knowledge about graph theory, complexity, and machine learning. Instead of an introduction to these underlaying topics, a deeper look inside four decision tree algorithm families shall be given: CHAID, CART, ID3, and C4.5.
The focus of all Python implementation is on classification. This limitation is not owed to the insufficient importance of regression calculating, but a wider look would push boundaries of this seminar paper.
Table of Content
--------------------
* Introduction
- What is a decision tree?
- Taxonomy
- About this paper
* Theory of Decision Trees
- Definitions
- Decision Tree Learning
- Splitting Criterion
- Stopping Criterion
- Tree Pruning
- Selected Algorithms
- Chi-squared Automatic Interaction Detector (CHAID)
- IterativeDichotomiser 3 (ID3)
- Classification And Regression Tree (CART)
- C4.5
- Discussion
- Advantages
- Disadvantages
- Outlook
- Complexity
- Missing Attributes
- Random Forests
* Summary & Conclusion
- Applications
- Programming Example
- SummaryQuicklinks
--------------------
* [Seminar Paper .pdf](https://raw.githubusercontent.com/michaeldorner/DecisionTrees/master/01_Seminar%20Paper/seminarpaper.pdf)
* [Presentation .pdf (German)](https://raw.githubusercontent.com/michaeldorner/DecisionTrees/master/02_Praesentation/praesentation.pdf)
* [Python source code](https://github.com/michaeldorner/DecisionTrees/tree/master/03_Python%20Code)