https://github.com/davidgasquez/letter-recognition

:information_source: Application of Machine Learning techniques to identify randomly distorted capital letters in the English alphabet.
https://github.com/davidgasquez/letter-recognition

letter-recognition machine-learning

Last synced: about 1 month ago
JSON representation

:information_source: Application of Machine Learning techniques to identify randomly distorted capital letters in the English alphabet.

Host: GitHub
URL: https://github.com/davidgasquez/letter-recognition
Owner: davidgasquez
License: gpl-2.0
Created: 2014-12-11T20:39:02.000Z (over 10 years ago)
Default Branch: master
Last Pushed: 2015-06-30T14:53:09.000Z (almost 10 years ago)
Last Synced: 2025-04-12T03:53:02.730Z (about 1 month ago)
Topics: letter-recognition, machine-learning
Language: R
Homepage:
Size: 426 KB
Stars: 13
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        Letter Recognition

==================

The objective is to identify each of a large number of black-and-white

rectangular pixel displays as one of the 26 capital letters in the English

alphabet. The character images were based on 20 different fonts and each

letter within these 20 fonts was randomly distorted to produce a file of

20.000 unique stimuli. Each stimulus was converted into 16 primitive numerical

attributes (statistical moments and edge counts) which were then scaled to fit

into a range of integer values from 0 through 15.

Data Set Information

--------------------

 - [Letter Recognition Data Set](https://archive.ics.uci.edu/ml/machine-learning-databases/letter-recognition/letter-recognition.names)

 - [Data Folder](https://archive.ics.uci.edu/ml/machine-learning-databases/letter-recognition/)

  

    Data Set Characteristics

    Multivariate

  

  

    Attribute Characteristics

    Integer

  

  

    Number of Attributes

    16

  

  

    Number of Instances

    20.000

  

  

    Associated Tasks

    Classification

  

Results

-------

We are going to measure the accuracy rate into the *test*

subset(4.000 instances)

| Technique      | Test Rate |

|----------------|-----------|

|            LDA |  0.8955   |

|            QDA |  0.9497   |

|            KNN |  0.9641   |

|   Tree(simple) |  0.4799   |

|        Bagging |  0.9454   |

| Random Forests |  0.9915   |

|       Boosting |  0.5805   |

|            SVM |  0.9487   |

Source

------

David J. Slate

Odesta Corporation;

1890 Maple Ave; Suite 115;

Evanston, IL 60201

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/davidgasquez/letter-recognition

Awesome Lists containing this project

README