An open API service indexing awesome lists of open source software.

https://github.com/s/bigdataanalysis

A project to analyze Enron Email Dataset with MapReduce pattern
https://github.com/s/bigdataanalysis

Last synced: 12 months ago
JSON representation

A project to analyze Enron Email Dataset with MapReduce pattern

Awesome Lists containing this project

README

          

# BigDataAnalysis
This project is done for the course Big Data Analysis in [Computer Engineering Department](http://ce.yildiz.edu.tr) of [YTU](http://yildiz.edu.tr).

##Functions
- Finding each person's mail count
- Finding word count in each person's mails
- Finding most used 30 words for each person
- Finding word count in all mails
- Clustering persons with k-Means clustering algorithm using their most used 30 words as features.