https://github.com/s/bigdataanalysis
A project to analyze Enron Email Dataset with MapReduce pattern
https://github.com/s/bigdataanalysis
Last synced: 12 months ago
JSON representation
A project to analyze Enron Email Dataset with MapReduce pattern
- Host: GitHub
- URL: https://github.com/s/bigdataanalysis
- Owner: s
- Created: 2015-12-12T22:57:10.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2015-12-19T08:05:03.000Z (over 10 years ago)
- Last Synced: 2025-04-02T22:27:50.741Z (about 1 year ago)
- Language: Java
- Homepage:
- Size: 17.6 KB
- Stars: 3
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# BigDataAnalysis
This project is done for the course Big Data Analysis in [Computer Engineering Department](http://ce.yildiz.edu.tr) of [YTU](http://yildiz.edu.tr).
##Functions
- Finding each person's mail count
- Finding word count in each person's mails
- Finding most used 30 words for each person
- Finding word count in all mails
- Clustering persons with k-Means clustering algorithm using their most used 30 words as features.