https://github.com/fbukevin/hadoop-cooccurrence
This is a hadoop project implementing words co-occurrence algorithm.
https://github.com/fbukevin/hadoop-cooccurrence
Last synced: about 1 year ago
JSON representation
This is a hadoop project implementing words co-occurrence algorithm.
- Host: GitHub
- URL: https://github.com/fbukevin/hadoop-cooccurrence
- Owner: fbukevin
- Created: 2014-09-09T10:52:20.000Z (almost 12 years ago)
- Default Branch: master
- Last Pushed: 2014-09-10T07:38:05.000Z (almost 12 years ago)
- Last Synced: 2025-04-04T20:22:12.947Z (about 1 year ago)
- Language: Java
- Size: 2.04 MB
- Stars: 4
- Watchers: 1
- Forks: 5
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Introduction
===
This is a project that implements co-occurrence of text words algorithm with Hadoop. Here we follow the tutorial of book implementing Pairs and Stripes algorithm.
Usage
===
`$ yarn jar .jar [pairs | stripes] `
tutorial
===
* Book - [Data-Intensive Processing with MapReduce](http://www.amazon.com/Data-Intensive-Processing-MapReduce-Synthesis-Technologies/dp/1608453421)
* Original Blog - [Calculating a Co-Occurrence Matrix With Hadoop](http://codingjunkie.net/cooccurrence/)
* Adapter to this repo - [以 Hadoop 計算共現矩陣](http://veck.logdown.com/posts/231257-hadoop-co-occurrence-matrix-calculation)