An open API service indexing awesome lists of open source software.

https://github.com/fbukevin/hadoop-cooccurrence

This is a hadoop project implementing words co-occurrence algorithm.
https://github.com/fbukevin/hadoop-cooccurrence

Last synced: about 1 year ago
JSON representation

This is a hadoop project implementing words co-occurrence algorithm.

Awesome Lists containing this project

README

          

Introduction
===
This is a project that implements co-occurrence of text words algorithm with Hadoop. Here we follow the tutorial of book implementing Pairs and Stripes algorithm.

Usage
===
`$ yarn jar .jar [pairs | stripes] `

tutorial
===
* Book - [Data-Intensive Processing with MapReduce](http://www.amazon.com/Data-Intensive-Processing-MapReduce-Synthesis-Technologies/dp/1608453421)
* Original Blog - [Calculating a Co-Occurrence Matrix With Hadoop](http://codingjunkie.net/cooccurrence/)
* Adapter to this repo - [以 Hadoop 計算共現矩陣](http://veck.logdown.com/posts/231257-hadoop-co-occurrence-matrix-calculation)