An open API service indexing awesome lists of open source software.

https://github.com/blend/social-modeling-starter

Facebook social data modeling with Scala, HBase, and HPaste
https://github.com/blend/social-modeling-starter

facebook-data hbase scala

Last synced: 4 months ago
JSON representation

Facebook social data modeling with Scala, HBase, and HPaste

Awesome Lists containing this project

README

        

social-modeling-starter
=======================

Facebook social data modeling with Scala, HBase, and HPaste.

Includes accessing the Facebook Graph API in Scala via
[Dispatch](http://dispatch.databinder.net/Dispatch.html) and [Lift
JSON](https://github.com/lift/framework/tree/master/core/json), and
using [HPaste](https://github.com/GravityLabs/HPaste) to access HBase
and define MapReduce jobs.

Slides at http://blendlabsinc.com/blend-labs-tech-talk-stanford-may-2012.pdf.

Setup
-----------------------

Visit https://developers.facebook.com/tools/explorer?method=GET&path=me .
Click "Get Access Token". Select the following permissions:
* User Data Permissions
* user_groups
* user_likes
* Friends Data Permissions
* friends_groups
* friends_likes

Click "Get Access Token", then "Allow". Copy and paste the Access Token.
In your console, run:

export FACEBOOK_ACCESS_TOKEN=

Download Cloudera's HBase distribution (CDH4 Beta 2) at
http://archive.cloudera.com/cdh4/cdh/4/hbase-0.92.1-cdh4.0.0b2.tar.gz.

Unarchive the file and run

bin/start-hbase.sh
sleep 5
bin/hbase shell
hbase> create 'person', 'info', 'like'

To load your Facebook data, run

./bin/load.sh

Two MapReduce calculations are included: "friend likes" and "top
likes". To execute these, run

./bin/topLikes.sh
./bin/friendLikes.sh