Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/cbuntain/replaytweetstream

Given a gzipped file of tweets, replay them as though you were streaming them from Twitter's data stream
https://github.com/cbuntain/replaytweetstream

Last synced: 27 days ago
JSON representation

Given a gzipped file of tweets, replay them as though you were streaming them from Twitter's data stream

Awesome Lists containing this project

README

        

# replayTweetStream

Given a gzipped file of tweets, replay them as though you were streaming them from Twitter's data stream.

Currently, only supports a single GZipped file and replays at the per-second level with gaps between messages as determined by their created_at timestamps.
THe code assumes tweet files are temporally ordered.

The code was recently updated to support configurable senders. Simply implement the Sender interface, and you should be good. I've included an implementation to send to the console and to write to a server socket, and both seem to work.

I've also tested this streaming capability with Spark Streaming over a socket and achieved good results.

## Todo Items

- Add support for multiple files
- Add support for streaming files from HDFS