An open API service indexing awesome lists of open source software.

https://github.com/RIPE-NCC/hadoop-pcap

Hadoop library to read packet capture (PCAP) files
https://github.com/RIPE-NCC/hadoop-pcap

Last synced: 7 months ago
JSON representation

Hadoop library to read packet capture (PCAP) files

Awesome Lists containing this project

README

          

Hadoop PCAP library
===================

License
-------
This library is distributed under the LGPL.
See: https://raw.github.com/RIPE-NCC/hadoop-pcap/master/LICENSE

Repository
----------

UPDATE: Since Bintray has been discontinued, the latest releases of hadoop-pcap are not available there, and you have to build them from source.



hadoop-pcap
http://dl.bintray.com/hadoop-pcap/hadoop-pcap

Screencast
----------

We have created a screencast showing the use of the Hadoop PCAP SerDe in Hive using [Amazon Elastic MapReduce](http://aws.amazon.com/elasticmapreduce/).
You can find the video on YouTube: http://www.youtube.com/watch?v=FLxeQciax-Q

Components
----------

This project consists of two components:

### Library

Bundles the code used to read PCAPs. Can be used within MapReduce jobs to natively read PCAP files.
See: https://github.com/RIPE-NCC/hadoop-pcap/tree/master/hadoop-pcap-lib

### SerDe

Implements a Hive Serializer/Deserializer (SerDe) to query PCAPs using SQL like commands.
See: https://github.com/RIPE-NCC/hadoop-pcap/tree/master/hadoop-pcap-serde