Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tomwhite/hadoop-book
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
https://github.com/tomwhite/hadoop-book
book hadoop o-reilly
Last synced: 29 days ago
JSON representation
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
- Host: GitHub
- URL: https://github.com/tomwhite/hadoop-book
- Owner: tomwhite
- Created: 2009-07-13T10:13:48.000Z (over 15 years ago)
- Default Branch: master
- Last Pushed: 2020-03-17T05:11:30.000Z (over 4 years ago)
- Last Synced: 2024-10-01T11:04:15.225Z (about 1 month ago)
- Topics: book, hadoop, o-reilly
- Language: Makefile
- Homepage: http://www.hadoopbook.com/
- Size: 2.54 MB
- Stars: 3,502
- Watchers: 449
- Forks: 2,569
- Open Issues: 18
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Hadoop Book Example Code
This repository contains the example code for [Hadoop: The Definitive Guide, Fourth Edition](http://shop.oreilly.com/product/0636920033448.do)
by Tom White (O'Reilly, 2014).Code for the [First], [Second], and [Third] Editions is also available.
Note that the chapter names and numbering has changed between editions, see
[Chapter Numbers By Edition](https://github.com/tomwhite/hadoop-book/wiki/Chapter-Numbers-By-Edition).[First]: http://github.com/tomwhite/hadoop-book/tree/1e
[Second]: http://github.com/tomwhite/hadoop-book/tree/2e
[Third]: http://github.com/tomwhite/hadoop-book/tree/3e## Building and Running
To build the code, you will first need to have installed Maven and Java. Then type
```bash
% mvn package -DskipTests
```This will do a full build and create example JAR files in the top-level directory (e.g.
`hadoop-examples.jar`).To run the examples from a particular chapter, first install the component
needed for the chapter (e.g. Hadoop, Pig, Hive, etc), then run the command lines shown
in the chapter.Sample datasets are provided in the [input](input) directory, but the full weather dataset
is not contained there due to size restrictions. You can find information about how to obtain
the full weather dataset on the book's website at [http://www.hadoopbook.com/]
(http://www.hadoopbook.com/).## Hadoop Component Versions
This edition of the book works with Hadoop 2. It has not been tested extensively with
Hadoop 1, although most of it should work.For the precise versions of each component that the code has been tested with, see
[book/pom.xml](book/pom.xml).## Copyright
Copyright (C) 2014 Tom White