https://github.com/infochimps-labs/big_data_for_chimps
A Seriously Fun guide to Big Data Analytics in Practice
https://github.com/infochimps-labs/big_data_for_chimps
Last synced: 6 months ago
JSON representation
A Seriously Fun guide to Big Data Analytics in Practice
- Host: GitHub
- URL: https://github.com/infochimps-labs/big_data_for_chimps
- Owner: infochimps-labs
- Created: 2012-03-17T01:30:02.000Z (about 14 years ago)
- Default Branch: master
- Last Pushed: 2015-06-15T17:17:02.000Z (about 11 years ago)
- Last Synced: 2024-05-13T14:31:29.012Z (about 2 years ago)
- Language: Ruby
- Homepage: http://infochimps.com/labs
- Size: 313 MB
- Stars: 168
- Watchers: 50
- Forks: 66
- Open Issues: 10
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-machine-master - Big Data For Chimps
- awesome-machine-learning - Big Data For Chimps
- awesome-machine-learning - Big Data For Chimps
- awesome-machine-learning - Big Data For Chimps
- fucking-awesome-machine-learning - Big Data For Chimps
- awesome-machine-learning - Big Data For Chimps
- awesome-machine-learning - Big Data For Chimps
- awesome-machine-learning-cn - 官网
- awesome-machine-learning - Big Data For Chimps
- awesome-advanced-metering-infrastructure - Big Data For Chimps
README
## Big Data for Chimps: A Seriously Fun guide to Terabyte-scale data processing
This is the work-in-progress version of the upcoming O'Reilly book, _Big Data for Chimps: A Seriously Fun guide to Hadoop and Terabyte-scale data processing_.
Our intent is to provide the best guide for _exploratory_ data analytics using Hadoop -- for data science in practice. We use high-level languages (Pig and Ruby) that make Hadoop a tool, not a framework, allowing re-use and rapid development. We'll cover enough Hadoop internals to save you from diving into the source code, and enough tuning advice to let you know where to drill deep.
In all cases, the focus is on maximizing your time and creativity -- on helping you uncover what question to ask and the right way to ask it.
O'Reilly has courageouly agreed to release the book under an http://creativecommons.org/licenses/by-nc-sa/3.0/[CC-BY-NC-SA]. To buy a physical copy of the book, or a Kindle (`.mobi`) or iOS/Nook (`.epub`), visite the early release http://shop.oreilly.com[O'Reilly bookstore] (TODO: link to early release page). Buy it now, and you'll get frequently-updated access and the final version once available.
### License
This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA.
Code is Apache licensed unless specifically labeled otherwise.