Jump to ratings and reviews
Rate this book

Parallel R: Data Analysis in the Distributed World

Rate this book
It’s tough to argue with R as a high-quality, cross-platform, open source statistical software product―unless you’re in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. You’ll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they don’t. With these packages, you can overcome R’s single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R’s memory barrier.

126 pages, Paperback

First published January 1, 2011

4 people are currently reading
9 people want to read

About the author

Q. Ethan McCallum

5 books4 followers

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
2 (9%)
4 stars
8 (36%)
3 stars
8 (36%)
2 stars
3 (13%)
1 star
1 (4%)
Displaying 1 - 2 of 2 reviews
4 reviews
Read
March 19, 2020
Read snow multi core and parallel packages
Profile Image for liyang.
3 reviews1 follower
January 19, 2012
A good big picture of parallel R by introducing multiple packages for parallel and distributed computing, including snow, multicore, parallel and a little about R+Hadoop, RHIPE etc. However, it's just a big picture. Besides, it's more like a simple how-to manual, and lack of details.
Displaying 1 - 2 of 2 reviews

Can't find what you're looking for?

Get help and learn more about the design.