Jump to ratings and reviews
Rate this book

Spark in Action by Petar Zecevic

Rate this book
Working with big data can be complex and challenging, in part because of the multiple analysis frameworks and tools required. Apache Spark is a big data processing framework perfect for analyzing near-real-time streams and discovering historical patterns in batched data sets. But Spark goes much further than other frameworks. By including machine learning and graph processing capabilities, it makes many specialized data processing platforms obsolete. Spark's unified framework and programming model significantly lowers the initial infrastructure investment, and Spark's core abstractions are intuitive for most Scala, Java, and Python developers.Spark in Action teaches you to use Spark for stream and batch data processing. It starts with an introduction to the Spark architecture and ecosystem followed by a taste of Spark's command line interface. You then discover the most fundamental concepts and abstractions of Spark, particularly Resilient Distributed Datasets (RDDs) and the basic data transformations that RDDs provide. The first part of the book also introduces you to writing Spark applications using the the core APIs. Next, you learn about different Spark how to work with structured data using Spark SQL, how to process near-real time data with Spark Streaming, how to apply machine learning algorithms with Spark MLlib, how to apply graph algorithms on graph-shaped data using Spark GraphX, and a clear introduction to Spark clustering.

Unknown Binding

First published November 1, 2016

10 people are currently reading
71 people want to read

About the author

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
9 (37%)
4 stars
6 (25%)
3 stars
8 (33%)
2 stars
1 (4%)
1 star
0 (0%)
Displaying 1 - 5 of 5 reviews
Profile Image for Lauri Savolainen.
11 reviews
February 3, 2020
The author’s sense of humor and weird analogues makes this a bit of a burden to read. Real life datasets and examples are nice though. Probably worth looking into if you are fetting started with spark with a regular developer background.
Profile Image for Christophe Addinquy.
390 reviews18 followers
July 1, 2018
The book is pretty well written, with even a drop of (unexpected) humor. This, with the good addition of well done illustrations makes this book an honorable "in action" series member. However, I quit with a feeling of frustration, because examples are rather fragments, written trhough the CLI. I have no idea what a full fledged "hello world" would looks like ! And this is only fragments because the subject was indeed very large. I think the authoe would have better leave the advanced subjects for another book. Because this one is really compact !
Ma note de lecture en Français ici
Profile Image for Alex.
10 reviews16 followers
October 19, 2016
One of the best books on Apache Spark I've read.
Displaying 1 - 5 of 5 reviews

Can't find what you're looking for?

Get help and learn more about the design.