Rate this book

Data Algorithms: Recipes for Scaling Up with Hadoop and Spark

Name: Data Algorithms: Recipes for Scaling Up with Hadoop and Spark
Rating: 3.52 (2 reviews)
ISBN: 9781491906187

Mahmoud Parsian

Rate this book

Learn the algorithms and tools you need to build MapReduce applications with Hadoop for processing gigabyte, terabyte, or petabyte-sized datasets on clusters of commodity hardware. With this practical book, Author Mahmoud Parsian, head of the big data team at Illumina, takes you step-by-step through the design of machine-learning algorithms, such as Naive Bayes and Markov Chain, and shows you how apply them to clinical and biological datasets, using MapReduce design patterns.Apply MapReduce algorithms to clinical and biological data, such as DNA-Seq and RNA-SeqUse the most relevant regression/analytical algorithms used for different biological data types

GenresComputersComputer Science

778 pages, Paperback

First published August 1, 2014

11 people are currently reading

82 people want to read

About the author

Mahmoud Parsian

7 books1 follower

What do you think?

Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars

5 (18%)

4 stars

10 (37%)

3 stars

8 (29%)

2 stars

2 (7%)

1 star

2 (7%)

Displaying 1 - 2 of 2 reviews

Xianshun Chen

90 reviews4 followers

February 3, 2021

Very nice book which teaches how to implement mechine learning and data mining techniques such as NBC, recommender, clustering, etc. Implemented in java, the book provides codes in both hadoop mapreduce and apache spark in simple-to-understand and clean manner. Have re-coded most of the algorithms in the book except for chapters dealing with some of the bio stuff which i am not particularly interested at the moment.