Jump to ratings and reviews
Rate this book

Designing Cloud Data Platforms

Rate this book
In Designing Cloud Data Platforms , Danil Zburivsky and Lynda Partner reveal a six-layer approach that increases flexibility and reduces costs. Discover patterns for ingesting data from a variety of sources, then learn to harness pre-built services provided by cloud vendors.

Summary
Centralized data warehouses, the long-time defacto standard for housing data for analytics, are rapidly giving way to multi-faceted cloud data platforms. Companies that embrace modern cloud data platforms benefit from an integrated view of their business using all of their data and can take advantage of advanced analytic practices to drive predictions and as yet unimagined data services. Designing Cloud Data Platforms is a hands-on guide to envisioning and designing a modern scalable data platform that takes full advantage of the flexibility of the cloud. As you read, you’ll learn the core components of a cloud data platform design, along with the role of key technologies like Spark and Kafka Streams. You’ll also explore setting up processes to manage cloud-based data, keep it secure, and using advanced analytic and BI tools to analyze it.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the technology
Well-designed pipelines, storage systems, and APIs eliminate the complicated scaling and maintenance required with on-prem data centers. Once you learn the patterns for designing cloud data platforms, you’ll maximize performance no matter which cloud vendor you use.

About the book
In Designing Cloud Data Platforms , Danil Zburivsky and Lynda Partner reveal a six-layer approach that increases flexibility and reduces costs. Discover patterns for ingesting data from a variety of sources, then learn to harness pre-built services provided by cloud vendors.

What's inside
Best practices for structured and unstructured data sets
Cloud-ready machine learning tools
Metadata and real-time analytics
Defensive architecture, access, and security

About the reader
For data professionals familiar with the basics of cloud computing, and Hadoop or Spark.

About the author
Danil Zburivsky has over 10 years of experience designing and supporting large-scale data infrastructure for enterprises across the globe. Lynda Partner is the VP of Analytics-as-a-Service at Pythian, and has been on the business side of data for over 20 years.

Table of Contents
1 Introducing the data platform
2 Why a data platform and not just a data warehouse
3 Getting bigger and leveraging the Big 3: Amazon, Microsoft Azure, and Google
4 Getting data into the platform
5 Organizing and processing data
6 Real-time data processing and analytics
7 Metadata layer architecture
8 Schema management
9 Data access and security
10 Fueling business value with data platforms

336 pages, Paperback

Published April 20, 2021

26 people are currently reading
115 people want to read

About the author

Danil Zburivsky

3 books1 follower

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
40 (59%)
4 stars
16 (23%)
3 stars
7 (10%)
2 stars
4 (5%)
1 star
0 (0%)
Displaying 1 - 12 of 12 reviews
Profile Image for Quynh Anh Phuong.
76 reviews6 followers
December 16, 2021
Good books to know why need to used cloud data platform? Compare with on-prem. Using AWS, Azure or Google Cloud to build. Detail every step and what to be aware of it. Gôd for beginner like me.
Profile Image for Gualtiero Testa.
5 reviews1 follower
April 25, 2025
**Designing Cloud Data Platforms** by Manning offers a useful overview for data engineers, software architects and experienced developers working with cloud data solutions.

The book effectively describes various architectural approaches, providing a balance of theoretical explanation and practical considerations. The inclusion of pros and cons for each architecture is a helpful element.

The identification and description of relevant product offerings from AWS, Azure, and Google Cloud are also beneficial, given the extensive range of services available.

The book addresses practical challenges such as data deduplication and file organization for batch processing with a pragmatic perspective.

The authors' experience in the field is apparent and contributes to a knowledgeable presentation of the material.

In conclusion, "Designing Cloud Data Platforms" is a warmly recommended read for all architects and developers looking to build robust and scalable data solutions in the cloud. It offers a strong blend of theory, practical guidance, and real-world insights that make it an invaluable addition to any technical library.
2 reviews2 followers
February 27, 2021
If you are interested on how cloud data platforms work today, this book definitely for you. The author deep dive into how the data is ingested or collected to our system, go through different pipeline and procedure, and produced interesting data in the end.
Profile Image for Wojtek Gawroński.
123 reviews45 followers
August 26, 2020
I know it's still in MEAP program from Manning, but it's already a great book.

A lot of ground covered in a very detailed way, looking forward to a full version.
Profile Image for Maximiliano Curia.
41 reviews1 follower
August 15, 2022
I like that it actually has an architectural view that can be applied using a broad number of technologies and uses, ant it's refreshing to have a book that has little amount of buzz. While reading the book I had a couple of aha moments about my experiences, things that could be improved, etc.

One problem with this book is that it mentions a lot of technologies and sometimes it goes into the details of them, also modifying the architectures to take into account specifics of a particular technology, and while there is value in having the implementation connection of the presented contents, it makes a large part of the contents specific for the current time frame.
Profile Image for Roman Mkrtchian.
44 reviews2 followers
September 18, 2023
The book is thoroughly detailed. It showcases a modern, loosely-coupled architecture that the authors developed after their experience with numerous data platforms.

Every layer of the architecture is explained comprehensively, ensuring readers can get started while sidestepping common pitfalls. The authors delve into potential tools for each layer, discussing options from the three leading cloud vendors as well as insights into implementing certain components in-house.
Profile Image for Łukasz Słonina.
124 reviews25 followers
April 17, 2022
A comprahensive, clear, theoretical guide. It provides a solid introduction to architectures and tools across the entire spectrum of cloud data solutions. Authors present batch and real-time processing on Azure, Google and Amazon cloud platforms. Book is complete when it comes to designing data platform. More for beginners, but still well written and allowes to organize your knowledge.
Profile Image for Mikhail Filatov.
381 reviews18 followers
September 11, 2023
This book can be a reasonable introduction for readers not aware of Data Lake architecture.
But if you know basics already, I don’t think you will learn a lot of new stuff. Their “architecture diagram” , were they tried to put all batch and real time components together is completely unusable (for me). Not enough practical examples and e2e scenarios.
Profile Image for Vlad Bezden.
240 reviews13 followers
January 3, 2022
Excellent book. There is so much helpful information that I will reread. This book is very beneficial for software architects and developers who will build cloud data-driven applications.
Displaying 1 - 12 of 12 reviews

Can't find what you're looking for?

Get help and learn more about the design.