Big Data Books

Showing 1-50 of 990
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are (Hardcover)
by (shelved 51 times as big-data)
avg rating 3.91 — 42,268 ratings — published 2017
Rate this book
Clear rating
Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy (Hardcover)
by (shelved 50 times as big-data)
avg rating 3.87 — 29,613 ratings — published 2016
Rate this book
Clear rating
Big Data: A Revolution That Will Transform How We Live, Work, and Think Big Data: A Revolution That Will Transform How We Live, Work, and Think (Hardcover)
by (shelved 48 times as big-data)
avg rating 3.69 — 8,666 ratings — published 2013
Rate this book
Clear rating
Hadoop: The Definitive Guide Hadoop: The Definitive Guide (Paperback)
by (shelved 34 times as big-data)
avg rating 3.93 — 1,012 ratings — published 2009
Rate this book
Clear rating
Designing Data-Intensive Applications Designing Data-Intensive Applications (ebook)
by (shelved 30 times as big-data)
avg rating 4.70 — 10,210 ratings — published 2015
Rate this book
Clear rating
Big Data: Principles and best practices of scalable realtime data systems Big Data: Principles and best practices of scalable realtime data systems (Paperback)
by (shelved 27 times as big-data)
avg rating 3.82 — 490 ratings — published 2012
Rate this book
Clear rating
Learning Spark: Lightning-Fast Big Data Analysis Learning Spark: Lightning-Fast Big Data Analysis (Kindle Edition)
by (shelved 25 times as big-data)
avg rating 3.91 — 566 ratings — published 2013
Rate this book
Clear rating
Dataclysm: Who We Are (When We Think No One's Looking) Dataclysm: Who We Are (When We Think No One's Looking)
by (shelved 20 times as big-data)
avg rating 3.73 — 12,421 ratings — published 2014
Rate this book
Clear rating
The Signal and the Noise: Why So Many Predictions Fail—But Some Don't The Signal and the Noise: Why So Many Predictions Fail—But Some Don't (Hardcover)
by (shelved 17 times as big-data)
avg rating 3.97 — 51,612 ratings — published 2012
Rate this book
Clear rating
Spark: The Definitive Guide: Big Data Processing Made Simple Spark: The Definitive Guide: Big Data Processing Made Simple (Kindle Edition)
by (shelved 15 times as big-data)
avg rating 4.15 — 280 ratings — published
Rate this book
Clear rating
Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking (Paperback)
by (shelved 15 times as big-data)
avg rating 4.13 — 2,613 ratings — published 2013
Rate this book
Clear rating
MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems (Paperback)
by (shelved 15 times as big-data)
avg rating 3.84 — 94 ratings — published 2012
Rate this book
Clear rating
Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale (Paperback)
by (shelved 14 times as big-data)
avg rating 4.15 — 718 ratings — published
Rate this book
Clear rating
Rate this book
Clear rating
Big data @ work : dispelling the myths, uncovering the opportunities Big data @ work : dispelling the myths, uncovering the opportunities (Hardcover)
by (shelved 12 times as big-data)
avg rating 3.58 — 437 ratings — published 2014
Rate this book
Clear rating
Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die (Paperback)
by (shelved 12 times as big-data)
avg rating 3.66 — 2,110 ratings — published 2013
Rate this book
Clear rating
Invisible Women: Data Bias in a World Designed for Men Invisible Women: Data Bias in a World Designed for Men (Hardcover)
by (shelved 11 times as big-data)
avg rating 4.34 — 161,922 ratings — published 2019
Rate this book
Clear rating
High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark (Paperback)
by (shelved 11 times as big-data)
avg rating 3.98 — 128 ratings — published
Rate this book
Clear rating
Advanced Analytics with Spark Advanced Analytics with Spark (Paperback)
by (shelved 11 times as big-data)
avg rating 3.99 — 133 ratings — published 2015
Rate this book
Clear rating
The Human Face of Big Data The Human Face of Big Data (Hardcover)
by (shelved 11 times as big-data)
avg rating 4.03 — 214 ratings — published 2012
Rate this book
Clear rating
Mining of Massive Datasets Mining of Massive Datasets (Hardcover)
by (shelved 11 times as big-data)
avg rating 4.35 — 247 ratings — published 2011
Rate this book
Clear rating
The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power (Hardcover)
by (shelved 10 times as big-data)
avg rating 4.06 — 13,010 ratings — published 2018
Rate this book
Clear rating
Rate this book
Clear rating
Data Smart: Using Data Science to Transform Information into Insight Data Smart: Using Data Science to Transform Information into Insight (Paperback)
by (shelved 9 times as big-data)
avg rating 4.12 — 1,014 ratings — published 2013
Rate this book
Clear rating
Graph Databases Graph Databases (Paperback)
by (shelved 8 times as big-data)
avg rating 3.63 — 436 ratings — published 2013
Rate this book
Clear rating
Big Data Now Big Data Now (Kindle Edition)
by (shelved 8 times as big-data)
avg rating 3.34 — 262 ratings — published 2012
Rate this book
Clear rating
Data Analysis with Open Source Tools: A Hands-On Guide for Programmers and Data Scientists Data Analysis with Open Source Tools: A Hands-On Guide for Programmers and Data Scientists (Paperback)
by (shelved 8 times as big-data)
avg rating 4.07 — 312 ratings — published 2010
Rate this book
Clear rating
Bad Data Handbook Bad Data Handbook (Paperback)
by (shelved 8 times as big-data)
avg rating 3.56 — 117 ratings — published 2012
Rate this book
Clear rating
Big Data Now: Current Perspectives from O'Reilly Radar Big Data Now: Current Perspectives from O'Reilly Radar (Kindle Edition)
by (shelved 8 times as big-data)
avg rating 3.35 — 271 ratings — published 2011
Rate this book
Clear rating
Data Mesh: Delivering Data-Driven Value at Scale Data Mesh: Delivering Data-Driven Value at Scale (Paperback)
by (shelved 7 times as big-data)
avg rating 3.77 — 357 ratings — published 2022
Rate this book
Clear rating
Streaming Systems Streaming Systems (Paperback)
by (shelved 7 times as big-data)
avg rating 3.89 — 169 ratings — published
Rate this book
Clear rating
Programming Hive: Data Warehouse and Query Language for Hadoop Programming Hive: Data Warehouse and Query Language for Hadoop (Paperback)
by (shelved 7 times as big-data)
avg rating 3.68 — 93 ratings — published 2012
Rate this book
Clear rating
Big Data and Analytics Big Data and Analytics (Paperback)
by (shelved 7 times as big-data)
avg rating 4.10 — 129 ratings — published
Rate this book
Clear rating
Planning for Big Data Planning for Big Data (Kindle Edition)
by (shelved 7 times as big-data)
avg rating 3.44 — 165 ratings — published 2004
Rate this book
Clear rating
Data and Goliath: The Hidden Battles to Collect Your Data and Control Your World Data and Goliath: The Hidden Battles to Collect Your Data and Control Your World (Paperback)
by (shelved 7 times as big-data)
avg rating 4.00 — 3,857 ratings — published 2015
Rate this book
Clear rating
I Heart Logs: Event Data, Stream Processing, and Data Integration I Heart Logs: Event Data, Stream Processing, and Data Integration (Paperback)
by (shelved 7 times as big-data)
avg rating 3.85 — 390 ratings — published 2014
Rate this book
Clear rating
Elasticsearch: The Definitive Guide: A Distributed Real-Time Search and Analytics Engine Elasticsearch: The Definitive Guide: A Distributed Real-Time Search and Analytics Engine (Paperback)
by (shelved 7 times as big-data)
avg rating 4.26 — 272 ratings — published 2014
Rate this book
Clear rating
Uncharted: Big Data as a Lens on Human Culture Uncharted: Big Data as a Lens on Human Culture (Hardcover)
by (shelved 7 times as big-data)
avg rating 3.72 — 673 ratings — published 2013
Rate this book
Clear rating
Doing Data Science: Straight Talk from the Frontline Doing Data Science: Straight Talk from the Frontline (Paperback)
by (shelved 7 times as big-data)
avg rating 3.74 — 570 ratings — published 2013
Rate this book
Clear rating
Homo Deus: A History of Tomorrow Homo Deus: A History of Tomorrow (ebook)
by (shelved 6 times as big-data)
avg rating 4.19 — 282,735 ratings — published 2015
Rate this book
Clear rating
Cassandra: The Definitive Guide Cassandra: The Definitive Guide (Paperback)
by (shelved 6 times as big-data)
avg rating 3.77 — 265 ratings — published 2010
Rate this book
Clear rating
Making Sense of Stream Processing Making Sense of Stream Processing (ebook)
by (shelved 6 times as big-data)
avg rating 4.25 — 189 ratings — published
Rate this book
Clear rating
Data Science from Scratch: First Principles with Python Data Science from Scratch: First Principles with Python (ebook)
by (shelved 6 times as big-data)
avg rating 3.91 — 1,133 ratings — published 2015
Rate this book
Clear rating
Real-Time Big Data Analytics: Emerging Architecture Real-Time Big Data Analytics: Emerging Architecture (Kindle Edition)
by (shelved 6 times as big-data)
avg rating 3.55 — 181 ratings — published 2013
Rate this book
Clear rating
Big Data For Dummies Big Data For Dummies (Paperback)
by (shelved 6 times as big-data)
avg rating 3.32 — 185 ratings — published 2013
Rate this book
Clear rating
Fundamentals of Data Engineering: Plan and Build Robust Data Systems Fundamentals of Data Engineering: Plan and Build Robust Data Systems (Paperback)
by (shelved 5 times as big-data)
avg rating 4.19 — 844 ratings — published 2022
Rate this book
Clear rating
HADOOP APPLICATION ARCHITECTURES HADOOP APPLICATION ARCHITECTURES (Paperback)
by (shelved 5 times as big-data)
avg rating 4.09 — 81 ratings — published 2015
Rate this book
Clear rating
Algorithms to Live By: The Computer Science of Human Decisions Algorithms to Live By: The Computer Science of Human Decisions (Hardcover)
by (shelved 5 times as big-data)
avg rating 4.13 — 34,451 ratings — published 2016
Rate this book
Clear rating
Data Strategy: How to Profit from a World of Big Data, Analytics and the Internet of Things Data Strategy: How to Profit from a World of Big Data, Analytics and the Internet of Things (Paperback)
by (shelved 5 times as big-data)
avg rating 3.78 — 410 ratings — published
Rate this book
Clear rating
Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor (Hardcover)
by (shelved 5 times as big-data)
avg rating 4.01 — 2,766 ratings — published 2018
Rate this book
Clear rating


“Search engine query data is not the product of a designed statistical experiment and finding a way to meaningfully analyse such data and extract useful knowledge is a new and challenging field that would benefit from collaboration. For the 2012–13 flu season, Google made significant changes to its algorithms and started to use a relatively new mathematical technique called Elasticnet, which provides a rigorous means of selecting and reducing the number of predictors required. In 2011, Google launched a similar program for tracking Dengue fever, but they are no longer publishing predictions and, in 2015, Google Flu Trends was withdrawn. They are, however, now sharing their data with academic researchers...

Google Flu Trends, one of the earlier attempts at using big data for epidemic prediction, provided useful insights to researchers who came after them...

The Delphi Research Group at Carnegie Mellon University won the CDC’s challenge to ‘Predict the Flu’ in both 2014–15 and 2015–16 for the most accurate forecasters. The group successfully used data from Google, Twitter, and Wikipedia for monitoring flu outbreaks.”
Dawn E. Holmes, Big Data: A Very Short Introduction

Neil Postman
“To which I might add that questions about the psychic, political and social effects of information are as applicable to the computer as to television. Although I believe the computer to be a vastly overrated technology, I mention it here because, clearly, Americans have accorded it their customary mindless inattention; which means they will use it as they are told, without a whimper. Thus, a central thesis of computer technology—that the principal difficulty we have in solving problems stems from insufficient data—will go unexamined. Until, years from now, when it will be noticed that the massive collection and speed-of-light retrieval of data have been of great value to large-scale organizations but have solved very little of importance to most people and have created at least as many problems for them as they may have solved.”
Neil Postman, Amusing Ourselves to Death: Public Discourse in the Age of Show Business

More quotes...