“Big Data, unlike any other trend at the moment, will affect everyone and everything we do”, writes Marr at the start of Big Data, concluding that “companies who ignore Big Data [will] be overtaken by those who don’t.”
Across each of 45 brief (~6 page) case studies, Marr sets out: the background; what problem big data is helping to solve; how big data is use in practice; what the results were; what data was used; some (very high level) technical details; the challenges the organisation needed to overcome; and their key learning points.
There are three major items missing from this. One is any real insight into customer benefits. Another significant limitation is the absence of any real discussion on the security, privacy and ethical implications from organisations collection and use of data. The third is the uncritical analysis by Marr in each case study.
Because each chapter is brief and uncritical, they end up being shallow and bland. In the chapter on how big data is used in banks Marr fails to cover the many ways banks are already using data to enhance customer experience by understanding consumer preferences; for credit decision making; in fraud identification; to improve operational effectiveness. In lauding LinkedIn’s use of data, Marr fails to provide any insight on the benefit to members that flows from LinkedIn’s data analysis – I certainly haven’t experienced any.
In the chapter on Etsy Marr notes the choice of cloud and in-house data storage, but does not provide any analysis of what contingencies would influence an organisation’s choice between these alternatives.
Each chapters’ list of technology used, becomes a mere list of names and acronyms: Apache, Apixio, Azure, BigQuery, Cassandra, Cloudera, Conjecture, Datameer, DeepQZ, DMX, EC2, EMC, Flume, Fusion, Hadoop, HANA, HDFS, Hive, Java, Lambda, Mahoot, MapR, MK:Smart, Mongo, Oozie, Predix, Presto, Python, R, Redshift, Solr, Spark , Splunk, SQL MemSQL MySQL and MySQLSSD, Sqoop, TeraData, V-Block, Vertica, Voldemort, Yellowfin. By time you get to Espresso and Pinot you can start to get confused about whether you’re reading about data technology or beverages and wanting one of the latter.
Marr notes that he has been in communication with the organisations about their use of data. But when he writes in the chapter on Microsoft that “data and analytics have existed for a long time and we’ve always combined them”, you can’t help but wonder whether this was written by Marr or by Microsoft. The book also has a liberal sprinkling of factual and grammatical errors (just what is a ‘compressive’ overview (p293) – perhaps Marr wanted to accurately represent the brevity of his analysis?) reflecting the regrettable decline in editorial standards given the changing economics of publishing.
The end result is that Marr has merely summarised (albeit effectively) a very high level overview of how organisations are using data, information that is generally available online. But his uncritical approach means there are no real insights for readers who have anything more than a cursory interest in this important topic.