If you're a developer looking to supplement your own data tools and services, this concise ebook covers the most useful sources of public data available today. You'll find useful information on APIs that offer broad coverage, tie their data to the outside world, and are either accessible online or feature downloadable bulk data. You'll also find code and helpful links.This guide organizes APIs by the subjects they cover--such as websites, people, or places--so you can quickly locate the best resources for augmenting the data you handle in your own service. Categories tools such as WHOIS, bit.ly, and CompeteServices that use email addresses as search terms, including GithubFinding information from just a name, with APIs such as WhitePagesServices, such as Klout, for locating people with Facebook and Twitter accountsSearch APIs, including BOSS and WikipediaGeographical data sources, including SimpleGeo and U.S. CensusCompany information APIs, such as CrunchBase and ZoomInfoAPIs that list IP addresses, such as MaxMindServices that list books, films, music, and products
While this bills itself as covering 'the most useful sources of public data available today" and that "You’ll find useful information on APIs" what this really is a sampling of a small number (57) sources and an example of its use. The information given is terse, and you will find yourself having to Google the documentation to get good use of the source, and probably use Google to find more sources.
I was interested in the Data Source Handbook because many of my models require a range of setting specific data as well as data that is available publicly, if I can get it an put it into a useable form. And looking at it, it provides many examples of available data sources. But these are all fairly specific as presented, so I'd probably end up having to search to find out if what I want is actually available.
The limitations of the presentation are more apparent if I look at one source I have actually used before, the Google Maps API. Warden shows an example of submitting an address and getting a result, and also mentions that you can do reverse geocoding, but he does not describe what information is in the geocoding results (only displays a small portion of them), or the fact that there are many other result sets (such as directions, distances, elevation) that can be returned using the Maps API.
The other way that someone can have a problem with this is that it makes an assumption about the reader without stating it, specifically that the reader is a proficient at programming for the web (i.e. someone whose is primarily a computer programmer as opposed to someone who programs because they need to get something else done.) It comes up because it assumes that you recognize a JSON result set as well as some internet utilities (e.g. curl) which are presented without explanation. It could have been greatly improved by having a one page 'how to use this book' section in the introduction that gives pointers on what is assumed you know before presenting the material.
In the end, if you don't know what you are doing with programming for the web, this book is only useful as a guide so you have an idea of what can be found. Even if you had this book, you probably are going to end up using Google to understand the capabilities of the sites that it does cover, and it cannot be that comprehensive because it is so small.