Jump to ratings and reviews
Rate this book

Data Source Handbook: A Guide to Public Data

Rate this book
If you're a developer looking to supplement your own data tools and services, this concise ebook covers the most useful sources of public data available today. You'll find useful information on APIs that offer broad coverage, tie their data to the outside world, and are either accessible online or feature downloadable bulk data. You'll also find code and helpful links.This guide organizes APIs by the subjects they cover--such as websites, people, or places--so you can quickly locate the best resources for augmenting the data you handle in your own service. Categories tools such as WHOIS, bit.ly, and CompeteServices that use email addresses as search terms, including GithubFinding information from just a name, with APIs such as WhitePagesServices, such as Klout, for locating people with Facebook and Twitter accountsSearch APIs, including BOSS and WikipediaGeographical data sources, including SimpleGeo and U.S. CensusCompany information APIs, such as CrunchBase and ZoomInfoAPIs that list IP addresses, such as MaxMindServices that list books, films, music, and products

46 pages, Kindle Edition

First published January 1, 2011

1 person is currently reading
22 people want to read

About the author

Pete Warden

9 books3 followers

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
1 (7%)
4 stars
5 (35%)
3 stars
5 (35%)
2 stars
2 (14%)
1 star
1 (7%)
Displaying 1 - 2 of 2 reviews
Profile Image for Louis.
227 reviews31 followers
March 19, 2011
While this bills itself as covering 'the most useful sources of public data available today" and that "You’ll find useful information on APIs" what this really is a sampling of a small number (57) sources and an example of its use. The information given is terse, and you will find yourself having to Google the documentation to get good use of the source, and probably use Google to find more sources.

I was interested in the Data Source Handbook because many of my models require a range of setting specific data as well as data that is available publicly, if I can get it an put it into a useable form. And looking at it, it provides many examples of available data sources. But these are all fairly specific as presented, so I'd probably end up having to search to find out if what I want is actually available.

The limitations of the presentation are more apparent if I look at one source I have actually used before, the Google Maps API. Warden shows an example of submitting an address and getting a result, and also mentions that you can do reverse geocoding, but he does not describe what information is in the geocoding results (only displays a small portion of them), or the fact that there are many other result sets (such as directions, distances, elevation) that can be returned using the Maps API.

The other way that someone can have a problem with this is that it makes an assumption about the reader without stating it, specifically that the reader is a proficient at programming for the web (i.e. someone whose is primarily a computer programmer as opposed to someone who programs because they need to get something else done.) It comes up because it assumes that you recognize a JSON result set as well as some internet utilities (e.g. curl) which are presented without explanation. It could have been greatly improved by having a one page 'how to use this book' section in the introduction that gives pointers on what is assumed you know before presenting the material.

In the end, if you don't know what you are doing with programming for the web, this book is only useful as a guide so you have an idea of what can be found. Even if you had this book, you probably are going to end up using Google to understand the capabilities of the sites that it does cover, and it cannot be that comprehensive because it is so small.

More information on this book can be found at the book website at Data Source Handbook at Oreilly.com

I received this as a free ebook as part of the Oreilly Press Blogger Review Program
Profile Image for Dgg32.
146 reviews6 followers
September 29, 2014
Nice collections of API. But it lacks in breadth. For example, data for environment, biology and other fields.
Displaying 1 - 2 of 2 reviews

Can't find what you're looking for?

Get help and learn more about the design.