Jump to ratings and reviews
Rate this book

Learning Apache Drill: Query and Analyze Distributed Data Sources with SQL

Rate this book
Get up to speed with Apache Drill, an extensible distributed SQL query engine that reads massive datasets in many popular file formats such as Parquet, JSON, and CSV. Drill reads data in HDFS or in cloud-native storage such as S3 and works with Hive metastores along with distributed databases such as HBase, MongoDB, and relational databases. Drill works on your laptop or in your largest cluster. In this practical book, Drill committers Charles Givre and Paul Rogers show analysts and data scientists how to query and analyze raw data using this powerful tool. Data scientists today spend about 80% of their time just gathering and cleaning data. With this book, you’ll learn how Drill helps you analyze data more effectively to drive down time to insight.

329 pages, Paperback

Published December 18, 2018

28 people want to read

About the author

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
1 (20%)
4 stars
3 (60%)
3 stars
1 (20%)
2 stars
0 (0%)
1 star
0 (0%)
Displaying 1 of 1 review
Profile Image for Suraj.
23 reviews
December 30, 2019
Good book if you are new to Apache Drill.

delivers what it promises, it will not go much into the internals of Apache Drill but it's good if you are new/starting out with Apache Drill
Displaying 1 of 1 review

Can't find what you're looking for?

Get help and learn more about the design.