Jump to ratings and reviews
Rate this book

Instant Apache Sqoop

Rate this book
In Detail

In today’s world, data size is growing at a very fast rate, and people want to perform analytics by combining different sources of data (RDBMS, Text, and so on). Using Hadoop for analytics requires you to load data from RDBMS to Hadoop and perform analytics on that data, before then loading that process data back to RDBMS to generate business reports.

Instant Apache Sqoop is a practical, hands-on guide that provides you with a number of clear, step-by-step exercises that will help you to take advantage of the real power of Apache Sqoop and give you a good grounding in the knowledge required to transfer data between RDBMS and the Hadoop ecosystem.

Instant Apache Sqoop looks at the import/export process required in data transfer and discusses examples of each process. It will also give you an overview of HBase and Hive table structures and how you can populate HBase and Hive tables. The book will finish by taking you through a number of third-party Sqoop connectors.

You will also learn about various import and export arguments and how you can use these arguments to move data between RDBMS and the Hadoop ecosystem. This book also explains the architecture of import and export processes. The book will also take a look at some Sqoop connectors and will discuss examples of each connector. If you want to move data between RDBMS and the Hadoop ecosystem, then this is the book for you.

You will learn everything that you need to know to transfer data between RDBMS and the Hadoop ecosystem as well as how you can add new connectors into Sqoop.

Approach

Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks. Instant Apache Sqoop is full of step-by-step instructions and practical examples along with challenges to test and improve your knowledge.

Who this book is for

This book is great for developers who are looking to get a good grounding in how to effectively and efficiently move data between RDBMS and the Hadoop ecosystem. It’s assumed that you will have some experience in Hadoop already as well as some familiarity with HBase and Hive.

58 pages, Kindle Edition

First published January 1, 2013

3 people are currently reading
29 people want to read

About the author

Ankit Jain

33 books

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
6 (75%)
4 stars
1 (12%)
3 stars
1 (12%)
2 stars
0 (0%)
1 star
0 (0%)
Displaying 1 - 3 of 3 reviews
Profile Image for Hanish Bansal.
2 reviews
November 22, 2013
Best way to learn any technology is to look at examples. Instant Apache Sqoop is full of practical examples of moving data between Hadoop ecosystem and RDBMS using sqoop. A must read for sqoop learners.
1 review
December 12, 2013
Up till now, we all have been using and have been dependent on RDBMS’s for our database needs. In an age where the word is advancing rapidly in the age of Big Data, it’s essential to capture data from multiple sources. Apache Sqoop is such a platform that offers great help in such uphill & time-consuming tasks involving data transfer from RDBMS’s to Hadoop ecosystem.
This Book covers topics like importing/exporting data from RDBMS to Hadoop ecosystem & vice-versa along with different RDBMS’s Sqoop connectors, in short, is a real-quick, to the point guide for getting started with Apache Sqoop, a clear justification to the book’s name “Instant Apache Sqoop”.
The Book is an excellent guide for beginners, as it covers different use-cases with utilitarian examples along with comprehensive detailed explanations for each step in the process. A must read for all those who want to learn Apache Sqoop.
1 review
November 26, 2013
A great book to get started with Sqoop. I would recommend this for all beginners to expert level. This book covers the following key features:

1. Import RDBMS data into Hadoop, HBase, Hive.
2. Export Hadoop, Hive data back to RDBMS.
3. Incremental import.
4. Hbase/Hive basic commands.
5. Sqoop Connectors.
Displaying 1 - 3 of 3 reviews

Can't find what you're looking for?

Get help and learn more about the design.