Jump to ratings and reviews
Rate this book

Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data, 2nd Edition

Rate this book
This book takes you on a fantastic journey to discover the attributes of big data using Apache Hive.

Key FeaturesGrasp the skills needed to write efficient Hive queries to analyze the Big Data Discover how Hive can coexist and work with other tools within the Hadoop ecosystemUses practical, example-oriented scenarios to cover all the newly released features of Apache Hive 2.3.3Book DescriptionIn this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment.

Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner. Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.

By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems

What you will learnCreate and set up the Hive environmentDiscover how to use Hive's definition language to describe dataDiscover interesting data by joining and filtering datasets in HiveTransform data by using Hive sorting, ordering, and functionsAggregate and sample data in different waysBoost Hive query performance and enhance data security in HiveCustomize Hive to your needs by using user-defined functions and integrate it with other toolsWho this book is forIf you are a data analyst, developer, or simply someone who wants to quickly get started with Hive to explore and analyze Big Data in Hadoop, this is the book for you. Since Hive is an SQL-like language, some previous experience with SQL will be useful to get the most out of this book.

Table of ContentsOVERVIEW OF BIG DATA AND HIVESETTING UP THE HIVE ENVIRONMENTDATA DEFINITION AND DESCRIPTIONData Correlation and ScopeDATA MANIPULATION DATA AGGREGATION AND SAMPLINGExtensibility ConsiderationsWorking with Other ToolsPerformance ConsiderationsSecurity Considerations

212 pages, Kindle Edition

Published June 30, 2018

3 people are currently reading
6 people want to read

About the author

Dayong Du

3 books

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
5 (50%)
4 stars
3 (30%)
3 stars
1 (10%)
2 stars
1 (10%)
1 star
0 (0%)
Displaying 1 - 4 of 4 reviews
Profile Image for LIUF.
30 reviews2 followers
November 3, 2019
A entry level book for basic hive operations. The code examples are concise and clear.

If your main programming language is not hive-sql and only use hive as database, this book is enough. I use this book frequently as a reference book.
Profile Image for Łukasz Słonina.
124 reviews27 followers
March 24, 2019
Reference book for Hive, high level overview of Hive features, I'm missing real life examples and application of selected features.
Displaying 1 - 4 of 4 reviews

Can't find what you're looking for?

Get help and learn more about the design.