Apache Hive is the new member in database family that works within the Hadoop ecosystem. It provides all great features like data summarization, ad-hoc query, and analysis of large datasets. If you are not a good programmer, then this edition will teach you how to use hive queries without writing complex codes.
Most users face the problem of not getting a dedicated course on Hive. The goal of this e-book is to cater everything about Hive and only Hive with minimum jargons. The notes, lessons and hands-on examples in this small e-book are simplified and tactfully presented to solve all your Hive queries. Instead of writing long code for MapReduce or Java, the e-book shows tips on writing the same program with a minimum code snippet.
Beginners as well as peers will thoroughly enjoy this book. They will discover and learn more hive patterns for data processing and data integrations. Unlike other e-book, where they skip basic detail thinking users having prior subject knowledge. This edition has given complete attention to each and every small aspect of the hive like “how to set up and configure Hive in your environment”.
This e-book is also helpful for those who just want to explore Hive and don’t want to spend big bucks for short courses. You will quickly learn, apply and share your Hive knowledge with this e-book.
Table of content
Chapter 1: Introduction
- What is Hive?
- Hive Architecture
- Different modes of Hive
- What is Hive Server2 (HS2)?
- Hive vs Map Reduce
Chapter 2: Installation and Configuration
- Installation of Hive
- Hive shell commands
- Install and configure MYSQL database
Chapter 3: Data operations
- Data types in Hive
- Creation and dropping of Database in Hive
- Create, Drop and altering of tables in Hive
- Table types and its Usage
- Partitions
- Buckets
Chapter 4: Queries and Implementation
- Order by query
- Group by query
- Sort by
- Cluster By
- Distribute By
- Join queries
- Different type of joins
- Sub queries
- Embedding custom scripts
- UDFs (User Define Functions)
Chapter 5: Query Language, Built-in Operators and Functions
- Hive Query Language (HQL)
- Built-in operators
- Built-in functions
Chapter 6: Data Extraction
- Working with Structured Data using Hive
- Working with Semi structured data using Hive (XML, JSON)
- Hive in Real time projects – When and Where to Use
Share This eBook: