Let us help you find the training program you are looking for.

If you can't find what you are looking for, contact us, we'll help you find it. We have over 800 training programs to choose from.

Hadoop 3 with Hive 3

  • Course Code:
  • Course Dates: Contact us to schedule.
  • Course Category: DevOps Duration: 2 Days Audience: Business Analysts, Software Developers, Managers

Hive is de-facto standard the SQL interface into Big Data. Today, it offers ACID tables, storage requirements reduction by the factor of 2 with erasure coding, HBase integration with Phoenix, and much more.

However, in order to achieve efficiency, one must be familiar with the best practices of the HQL language, compare different tools for looking at your data, whether it be Hive, Phoenix HBase, or plan Excel.

This course will explain the capabilities of Hive, HQL dialects, and best practices.

Prerequisites:

  • Exposure SQl
  • Be able to navigate Linux command line

Course Outline

  • Why Hadoop?
    • The motivation for Hadoop
    • Use cases and case studies about Hadoop
  • The Hadoop platform
    • MapReduce, HDFS, YARN
    • New in Hadoop 3
      • Erasure Coding vs 3x replication
  • Hive Basics
    • Defining Hive Tables
    • SQL Queries over Structured Data
    • Filtering / Search
    • Aggregations / Ordering
    • Partitions
    • Joins
    • Text Analytics (Semi Structured Data)
  • New in Hive 3
    • ACID tables
    • Hive Query Language (HQL)
      • How to run a good query?
      • How to trouble shoot queries?

  • HBase
    • Basics
    • HBase tables – design and use
    • Phoenix driver for HBase tables
  • Sqoop
    • Tool
    • Architecture
    • Use
  • Spark
    • Overview
    • Spark SQL
  • The big picture
    • How Hadoop fits into your architecture
    • Hive vs HBase with Phoenix vs Excel
View All Courses

    Course Inquiry

    Fill in the details below and we will get back to you as quickly as we can.

    Interested in any of these related courses?