DEV 325 - Apache HBase Schema Design

Targeted towards data analysts, data architects and application developers, the goal of this course is to enable you to design HBase schemas based on design guidelines. You will learn about the various elements of schema design and how to design for data access patterns.

Processing...
Processing...

About this Course

Targeted towards data analysts, data architects and application developers, the goal of this course is to enable you to design HBase schemas based on design guidelines. You will learn about the various elements of schema design and how to design for data access patterns. The course offers an in-depth look at designing row keys, avoiding hot-spotting and designing column families. It discusses how to transition from a relational model to an HBase model. You will learn the differences between tall tables and wide tables. Concepts are conveyed through lectures, hands-on labs and analysis of scenarios.

This on-demand course is designed to be flexible to fit your schedule. Each lesson and quiz takes approximately 30 to 45 minutes to complete.

  • Option 1: Complete the course in one session, approximately 90 to 120 minutes
  • Option 2: Complete the course over a few days, 3 days of 30-45min/day

Lab activities take additional time and vary based on your system.

Certification

This course is part of the preparation for the MapR Certified HBase Developer (MCHDH) certification.

Syllabus

Lesson 4 - Basic Schema Design

  • List the elements of schema design
  • Design row keys for data access patterns
  • Design table shape & column families for data access patterns
  • Define column family properties
  • Design schema for given scenario

Lesson 5 - Design Schemas for Complex Data Structures

  • Transition from relational model to HBase
  • Use intelligent keys
  • Use secondary indexes or Lookup tables
  • Design for other complex data structures
  • Evolve schemas over time

Lesson 6 Using Hive to Query HBase

  • Use Hive to query HBase/MapR tables

Prerequisites for Success in the Course

Review the following prerequisites carefully and decide if you are ready to succeed in this programming-oriented course. The Instructor will move forward with lab exercises, assuming that you have mastered the skills listed below.

Required:
  • Basic Linux knowledge, including familiarity with basic command-line options such a mv, cp, cd, ls, ssh, and scp
  • Access to, and the ability to use, a laptop with a terminal program installed (such as terminal on the Mac, or PuTTY and WinSCP Windows)
  • Beginner-to-intermediate fluency with Java or object-oriented programming in an IDE such as Eclipse
  • Completion of DEV 320 – Apache HBase Data Model and Architecture
Recommended:

 

  • Completion of the on-demand course HDE 100 - Hadoop Essentials
  • Completion of the on-demand course HDE 110 - MapR Distribution Essentials
  • Optional: Basic Hadoop and database knowledge

Curriculum

  • Get Started
  • Lesson 4 - Apache HBase Basic Schema Design
  • Quiz 4
  • Lesson 5 - Design Schemas for Complex Data Structures
  • Quiz 5
  • Lesson 6 - Use Hive to Query Apache HBase
  • Quiz 6
  • Course Materials
  • Slide Guide (Transcript)
  • Lab Guide
  • Lab Files
  • Lab Environment Connection Guide
  • Join course discussions in the MapR Academy Community

About this Course

Targeted towards data analysts, data architects and application developers, the goal of this course is to enable you to design HBase schemas based on design guidelines. You will learn about the various elements of schema design and how to design for data access patterns. The course offers an in-depth look at designing row keys, avoiding hot-spotting and designing column families. It discusses how to transition from a relational model to an HBase model. You will learn the differences between tall tables and wide tables. Concepts are conveyed through lectures, hands-on labs and analysis of scenarios.

This on-demand course is designed to be flexible to fit your schedule. Each lesson and quiz takes approximately 30 to 45 minutes to complete.

  • Option 1: Complete the course in one session, approximately 90 to 120 minutes
  • Option 2: Complete the course over a few days, 3 days of 30-45min/day

Lab activities take additional time and vary based on your system.

Certification

This course is part of the preparation for the MapR Certified HBase Developer (MCHDH) certification.

Syllabus

Lesson 4 - Basic Schema Design

  • List the elements of schema design
  • Design row keys for data access patterns
  • Design table shape & column families for data access patterns
  • Define column family properties
  • Design schema for given scenario

Lesson 5 - Design Schemas for Complex Data Structures

  • Transition from relational model to HBase
  • Use intelligent keys
  • Use secondary indexes or Lookup tables
  • Design for other complex data structures
  • Evolve schemas over time

Lesson 6 Using Hive to Query HBase

  • Use Hive to query HBase/MapR tables

Prerequisites for Success in the Course

Review the following prerequisites carefully and decide if you are ready to succeed in this programming-oriented course. The Instructor will move forward with lab exercises, assuming that you have mastered the skills listed below.

Required:
  • Basic Linux knowledge, including familiarity with basic command-line options such a mv, cp, cd, ls, ssh, and scp
  • Access to, and the ability to use, a laptop with a terminal program installed (such as terminal on the Mac, or PuTTY and WinSCP Windows)
  • Beginner-to-intermediate fluency with Java or object-oriented programming in an IDE such as Eclipse
  • Completion of DEV 320 – Apache HBase Data Model and Architecture
Recommended:

 

  • Completion of the on-demand course HDE 100 - Hadoop Essentials
  • Completion of the on-demand course HDE 110 - MapR Distribution Essentials
  • Optional: Basic Hadoop and database knowledge

Curriculum

  • Get Started
  • Lesson 4 - Apache HBase Basic Schema Design
  • Quiz 4
  • Lesson 5 - Design Schemas for Complex Data Structures
  • Quiz 5
  • Lesson 6 - Use Hive to Query Apache HBase
  • Quiz 6
  • Course Materials
  • Slide Guide (Transcript)
  • Lab Guide
  • Lab Files
  • Lab Environment Connection Guide
  • Join course discussions in the MapR Academy Community