DEV 325 - Apache HBase Schema Design

Targeted towards data analysts, data architects and application developers, the goal of this course is to enable you to design HBase schemas based on design guidelines. You will learn about the various elements of schema design and how to design for data access patterns.

Processing...
Processing...

About this Course

Targeted towards data analysts, data architects and application developers, the goal of this course is to enable you to design HBase schemas based on design guidelines. You will learn about the various elements of schema design and how to design for data access patterns. The course offers an in-depth look at designing row keys, avoiding hot-spotting and designing column families. It discusses how to transition from a relational model to an HBase model. You will learn the differences between tall tables and wide tables. Concepts are conveyed through lectures, hands-on labs and analysis of scenarios.

This on-demand course is designed to be flexible to fit your schedule. Each lesson and quiz takes approximately 30 to 45 minutes to complete. Lab activities take additional time and vary based on your system.

Prerequisites

  • Basic Linux knowledge, including familiarity with basic command-line options such a mv, cp, cd, ls, ssh, and scp
  • Access to, and the ability to use, a laptop with a terminal program installed (such as terminal on the Mac, or PuTTY and WinSCP Windows)
  • Beginner-to-intermediate fluency with Java or object-oriented programming in an IDE such as Eclipse
  • Completion of the on-demand course ESS 100 - Big Data Essentials
  • Completion of the on-demand course ESS 101 - Apache Hadoop Essentials
  • Completion of the on-demand course ESS 102 - MapR Converged Data Platform Essentials
  • Completion of DEV 320 – Apache HBase Data Model and Architecture
  • Basic Hadoop and database knowledge

Certification

This course is part of the preparation for the MapR Certified HBase Developer (MCHBD) certification.

Syllabus

Lesson 4 - Basic Schema Design

  • List the elements of schema design
  • Design row keys for data access patterns
  • Lab 4.2: Import data with different row key designs
  • Design table shape and column families for data access patterns
  • Lab 4.3: Populate and examine trades tall and flat tables
  • Define column family properties
  • Design schema for given scenario

Lesson 5 - Design Schemas for Complex Data Structures

  • Transition from relational model to HBase
  • Use intelligent keys
  • Use secondary indexes or Lookup tables
  • Design for other complex data structures
  • Lab 5.4a: Model person-relatives schema
  • Lab 5.4b: Model movie rental online store schema
  • Lab 5.4c: Model customer click event or action
  • Evolve schemas over time

Lesson 6 Using Hive to Query HBase

  • Use Hive to query HBase/MapR tables
  • Lab 6.1a: Use Hive with the airlines HBase table
  • Lab 6.1b: Use Hive to query the trades table

Curriculum

  • Lesson 4: Apache HBase Basic Schema Design
  • Quiz 4
  • Lesson 5: Design Schemas for Complex Data Structures
  • Quiz 5
  • Lesson 6: Use Hive to Query Apache HBase
  • Quiz 6
  • Course Materials
  • Slide Guide (Transcript)
  • Lab Guide
  • Join MapR Community Discussions

About this Course

Targeted towards data analysts, data architects and application developers, the goal of this course is to enable you to design HBase schemas based on design guidelines. You will learn about the various elements of schema design and how to design for data access patterns. The course offers an in-depth look at designing row keys, avoiding hot-spotting and designing column families. It discusses how to transition from a relational model to an HBase model. You will learn the differences between tall tables and wide tables. Concepts are conveyed through lectures, hands-on labs and analysis of scenarios.

This on-demand course is designed to be flexible to fit your schedule. Each lesson and quiz takes approximately 30 to 45 minutes to complete. Lab activities take additional time and vary based on your system.

Prerequisites

  • Basic Linux knowledge, including familiarity with basic command-line options such a mv, cp, cd, ls, ssh, and scp
  • Access to, and the ability to use, a laptop with a terminal program installed (such as terminal on the Mac, or PuTTY and WinSCP Windows)
  • Beginner-to-intermediate fluency with Java or object-oriented programming in an IDE such as Eclipse
  • Completion of the on-demand course ESS 100 - Big Data Essentials
  • Completion of the on-demand course ESS 101 - Apache Hadoop Essentials
  • Completion of the on-demand course ESS 102 - MapR Converged Data Platform Essentials
  • Completion of DEV 320 – Apache HBase Data Model and Architecture
  • Basic Hadoop and database knowledge

Certification

This course is part of the preparation for the MapR Certified HBase Developer (MCHBD) certification.

Syllabus

Lesson 4 - Basic Schema Design

  • List the elements of schema design
  • Design row keys for data access patterns
  • Lab 4.2: Import data with different row key designs
  • Design table shape and column families for data access patterns
  • Lab 4.3: Populate and examine trades tall and flat tables
  • Define column family properties
  • Design schema for given scenario

Lesson 5 - Design Schemas for Complex Data Structures

  • Transition from relational model to HBase
  • Use intelligent keys
  • Use secondary indexes or Lookup tables
  • Design for other complex data structures
  • Lab 5.4a: Model person-relatives schema
  • Lab 5.4b: Model movie rental online store schema
  • Lab 5.4c: Model customer click event or action
  • Evolve schemas over time

Lesson 6 Using Hive to Query HBase

  • Use Hive to query HBase/MapR tables
  • Lab 6.1a: Use Hive with the airlines HBase table
  • Lab 6.1b: Use Hive to query the trades table

Curriculum

  • Lesson 4: Apache HBase Basic Schema Design
  • Quiz 4
  • Lesson 5: Design Schemas for Complex Data Structures
  • Quiz 5
  • Lesson 6: Use Hive to Query Apache HBase
  • Quiz 6
  • Course Materials
  • Slide Guide (Transcript)
  • Lab Guide
  • Join MapR Community Discussions