DEV 324 - Apache HBase Bulk Loading, Performance, and Security

Not currently available

About this Course

Targeted towards data architects and application developers, the goal of this course is to learn more about Architecting your HBase Applications for Performance and Security. This course covers how to bulk load data into HBase, Performance Considerations and Tips for designing your HBase application, Benchmarking and Monitoring your HBase Application, and MapR-DB Security. Concepts are conveyed through lectures, hands-on labs, and analysis of scenario.

Prerequisites

  • Completion of ESS 100, ESS 101ESS 320, DEV 320, DEV 321, DEV 322, DEV 323
  • Basic Hadoop knowledge and intermediate linux knowledge
  • Experience using a text editor such as vi
  • Terminal program installed; familiarity with command-line options such as mv, cp, ssh, grep, cd, and useradd
  • Beginner-to-intermediate fluency with Java in an IDE

Syllabus

Lesson 12  Bulk Loading of Data

  • Describe the Bulk Loading Data Process
  • Use the ImportTsv Bulk Load Tool
  • Lab 12.2: Use ImportTsv and CopyTable
  • Bulk Load with MapReduce
  • Lab 12.3: Use a Custom MapReduce Program to Bulk Load Data
  • Pre-Split Tables

Lesson 13  Performance

  • Define Performance Priorities Based on Data Access Patterns
  • Define Guidelines for Schema Design Based on Data Access Patterns and Performance Priorities
  • Apply Java API Performance Tips
  • Define Region, Row Sizes, and Performance Guidelines
  • Apply Configuration Performance Tips
  • Benchmark HBase Application Performance
  • Lab 13: Install and Use YCSB

Lesson 14  Securing MapR-DB Tables

  • Describe Security Fundamentals
  • Understand MapR-DB Access Control
  • Lab 14.2: Create MapR-DB Tables and Set Permissions

Curriculum

  • Lesson 12: Bulk Load Data
  • Quiz 12
  • Lesson 13: Performance
  • Quiz 13
  • Lesson 14: Security
  • Quiz 14

About this Course

Targeted towards data architects and application developers, the goal of this course is to learn more about Architecting your HBase Applications for Performance and Security. This course covers how to bulk load data into HBase, Performance Considerations and Tips for designing your HBase application, Benchmarking and Monitoring your HBase Application, and MapR-DB Security. Concepts are conveyed through lectures, hands-on labs, and analysis of scenario.

Prerequisites

  • Completion of ESS 100, ESS 101ESS 320, DEV 320, DEV 321, DEV 322, DEV 323
  • Basic Hadoop knowledge and intermediate linux knowledge
  • Experience using a text editor such as vi
  • Terminal program installed; familiarity with command-line options such as mv, cp, ssh, grep, cd, and useradd
  • Beginner-to-intermediate fluency with Java in an IDE

Syllabus

Lesson 12  Bulk Loading of Data

  • Describe the Bulk Loading Data Process
  • Use the ImportTsv Bulk Load Tool
  • Lab 12.2: Use ImportTsv and CopyTable
  • Bulk Load with MapReduce
  • Lab 12.3: Use a Custom MapReduce Program to Bulk Load Data
  • Pre-Split Tables

Lesson 13  Performance

  • Define Performance Priorities Based on Data Access Patterns
  • Define Guidelines for Schema Design Based on Data Access Patterns and Performance Priorities
  • Apply Java API Performance Tips
  • Define Region, Row Sizes, and Performance Guidelines
  • Apply Configuration Performance Tips
  • Benchmark HBase Application Performance
  • Lab 13: Install and Use YCSB

Lesson 14  Securing MapR-DB Tables

  • Describe Security Fundamentals
  • Understand MapR-DB Access Control
  • Lab 14.2: Create MapR-DB Tables and Set Permissions

Curriculum

  • Lesson 12: Bulk Load Data
  • Quiz 12
  • Lesson 13: Performance
  • Quiz 13
  • Lesson 14: Security
  • Quiz 14