Cluster Performance Tuning

Cluster Performance Tuning

About this Course

This course teaches you to tune MapR-XD and YARN parameters to optimize cluster performance. After setting up your cluster to run batch YARN jobs, the next steps are to run complex real-time queries on data. You will learn how to re-evaluate cluster resources and make modifications to ensure that both types of jobs - realtime and batch - can run effectively and efficiently.

What's Covered

Course Lessons Lab Activities

1: Prepare a High-Performing Cluster

Prepare Nodes
Prepare the OS

 

No labs

2: Perform Initial Tuning

Measure Performance
Plan the Allocation of Job Resources

 

Initial Tuning
Measure Performance

3: Tune MapR

Tune MapR-XD Parameters
Tune warden.conf Service

 

Tune MapR-XD Parameters
Tune warden.conf Service

4: Tune Cluster Applications

Tune yarn-site.xml
Tune mapred-site.xml for MapReduce Applications

 

Tune yarn-site.xml
Tune mapred-site.xml

5: Tuning After Changes

Make Changes After Adding Ecosystem Components
Add Disks or Nodes
Troubleshoot Performance Problems

 

Install Drill and Tune Parameters

Prerequisites

  • Completion of ESS 100 - 102 and ADM 200 - 204 v6 series
  • Experience administering a MapR cluster, including using the command line and MCS
  • Basic to intermediate Linux skills including familiarity with command-line options such as ls, cd, cp, and su
  • Ability to use the Linux vi editor to create files or make simple edits

Curriculum

  • Lesson 1 - Prepare a High-Performing Cluster
  • Quiz 1
  • Lesson 2 - Initial Tuning
  • Quiz 2
  • Lesson 3 - Tune MapR
  • Quiz 3
  • Lesson 4 - Tune Cluster Applications
  • Quiz 4
  • Lesson 5 - Tuning After Changes
  • Quiz 5
  • Course Materials
  • Slide Guide (Transcript)
  • Lab Guide

About this Course

This course teaches you to tune MapR-XD and YARN parameters to optimize cluster performance. After setting up your cluster to run batch YARN jobs, the next steps are to run complex real-time queries on data. You will learn how to re-evaluate cluster resources and make modifications to ensure that both types of jobs - realtime and batch - can run effectively and efficiently.

What's Covered

Course Lessons Lab Activities

1: Prepare a High-Performing Cluster

Prepare Nodes
Prepare the OS

 

No labs

2: Perform Initial Tuning

Measure Performance
Plan the Allocation of Job Resources

 

Initial Tuning
Measure Performance

3: Tune MapR

Tune MapR-XD Parameters
Tune warden.conf Service

 

Tune MapR-XD Parameters
Tune warden.conf Service

4: Tune Cluster Applications

Tune yarn-site.xml
Tune mapred-site.xml for MapReduce Applications

 

Tune yarn-site.xml
Tune mapred-site.xml

5: Tuning After Changes

Make Changes After Adding Ecosystem Components
Add Disks or Nodes
Troubleshoot Performance Problems

 

Install Drill and Tune Parameters

Prerequisites

  • Completion of ESS 100 - 102 and ADM 200 - 204 v6 series
  • Experience administering a MapR cluster, including using the command line and MCS
  • Basic to intermediate Linux skills including familiarity with command-line options such as ls, cd, cp, and su
  • Ability to use the Linux vi editor to create files or make simple edits

Curriculum

  • Lesson 1 - Prepare a High-Performing Cluster
  • Quiz 1
  • Lesson 2 - Initial Tuning
  • Quiz 2
  • Lesson 3 - Tune MapR
  • Quiz 3
  • Lesson 4 - Tune Cluster Applications
  • Quiz 4
  • Lesson 5 - Tuning After Changes
  • Quiz 5
  • Course Materials
  • Slide Guide (Transcript)
  • Lab Guide