ESS 101 – Apache Hadoop Essentials

This courses is designed to introduce students to the basics of Apache Hadoop. This course covers the Hadoop Distributed File System, MapReduce, the Apache Hadoop Ecosystem, and real-world use cases that use Hadoop.

Processing...
Processing...

About this course

This on-demand course is designed to be flexible to fit your schedule. Each lesson and quiz takes approximately 30 to 45 minutes to complete.

  • Option 1: Complete the course in one session, approximately 90 to 120 minutes
  • Option 2: Complete the course over a few days, 3 days of 30-45min/day

Syllabus

  • Lesson 3 – Core Elements of Apache Hadoop
    • Compare and contrast local and distributed file systems
    • Explain data management in the Hadoop file system
    • Summarize the MapReduce algorithm
  • Lesson 4 – The Apache Hadoop Ecosystem
    • Define the following ecosystem components:
      • Administration: ZooKeeper, YARN
      • Ingestion: Flume, Oozie, Sqoop
      • Processing: Spark, HBase, Pig
      • Analysis: Hive, Drill, Mahout
  • Lesson 5 – Solving Big Data Problems with Apache Hadoop
    • Summarize the following use cases:
      • Data Warehouse Optimization
      • Recommendation Engine
      • Large Scale Log Analysis 

Prerequisites

  • Access to, and the ability to use, a laptop with an internet connection
  • Completion of ESS 100 – Introduction to Big Data

Curriculum

  • Lesson 3: Core Elements of Apache Hadoop
  • Quiz 3
  • Lesson 4: The Apache Hadoop Ecosystem
  • Quiz 4
  • Lesson 5: Solving Big Data Problems with Apache Hadoop
  • Quiz 5
  • Course Materials
  • Slide Guide (Transcript)
  • Glossary
  • Join course discussions in the MapR Academy Community

About this course

This on-demand course is designed to be flexible to fit your schedule. Each lesson and quiz takes approximately 30 to 45 minutes to complete.

  • Option 1: Complete the course in one session, approximately 90 to 120 minutes
  • Option 2: Complete the course over a few days, 3 days of 30-45min/day

Syllabus

  • Lesson 3 – Core Elements of Apache Hadoop
    • Compare and contrast local and distributed file systems
    • Explain data management in the Hadoop file system
    • Summarize the MapReduce algorithm
  • Lesson 4 – The Apache Hadoop Ecosystem
    • Define the following ecosystem components:
      • Administration: ZooKeeper, YARN
      • Ingestion: Flume, Oozie, Sqoop
      • Processing: Spark, HBase, Pig
      • Analysis: Hive, Drill, Mahout
  • Lesson 5 – Solving Big Data Problems with Apache Hadoop
    • Summarize the following use cases:
      • Data Warehouse Optimization
      • Recommendation Engine
      • Large Scale Log Analysis 

Prerequisites

  • Access to, and the ability to use, a laptop with an internet connection
  • Completion of ESS 100 – Introduction to Big Data

Curriculum

  • Lesson 3: Core Elements of Apache Hadoop
  • Quiz 3
  • Lesson 4: The Apache Hadoop Ecosystem
  • Quiz 4
  • Lesson 5: Solving Big Data Problems with Apache Hadoop
  • Quiz 5
  • Course Materials
  • Slide Guide (Transcript)
  • Glossary
  • Join course discussions in the MapR Academy Community