ESS 101 - Apache Hadoop Essentials

ESS 101 - Apache Hadoop Essentials

About this Course

This course introduces you to the basics of Apache Hadoop. The course begins with a brief introduction to the Hadoop Distributed File System and MapReduce, then covers several open source ecosystem tools, such as Apache Spark, Apache Drill, and Apache Flume. Finally, these tools are applied to real-world use cases. Ideal for business managers, students, developers, administrators, analysts or anyone interested in learning the fundamentals of transitioning from traditional data models to big data models.

What's Covered

Course Lessons

3: Core Elements of Apache Hadoop

Local and distributed file systems
Data management in the Hadoop file system
Review of the MapReduce algorithm

4: The Apache Hadoop Ecosystem

Overview of the Apache ecosystem
Administration: ZooKeeper, YARN
Ingestion: Flume, Oozie, Sqoop
Processing: Spark, HBase, Pig
Analysis: Hive, Drill, Mahout

5: Solving Big Data Problems with Apache Hadoop

Data Warehouse Optimization
Recommendation Engine
Large-Scale Log Analysis

Prerequisites

  • Completion of ESS 100

Curriculum

  • Lesson 3: Core Elements of Apache Hadoop
  • Quiz 3
  • Lesson 4: The Apache Hadoop Ecosystem
  • Quiz 4
  • Lesson 5: Solving Big Data Problems with Apache Hadoop
  • Quiz 5
  • Course Materials
  • Slide Guide (Transcript)
  • Glossary

About this Course

This course introduces you to the basics of Apache Hadoop. The course begins with a brief introduction to the Hadoop Distributed File System and MapReduce, then covers several open source ecosystem tools, such as Apache Spark, Apache Drill, and Apache Flume. Finally, these tools are applied to real-world use cases. Ideal for business managers, students, developers, administrators, analysts or anyone interested in learning the fundamentals of transitioning from traditional data models to big data models.

What's Covered

Course Lessons

3: Core Elements of Apache Hadoop

Local and distributed file systems
Data management in the Hadoop file system
Review of the MapReduce algorithm

4: The Apache Hadoop Ecosystem

Overview of the Apache ecosystem
Administration: ZooKeeper, YARN
Ingestion: Flume, Oozie, Sqoop
Processing: Spark, HBase, Pig
Analysis: Hive, Drill, Mahout

5: Solving Big Data Problems with Apache Hadoop

Data Warehouse Optimization
Recommendation Engine
Large-Scale Log Analysis

Prerequisites

  • Completion of ESS 100

Curriculum

  • Lesson 3: Core Elements of Apache Hadoop
  • Quiz 3
  • Lesson 4: The Apache Hadoop Ecosystem
  • Quiz 4
  • Lesson 5: Solving Big Data Problems with Apache Hadoop
  • Quiz 5
  • Course Materials
  • Slide Guide (Transcript)
  • Glossary