DA 410 - Apache Drill Essentials

This introductory Apache Drill course, targeted at Data Analysts, Scientists and SQL programmers, covers how to use Drill to explore known or unknown data without writing code.

Not currently available
Processing...
Not currently available
Processing...

About this Course

This introductory Apache Drill course, targeted at Data Analysts, Scientists and SQL programmers, covers how to use Drill to explore known or unknown data without writing code. You will write SQL queries on a variety of data types including structured data in a Hive table, semi-structured data in HBase or MapR-DB, and complex data file types, such as Parquet and JSON.

Prerequisites

  • Basic Hadoop knowledge
Completion, or the equivalent knowledge, of:
  • Completion of the on-demand course ESS 100 - Big Data Essentials
  • Completion of the on-demand course ESS 101 - Apache Hadoop Essentials
  • Completion of the on-demand course ESS 102 - MapR Converged Data Platform Essentials

Certification

The courses in this curriculum prepare you for the MapR Certified Data Analyst (MCDA) certification exam.

Lab activities take additional time and vary based on your system.

Syllabus

Lesson 1 - SQL Queries

  • Perform familiar SQL queries with Drill on structured content
  • Perform familiar SQL queries on semi structured content
  • Join structured and semi structured content into a single query
  • Explore unknown data with drill explorer

Lesson 2 - Self Describing Data

  • Define self describing data
  • Determine how Drill discovers schema of data
  • Use drill explorer to explore unknown data and determine its structure to perform queries
  • Create a view and visualize the view with BI tools

Lab Exercises

  • Familiar SQL queries on structured Hive data
  • Familiar SQL queries on complex data
  • Query Parquet data
  • Query JSON data
  • A single query that joins Hive, HBase and JSON
  • Explore Multiple Data Sources with the Drill Explorer
  • Drill Explorer Interface
  • Data sources
  • Discover data schema
  • Preview data
  • Save a view

Curriculum

  • Get Started
  • Lesson 1 - SQL Queries
  • Lesson 2 - Query Self Describing Data
  • DA 410 Quiz
  • Course Materials
  • Slide Guide (Transcript)
  • Lab Guide
  • Lab Files
  • Apache Drill Sandbox
  • Lab Environment Connection Guide
  • Join course discussions in the MapR Academy Community

About this Course

This introductory Apache Drill course, targeted at Data Analysts, Scientists and SQL programmers, covers how to use Drill to explore known or unknown data without writing code. You will write SQL queries on a variety of data types including structured data in a Hive table, semi-structured data in HBase or MapR-DB, and complex data file types, such as Parquet and JSON.

Prerequisites

  • Basic Hadoop knowledge
Completion, or the equivalent knowledge, of:
  • Completion of the on-demand course ESS 100 - Big Data Essentials
  • Completion of the on-demand course ESS 101 - Apache Hadoop Essentials
  • Completion of the on-demand course ESS 102 - MapR Converged Data Platform Essentials

Certification

The courses in this curriculum prepare you for the MapR Certified Data Analyst (MCDA) certification exam.

Lab activities take additional time and vary based on your system.

Syllabus

Lesson 1 - SQL Queries

  • Perform familiar SQL queries with Drill on structured content
  • Perform familiar SQL queries on semi structured content
  • Join structured and semi structured content into a single query
  • Explore unknown data with drill explorer

Lesson 2 - Self Describing Data

  • Define self describing data
  • Determine how Drill discovers schema of data
  • Use drill explorer to explore unknown data and determine its structure to perform queries
  • Create a view and visualize the view with BI tools

Lab Exercises

  • Familiar SQL queries on structured Hive data
  • Familiar SQL queries on complex data
  • Query Parquet data
  • Query JSON data
  • A single query that joins Hive, HBase and JSON
  • Explore Multiple Data Sources with the Drill Explorer
  • Drill Explorer Interface
  • Data sources
  • Discover data schema
  • Preview data
  • Save a view

Curriculum

  • Get Started
  • Lesson 1 - SQL Queries
  • Lesson 2 - Query Self Describing Data
  • DA 410 Quiz
  • Course Materials
  • Slide Guide (Transcript)
  • Lab Guide
  • Lab Files
  • Apache Drill Sandbox
  • Lab Environment Connection Guide
  • Join course discussions in the MapR Academy Community