HDP Operations: Administration Foundations

Our classes are always live and instructor led from our Exton, PA or EPIC Partner locations. Springhouse AnywhereLive options require Internet Access. Select classes are Guaranteed to Run (GTR). View our complete schedule policies.

 

 

 

 

6cc4fa7b-f57e-e811-a840-00155d0a14062018-11-26T10:00:00Z2800.000000000001410:00 AM6:00 PMAnywhereLive6cc4fa7b-f57e-e811-a840-00155d0a1406
f2626943-b0bd-e811-81e9-00155d0a14062019-02-19T10:00:00Z2800.000000000001410:00 AM6:00 PMAnywhereLivef2626943-b0bd-e811-81e9-00155d0a1406

Overview

​This course is intended for systems administrators who will be responsible for the design, installation, configuration, and management of the Hortonworks Data Platform (HDP). The course provides in-depth knowledge and experience in using Apache Ambari as the operational management platform for HDP. This course presumes no prior knowledge or experience with Hadoop. 

Intended Audience

​Linux administrators and system operators responsible for installing, configuring and managing an HDP cluster.


At Completion

​Day 1: Introduction to Big Data, Hadoop and the Hortonworks Data Platform
Day 2: Managing HDFS Storage, Rack Awareness, HDFS Snapshots and HDFS Centralized Cache
Day 3: Introduction to YARN
Day 4: High Availability with HDP, Deploying HDP with Blueprints, and the HDP Upgrade Process


Prerequisites

​Students must have experience working in a Linux environment with standard Linux system commands. Students should be able to read and execute basic Linux shell scripts. Basic knowledge of SQL statements is recommended, but not a requirement. In addition, it is recommended for students to have some operational experience in data center practices, such as change management, release management, incident management, and problem management.


Exams & Certifications


Materials


Course Outline

​DAY 1 OBJECTIVES

  • Describe Apache Hadoop
  • Summarize the Purpose of the Hortonworks Data Platform Software Frameworks
  • List Hadoop Cluster Management Choices
  • Describe Apache Ambari
  • Identify Hadoop Cluster Deployment Options
  • Plan for a Hadoop Cluster Deployment
  • Perform an Interactive HDP Installation using Apache Ambari
  • Install Apache Ambari
  • Describe the Differences Between Hadoop Users, Hadoop Service Owners, and Apache Ambari Users
  • Manage Users, Groups and Permissions
  • Identify Hadoop Configuration Files
  • Summarize Operations of the Web UI Tool
  • Manage Hadoop Service Configuration Properties Using the Apache Ambari Web UI
  • Describe the Hadoop Distributed File System (HDFS)
  • Perform HDFS Shell Operations
  • Use WebHDFS
  • Protect Data Using HDFS Access Control Lists (ACLs)

 

DAY 1 LABS

  • Setting Up the Environment
  • Installing HDP
  • Managing Ambari Users and Groups
  • Managing Hadoop Services
  • Using HDFS Storage
  • Using WebHDFS
  • Using HDFS Access Control Lists

 

DAY 2 OBJECTIVES

  • Describe HDFS Architecture and Operation
  • Manage HDFS using Ambari Web, NameNode and DataNode UIs
  • Manage HDFS using Command-line Tools
  • Summarize the Purpose and Benefits of Rack Awareness
  • Configure Rack Awareness
  • Summarize Hadoop Backup Considerations
  • Enable and Manage HDFS Snapshots
  • Copy Data Using DistCP
  • Use Snapshots and DistCP Together
  • Identify the Purpose and Operation of Heterogeneous HDFS Storage
  • Summarize the Purpose and Operation of HDFS Centralized Caching
  • Configure HDFS Centralized Cache
  • Define and Manage Cache Pools and Cache Directives
  • Identify HDFS NFS Gateway Use Cases
  • Recall HDFS NFS Gateway Architecture and Operation
  • Install and Configure an HDFS NFS Gateway
  • Configure an HDFS NFS Gateway Client

 

DAY 2 LABS

  • Managing HDFS Storage
  • Managing HDFS Quotas
  • Configuring Rack Awareness
  • Managing HDFS Snapshots
  • Using DistCP
  • Configuring HDFS Storage Policies
  • Configuring HDFS Centralized Cache
  • Configuring an NFS Gateway

 

DAY 3 OBJECTIVES

  • Describe YARN Resource Management
  • Summarize YARN Architecture and Operation
  • Identify and Use YARN Management Options
  • Summarize YARN Response to Component Failure
  • Understand the Basics of Running Simple YARN Applications
  • Summarize the Purpose and Operation of the YARN Capacity Scheduler
  • Configure and Manage YARN Queues
  • Control Access to YARN Queues
  • Summarize the Purpose and Operation of YARN Node Labels
  • Describe the Process used to Create Node Labels
  • Describe the Process Used to Add, Modify and Remove Node Labels
  • Configure Queues to Access Node Label Resources
  • Run Test Jobs to Confirm Node Label Behavior

 

DAY 3 LABS

  • Managing YARN Using Ambari
  • Managing YARN Using CLI
  • Running Sample YARN Applications
  • Setting Up for Capacity Scheduler
  • Managing YARN Containers and Queues
  • Managing YARN ACLs and User Limits
  • Working with YARN Node Labels

 

DAY 4 OBJECTIVES

  • Summarize the Purpose of NameNode HA
  • Configure NameNode HA Using Ambari
  • Summarize the Purpose of ResourceManager HA
  • Configure ResourceManager HA using Apache Ambari
  • Identify Reasons to Add, Replace and Delete Worker Nodes
  • Demonstrate How to Add a Worker Node
  • Configure and Run the HDFS Balancer
  • Decommission and Re-commission a Worker Node
  • Describe the Process of Moving a Master Component
  • Summarize the Purpose and Operation of Apache Ambari Metrics
  • Describe the Features and Benefits of the Apache Ambari Dashboard
  • Summarize the Purpose and Benefits of Apache Ambari Blueprints
  • Recall the Process Used to Deploy a Cluster Using Ambari Blueprints
  • Recall the Definition of an HDP Stack and Interpret its Version Number
  • View the Current Stack and Identify Compatible Apache Ambari Software Versions
  • Recall the Types of Methods and Upgrades Available in HDP
  • Describe the Upgrade Process, Restrictions and Pre-upgrade Checklist
  • Perform an Upgrade Using the Apache Ambari Web UI


DAY 4 LABS

  • Configuring NameNode HA
  • Configuring Resource Manager HA
  • Adding, Decommissioning and Re-commissioning a Worker Node
  • Configuring Ambari Alerts
  • Deploying an HDP Cluster Using Ambari Blueprints
  • Performing an HDP Upgrade - Express

 

 

HDP Operations: Administration Foundationshttp://springhouse.com/course-catalog/HW HDP OPS FNDHDP Operations: Administration Foundations

Get More Information
Name:

Phone:  

Email:  

Comments:

Help us prove you're not a robot:
 

 ‭(Hidden)‬ Catalog-Item Reuse

Microsoft Gold Partner

PMI R.E.P.

AXELOS Limited

The Microsoft Gold CPLS logo is a mark of Microsoft, Inc.

The PMI R.E.P. logo is a mark of the Project Management Institute, Inc.

ITIL® is a registered trade mark of AXELOS Limited.
IT Infrastructure Library® is a registered trade mark of AXELOS Limited
The Swirl logo™ is a registered trade mark of AXELOS Limited
Accredited course material is property of ITSM Academy.

Connect with us

Springhouse Education & Consulting Services

Corporate HQ:Eagleview Corporate Park
707 Eagleview Boulevard
Suite 207
Exton, PA 19341

610-321-3500 - info@springhouse.com