32 hours of Classroom training
100% Money Back Guarantee*
20 hours of self-paced video
Includes 4 real industry-based projects
Prepares for Cloudera CCAH ‘CCA-500’ certification exam
Includes 3 simulation exams aligned to ‘CCA-500’ certification exam
Educera's Big Data Hadoop Administrator Training provides an in-depth understanding of Hadoop framework, HDFS, and Hadoop cluster including Sqoop, Flume, Pig, Hive, and Impala. You will learn about cluster management solutions, core Hadoop distribution, and Cloudera Manager.
Big Data Hadoop Administrator Training is best suited for:
Systems administrators and IT managers
IT administrators and operators
IT Systems Engineer
Data Engineer and database administrators
Data Analytics Administrator
Cloud Systems Administrator
Web Engineer
After completing this Big Data Hadoop Administrator Training, you will be able to:
Understand the fundamentals of Big Data and its characteristics, various scalability options to help organizations manage Big Data.
Master the concepts of the Hadoop framework; its architecture, working of Hadoop distributed file system and deployment of Hadoop cluster using core or vendor-specific distributions.
Learn about cluster management solutions such as Cloudera manager and its capabilities for setup, deploying, maintenance & monitoring of Hadoop Clusters.
Learn Hadoop Administration activities
Learn about computational frameworks for processing Big Data
Learn about Hadoop clients, nodes for clients and web interfaces like HUE to work with Hadoop Cluster
Learn about Cluster planning and tools for data ingestion into Hadoop clusters
Learn about Hadoop components within Hadoop ecosystem like Hive, HBase, Spark, and Kafka
Understand security implementation to secure data and clusters.
Learn about Hadoop cluster monitoring activities
Lesson 1: Big Data & Hadoop Introduction
Big Data Hadoop Administrator Training course you will learn about Big Data characteristics need for a framework such as Hadoop & its ecosystem. You will also be introduced to important daemons that support functioning of a Hadoop cluster. Topics covered are:
Data & Existing Solutions
Welcome to the world of Big Data—What, Why & Where
Case studies
Hadoop & its Ecosystem
Hadoop Core components
Hadoop & its capabilities
Lesson 2: HDFS – Hadoop Distributed File System & Hadoop’s Distributions
In this lesson, you will learn about Hadoop Distributed file System, its architecture, working & internals, Hadoop different distributions and about their similarities & differences. Topics covered are:
Gain knowledge on HDFS its internals, working & features
Learn about possibilities without HDFS
Differentiate or find similarities in different distributions of Hadoop.
Identify the requirements to set up a Hadoop cluster
Lesson 3: Hadoop Cluster Setup & Working with Hadoop Cluster
In this lesson, you will learn about steps to setup Apache Hadoop (core distribution) & Cloudera Distribution of Hadoop (vendor specific), cluster management solutions and their benefits and nut & bolts of Cloudera Distribution of Hadoop. You will also learn how to verify your cluster. Topics covered are:
The need for Cluster Management Solution
Choice of Installation methods—Automated/ Manual
Linux machines setup—Virtualization & Cloud
Hadoop Cluster Setup—Apache Hadoop V2 & Cloudera Distribution of Hadoop (CDH)
Cloudera manager features and capabilities
Working with Hadoop cluster, HDFS & data
Working with management console/ UI ( user interfaces) & Linux terminals
Understand administration scenarios
Lesson 4: Hadoop Configurations & Daemon Logs
In this lesson, you will learn about configuration files, ports & properties that relate to the functioning of Hadoop cluster. You will also learn about Hadoop daemons logs and how they help in problem scenarios for diagnosing & gathering information. Topics covered are:
List and describe the files that control Hadoop configuration
Explain how to manage Hadoop configuration with Cloudera Manager
Locate configuration files and make changes
Explain how to deal with stale configurations
Explain the properties of addresses and ports of RPC and HTTP servers run by Hadoop Daemons
Locate log files generated on hosts
Filter information in log files
Explain how to get diagnostic information from log files
Lesson 5: Hadoop Cluster Maintenance & Administration
In this lesson, you will learn Hadoop cluster maintenance and administration activities. You will also learn the shortcomings of Hadoop v1 and how they are fulfilled by Hadoop v2 features. Topics covered are:
Explain how to add and remove nodes in an ad-hoc way
Explain how to add and remove nodes in a systematic way, otherwise known as commissioning and decommissioning of nodes
Explain how to balance a cluster
List the steps for managing services including adding, deleting, starting, stopping and checking the status of services
Explain the procedure to enable rack awareness
List the steps to add, remove and move role instances and hosts
Cite the challenges faced with the first version of Hadoop
Explain the features in the second version that help overcome the challenges faced in the first version
Lesson 6: Hadoop Computational Frameworks
In this lesson, you will learn about different types of computational frameworks, MapReduce & YARN concepts & configurations and how YARN manages applications. Topics covered are:
Describe the role of computational frameworks
Explain MapReduce concepts
Describe MRv2 on YARN
Explain configuring and understanding of YARN
Describe YARN applications
Describe YARN memory and CPU settings
Lesson 7: Scheduling—Managing resources via Schedulers
In this lesson, you will learn cluster scheduling concepts, managing resources in your YARN cluster by usage of schedulers & queue management to manage jobs/applications. Topics covered are:
Describe the scheduling concepts
Indentify the Schedulers
Explain the ways to manage resources using Schedulers
Describe FIFO, Fair Scheduler, and Capacity Scheduler
Explain how to configure Schedulers
Explain queue management
Lesson 8: Hadoop Cluster Planning
In this lesson you will learn about how to plan your Hadoop cluster, considerations for cluster sizing & workload patterns in Hadoop cluster, making choices pertaining to variables such as hardware, software & different cluster deployment options. Topics covered are:
Planning Hadoop Cluster
General Planning considerations
Workload and cluster sizing
Making Choices—Hardware, Software & Network
Making Choices—Master/Slave considerations
News from the world—Existing Setups
Lesson 9: Hadoop Clients & HUE interface
In this lesson you will learn about Hadoop clients, nodes that support Hadoop clients and web interface such as HUE which can be used to work with Hadoop cluster and its components. Topics covered are:
Explain the concepts of Hadoop client, edge nodes, and gateway nodes
Install and configure Hadoop clients
Explain how Hue works
Install and configure Hue
Describe how authentication and authorization is managed in Hue
Lesson 10: Data Ingestion in Hadoop Cluster
In this lesson you will learn about data ingestion types & tools. You will learn more about tools such as Flume, Sqoop that can be used for data import/export. Topics covered are:
Understand Data Ingestion & its types
Knowing about various data ingestion tools & their capabilities
Understanding how Flume works
Understanding how sqoop works
In this lesson you will learn about open-source components (also known as services in CDH) that work within Hadoop ecosystem such as Hive, Hbase, kafka & Spark. Topics covered are:
List some of the services and open-source components that work within the Hadoop ecosystem
List the advantages and key features of Hive
Describe briefly about the components of Hive
Explain how to configure Hive in different modes
Explain the architecture of HBase and cite the advantages of using HBase
Explain the working of Apache Kafka
Describe the architecture of Apache Spark
Lesson 12: Hadoop Security—Securing Hadoop Cluster
In this lesson you will learn about security aspects and security implementation in a Hadoop cluster to secure data & cluster. Topics covered are:
Describe the different ways to avoid risks and secure data
Identify the different threat categories
Describe the security aspects for different nodes
Describe operating system security
Describe Kerberos and how it works
Describe Service Level Authorization
In this lesson you will learn about basics of cluster monitoring, choosing right monitoring solutions, Hadoop metrics categories & types and Cloudera manager’s features and capabilities that can be used for monitoring your Hadoop cluster. Topics covered are:
Describe cluster monitoring
Describe the ways to choose the right monitoring solutions
List the features and considerations of Cloudera manager for monitoring
Describe the different categories of Hadoop Metrics
List the different types of Hadoop Metrics
List the steps to monitor a cluster by using Cloudera Manager
Why Educera?
Educera’s training is the best and value for time & money invested. We stand out because our customers:
Get trained at the best price compared to other training providers.
Get trained by the best trainer in the industry.
Get access to course specific learning videos.
Get 100% Money back guarantee.
Training Fee:
Early Bird: Booking at least one month prior to the class start date
Training Venue:
Venue will be confirmed to the classroom participants one week prior to the workshop start date and online participants will get the session attendance link before 4- 5 days of the training start date.
Training Courses: CAPM | PMP | LSSGB | ITIL | CSM | CEH | PMI-ACP | CCBA | CBAP | CEH
Please contact us for more details