Class schedule All Batches
  • Apr 22 - May 14 Weekend classes 09:00 - 13:00 CDT 8 sessions
    • Apr
    • Sat 22
    • Sun 23
    • Sat 29
    • Sun 30
    • May
    • Sat 06
    • Sun 07
    • Sat 13
    • Sun 14
  • May 27 - Jun 18 Weekend classes 09:00 - 13:00 CDT 8 sessions
    • May
    • Sat 27
    • Sun 28
    • Jun
    • Sat 03
    • Sun 04
    • Sat 10
    • Sun 11
    • Sat 17
    • Sun 18

Download schedule

Key features

MONEY BACK GUARANTEE

How this works :

At Simplilearn, we greatly value the trust of our patrons. Our courses were designed to deliver an effective learning experience, and have helped over half a million find their professional calling. But if you feel your course is not to your liking, we offer a 7-day money-back guarantee. Just send us a refund request within 7 days of purchase, and we will refund 100% of your payment, no questions asked!

For Self Placed Learning :

Raise refund request within 7 days of purchase of course. Money back guarantee is void if the participant has accessed more than 25% content.

For Instructor Led Training :

Raise refund request within 7 days of commencement of the first batch you are eligible to attend. Money back guarantee is void if the participant has accessed more than 25% content of an e-learning course or has attended Online Classrooms for more than 1 day.

  • 32 hours of instructor-led training
  • 20 hours of self-paced video
  • Includes 4 real industry-based projects
  • Prepares for Cloudera CCAH ‘CCA-500’ certification exam
  • Includes 3 simulation exams aligned to ‘CCA-500’ certification exam

Course description

  • What is the focus of this course?

    The Simplilearn Big Data and Hadoop Administrator course will prepare you for Cloudera’s CCAH ‘CCA-500’ certification and equip you with all the skills for your next Big Data admin assignment. This course covers the Core Hadoop distributions—Apache Hadoop and Vendor specific distribution—CDH (Cloudera Distribution of Hadoop).

    You will learn the need for cluster management solutions, about Cloudera manager and its capabilities. It teaches you how to set up Hadoop cluster and its components such as Sqoop, Flume, Pig, Hive and Impala with basic or advanced configurations? The Hadoop administrator course also answers What is Hadoop’s Distributed File System, and its processing/computation frameworks? And How to plan, secure, safeguard, and monitor a cluster?

    This course will help you understand all basic and advance concepts of Big Data and all technologies related to Hadoop stack and components within Hadoop Ecosystem.

  • What learning outcomes can be expected?

    After completing this course, you will be able to:
    • Understand the fundamentals of Big Data and its characteristics, various scalability options to help organizations manage Big Data.
    • Master the concepts of the Hadoop framework ; its architecture, working of Hadoop distributed file system and deployment of Hadoop cluster using core or vendor specific distributions.
    • Learn about cluster management solutions such as Cloudera manager and its capabilities for setup, deploying, maintenance & monitoring of Hadoop Clusters.
    • Learn Hadoop Administration activities
    • Learn about computational frameworks for processing Big Data
    • Learn about Hadoop clients, nodes for clients and web interfaces like HUE to work with Hadoop Cluster
    • Learn about Cluster planning and tools for data ingestion into Hadoop clusters
    • Learn about Hadoop components within Hadoop ecosystem like Hive, HBase, Spark and Kafka
    • Understand security implementation to secure data and clusters.
    • Learn about Hadoop cluster monitoring activities

  • Who should do this course?

    Big Data career opportunities are on the rise, and Hadoop is quickly becoming a must-know technology for the following professionals:
    • Systems administrators and IT managers
    • IT administrators and operators
    • IT Systems Engineer
    • Data Engineer and database administrators
    • Data Analytics Administrator
    • Cloud Systems Administrator
    • Web Engineer

  • What projects are included in this course?

    Successful evaluation of one of the following 2 projects is a part of the certification eligibility criteria

    Project 1
    Scalability: Deploying Multiple Clusters
    Your company wants to set up a new cluster and has procured new machines; however, setting up clusters on new machines will take time. Meanwhile, your company wants you to set up a new cluster on the same set of machines and start testing the new cluster’s working and applications

    Project 2
    Working with Cluster
    Demonstrate your understanding of the following tasks (give the steps):
    • Enabling and Disabling HA for namenode and resourcemanager in CDH
    • Removing Hue service from your cluster, which has other services such as Hive, Hbase, HDFS, and YARN setup.
    • Adding a user and granting read access to your cloudera cluster.
    • Changing replication and blocksize of your cluster.
    • Adding Hue as a service, logging in as user HUE, and downloading examples for hive, pig, job designer, etc.
    For Further Practice we have 2 more projects to help you start your hadoop administrator journey

    Project 3
    Data Ingestion and Usage
    Ingesting data from external structured databases into HDFS.

    Working on Data on HDFS by loading it into Data warehouse package like Hive; using HiveQL for querying, analyzing, and loading data in another set of tables for further usage.

    Your organization already has a large amount of data in RDBMS and has now set up a Big Data practice. It is interested in moving data from RDBMS into HDFS so that it can perform data analysis by using Software packages such as Apache Hive. The organization would like to leverage the benefits of HDFS and features such as auto replication and fault tolerance that HDFS offers.

    Project 4
    Securing Data and Cluster
    Protecting data stored in your Hadoop cluster by safeguarding it and backing it up.

    Your organization has multiple Hadoop clusters and would like to safeguard its data on multiple clusters. The aim is to prevent data loss from accidental deletes and to make critical data available to users/applications even if one or more of these clusters are down.

Course preview

    • Lesson 00 - Course Introduction 05:41
      • Course Introduction 05:41
    • Lesson 01 - Big Data and Hadoop - Introduction 18:55
      • 1.1 Big Data and Hadoop - Introduction 18:00
      • 1.2 Quiz
      • 1.3 Key Takeaways 00:55
    • Lesson 02 - HDFS Hadoop Distributed File System 28:44
      • 2.1 Introduction to HDFS 14:54
      • 2.2 Internal Architecture and HDFS Workflow 12:36
      • 2.3 Quiz
      • 2.4 Key Takeaways 01:14
    • Lesson 03 - Hadoop Cluster Setup and Working 2:48:56
      • 3.1 Hadoop Cluster Setup and Working 01:32
      • 3.2 Demo1: Getting Virtualization software and Linux disc images 03:21
      • 3.3 Demo2: Adding Machines to your VMBox 02:58
      • 3.4 Demo3: Installing Linux into Machines 14:07
      • 3.5 Demo4: Preparing your linux machines to Install Hadoop (Centos 6) 24:32
      • 3.6 Demo5: Preparing your linux machine(Centos 6) 10:32
      • 3.7 Demo6: Preparing your linux machines(Centos 7) 07:27
      • 3.8 Cluster Management Solution 13:13
      • 3.9 Demo7: Setting Apache Hadoop Cluster 27:05
      • 3.10 Demo8: Writing data to cluster and checking replication status 13:29
      • 3.11 Demo9: Setting up Linux machines in AWS EC2 to setup Cloudera Cluster 30:05
      • 3.12 Demo10: Setting Cloudera Cluster on your machines in AWS EC2 19:43
      • 3.13 Quiz
      • 3.14 Key Takeaways 00:52
    • Lesson 04 - Hadoop Configurations and Daemon Logs 46:14
      • 4.1 Hadoop Configurations and Daemon Logs 27:02
      • 4.2 Hadoop Daemons or Roles 17:22
      • 4.3 Quiz
      • 4.4 Key Takeaways 01:50
    • Lesson 05 - Hadoop Cluster Maintenance and Administration 2:02:45
      • 5.1 Introduction 25:55
      • 5.2 Demo - Commisioning Decommissioning of Datanodes in Cloudera Cluster 07:08
      • 5.3 Demo - Decommissioning and commissioning nodes in Apache Hadoop Cluster 16:54
      • 5.4 Balancing a Cluster 06:59
      • 5.5 Managing Services 12:49
      • 5.6 Managing Software Packages with Apache Hadoop 19:31
      • 5.7 Managing Role Instances 11:14
      • 5.8 Improvements in Hadoop Version 2 19:59
      • 5.9 Quiz
      • 5.10 Key Takeaways 02:16
    • Lesson 06 - Hadoop Computational Frameworks 49:35
      • 6.1 Computation Framework 09:28
      • 6.2 MapReduce 25:17
      • 6.3 YARN 13:23
      • 6.4 Quiz
      • 6.5 Key Takeaways 01:27
    • Lesson 07 - Scheduling: Managing Resources 47:56
      • 7.1 Scheduling: Managing Resources 19:03
      • 7.2 Capacity Scheduler 27:51
      • 7.3 Quiz
      • 7.4 Key Takeaways 01:02
    • Lesson 08 - Hadoop Cluster Planning 21:12
      • 8.1 Hadoop Cluster Planning 11:30
      • 8.2 Cluster Setup Options 08:24
      • 8.3 Quiz
      • 8.4 Key Takeaways 01:18
    • Lesson 09 - Hadoop Clients and Hue Interface 50:20
      • 9.1 Hadoop Clients and Hue Interface 14:16
      • 9.2 Overview of Hadoop User Experience (Hue) 03:18
      • 9.3 Hue Application Interfaces 13:49
      • 9.4 Demo Working with Hue 17:12
      • 9.5 Quiz
      • 9.6 Key Takeaways 01:45
    • Lesson 10 - Data Ingestion in Hadoop Cluster 47:17
      • 10.1 Data Ingestion in Hadoop Cluster 16:31
      • 10.2 Structured Data Ingestion with Apache Sqoop 08:08
      • 10.3 Demo Using Sqoop to Import Data into HDFS 21:35
      • 10.4 Quiz
      • 10.5 Key Takeaways 01:03
    • Lesson 11 - Hadoop Ecosystem ComponentsServices 1:21:50
      • 11.1 Hadoop Ecosystem Components/Services 20:52
      • 11.2 Demo Setting up Hive in Different Modes in Apache Hadoop Cluster 18:54
      • 11.3 H-Base 21:32
      • 11.4 Apache Kafka 19:32
      • 11.5 Quiz
      • 11.6 Key Takeaways 01:00
    • Lesson 12 - Hadoop Security 1:12:02
      • 12.1 Introduction 10:38
      • 12.2 Implementation 34:06
      • 12.3 Service Level Authorization 09:54
      • 12.4 Demo Using Quotas to Control Amount of Data Written in HDFS 15:23
      • 12.5 Quiz
      • 12.6 Key Takeaways 02:01
    • Lesson 13 - Hadoop Cluster Monitoring 49:07
      • 13.1 Hadoop Cluster Monitoring 09:51
      • 13.2 Hadoop Cluster Monitoring Metrics 38:07
      • 13.3 Quiz
      • 13.4 Key Takeaways 01:09
    • Course Feedback
      • Course Feedback
    • {{childObj.title}}
      • {{childObj.childSection.chapter_name}}
        • {{lesson.title}}
      • {{lesson.title}}

    View More

    View Less

Exam & certification

  • What do I need to do to unlock my Simplilearn certificate?

    Online Classroom:
    • Complete 1 project and 1 simulation test with a minimum score of 80%.
    Online Self-Learning:
    • Complete 1 project and 1 simulation test with a minimum score of 80%.

Reviews

The course provided an extensive lab and lecture.

Hadoop Admin course hits the spot with the content. I am a sys admin and wanted to move to Big Data Industry. Course immensely helped me to bridge the skills gap and become a Hadoop admin.

Read more Read less

Excellent course! It was a good learning process.

Excellent certification course for building job relevant skillsets as a Hadoop Administrator. Course material is very relevant, demos provided are understandable, great examples and can be easily related.

Read more Read less

A great value addition for anyone who is interested in becoming a Big data Hadoop Administrator.

Trainer has great insights into the subject. Brilliant way of explaining the concepts which could directly fit into your brain. Blessed to get trained by such a Hadoop Admin Trainer and thanks to Simplilearn for creating this platform.

Read more Read less

It's a very good class. The material is designed very well... I like your presentation and the style of teaching...

FAQs

  • What are the System Requirements?

    To run Hadoop, your system needs to fulfil the following requirements:
    • 64-bit Operating System
    • 4GB RAM
    We will help you to set up a Virtual Machine with local access.

  • Who are the trainers?

    The trainings are delivered by highly qualified and certified instructors with relevant industry experience.

  • What are the modes of training offered for this course?

    We offer this training in the following mode:
    • Online Self-Learning: In this mode, you will receive the lecture videos and you can go through the course as per your convenience.
    • Live Virtual Classroom or Online Classroom: In online classroom training, you have the option to attend the course remotely from your desktop via video conferencing. This format saves productivity challenges and decreases your time spent away from work or home.

  • Can I cancel my enrolment? Do I get a refund?

    Yes, you can cancel your enrolment. We provide a complete refund after deducting the administration fee. To know more, please go through our Refund Policy.

  • What are the payment options?

    Payments can be made using any of the following options and a receipt of the same will be issued to you automatically via email.
    • Visa Debit/credit Card
    • American Express and Diners Club Card
    • Master Card, Or
    • PayPal

  • I want to know more about the training program. Whom do I contact?

    Please join our Live Chat for instant support, call us, or Request a Call Back to have your query resolved.

  • Who are our Faculties and how are they selected?

    All our trainers are working professionals and industry experts with at least 10-12 years of relevant teaching experience.

    Each of them have gone through a rigorous selection process which includes profile screening, technical evaluation, and training demo before they are certified to train for us.  

    We also ensure that only those trainers with a high alumni rating continue to train for us.

  • What is Global Teaching Assistance?

    Our teaching assistants are here to help you get certified in your first attempt.

    They are a dedicated team of subject matter experts to help you at every step and enrich your learning experience from class onboarding to project mentoring and job assistance.

    They engage with the students proactively to ensure the course path is followed.

    Teaching Assistance is available during business hours.

  • What is covered under the 24/7 Support promise?

    We offer 24/7 support through email, chat, and calls.  

    We also have a dedicated team that provides on demand assistance through our community forum. What’s more, you will have lifetime access to the community forum, even after completion of your course with us.

Contact Us

+1-844-532-7688

(Toll Free)

Request more information

For individuals
For business
Name*
Email*
Phone Number*
Your Message (Optional)
We are looking into your query.
Our consultants will get in touch with you soon.

A Simplilearn representative will get back to you in one business day.

First Name*
Last Name*
Email*
Phone Number*
Company*
Job Title*

People also bought this Masters Program:

  • Disclaimer
  • PMP, PMI, PMBOK, CAPM, PgMP, PfMP, ACP, PBA, RMP, SP, and OPM3 are registered marks of the Project Management Institute, Inc.
/index/hidden/ - Never remove this line