Course description

  • What are the objectives of this course?

    The Simplilearn Big Data and Hadoop Administrator course will equip you with all the skills you’ll need for your next Big Data admin assignment. You will learn to work with Hadoop’s Distributed File System, its processing and computation frameworks, core Hadoop distributions, and vendor-specific distributions such as Cloudera. You will learn the need for cluster management solutions and how to set up, secure, safeguard and monitor clusters and their components such as Sqoop, Flume, Pig, Hive and Impala with this Big Data Hadoop Admin course

    This Hadoop Admin training course will help you understand the basic and advanced concepts of Big Data and all of the technologies related to the Hadoop stack and components of the Hadoop Ecosystem.

  • What skills will you learn?

    After completing this Hadoop Admin course, you will be able to:
    • Understand the fundamentals and characteristics of Big Data and various scalability options available to help organizations manage Big Data
    • Master the concepts of the Hadoop framework, including architecture, the Hadoop distributed file system and deployment of Hadoop clusters using core or vendor specific distributions
    • Use Cloudera manager for setup, deployment, maintenance and monitoring of Hadoop clusters
    • Understand Hadoop Administration activities and computational frameworks for processing Big Data
    • Work with Hadoop clients, nodes for clients and web interfaces like HUE to work with Hadoop Cluster
    • Use cluster planning and tools for data ingestion into Hadoop clusters, and cluster monitoring activities
    • Utilize Hadoop components within Hadoop ecosystem like Hive, HBase, Spark and Kafka
    • Understand security implementation to secure data and clusters.

  • Who should take this course?

    Big Data career opportunities are on the rise, and Hadoop is quickly becoming a must-know technology for the following professionals:
    • Systems administrators and IT managers
    • IT administrators and operators
    • IT Systems Engineers
    • Data Engineers and database administrators
    • Data Analytics Administrators
    • Cloud Systems Administrators
    • Web Engineers
    • Individuals who intend to design, deploy and maintain Hadoop clusters

  • What projects are included in this course?

    Successful evaluation of one of the following two projects is part of the Hadoop Admin certification eligibility criteria:

    Project 1
    Scalability: Deploying Multiple Clusters
    Your company wants to set up a new cluster and has procured new machines; however, setting up clusters on new machines will take time. Meanwhile, your company wants you to set up a new cluster on the same set of machines and start testing the new cluster’s working and applications.

    Project 2
    Working with Clusters
    Demonstrate your understanding of the following tasks (give the steps):
    • Enabling and disabling HA for namenode and resourcemanager in CDH
    • Removing Hue service from your cluster, which has other services such as Hive, Hbase, HDFS, and YARN setup
    • Adding a user and granting read access to your Cloudera cluster
    • Changing replication and blocksize of your cluster
    • Adding Hue as a service, logging in as user HUE, and downloading examples for Hive, Pig, job designer, and others
    For additional practice we offer two more projects to help you start your Hadoop administrator journey:

    Project 3
    Data Ingestion and Usage
    Ingesting data from external structured databases into HDFS, working on data on HDFS by loading it into a data warehouse package like Hive, and using HiveQL for querying, analyzing, and loading data in another set of tables for further usage.

    Your organization already has a large amount of data in an RDBMS and has now set up a Big Data practice. It is interested in moving data from the RDBMS into HDFS so that it can perform data analysis by using software packages such as Apache Hive. The organization would like to leverage the benefits of HDFS and features such as auto replication and fault tolerance that HDFS offers.

    Project 4
    Securing Data and Cluster
    Protecting data stored in your Hadoop cluster by safeguarding it and backing it up.

    Your organization would like to safeguard its data on multiple Hadoop clusters. The aim is to prevent data loss from accidental deletes and to make critical data available to users/applications even if one or more of these clusters is down.

  • Why become Big Data Hadoop Administrator?

    The world is getting increasingly digital, and this means big data is here to stay. In fact, the importance of big data and data analytics is going to continue growing in the coming years. Choosing a career in the field of big data and analytics might just be the type of role that you have been trying to find to meet your career expectations.

    Professionals who are working in this field can expect an impressive salary, with the median salary for data scientists being $116,000. Even those who are at the entry level will find high salaries, with average earnings of $92,000. 

Course preview

    • Lesson 00 - Course Introduction 05:41
      • Course Introduction 05:41
    • Lesson 01 - Big Data and Hadoop - Introduction 18:55
      • 1.1 Big Data and Hadoop - Introduction 18:00
      • 1.2 Quiz
      • 1.3 Key Takeaways 00:55
    • Lesson 02 - HDFS Hadoop Distributed File System 28:44
      • 2.1 Introduction to HDFS 14:54
      • 2.2 Internal Architecture and HDFS Workflow 12:36
      • 2.3 Quiz
      • 2.4 Key Takeaways 01:14
    • Lesson 03 - Hadoop Cluster Setup and Working 2:48:56
      • 3.1 Hadoop Cluster Setup and Working 01:32
      • 3.2 Demo1: Getting Virtualization software and Linux disc images 03:21
      • 3.3 Demo2: Adding Machines to your VMBox 02:58
      • 3.4 Demo3: Installing Linux into Machines 14:07
      • 3.5 Demo4: Preparing your linux machines to Install Hadoop (Centos 6) 24:32
      • 3.6 Demo5: Preparing your linux machine(Centos 6) 10:32
      • 3.7 Demo6: Preparing your linux machines(Centos 7) 07:27
      • 3.8 Cluster Management Solution 13:13
      • 3.9 Demo7: Setting Apache Hadoop Cluster 27:05
      • 3.10 Demo8: Writing data to cluster and checking replication status 13:29
      • 3.11 Demo9: Setting up Linux machines in AWS EC2 to setup Cloudera Cluster 30:05
      • 3.12 Demo10: Setting Cloudera Cluster on your machines in AWS EC2 19:43
      • 3.13 Quiz
      • 3.14 Key Takeaways 00:52
    • Lesson 04 - Hadoop Configurations and Daemon Logs 46:14
      • 4.1 Hadoop Configurations and Daemon Logs 27:02
      • 4.2 Hadoop Daemons or Roles 17:22
      • 4.3 Quiz
      • 4.4 Key Takeaways 01:50
    • Lesson 05 - Hadoop Cluster Maintenance and Administration 2:02:45
      • 5.1 Introduction 25:55
      • 5.2 Demo - Commisioning Decommissioning of Datanodes in Cloudera Cluster 07:08
      • 5.3 Demo - Decommissioning and commissioning nodes in Apache Hadoop Cluster 16:54
      • 5.4 Balancing a Cluster 06:59
      • 5.5 Managing Services 12:49
      • 5.6 Managing Software Packages with Apache Hadoop 19:31
      • 5.7 Managing Role Instances 11:14
      • 5.8 Improvements in Hadoop Version 2 19:59
      • 5.9 Quiz
      • 5.10 Key Takeaways 02:16
    • Lesson 06 - Hadoop Computational Frameworks 49:35
      • 6.1 Computation Framework 09:28
      • 6.2 MapReduce 25:17
      • 6.3 YARN 13:23
      • 6.4 Quiz
      • 6.5 Key Takeaways 01:27
    • Lesson 07 - Scheduling: Managing Resources 47:56
      • 7.1 Scheduling: Managing Resources 19:03
      • 7.2 Capacity Scheduler 27:51
      • 7.3 Quiz
      • 7.4 Key Takeaways 01:02
    • Lesson 08 - Hadoop Cluster Planning 21:12
      • 8.1 Hadoop Cluster Planning 11:30
      • 8.2 Cluster Setup Options 08:24
      • 8.3 Quiz
      • 8.4 Key Takeaways 01:18
    • Lesson 09 - Hadoop Clients and Hue Interface 50:20
      • 9.1 Hadoop Clients and Hue Interface 14:16
      • 9.2 Overview of Hadoop User Experience (Hue) 03:18
      • 9.3 Hue Application Interfaces 13:49
      • 9.4 Demo Working with Hue 17:12
      • 9.5 Quiz
      • 9.6 Key Takeaways 01:45
    • Lesson 10 - Data Ingestion in Hadoop Cluster 47:17
      • 10.1 Data Ingestion in Hadoop Cluster 16:31
      • 10.2 Structured Data Ingestion with Apache Sqoop 08:08
      • 10.3 Demo Using Sqoop to Import Data into HDFS 21:35
      • 10.4 Quiz
      • 10.5 Key Takeaways 01:03
    • Lesson 11 - Hadoop Ecosystem ComponentsServices 1:21:50
      • 11.1 Hadoop Ecosystem Components/Services 20:52
      • 11.2 Demo Setting up Hive in Different Modes in Apache Hadoop Cluster 18:54
      • 11.3 H-Base 21:32
      • 11.4 Apache Kafka 19:32
      • 11.5 Quiz
      • 11.6 Key Takeaways 01:00
    • Lesson 12 - Hadoop Security 1:12:02
      • 12.1 Introduction 10:38
      • 12.2 Implementation 34:06
      • 12.3 Service Level Authorization 09:54
      • 12.4 Demo Using Quotas to Control Amount of Data Written in HDFS 15:23
      • 12.5 Quiz
      • 12.6 Key Takeaways 02:01
    • Lesson 13 - Hadoop Cluster Monitoring 49:07
      • 13.1 Hadoop Cluster Monitoring 09:51
      • 13.2 Hadoop Cluster Monitoring Metrics 38:07
      • 13.3 Quiz
      • 13.4 Key Takeaways 01:09
    • Course Feedback
      • Course Feedback
    • {{childObj.title}}
      • {{childObj.childSection.chapter_name}}
        • {{lesson.title}}
      • {{lesson.title}}

    View More

    View Less

Exam & certification FREE PRACTICE TEST

  • What do I need to do to unlock my Simplilearn certificate?

    Online Classroom:
    • Attend one complete batch.
    • Complete one project and one simulation test with a minimum score of 80%.
    Online Self-Learning:
    • Complete 85% of the course.
    • Complete one project and one simulation test with a minimum score of 80%.

Course advisor

Ronald van Loon
Ronald van Loon Top 10 Big Data & Data Science Influencer, Director - Adversitement

Named by Onalytica as one of the three most influential people in Big Data, Ronald is also an author for a number of leading Big Data and Data Science websites, including Datafloq, Data Science Central, and The Guardian. He also regularly speaks at renowned events.


Olga Barrett
Olga Barrett Career Advisor @ CV Wizard of OZ, Perth

It's a very good class. The material is designed very well... I like your presentation and the style of teaching...

Peter Dao
Peter Dao Senior Technical Analyst at Sutter Health, Sacramento

The course provided an extensive lab and lecture.

Amit Goyal
Amit Goyal Houston

Hadoop Admin course hits the spot with the content. I am a sys admin and wanted to move to Big Data Industry. Course immensely helped me to bridge the skills gap and become a Hadoop admin.

Read more Read less
Sanket Gujar
Sanket Gujar Senior Associate - Projects (Teradata DBA) at Cognizant, Pune

I had enrolled in SimpliLearn's Big data certification. The course content is very much aligned with market demands. The trainer and the customer support are very co-operative. All the online training materials and sessions are very helpful. Thanks for providing such a wonderful platform for learning.

Read more Read less
Manoj Nirale
Manoj Nirale Programmer Analyst at Cognizant, Hyderabad

Trainer has great insights into the subject. Brilliant way of explaining the concepts which could directly fit into your brain. Blessed to get trained by such a Hadoop Admin Trainer and thanks to Simplilearn for creating this platform.

Read more Read less
Suresh Chitithoti
Suresh Chitithoti Princ. DBA at Symantec, San Francisco

Excellent course! It was a good learning process.

Amit Kumar
Amit Kumar Houston

Excellent certification course for building job relevant skillsets as a Hadoop Administrator. Course material is very relevant, demos provided are understandable, great examples and can be easily related.

Read more Read less
Steve Jacobs
Steve Jacobs Houston

A great value addition for anyone who is interested in becoming a Big data Hadoop Administrator.


  • What are the System Requirements?

    To run Hadoop, your system must fulfill the following requirements:
    • 64-bit Operating System
    • 8GB RAM
    We will help you to set up a Virtual Machine with local access.

  • What are the modes of training offered for this course?

    We offer a flexible set of options:
    • Live Virtual Classroom or Online Classroom: Attend the course remotely from your desktop via video conferencing for better productivity and to reduce the time spent away from work or home.
    • Online Self-learning: Watch lecture videos online at your own pace.

  • Can I cancel my enrollment? Will I get a refund?

    Yes, you can cancel your enrollment if necessary. We will refund the course price after deducting an administration fee. To learn more, you can view our Refund Policy.

  • What payment options are available?

    Payments can be made using any of the following options. You will be emailed a receipt after the payment is made.
    • Visa Credit or Debit Card
    • MasterCard
    • American Express
    • Diner’s Club
    • PayPal

  • I’d like to learn more about this training program. Whom should I contact?

    Contact us using the form on the right of any page on the Simplilearn website, or select the Live Chat link. Our customer service representatives can provide you with more details.

  • Who are our instructors and how are they selected?

    All of our highly qualified trainers are industry experts with at least 10-12 years of relevant teaching experience. Each of them has gone through a rigorous selection process that includes profile screening, technical evaluation, and a training demo before they are certified to train for us. We also ensure that only those trainers with a high alumni rating remain on our faculty for the Big Data Hadoop Administration training program.

  • What is Global Teaching Assistance?

    Our teaching assistants are a dedicated team of subject matter experts here to help you get certified in your first attempt. They engage students proactively to ensure the course path is being followed and help you enrich your learning experience, from class onboarding to project mentoring and job assistance. Teaching Assistance is available during business hours.

  • What is covered under the 24/7 Support promise?

    We offer 24/7 support through email, chat, and calls. We also have a dedicated team that provides on-demand assistance through our community forum. What’s more, you will have lifetime access to the community forum, even after completion of your course with us.

  • What if I miss a class?

    • Simplilearn has Flexi-pass that lets you attend classes to blend in with your busy schedule and gives you an advantage of being trained by world-class faculty with decades of industry experience combining the best of online classroom training and self-paced learning
    • With Flexi-pass, Simplilearn gives you access to as many as 15 sessions for 90 days

  • Are there any group discounts for classroom training programs?

    Yes, we have group discount options for our training programs. Contact us using the form on the right of any page on the Simplilearn website, or select the Live Chat link. Our customer service representatives will give you more details.

    Our Melbourne Correspondence / Mailing address

    Simplilearn Americas, Inc, Level 28, 303 Collins St, Melbourne, 3000, Australia, Call us at:1-800-982-536

    • Disclaimer
    • PMP, PMI, PMBOK, CAPM, PgMP, PfMP, ACP, PBA, RMP, SP, and OPM3 are registered marks of the Project Management Institute, Inc.