Successful evaluation of one of the following two projects is part of the Hadoop Admin certification eligibility criteria:
Project 1
Scalability: Deploying Multiple Clusters
Your company wants to set up a new cluster and has procured new machines; however, setting up clusters on new machines will take time. Meanwhile, your company wants you to set up a new cluster on the same set of machines and start testing the new cluster’s working and applications.
Project 2
Working with Clusters
Demonstrate your understanding of the following tasks (give the steps):
- Enabling and disabling HA for namenode and resourcemanager in CDH
- Removing Hue service from your cluster, which has other services such as Hive, Hbase, HDFS, and YARN setup
- Adding a user and granting read access to your Cloudera cluster
- Changing replication and blocksize of your cluster
- Adding Hue as a service, logging in as user HUE, and downloading examples for Hive, Pig, job designer, and others
For additional practice we offer two more projects to help you start your Hadoop administrator journey:
Project 3
Data Ingestion and Usage
Ingesting data from external structured databases into HDFS, working on data on HDFS by loading it into a data warehouse package like Hive, and using HiveQL for querying, analyzing, and loading data in another set of tables for further usage.
Your organization already has a large amount of data in an RDBMS and has now set up a Big Data practice. It is interested in moving data from the RDBMS into HDFS so that it can perform data analysis by using software packages such as Apache Hive. The organization would like to leverage the benefits of HDFS and features such as auto replication and fault tolerance that HDFS offers.
Project 4
Securing Data and Cluster
Protecting data stored in your Hadoop cluster by safeguarding it and backing it up.
Your organization would like to safeguard its data on multiple Hadoop clusters. The aim is to prevent data loss from accidental deletes and to make critical data available to users/applications even if one or more of these clusters is down.