Given:

You want to clean up this list by removing jobs where the state is KILLED. What command you enter?
A. Yarn application kill application_1374638600275_0109
B. Yarn rmadmin refreshQueue
C. Yarn application refreshJobHistory
D. Yarn rmadmin kill application_1374638600275_0109
During the execution of a MapReduce v2 (MRv2) job on YARN, where does the Mapper place the intermediate data each Map task?
A. The Mapper stores the intermediate data on the mode running the job's ApplicationMaster so that is available to YARN's ShuffleService before the data is presented to the Reducer
B. The Mapper stores the intermediate data in HDFS on the node where the MAP tasks ran in the HDFS / usercache/and[user]sppcache/application_and(appid) directory for the user who ran the job
C. YARN holds the intermediate data in the NodeManager's memory (a container) until it is transferred to the Reducers
D. The Mapper stores the intermediate data on the underlying filesystem of the local disk in the directories yarn.nodemanager.local-dirs
E. The Mapper transfers the intermediate data immediately to the Reducers as it generated by the Map task
You are configuring your cluster to run HDFS and MapReduce v2 (MRv2) on YARN. Which daemons need to be installed on your clusters master nodes? (Choose Two)
A. ResourceManager
B. DataNode
C. NameNode
D. JobTracker
E. TaskTracker
F. HMaster
You are upgrading a Hadoop cluster from HDFS and MapReduce version 1 (MRv1) to one running HDFS and MapReduce version 2 (MRv2) on YARN. You want to set and enforce a block of 128MB for all new files written to the cluster after the upgrade. What should you do?
A. Set dfs.block.size to 128M on all the worker nodes, on all client machines, and on the NameNode, and set the parameter to final.
B. Set dfs.block.size to 134217728 on all the worker nodes, on all client machines, and on the NameNode, and set the parameter to final.
C. Set dfs.block.size to 134217728 on all the worker nodes and client machines, and set the parameter to final. You do need to set this value on the NameNode.
D. Set dfs.block.size to 128M on all the worker nodes and client machines, and set the parameter to final. You do need to set this value on the NameNode.
E. You cannot enforce this, since client code can always override this value.
Which two are Features of Hadoop's rack topology?
A. Configuration of rack awareness is accomplished using a configuration file. You cannot use a rack topology script.
B. Even for small clusters on a single rack, configuring rack awareness will improve performance.
C. Rack location is considered in the HDFS block placement policy
D. HDFS is rack aware but MapReduce daemons are not
E. Hadoop gives preference to Intra rack data transfer in order to conserve bandwidth
You are configuring a cluster running HDFS, MapReduce version 2 (MRv2) on YARN running Linux. How must you format the underlying filesystem of each DataNode?
A. They must not formatted - - HDFS will format the filesystem automatically
B. They may be formatted in any Linux filesystem
C. They must be formatted as HDFS
D. They must be formatted as either ext3 or ext4
You want a node to only swap Hadoop daemon data from RAM to disk when absolutely necessary. What should you do?
A. Delete the /swapfile file on the node
B. Set vm.swappiness to o in /etc/sysctl.conf
C. Set the ram.swap parameter to o in core-site.xml
D. Delete the /etc/swap file on the node
E. Delete the /dev/vmswap file on the node
You are working on a project where you need to chain together MapReduce, Pig jobs. You also needs the ability to use forks, decision, and path joins. Which ecosystem project should you use to perform these actions?
A. Oozie
B. Zookeeper
C. HBase
D. Sqoop
E. HUE
Your cluster is running MapReduce vserion 2 (MRv2) on YARN. Your ResourceManager is configured to use the FairScheduler. Now you want to configure your scheduler such that a new user on the cluster can submit jobs into their own queue application submission. Which configuration should you set?
A. You can specify new queue name when user submits a job and new queue can be created dynamically if yarn.scheduler.fair.user-as-default-queue = false
B. Yarn.scheduler.fair.user-as-default-queue = false and yarn.scheduler.fair.allow-undeclared- people = true
C. You can specify new queue name per application in allocation.fair.allow-undeclared-people = true automatically assigned to the application queue
D. You can specify new queue name when user submits a job and new queue can be created dynamically if the property yarn.scheduler.fair.allow-undecleared-pools = true
In CDH4 and later, which file contains a serialized form of all the directory and files inodes in the filesystem, giving the NameNode a persistent checkpoint of the filesystem metadata?
A. fstime
B. VERSION
C. Fsimage_N (Where N reflects all transactions up to transaction ID N)
D. Edits_N-M (Where N-M specifies transactions between transactions ID N and transaction ID N)