Cloudera CCA-500 - Cloudera Certified Administrator for Apache Hadoop (CCAH)

Cloudera CCA-500 Premium Access Download Demo

Page: 2 / 2
Total 60 questions

You observed that the number of spilled records from Map tasks far exceeds the number of map output records. Your child heap size is 1GB and your io.sort.mb value is set to 1000MB. How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?

For a 1GB child heap size an io.sort.mb of 128 MB will always maximize memory to disk I/O

Increase the io.sort.mb to 1GB

Decrease the io.sort.mb value to 0

Tune the io.sort.mb value until you observe that the number of spilled records equals (or is as close to equals) the number of map output records.

Question # 12

You want to node to only swap Hadoop daemon data from RAM to disk when absolutely necessary. What should you do?

Delete the /dev/vmswap file on the node

Delete the /etc/swap file on the node

Set the ram.swap parameter to 0 in core-site.xml

Set vm.swapfile file on the node

Delete the /swapfile file on the node

Question # 13

Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is the result when you execute: hadoop jar SampleJar MyClass on a client machine?

SampleJar.Jar is sent to the ApplicationMaster which allocates a container for SampleJar.Jar

Sample.jar is placed in a temporary directory in HDFS

SampleJar.jar is sent directly to the ResourceManager

SampleJar.jar is serialized into an XML file which is submitted to the ApplicatoionMaster

Question # 14

On a cluster running MapReduce v2 (MRv2) on YARN, a MapReduce job is given a directory of 10 plain text files as its input directory. Each file is made up of 3 HDFS blocks. How many Mappers will run?

We cannot say; the number of Mappers is determined by the ResourceManager

We cannot say; the number of Mappers is determined by the developer

We cannot say; the number of mappers is determined by the ApplicationMaster

Question # 15

Identify two features/issues that YARN is designated to address: (Choose two)

Standardize on a single MapReduce API

Single point of failure in the NameNode

Reduce complexity of the MapReduce APIs

Resource pressure on the JobTracker

Ability to run framework other than MapReduce, such as MPI

HDFS latency

Question # 16

Assuming youâ€™re not running HDFS Federation, what is the maximum number of NameNode daemons you should run on your cluster in order to avoid a â€œsplit-brainâ€ scenario with your NameNode when running HDFS High Availability (HA) using Quorum-based storage?

Two active NameNodes and two Standby NameNodes

One active NameNode and one Standby NameNode

Two active NameNodes and on Standby NameNode

Unlimited. HDFS High Availability (HA) is designed to overcome limitations on the number of NameNodes you can deploy

Question # 17

Assume you have a file named foo.txt in your local directory. You issue the following three commands:

Hadoop fs â€“mkdir input

Hadoop fs â€“put foo.txt input/foo.txt

Hadoop fs â€“put foo.txt input

What happens when you issue the third command?

The write succeeds, overwriting foo.txt in HDFS with no warning

The file is uploaded and stored as a plain file named input

You get a warning that foo.txt is being overwritten

You get an error message telling you that foo.txt already exists, and asking you if you would like to overwrite it.

You get a error message telling you that foo.txt already exists. The file is not written to HDFS

You get an error message telling you that input is not a directory

The write silently fails

Question # 18

You are running a Hadoop cluster with MapReduce version 2 (MRv2) on YARN. You consistently see that MapReduce map tasks on your cluster are running slowly because of excessive garbage collection of JVM, how do you increase JVM heap size property to 3GB to optimize performance?

yarn.application.child.java.opts=-Xsx3072m

yarn.application.child.java.opts=-Xmx3072m

mapreduce.map.java.opts=-Xms3072m

mapreduce.map.java.opts=-Xmx3072m

Summer Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: ecus65

Cloudera CCA-500 - Cloudera Certified Administrator for Apache Hadoop (CCAH)

The Answer Is:

The Answer Is:

The Answer Is:

The Answer Is:

The Answer Is:

Explanation:

The Answer Is:

The Answer Is:

The Answer Is:

Explanation: