CCD-333 Exam Questions
57 real CCD-333 exam questions with expert-verified answers and explanations. Page 1 of 2.
- Question #1
What is the difference between a failed task attempt and a killed task attempt?
- Question #2
What is a SequenceFile?
- Question #3
Can you use MapReduce to perform a relational join on two large tables sharing a key? Assume that the two tables are formatted as comma-separated file in HDFS.
- Question #4
To process input key-value pairs, your mapper needs to load a 512 MB data file in memory. What is the best way to accomplish this?
- Question #5
What types of algorithms are difficult to express MapReduce?
- Question #7
Your Custer's HOFS block size is 64MB. You have a directory containing 100 plain text files, each of which Is 100MB in size. The InputFormat for your job is TextInputFormat. How ma...
- Question #8
Does the MapReduce programming model provide a way for reducers to communicate with each other?
- Question #9
Which of the following best describes the map method input and output?
- Question #10
You have written a Mapper which invokes the following five calls to the outputcollector.collect method: How many times will the Reducer's reduce method be invoked.
- Question #11
In a MapReduce job with 500 map tasks, how many map task attempts will there be?
- Question #12
The Hadoop framework provides a mechanism for coping with machine issues such as faulty configuration or impending hardware failure. MapReduce detects that one or a number of machi...
- Question #15
Which MapReduce daemon runs on each slave node and participates in job execution?
- Question #16
Cloudera CCD-333 Exam What is the standard configuration of slave nodes in a Hadoop cluster?
- Question #17
Which happens if the NameNode crashes?
- Question #18
You are running a job that will process a single InputSplit on a cluster which has no other jobs currently running. Each node has an equal number of open Map slots. On which node w...
- Question #19
If you run the word count MapReduce program with m mappers and r reducers, how many output files will you get at the end of the job? And how many key-value pairs will there be in e...
- Question #20
You have a large dataset of key-value pairs, where the keys are strings, and the values are integers. For each unique key, you want to identify the largest integer. In writing a Ma...
- Question #21
What happens in a MapReduce job when you set the number of reducers to zero?
- Question #22
You are developing a combiner that takes as input Text keys, IntWritable values, and emits Text keys, Intwritable values. Which interface should your class implement?
- Question #23
Combiners Increase the efficiency of a MapReduce program because:
- Question #24
In a large MapReduce job with m mappers and r reducers, how many distinct copy operations will there be in the sort/shuffle phase?
- Question #25
When is the reduce method first called in a MapReduce job?
- Question #26
What happens in a MapReduce job when you set the number of reducers to one?
- Question #27
In the standard word count MapReduce algorithm, why might using a combiner reduce the overall Job running time?
- Question #28
Which two of the following are valid statements? (Choose two)
- Question #30
Which of the following describes how a client reads a file from HDFS?
- Question #31
You need to create a job that does frequency analysis on input data. You will do this by writing a Mapper that uses TextInputForma and splits each value (a line of text from an inp...
- Question #32
Which of the following statements best describes how a large (100 GB) file is stored in HDFS?
- Question #33
Your cluster has 10 DataNodes, each with a single 1 TB hard drive. You utilize all your disk capacity for HDFS, reserving none for MapReduce. You implement default replication sett...
- Question #34
You use the hadoop fs 璸ut command to write a 300 MB file using an HDFS block size of 64 MB. Just after this command has finished writing 200 MB of this file, what would another use...
- Question #35
What is a SequenceFile?
- Question #36
Given a directory of files with the following structure: line number, tab character, string: 1. abialkjfjkaoasdfjksdlkjhqweroij 2. kadf jhuwqounahagtnbvaswslmnbfgy 3. kjfteiomndscx...
- Question #37
In a MapReduce job, you want each of you input files processed by a single map task. How do you configure a MapReduce job so that a single map task processes each input file regard...
- Question #38
Which of the following best describes the workings of TextInputFormat?
- Question #39
Which of the following statements most accurately describes the relationship between MapReduce and Pig?
- Question #41
You have an employee who is a Date Analyst and is very comfortable with SQL. He would like to run ad-hoc analysis on data in your HDFS duster. Which of the following is a data ware...
- Question #42
Workflows expressed in Oozie can contain:
- Question #43
You need a distributed, scalable, data Store that allows you random, realtime read/write access to hundreds of terabytes of data. Which of the following would you use?
- Question #44
Which of the following utilities allows you to create and run MapReduce jobs with any executable or script as the mapper and/or the reducer?
- Question #45
What is the preferred way to pass a small number of configuration parameters to a mapper or reducer? Cloudera CCD-333 Exam
- Question #47
How does the NameNode detect that a DataNode has failed?
- Question #48
The NameNode uses RAM for the following purpose:
- Question #49
In the reducer, the MapReduce API provides you with an iterator over Writable values. Calling the next () method:
- Question #50
What is a Writable?
- Question #51
In a MapReduce job, the reducer receives all values associated with the same key. Which statement is most accurate about the ordering of these values?
- Question #52
All keys used for intermediate output from mappers must do which of the following:
- Question #53
You have the following key value pairs as output from your Map task: (the, 1) (fox, 1) (runs, 1) (faster, 1) (than, 1) (the, 1) (dog, 1) How many keys will be passed to the reducer...
- Question #54
You write a MapReduce job to process 100 files in HDFS. Your MapReducc algorithm uses TextInputFormat and the IdentityReducer: the mapper applies a regular expression over input va...
- Question #55
For each intermediate key, each reducer task can emit:
- Question #56
For each input key-value pair, mappers can emit: