nerdexam
Cloudera

DS-200 · Question #21

DS-200 Question #21: Real Exam Question with Answer & Explanation

Sign in or unlock DS-200 to reveal the answer and full explanation for question #21. The question stem and answer options stay visible for context.

Question

You have acquired a new data source of millions of customer records, and you've this data into HDFS. Prior to analysis, you want to change all customer registration to the same date format, make all addresses uppercase, and remove all customer names (for anonymization). Which process will accomplish all three objectives?

Options

  • AAdapt the data cleansing module in Mahout to your data, and invoke the Mahout library when you
  • BPull this data into an RDBMS using sqoop and scrub records using stored procedures
  • CWrite a script that receives records on stdin, corrects them, and then writes them to stdout.
  • DWrite a MapReduce job with a mapper to change words to uppercase and to reduce different

Unlock DS-200 to see the answer

You've previewed enough free DS-200 questions. Unlock DS-200 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Full DS-200 Practice