nerdexam
GoogleGoogle

PROFESSIONAL-DATA-ENGINEER · Question #141

PROFESSIONAL-DATA-ENGINEER Question #141: Real Exam Question with Answer & Explanation

Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #141. The question stem and answer options stay visible for context.

Submitted by tyler.j· Mar 30, 2026

Question

You architect a system to analyze seismic data. Your extract, transform, and load (ETL) process runs as a series of MapReduce jobs on an Apache Hadoop cluster. The ETL process takes days to process a data set because some steps are computationally expensive. Then you discover that a sensor calibration step has been omitted. How should you change your ETL process to carry out sensor calibration systematically in the future?

Options

  • AModify the transformMapReduce jobs to apply sensor calibration before they do anything else.
  • BIntroduce a new MapReduce job to apply sensor calibration to raw data, and ensure all other MapReduce jobs are chained after this.
  • CAdd sensor calibration data to the output of the ETL process, and document that all users need to apply sensor calibration themselves.
  • DDevelop an algorithm through simulation to predict variance of data output from the last MapReduce job based on calibration factors, and apply the correction

Unlock PROFESSIONAL-DATA-ENGINEER to see the answer

You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Full PROFESSIONAL-DATA-ENGINEER PracticeBrowse All PROFESSIONAL-DATA-ENGINEER Questions