An ML engineer is building an ML pipeline. The pipeline must process a dataset in two ways by using Amazon Athena. The pipeline must use batch processing to perform large-scale data transformations an

Sign in or unlock MLA-C01 to reveal the answer and full explanation for question #169. The question stem and answer options stay visible for context.

Data Preparation for Machine Learning

Question

An ML engineer is building an ML pipeline. The pipeline must process a dataset in two ways by using Amazon Athena. The pipeline must use batch processing to perform large-scale data transformations and for model training. The pipeline must also use near real-time processing to perform low-latency queries for inference and analytics. Which file format will provide the LEAST latency for both types of processing?

Options

ACSV
BApache Parquet
CNested JSON
DDeserialized JSON

Unlock MLA-C01 to see the answer

You've previewed enough free MLA-C01 questions. Unlock MLA-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock MLA-C01 - $49.99 / 30 days Sign in

Topics

#Data Formats#Amazon Athena#Latency Optimization#ML Data Processing

Full MLA-C01 Practice