PROFESSIONAL-DATA-ENGINEER · Question #281
PROFESSIONAL-DATA-ENGINEER Question #281: Real Exam Question with Answer & Explanation
Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #281. The question stem and answer options stay visible for context.
Question
You are loading CSV files from Cloud Storage to BigQuery. The files have known data quality issues, including mismatched data types, such as STRINGs and INT64s in the same column, and inconsistent formatting of values such as phone numbers or addresses. You need to create the data pipeline to maintain data quality and perform the required cleansing and transformation. What should you do?
Options
- AUse Data Fusion to transform the data before loading it into BigQuery.
- BUse Data Fusion to convert the CSV files to a self-describing data format, such as AVRO, before loading the data to BigQuery.
- CLoad the CSV files into a staging table with the desired schema, perform the transformations with SQL, and then write the results to the final destination table.
- DCreate a table with the desired schema, load the CSV files into the table, and perform the transformations in place using SQL.
Unlock PROFESSIONAL-DATA-ENGINEER to see the answer
You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.