Your company's data platform ingests CSV file dumps of booking and user profile data from upstream sources into Cloud Storage. The data analyst team wants to join these datasets on the email field ava

Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #332. The question stem and answer options stay visible for context.

Submitted by lucia.co· Mar 30, 2026Building and operationalizing data processing systems

Question

Your company's data platform ingests CSV file dumps of booking and user profile data from upstream sources into Cloud Storage. The data analyst team wants to join these datasets on the email field available in both the datasets to perform analysis. However, personally identifiable information (PII) should not be accessible to the analysts. You need to de-identify the email field in both the datasets before loading them into BigQuery for analysts. What should you do?

Options

A
1. Create a pipeline to de-identify the email field by using recordTransformations in Cloud Data Loss Prevention (Cloud DLP) with masking as the de-
B
1. Create a pipeline to de-identify the email field by using recordTransformations in Cloud DLP with format-preserving encryption with FFX as the de-
C
1. Load the CSV files from Cloud Storage into a BigQuery table, and enable dynamic data masking.
D
1. Load the CSV files from Cloud Storage into a BigQuery table, and enable dynamic data masking.

Unlock PROFESSIONAL-DATA-ENGINEER to see the answer

You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock PROFESSIONAL-DATA-ENGINEER - $49.99 / 30 days Sign in

Topics

#Dynamic Data Masking#BigQuery#Data De-identification#PII Protection

Full PROFESSIONAL-DATA-ENGINEER Practice