You are working on a linear regression model on BigQuery ML to predict a customer's likelihood of purchasing your company's products. Your model uses a city name variable as a key predictive component

Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #243. The question stem and answer options stay visible for context.

Submitted by valeria.br· Mar 30, 2026Building and operationalizing data processing systems

Question

You are working on a linear regression model on BigQuery ML to predict a customer's likelihood of purchasing your company's products. Your model uses a city name variable as a key predictive component. In order to train and serve the model, your data must be organized in columns. You want to prepare your data using the least amount of coding while maintaining the predictable variables. What should you do?

Options

ACreate a new view with BigQuery that does not include a column with city information.
BUse SQL in BigQuery to transform the state column using a one-hot encoding method, and make each city a column with binary values.
CUse TensorFlow to create a categorical variable with a vocabulary list.
DUse Cloud Data Fusion to assign each city to a region that is labeled as 1, 2, 3, 4, or 5, and then use that number to represent the city in the model.

Unlock PROFESSIONAL-DATA-ENGINEER to see the answer

You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock PROFESSIONAL-DATA-ENGINEER - $49.99 / 30 days Sign in

Topics

#Cloud Data Fusion#Feature Engineering#Categorical Data#BigQuery ML

Full PROFESSIONAL-DATA-ENGINEER Practice