A data engineer at a bank is evaluating a new tabular dataset that includes customer data. The data engineer will use the customer data to create a new model to predict customer behavior. After creati

Sign in or unlock MLS-C01 to reveal the answer and full explanation for question #194. The question stem and answer options stay visible for context.

Modeling

Question

A data engineer at a bank is evaluating a new tabular dataset that includes customer data. The data engineer will use the customer data to create a new model to predict customer behavior. After creating a correlation matrix for the variables, the data engineer notices that many of the 100 features are highly correlated with each other. Which steps should the data engineer take to address this issue? (Choose two.)

Options

AUse a linear-based algorithm to train the model.
BApply principal component analysis (PCA).
CRemove a portion of highly correlated features from the dataset.
DApply min-max feature scaling to the dataset.
EApply one-hot encoding category-based variables.

Unlock MLS-C01 to see the answer

You've previewed enough free MLS-C01 questions. Unlock MLS-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock MLS-C01 - $49.99 / 30 days Sign in

Topics

#Feature Preprocessing#Multicollinearity#Dimensionality Reduction#Principal Component Analysis

Full MLS-C01 Practice