PROFESSIONAL-MACHINE-LEARNING-ENGINEER Exam Questions
349 real PROFESSIONAL-MACHINE-LEARNING-ENGINEER exam questions with expert-verified answers and explanations. Page 4 of 7.
- Question #151Data processing and feature engineering
You work for a retailer that sells clothes to customers around the world. You have been tasked with ensuring that ML models are built in a secure manner. Specifically, you need to...
Data SecurityPrivacy-Preserving MLData AnonymizationData Preprocessing - Question #152ML model development
You work for a magazine publisher and have been tasked with predicting whether customers will cancel their annual subscription. In your exploratory data analysis, you find that 90%...
Class ImbalanceModel EvaluationAccuracy MetricsMinority Class Prediction - Question #153ML pipeline operationalization
You have built a model that is trained on data stored in Parquet files. You access the data through a Hive table hosted on Google Cloud. You preprocessed these data with PySpark an...
Kubeflow PipelinesDataprocPySparkData Transformation - Question #154ML pipeline operationalization
You have developed an ML model to detect the sentiment of users' posts on your company's social media page to identify outages or bugs. You are using Dataflow to provide real-time...
ML Model DeploymentTraffic ManagementVertex AIReal-time Prediction - Question #155ML pipeline operationalization
You are developing an image recognition model using PyTorch based on ResNet50 architecture. Your code is working fine on your local laptop on a small subsample. Your full dataset h...
ML TrainingVertex AI Custom TrainingGPU AccelerationCloud Resource Optimization - Question #156Monitoring, optimizing, and maintaining ML solutions
You have trained a DNN regressor with TensorFlow to predict housing prices using a set of predictive features. Your default precision is tf.float64, and you use a standard TensorFl...
Model optimizationQuantizationInference latencyTensorFlow serving - Question #157Data processing and feature engineering
You work on the data science team at a manufacturing company. You are reviewing the company's historical sales data, which has hundreds of millions of records. For your exploratory...
Data processingStatistical analysisBigQueryVertex AI Workbench - Question #158ML pipeline operationalization
Your data science team needs to rapidly experiment with various features, model architectures, and hyperparameters. They need to track the accuracy metrics for various experiments...
Experiment TrackingVertex AI PipelinesMetadata ManagementMLOps - Question #159Data processing and feature engineering
You are training an ML model using data stored in BigQuery that contains several values that are considered Personally Identifiable Information (PII). You need to reduce the sensit...
Data De-identificationCloud DLPDataflowBigQuery - Question #160ML model development
You recently deployed an ML model. Three months after deployment, you notice that your model is underperforming on certain subgroups, thus potentially leading to biased results. Yo...
Class ImbalanceBias MitigationModel TrainingData Resampling - Question #161ML model development
You are working on a binary classification ML algorithm that detects whether an image of a classified scanned document contains a company's logo. In the dataset, 96% of examples do...
Evaluation MetricsImbalanced DataF1 ScoreBinary Classification - Question #162ML pipeline operationalization
While running a model training pipeline on Vertex Al, you discover that the evaluation step is failing because of an out-of-memory error. You are currently using TensorFlow Model A...
TFXVertex AIDataflowPipeline Scaling - Question #163Data processing and feature engineering
You are developing an ML model using a dataset with categorical input variables. You have randomly split half of the data into training and test sets. After applying one-hot encodi...
Data preprocessingCategorical variablesOne-hot encodingTraining-test consistency - Question #164Data processing and feature engineering
You work for a bank and are building a random forest model for fraud detection. You have a dataset that includes transactions, of which 1% are identified as fraudulent. Which data...
Class ImbalanceOversamplingData PreprocessingRandom Forest - Question #165ML model development
You are developing a classification model to support predictions for your company's various products. The dataset you were given for model development has class imbalance You need...
Evaluation MetricsClass ImbalanceF1 ScoreClassification - Question #166ML model development
You are training an object detection machine learning model on a dataset that consists of three million X-ray images, each roughly 2 GB in size. You are using Vertex AI Training to...
Distributed TrainingTraining OptimizationScalable MLGPU Acceleration - Question #167ML model development
You need to build classification workflows over several structured datasets currently stored in BigQuery. Because you will be performing the classification several times, you want...
Vertex AutoMLNo-code MLStructured dataClassification - Question #168ML model development
You recently developed a deep learning model. To test your new model, you trained it for a few epochs on a large dataset. You observe that the training and validation losses barely...
Deep learning debuggingModel training diagnosticsSanity checksLoss function analysis - Question #169ML pipeline operationalization
You are a data scientist at an industrial equipment manufacturing company. You are developing a regression model to estimate the power consumption in the company's manufacturing pl...
BigQuery MLScalable MLRegression ModelsMLOps Automation - Question #170Monitoring, optimizing, and maintaining ML solutions
Your organization manages an online message board. A few months ago, you discovered an increase in toxic language and bullying on the message board. You deployed an automated text...
Model MonitoringBias MitigationClassification ThresholdsMLOps - Question #171Monitoring, optimizing, and maintaining ML solutions
You work for a magazine distributor and need to build a model that predicts which customers will renew their subscriptions for the upcoming year. Using your company's historical da...
Vertex Explainable AIFeature attributionModel explainabilityGoogle Cloud ML - Question #172ML model development
You are an ML engineer at a manufacturing company. You are creating a classification model for a predictive maintenance use case. You need to predict whether a crucial machine will...
Model EvaluationPrecision and RecallClassification MetricsPredictive Maintenance - Question #173ML pipeline operationalization
You built a custom ML model using scikit-learn. Training time is taking longer than expected. You decide to migrate your model to Vertex AI Training, and you want to improve the mo...
Vertex AI Trainingscikit-learntraining optimizationcompute resource selection - Question #174Monitoring, optimizing, and maintaining ML solutions
You are an ML engineer at a retail company. You have built a model that predicts a coupon to offer an ecommerce customer at checkout based on the items in their cart. When a custom...
ML serving latencyReal-time predictionFeature servingLow-latency data store - Question #175Monitoring, optimizing, and maintaining ML solutions
You work for a small company that has deployed an ML model with autoscaling on Vertex AI to serve online predictions in a production environment. The current model receives about 2...
Canary testingModel performanceOnline prediction servingTroubleshooting - Question #176Data processing and feature engineering
You want to train an AutoML model to predict house prices by using a small public dataset stored in BigQuery. You need to prepare the data and want to use the simplest, most effici...
BigQueryData PreprocessingAutoML Data PreparationGCP Data Services - Question #177ML pipeline operationalization
You developed a Vertex AI ML pipeline that consists of preprocessing and training steps and each set of steps runs on a separate custom Docker image. Your organization uses GitHub...
CI/CDML PipelinesDockerGitHub Actions - Question #178Data processing and feature engineering
You are working with a dataset that contains customer transactions. You need to build an ML model to predict customer purchase behavior. You plan to develop the model in BigQuery M...
BigQuery MLFeature EngineeringCategorical FeaturesOnline Prediction - Question #179ML model development
You need to develop an image classification model by using a large dataset that contains labeled images in a Cloud Storage bucket. What should you do?
Vertex AI AutoMLImage ClassificationManaged DatasetsModel Training - Question #180ML model development
You are developing a model to detect fraudulent credit card transactions. You need to prioritize detection, because missing even one fraudulent transaction could severely impact th...
Class imbalanceDecision thresholdModel recallFraud detection - Question #181ML pipeline operationalization
You need to deploy a scikit-leam classification model to production. The model must be able to serve requests 24/7, and you expect millions of requests per second to the production...
Model deploymentVertex AIOnline predictionAuto-scaling - Question #182ML model development
You work with a team of researchers to develop state-of-the-art algorithms for financial analysis. Your team develops and debugs complex models in TensorFlow. You want to maintain...
ML InfrastructureTensorFlowGPU AccelerationModel Debugging - Question #183Monitoring, optimizing, and maintaining ML solutions
You created an ML pipeline with multiple input parameters. You want to investigate the tradeoffs between different parameter combinations. The parameter options are: - Input datase...
Experiment trackingHyperparameter tuningMLOpsVertex AI - Question #184Monitoring, optimizing, and maintaining ML solutions
You received a training-serving skew alert from a Vertex AI Model Monitoring job running in production. You retrained the model with more recent training data, and deployed it back...
Model MonitoringTraining-Serving SkewVertex AIData Drift - Question #185Monitoring, optimizing, and maintaining ML solutions
You developed a custom model by using Vertex AI to forecast the sales of your company's products based on historical transactional data. You anticipate changes in the feature distr...
Vertex AI Model MonitoringDrift DetectionCost OptimizationPrediction Sampling - Question #186ML pipeline operationalization
You have recently trained a scikit-learn model that you plan to deploy on Vertex AI. This model will support both online and batch prediction. You need to preprocess input data for...
Vertex AI DeploymentCustom Prediction RoutineModel PreprocessingScikit-learn - Question #187Data processing and feature engineering
You work for a food product company. Your company's historical sales data is stored in BigQuery.You need to use Vertex AI's custom training service to train multiple TensorFlow mod...
BigQueryData preprocessingFeature engineeringCost optimization - Question #188ML pipeline operationalization
You have created a Vertex AI pipeline that includes two steps. The first step preprocesses 10 TB data completes in about 1 hour, and saves the result in a Cloud Storage bucket. The...
Vertex AI PipelinesPipeline CachingML Workflow OptimizationCost Optimization - Question #189ML pipeline operationalization
You work for a bank. You have created a custom model to predict whether a loan application should be flagged for human review. The input features are stored in a BigQuery table. Th...
Explainable AI (XAI)Custom Model DeploymentVertex AIFeature Attribution - Question #190ML pipeline operationalization
You recently used XGBoost to train a model in Python that will be used for online serving. Your model prediction service will be called by a backend service implemented in Golang r...
Model DeploymentVertex AIPre/Post ProcessingManaged Services - Question #191ML pipeline operationalization
You recently deployed a pipeline in Vertex AI Pipelines that trains and pushes a model to a Vertex AI endpoint to serve real-time traffic. You need to continue experimenting and it...
CI/CDMLOpsVertex AI PipelinesStaged Deployment - Question #192ML pipeline operationalization
You work for a bank with strict data governance requirements. You recently implemented a custom model to detect fraudulent transactions. You want your training code to download int...
Data SecurityNetwork SecurityIAMML Pipeline Security - Question #193ML pipeline operationalization
You are deploying a new version of a model to a production Vertex Al endpoint that is serving traffic. You plan to direct all user traffic to the new model. You need to deploy the...
Vertex AIModel DeploymentModel VersioningMinimal Disruption - Question #194ML model development
You are training an ML model on a large dataset. You are using a TPU to accelerate the training process. You notice that the training process is taking longer than expected. You di...
TPU optimizationBatch sizeML training performanceHardware utilization - Question #195Data processing and feature engineering
You work for a retail company. You have a managed tabular dataset in Vertex AI that contains sales data from three different stores. The dataset includes several features, such as...
Data SplittingTime Series DataData Leakage PreventionVertex AI Tabular - Question #196Monitoring, optimizing, and maintaining ML solutions
You have developed a BigQuery ML model that predicts customer chum, and deployed the model to Vertex AI Endpoints. You want to automate the retraining of your model by using minima...
Vertex AI Model MonitoringPrediction DriftModel RetrainingData Drift - Question #197ML pipeline operationalization
You have been tasked with deploying prototype code to production. The feature engineering code is in PySpark and runs on Dataproc Serverless. The model training is executed by usin...
ML PipelinesWorkflow OrchestrationKubeflow Pipelines SDKProduction Deployment - Question #198Monitoring, optimizing, and maintaining ML solutions
You recently deployed a scikit-learn model to a Vertex AI endpoint. You are now testing the model on live production traffic. While monitoring the endpoint, you discover twice as m...
Vertex AI EndpointsAutoscaling ConfigurationPerformance OptimizationResource Efficiency - Question #199ML pipeline operationalization
You work at a bank. You have a custom tabular ML model that was provided by the bank's vendor. The training data is not available due to its sensitivity. The model is packaged as a...
Vertex AI Model DeploymentVertex AI Model MonitoringCustom Prediction ContainersFeature Drift - Question #200ML pipeline operationalization
You are implementing a batch inference ML pipeline in Google Cloud. The model was developed using TensorFlow and is stored in SavedModel format in Cloud Storage. You need to apply...
Vertex AIBatch PredictionBigQueryML Pipeline