nerdexam
GoogleGoogle

PROFESSIONAL-MACHINE-LEARNING-ENGINEER · Question #198

PROFESSIONAL-MACHINE-LEARNING-ENGINEER Question #198: Real Exam Question with Answer & Explanation

Sign in or unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER to reveal the answer and full explanation for question #198. The question stem and answer options stay visible for context.

Submitted by akirajp· Apr 18, 2026Monitoring, optimizing, and maintaining ML solutions

Question

You recently deployed a scikit-learn model to a Vertex AI endpoint. You are now testing the model on live production traffic. While monitoring the endpoint, you discover twice as many requests per hour than expected throughout the day. You want the endpoint to efficiently scale when the demand increases in the future to prevent users from experiencing high latency. What should you do?

Options

  • ADeploy two models to the same endpoint, and distribute requests among them evenly
  • BConfigure an appropriate minReplicaCount value based on expected baseline traffic
  • CSet the target utilization percentage in the autoscailngMetricSpecs configuration to a higher value
  • DChange the model's machine type to one that utilizes GPUs

Unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER to see the answer

You've previewed enough free PROFESSIONAL-MACHINE-LEARNING-ENGINEER questions. Unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#Vertex AI Endpoints#Autoscaling Configuration#Performance Optimization#Resource Efficiency
Full PROFESSIONAL-MACHINE-LEARNING-ENGINEER PracticeBrowse All PROFESSIONAL-MACHINE-LEARNING-ENGINEER Questions