PROFESSIONAL-MACHINE-LEARNING-ENGINEER · Question #175
PROFESSIONAL-MACHINE-LEARNING-ENGINEER Question #175: Real Exam Question with Answer & Explanation
Sign in or unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER to reveal the answer and full explanation for question #175. The question stem and answer options stay visible for context.
Question
You work for a small company that has deployed an ML model with autoscaling on Vertex AI to serve online predictions in a production environment. The current model receives about 20 prediction requests per hour with an average response time of one second. You have retrained the same model on a new batch of data, and now you are canary testing it, sending ~10% of production traffic to the new model. During this canary test, you notice that prediction requests for your new model are taking between 30 and 180 seconds to complete. What should you do?
Options
- ASubmit a request to raise your project quota to ensure that multiple prediction services can run
- BTurn off auto-scaling for the online prediction service of your new model. Use manual scaling with
- CRemove your new model from the production environment. Compare the new model and existing
- DRemove your new model from the production environment. For a short trial period, send all
Unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER to see the answer
You've previewed enough free PROFESSIONAL-MACHINE-LEARNING-ENGINEER questions. Unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.