You work for a large bank that serves customers through an application hosted in Google Cloud that is running in the US and Singapore. You have developed a PyTorch model to classify transactions as po

Sign in or unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER to reveal the answer and full explanation for question #288. The question stem and answer options stay visible for context.

Submitted by tom_us· Apr 18, 2026Monitoring, optimizing, and maintaining ML solutions

Question

You work for a large bank that serves customers through an application hosted in Google Cloud that is running in the US and Singapore. You have developed a PyTorch model to classify transactions as potentially fraudulent or not. The model is a three-layer perceptron that uses both numerical and categorical features as input, and hashing happens within the model. You deployed the model to the us-central1 region on nl-highcpu-16 machines, and predictions are served in real time. The model's current median response latency is 40 ms. You want to reduce latency, especially in Singapore, where some customers are experiencing the longest delays. What should you do?

Options

AAttach an NVIDIA T4 GPU to the machines being used for online inference.
BChange the machines being used for online inference to nl-highcpu-32.
CDeploy the model to Vertex AI private endpoints in the us-central1 and asia-southeast1 regions,
DCreate another Vertex AI endpoint in the asia-southeast1 region, and allow the application to

Unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER to see the answer

You've previewed enough free PROFESSIONAL-MACHINE-LEARNING-ENGINEER questions. Unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock PROFESSIONAL-MACHINE-LEARNING-ENGINEER - $49.99 / 30 days Sign in

Topics

#Latency reduction#Multi-region deployment#Vertex AI Endpoints#Network latency

Full PROFESSIONAL-MACHINE-LEARNING-ENGINEER Practice