AIP-C01 Question #76: Real Exam Question with Answer & Explanation

Sign in or unlock AIP-C01 to reveal the answer and full explanation for question #76. The question stem and answer options stay visible for context.

Deployment, Operations, and Optimization

Question

When designing a large-scale, low-latency inference architecture for a generative AI model, which AWS service would you use to automatically scale the inference workload based on the number of incoming requests?

Options

AAmazon EC2 Auto Scaling
BAmazon SageMaker Hosting Endpoints
CAWS Lambda
DAmazon API Gateway

Unlock AIP-C01 to see the answer

You've previewed enough free AIP-C01 questions. Unlock AIP-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock AIP-C01 - $49.99 / 30 days Sign in

Topics

#Generative AI#Inference Scaling#SageMaker Endpoints#Auto Scaling

Full AIP-C01 Practice Browse All AIP-C01 Questions