AmazonAmazon
AIP-C01 · Question #76
AIP-C01 Question #76: Real Exam Question with Answer & Explanation
Sign in or unlock AIP-C01 to reveal the answer and full explanation for question #76. The question stem and answer options stay visible for context.
Deployment, Operations, and Optimization
Question
When designing a large-scale, low-latency inference architecture for a generative AI model, which AWS service would you use to automatically scale the inference workload based on the number of incoming requests?
Options
- AAmazon EC2 Auto Scaling
- BAmazon SageMaker Hosting Endpoints
- CAWS Lambda
- DAmazon API Gateway
Unlock AIP-C01 to see the answer
You've previewed enough free AIP-C01 questions. Unlock AIP-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.
Topics
#Generative AI#Inference Scaling#SageMaker Endpoints#Auto Scaling