AIP-C01 Question #48: Real Exam Question with Answer & Explanation

Sign in or unlock AIP-C01 to reveal the answer and full explanation for question #48. The question stem and answer options stay visible for context.

Deployment, Operations, and Optimization

Question

A company is using Amazon Bedrock and Anthropic Claude 3 Haiku to develop an AI assistant. The AI assistant normally processes 10,000 requests each hour but experiences surges of up to 30,000 requests each hour during peak usage periods. The AI assistant must respond within 2 seconds while operating across multiple AWS Regions. The company observes that during peak usage periods, the AI assistant experiences throughput bottlenecks that cause increased latency and occasional request timeouts. The company must resolve the performance issues. Which solution will meet this requirement?

Options

APurchase provisioned throughput and sufficient model units (MUs) in a single Region. Configure
BImplement token batching to reduce API overhead. Use cross-Region inference profiles to
CSet up auto scaling AWS Lambda functions in each Region. Implement client-side round-robin
DImplement batch inference for all requests by using Amazon S3 buckets across multiple Regions.

Unlock AIP-C01 to see the answer

You've previewed enough free AIP-C01 questions. Unlock AIP-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock AIP-C01 - $49.99 / 30 days Sign in

Topics

#Generative AI optimization#Amazon Bedrock#Throughput and Latency#Multi-Region Architecture

Full AIP-C01 Practice Browse All AIP-C01 Questions