A company runs Amazon SageMaker ML models that use accelerated instances. The models require real-time responses. Each model has different scaling requirements. The company must not allow a cold start

Sign in or unlock MLA-C01 to reveal the answer and full explanation for question #112. The question stem and answer options stay visible for context.

Deployment and Orchestration of ML Workflows

Question

A company runs Amazon SageMaker ML models that use accelerated instances. The models require real-time responses. Each model has different scaling requirements. The company must not allow a cold start for the models. Which solution will meet these requirements?

Options

ACreate a SageMaker Serverless Inference endpoint for each model. Use provisioned concurrency
BCreate a SageMaker Asynchronous Inference endpoint for each model. Create an auto scaling
CCreate a SageMaker endpoint. Create an inference component for each model. In the inference
DCreate an Amazon S3 bucket. Store all the model artifacts in the S3 bucket. Create a SageMaker

Unlock MLA-C01 to see the answer

You've previewed enough free MLA-C01 questions. Unlock MLA-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock MLA-C01 - $49.99 / 30 days Sign in

Topics

#SageMaker Inference Components#Real-time Inference#No Cold Start#ML Model Deployment

Full MLA-C01 Practice