MLS-C01 · Question #258
MLS-C01 Question #258: Real Exam Question with Answer & Explanation
The correct answer is A: Use Amazon Elastic Inference on the SageMaker hosted endpoint.. Use Amazon Elastic Inference on the SageMaker hosted endpoint would be the most cost- effective solution for increasing throughput and decreasing latency. Amazon Elastic Inference is a service that allows you to attach GPU-powered inference acceleration to Amazon SageMaker hosted
Question
An analytics company has an Amazon SageMaker hosted endpoint for an image classification model. The model is a custom-built convolutional neural network (CNN) and uses the PyTorch deep learning framework. The company wants to increase throughput and decrease latency for customers that use the model. Which solution will meet these requirements MOST cost-effectively?
Options
- AUse Amazon Elastic Inference on the SageMaker hosted endpoint.
- BRetrain the CNN with more layers and a larger dataset.
- CRetrain the CNN with more layers and a smaller dataset.
- DChoose a SageMaker instance type that has multiple GPUs.
Explanation
Use Amazon Elastic Inference on the SageMaker hosted endpoint would be the most cost- effective solution for increasing throughput and decreasing latency. Amazon Elastic Inference is a service that allows you to attach GPU-powered inference acceleration to Amazon SageMaker hosted endpoints and EC2 instances. By attaching an Elastic Inference accelerator to the SageMaker endpoint, you can achieve better performance with lower costs than using a larger, more expensive instance type.
Topics
Community Discussion
No community discussion yet for this question.