A manufacturer has deployed an array of 50000 sensors throughout its plant to predict failures in components. Data Scientists have built a long short-term memory (LSTM) model in Gluon and are training

Sign in or unlock MLS-C01 to reveal the answer and full explanation for question #149. The question stem and answer options stay visible for context.

ML Implementation and Operations

Question

A manufacturer has deployed an array of 50000 sensors throughout its plant to predict failures in components. Data Scientists have built a long short-term memory (LSTM) model in Gluon and are training it using the Amazon SageMaker API. The Data Scientists are training the model using a time series with 10 million examples. Training is currently taking 100 hours and Data Scientists are attempting to speed it up by using multiple GPUs. However, when they modified the code to use 8 GPUs, it is running slightly slower than on 1 GPU. The current hyperparameter settings are:

Hyperparameter Value Batch size 128 Clip gradient 10 Autoregressive window 160 Learning rate 0.01 Epochs 80 Which of the following changes together are recommended to speed up training on 8 GPUs while maintaining test accuracy? (Select TWO)

Options

AIncrease the batch size by a factor of 8
BIncrease the clip gradient by 8
CIncrease the autoregressive window by a factor of 8
DIncrease the learning rate by a factor of 8
EDecrease the number of epochs by 20

Unlock MLS-C01 to see the answer

You've previewed enough free MLS-C01 questions. Unlock MLS-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock MLS-C01 - $49.99 / 30 days Sign in

Topics

#Distributed Training#Hyperparameter Optimization#Model Training Performance#Batch Size Scaling

Full MLS-C01 Practice