A company is creating an ML model to identify defects in a product. The company has gathered a dataset and has stored the dataset in TIFF format in Amazon S3. The dataset contains 200 images in which the most common defects are visible. The dataset also contains 1,800 images in which there is no defect visible. An ML engineer trains the model and notices poor performance in some classes. The ML engineer identifies a class imbalance problem in the dataset. What should the ML engineer do to solves this problem?

Question

Accepted Answer

C. Oversample the 200 images in which the most common defects are visible.

Answer

A. Use a few hundred images and Amazon Rekognition Custom Labels to train a new model.

Answer

B. Undersample the 200 images in which the most common defects are visible.

Answer

D. Use all 2,000 images and Amazon Rekognition Custom Labels to train a new model.

A company is creating an ML model to identify defects in a product. The company has gathered a dataset and has stored the dataset in TIFF format in Amazon S3. The dataset contains 200 images in which

Question

Options

How the community answered

Explanation

Topics

Community Discussion