MLS-C01 · Question #132
MLS-C01 Question #132: Real Exam Question with Answer & Explanation
The correct answer is B: Create a FindMatches machine learning transform in AWS Glue.. You can use the FindMatches transform to find duplicate records in the source data. A labeling file is generated or provided to help teach the transform. https://aws.amazon.com/blogs/big-data/integrate-and-deduplicate-datasets-using-aws-lake- formation-findmatches/
Question
A data scientist needs to identify fraudulent user accounts for a company's ecommerce platform. The company wants the ability to determine if a newly created account is associated with a previously known fraudulent user. The data scientist is using AWS Glue to cleanse the company's application logs during ingestion. Which strategy will allow the data scientist to identify fraudulent accounts?
Options
- AExecute the built-in FindDuplicates Amazon Athena query.
- BCreate a FindMatches machine learning transform in AWS Glue.
- CCreate an AWS Glue crawler to infer duplicate accounts in the source data.
- DSearch for duplicate accounts in the AWS Glue Data Catalog.
Explanation
You can use the FindMatches transform to find duplicate records in the source data. A labeling file is generated or provided to help teach the transform. https://aws.amazon.com/blogs/big-data/integrate-and-deduplicate-datasets-using-aws-lake- formation-findmatches/
Topics
Community Discussion
No community discussion yet for this question.