DEA-C01 · Question #134
DEA-C01 Question #134: Real Exam Question with Answer & Explanation
The correct answer is A: Ensure that the object format, compression type, and schema are the same for each object.. To ensure an AWS Glue crawler creates a single table from S3 objects, the objects must have a consistent schema, format, and compression, along with a consistent S3 prefix structure.
Question
A finance company receives data from third-party data providers and stores the data as objects in an Amazon S3 bucket. The company ran an AWS Glue crawler on the objects to create a data catalog. The AWS Glue crawler created multiple tables. However, the company expected that the crawler would create only one table. The company needs a solution that will ensure the AVS Glue crawler creates only one table. Which combination of solutions will meet this requirement? (Choose two.)
Options
- AEnsure that the object format, compression type, and schema are the same for each object.
- BEnsure that the object format and schema are the same for each object. Do not enforce
- CEnsure that the schema is the same for each object. Do not enforce consistency for the file format
- DEnsure that the structure of the prefix for each S3 object name is consistent.
- EEnsure that all S3 object names follow a similar pattern.
Explanation
To ensure an AWS Glue crawler creates a single table from S3 objects, the objects must have a consistent schema, format, and compression, along with a consistent S3 prefix structure.
Common mistakes.
- B. While consistent format and schema are crucial, compression type also influences how crawlers group data, and omitting it as a consistency factor might still lead to multiple tables.
- C. Inconsistent file formats will cause the Glue crawler to create separate tables, as it assumes different formats represent different underlying data structures.
- E. A 'similar pattern' is less precise than a 'consistent prefix structure,' which explicitly defines the logical grouping for the crawler based on how it identifies table boundaries.
Concept tested. AWS Glue crawler table grouping logic
Reference. https://docs.aws.amazon.com/glue/latest/dg/how-it-works-crawlers.html
Topics
Community Discussion
No community discussion yet for this question.