DEA-C01 Exam Questions
308 real DEA-C01 exam questions with expert-verified answers and explanations. Page 4 of 7.
- Question #151Data Operations and Support
A data engineer uses Amazon Managed Workflows for Apache Airflow (Amazon MWAA) to run data pipelines in an AWS account. A workflow recently failed to run. The data engineer needs t...
MWAA LoggingApache Airflow LogsWorkflow TroubleshootingTask Logs - Question #152Data Security and Governance
A finance company uses Amazon Redshift as a data warehouse. The company stores the data in a shared Amazon S3 bucket. The company uses Amazon Redshift Spectrum to access the data t...
Redshift SecurityVPC NetworkingRedshift SpectrumData Privacy - Question #153Data Ingestion and Transformation
Files from multiple data sources arrive in an Amazon S3 bucket on a regular basis. A data engineer wants to ingest new files into Amazon Redshift in near real time when the new fil...
S3 Event NotificationsAWS LambdaRedshift IngestionNear Real-time Data Processing - Question #154Data Ingestion and Transformation
A technology company currently uses Amazon Kinesis Data Streams to collect log data in real time. The company wants to use Amazon Redshift for downstream real-time queries and to e...
Kinesis Data StreamsAmazon RedshiftStreaming IngestionReal-time Data - Question #155Data Ingestion and Transformation
A company maintains a data warehouse in an on-premises Oracle database. The company wants to build a data lake on AWS. The company wants to load data warehouse tables into Amazon S...
Data IngestionChange Data CaptureAWS DMSOperational Efficiency - Question #156Data Ingestion and Transformation
A company is building a data lake for a new analytics team. The company is using Amazon S3 for storage and Amazon Athena for query analysis. All data that is in Amazon S3 is in Apa...
Data IngestionAWS DMSOn-premises to CloudChange Data Capture - Question #157Data Ingestion and Transformation
A transportation company wants to track vehicle movements by capturing geolocation records. The records are 10 bytes in size. The company receives up to 10.000 records every second...
Kinesis Data StreamsData IngestionThroughput OptimizationKinesis Producer Library (KPL) - Question #158Data Ingestion and Transformation
An investment company needs to manage and extract insights from a volume of semi-structured data that grows continuously. A data engineer needs to deduplicate the semi-structured d...
Data DeduplicationFuzzy MatchingAWS GlueData Transformation - Question #159Data Ingestion and Transformation
A company is building an inventory management system and an inventory reordering system to automatically reorder products. Both systems use Amazon Kinesis Data Streams. The invento...
Kinesis Data StreamsKinesis Client Library (KCL)Data DuplicationAt-least-once Delivery - Question #160Data Ingestion and Transformation
An ecommerce company operates a complex order fulfilment process that spans several operational systems hosted in AWS. Each of the operational systems has a Java Database Connectiv...
Data IngestionETLData WarehousingAWS Glue - Question #161Data Store Management
A data engineer needs to use Amazon Neptune to develop graph applications. Which programming languages should the engineer use to develop the graph applications? (Choose two.)
Amazon NeptuneGraph DatabasesGremlinSPARQL - Question #162Data Ingestion and Transformation
A mobile gaming company wants to capture data from its gaming app. The company wants to make the data available to three internal consumers of the data. The data records are approx...
Kinesis Data StreamsData IngestionThroughput OptimizationReal-time Data Processing - Question #163Data Store Management
A retail company uses an Amazon Redshift data warehouse and an Amazon S3 bucket. The company ingests retail order data into the S3 bucket every day. The company stores all order da...
Redshift SpectrumData PartitioningColumnar StorageQuery Optimization - Question #164Data Security and Governance
A company stores customer records in Amazon S3. The company must not delete or modify the customer record data for 7 years after each record is created. The root user also must not...
S3 Object LockData RetentionCompliance ModeData Security - Question #165Data Store Management
A data engineer needs to create a new empty table in Amazon Athena that has the same schema as an existing table named old_table. Which SQL statement should the data engineer use t...
AthenaSQLDDLTable Creation - Question #166Data Ingestion and Transformation
A data engineer needs to create an Amazon Athena table based on a subset of data from an existing Athena table named cities_world. The cities_world table contains cities that are l...
SQL DMLAmazon AthenaData TransformationSubsetting Data - Question #167Data Security and Governance
A company implements a data mesh that has a central governance account. The company needs to catalog all data in the governance account. The governance account uses AWS Lake Format...
Redshift Data SharingLake Formation IntegrationColumn-Level AccessData Governance - Question #168Data Ingestion and Transformation
A company has a data lake in Amazon S3. The company uses AWS Glue to catalog data and AWS Glue Studio to implement data extract, transform, and load (ETL) pipelines. The company ne...
AWS Glue Data QualityETL PipelinesData Quality ManagementAWS Glue Transforms - Question #169Data Operations and Support
A company has an application that uses a microservice architecture. The company hosts the application on an Amazon Elastic Kubernetes Services (Amazon EKS) cluster. The company wan...
EKS MonitoringLog & Trace ManagementObservabilityOpenSearch Service - Question #170Data Ingestion and Transformation
A company has a gaming application that stores data in Amazon DynamoDB tables. A data engineer needs to ingest the game data into an Amazon OpenSearch Service cluster. Data updates...
DynamoDB StreamsAWS LambdaChange Data Capture (CDC)Near Real-time Ingestion - Question #171Data Store Management
A company uses Amazon Redshift as its data warehouse service. A data engineer needs to design a physical data model. The data engineer encounters a de-normalized table that is grow...
Redshift Data DistributionAUTO Distribution StylePhysical Data ModelingData Warehouse Performance - Question #172Data Ingestion and Transformation
A retail company is expanding its operations globally. The company needs to use Amazon QuickSight to accurately calculate currency exchange rates for financial reports. The company...
QuickSightCalculated FieldsSPICEData Transformation - Question #173Data Ingestion and Transformation
A company has three subsidiaries. Each subsidiary uses a different data warehousing solution. The first subsidiary hosts its data warehouse in Amazon Redshift. The second subsidiar...
Data LakeFederated QueryETL PipelineApache Iceberg - Question #174Data Security and Governance
A company is building a data stream processing application. The application runs in an Amazon Elastic Kubernetes Service (Amazon EKS) cluster. The application stores processed data...
IAM RolesEKS SecurityDynamoDB AccessCredential Management - Question #175Data Security and Governance
A data engineer needs to onboard a new data producer into AWS. The data producer needs to migrate data products to AWS. The data producer maintains many data pipelines that support...
Hybrid ConnectivitySecrets ManagementIAMSecure Data Transfer - Question #176Data Store Management
A data engineer configured an AWS Glue Data Catalog for data that is stored in Amazon S3 buckets. The data engineer needs to configure the Data Catalog to receive incremental updat...
Glue Data CatalogS3 EventsServerless ArchitectureIncremental Updates - Question #177Data Ingestion and Transformation
A company uses AWS Glue Data Catalog to index data that is uploaded to an Amazon S3 bucket every day. The company uses a daily batch processes in an extract, transform, and load (E...
AWS Glue Data QualityData ValidationETL MonitoringSNS Notifications - Question #178Data Security and Governance
A company stores customer data that contains personally identifiable information (PII) in an Amazon Redshift cluster. The company's marketing, claims, and analytics teams need to b...
Redshift securityData maskingRole-based access controlPII data protection - Question #179Data Store Management
A financial company recently added more features to its mobile app. The new features required the company to create a new topic in an existing Amazon Managed Streaming for Apache K...
Amazon MSKStorage ManagementCloudWatch MonitoringTroubleshooting - Question #180Data Security and Governance
A data engineer needs to build an enterprise data catalog based on the company's Amazon S3 buckets and Amazon RDS databases. The data catalog must include storage format metadata f...
AWS GlueData CatalogCrawlersClassifiers - Question #181Data Security and Governance
A company analyzes data in a data lake every quarter to perform inventory assessments. A data engineer uses AWS Glue DataBrew to detect any personally identifiable formation (PII)...
AWS Glue DataBrewData QualityPII DetectionOperational Overhead - Question #182Data Ingestion and Transformation
A company receives a data file from a partner each day in an Amazon S3 bucket. The company uses a daily AWS Glue extract, transform, and load (ETL) pipeline to clean and transform...
AWS GlueData QualityETLS3 - Question #183Data Store Management
A marketing company uses Amazon S3 to store marketing data. The company uses versioning in some buckets. The company runs several jobs to read and load data into the buckets. To he...
S3 Storage ManagementCost OptimizationS3 Storage LensStorage Analytics - Question #184Data Ingestion and Transformation
A gaming company uses Amazon Kinesis Data Streams to collect clickstream data. The company uses Amazon Data Firehose delivery streams to store the data in JSON format in Amazon S3....
Kinesis FirehoseData FormatsAthena Cost OptimizationS3 Partitioning - Question #185Data Store Management
A company needs a solution to manage costs for an existing Amazon DynamoDB table. The company also needs to control the size of the table. The solution must not disrupt any ongoing...
DynamoDB TTLData Lifecycle ManagementCost OptimizationAutomated Deletion - Question #186Data Security and Governance
A company uses Amazon S3 to store data and Amazon QuickSight to create visualizations, The company has an S3 bucket in an AWS account named Hub-Account. The S3 bucket is encrypted...
Cross-account accessKMS key policiesS3 encryptionQuickSight integration - Question #187Data Ingestion and Transformation
A car sales company maintains data about cars that are listed for sale in an area. The company receives data about new car listings from vendors who upload the data daily as compre...
Data IngestionETLServerless ArchitectureWorkflow Orchestration - Question #188Data Ingestion and Transformation
A company has AWS resources in multiple AWS Regions. The company has an Amazon EFS file system in each Region where the company operates. The company's data science team operates w...
AWS DataSyncAmazon EFSCross-Region Data TransferServerless Data Processing - Question #189Data Security and Governance
A company hosts its applications on Amazon EC2 instances. The company must use SSL/TLS connections that encrypt data in transit to communicate securely with AWS infrastructure that...
AWS Certificate ManagerSSL/TLSCertificate Lifecycle ManagementData Security - Question #190Data Security and Governance
A company saves customer data to an Amazon S3 bucket. The company uses server-side encryption with AWS KMS keys (SSE-KMS) to encrypt the bucket. The dataset includes personally ide...
PII MaskingData TransformationAccess ControlData Security - Question #191Data Security and Governance
A data engineer is launching an Amazon EMR cluster. The data that the data engineer needs to load into the new cluster is currently in an Amazon S3 bucket. The data engineer needs...
Amazon EMRData EncryptionAWS KMSSecurity Configuration - Question #192Data Ingestion and Transformation
A retail company is using an Amazon Redshift cluster to support real-time inventory management. The company has deployed an ML model on a real- time endpoint in Amazon SageMaker. T...
Redshift MLSageMaker IntegrationReal-time PredictionsData Warehousing - Question #193Data Ingestion and Transformation
A company stores CSV files in an Amazon S3 bucket. A data engineer needs to process the data in the CSV files and store the processed data in a new S3 bucket. The process needs to...
AWS Glue DataBrewData TransformationETLLow-Code Data Prep - Question #194Data Store Management
A company uses Amazon Redshift as its data warehouse. Data encoding is applied to the existing tables of the data warehouse. A data engineer discovers that the compression encoding...
Amazon RedshiftData CompressionData OptimizationANALYZE COMPRESSION - Question #195Data Store Management
The company stores a large volume of customer records in Amazon S3. To comply with regulations, the company must be able to access new customer records immediately for the first 30...
S3 Storage ClassesS3 Lifecycle PoliciesCost OptimizationData Tiering - Question #196Data Ingestion and Transformation
A data engineer is using Amazon QuickSight to build a dashboard to report a company's revenue in multiple AWS Regions. The data engineer wants the dashboard to display the total re...
Amazon QuickSightLevel-Aware CalculationsData AggregationBusiness Intelligence - Question #197Data Security and Governance
A retail company stores customer data in an Amazon S3 bucket. Some of the customer data contains personally identifiable information (PII) about customers. The company must not sha...
Amazon MaciePII detectionData securityAutomated discovery - Question #198Data Store Management
A data engineer needs to create an empty copy of an existing table in Amazon Athena to perform data processing tasks. The existing table in Athena contains 1,000 rows. Which query...
Athena SQLCREATE TABLE AS SELECT (CTAS)Schema Only CopyEmpty Table Creation - Question #199Data Operations and Support
A company has a data lake in Amazon S3. The company collects AWS CloudTrail logs for multiple applications. The company stores the logs in the data lake, catalogs the logs in AWS G...
Amazon AthenaAWS Glue Data CatalogData PartitioningTroubleshooting - Question #200Data Operations and Support
A data engineer wants to orchestrate a set of extract, transform, and load (ETL) jobs that run on AWS. The ETL jobs contain tasks that must run Apache Spark jobs on Amazon EMR, mak...
ETL OrchestrationWorkflow ManagementApache AirflowData Pipelines