DEA-C01 Exam Questions
308 real DEA-C01 exam questions with expert-verified answers and explanations. Page 5 of 7.
- Question #201Data Operations and Support
A data engineer maintains custom Python scripts that perform a data formatting process that many AWS Lambda functions use. When the data engineer needs to modify the Python scripts...
AWS LambdaLambda LayersCode ReusabilityDependency Management - Question #202Data Security and Governance
A company stores customer data in an Amazon S3 bucket. Multiple teams in the company want to use the customer data for downstream analysis. The company needs to ensure that the tea...
S3 Object LambdaPII RedactionData SecurityOn-demand Transformation - Question #203Data Security and Governance
A company stores its processed data in an S3 bucket. The company has a strict data access policy. The company uses IAM roles to grant teams within the company different levels of a...
CloudTrailS3Access ControlSecurity Monitoring - Question #204Data Ingestion and Transformation
A company needs to load customer data that comes from a third party into an Amazon Redshift data warehouse. The company stores order data and product data in the same data warehous...
Amazon RedshiftSUPER data typeJSON dataData ingestion - Question #205Data Ingestion and Transformation
A company wants to analyze sales records that the company stores in a MySQL database. The company wants to correlate the records with sales opportunities identified by Salesforce....
Data IngestionETLServerless ArchitectureData Lake - Question #206Data Store Management
A company stores server logs in an Amazon S3 bucket. The company needs to keep the logs for 1 year. The logs are not required after 1 year. A data engineer needs a solution to auto...
S3 LifecycleData RetentionOperational OverheadAmazon S3 - Question #207Data Ingestion and Transformation
A company is designing a serverless data processing workflow in AWS Step Functions that involves multiple steps. The processing workflow ingests data from an external API, transfor...
AWS Step FunctionsConditional LogicWorkflow OrchestrationServerless Data Processing - Question #208Data Operations and Support
A data engineer created a table named cloudtrail_logs in Amazon Athena to query AWS CloudTrail logs and prepare data for audits. The data engineer needs to write a query to display...
SQL QueryingAmazon AthenaAWS CloudTrailLog Analysis - Question #209Data Security and Governance
An online retailer uses multiple delivery partners to deliver products to customers. The delivery partners send order summaries to the retailer. The retailer stores the order summa...
PII DetectionAmazon MacieData SecuritySensitive Data Discovery - Question #210Data Security and Governance
A company has an Amazon Redshift data warehouse that users access by using a variety of IAM roles. More than 100 users access the data warehouse every day. The company wants to con...
Amazon RedshiftAccess ControlRole-Based Access ControlData Security - Question #211Data Security and Governance
A company uses Amazon DataZone as a data governance and business catalog solution. The company stores data in an Amazon S3 data lake. The company uses AWS Glue with an AWS Glue Dat...
AWS Glue Data QualityAmazon DataZoneData GovernanceAWS Glue Data Catalog - Question #212Data Security and Governance
A company has a data warehouse in Amazon Redshift. To comply with security regulations, the company needs to log and store all user activities and connection activities for the dat...
Redshift loggingAudit trailsAmazon S3Data security - Question #213Data Ingestion and Transformation
A company wants to migrate a data warehouse from Teradata to Amazon Redshift. Which solution will meet this requirement with the LEAST operational effort?
Data MigrationAWS SCTAWS DMSAmazon Redshift - Question #214Data Store Management
A company uses a variety of AWS and third-party data stores. The company wants to consolidate all the data into a central data warehouse to perform analytics. Users need fast respo...
Data WarehousingAmazon Redshift ServerlessOperational OverheadAnalytics Performance - Question #215Data Ingestion and Transformation
A data engineer uses Amazon Kinesis Data Streams to ingest and process records that contain user behavior data from an application every day. The data engineer notices that the dat...
Kinesis Data StreamsThrottlingPartition KeySharding - Question #216Data Operations and Support
A company has a data processing pipeline that includes several dozen steps. The data processing pipeline needs to send alerts in real time when a step fails or succeeds. The data p...
Event-driven architecturePipeline monitoringAWS Step FunctionsAmazon EventBridge - Question #217Data Operations and Support
A company has an application that uses an Amazon API Gateway REST API and an AWS Lambda function to retrieve data from an Amazon DynamoDB instance. Users recently reported intermit...
AWS LambdaLambda ConcurrencyThrottlingPerformance Tuning - Question #218Data Security and Governance
A company has as JSON file that contains personally identifiable information (PII) data and non- PII data. The company needs to make the data available for querying and analysis. T...
Data SecurityData GovernanceAWS Lake FormationPII - Question #219Data Store Management
A company uses AWS Key Management Service (AWS KMS) to encrypt an Amazon Redshift cluster. The company wants to configure a cross-Region snapshot of the Redshift cluster as part of...
Redshift Snapshot ManagementCross-Region Disaster RecoveryKMS Key ManagementData Store Resiliency - Question #220Data Ingestion and Transformation
A company is using Amazon S3 to build a data lake. The company needs to replicate records from multiple source databases into Apache Parquet format. Most of the source databases ar...
AWS DMSChange Data CaptureData IngestionHybrid Data Replication - Question #221Data Ingestion and Transformation
A data engineer needs to optimize the performance of a data pipeline that handles retail orders. Data about the orders is ingested daily into an Amazon S3 bucket. The data engineer...
Data PartitioningAmazon AthenaS3 Data LakeCost Optimization - Question #222Data Ingestion and Transformation
A data engineer has two datasets that contain sales information for multiple cities and states. One dataset is named reference, and the other dataset is named primary. The data eng...
AWS Glue Data QualityDQDLReferential IntegrityData Validation - Question #223Data Ingestion and Transformation
A company has an on-premises PostgreSQL database that contains customer data. The company wants to migrate the customer data to an Amazon Redshift data warehouse. The company has e...
Database MigrationChange Data CaptureAWS DMSAmazon Redshift - Question #224Data Store Management
A company has several new datasets in CSV and JSON formats. A data engineer needs to make the data available to a team of data analysts who will analyze the data by using SQL queri...
Data LakeAWS Glue Data CatalogAmazon S3Serverless SQL Analytics - Question #225Data Ingestion and Transformation
A retail company stores order information in an Amazon Aurora table named Orders. The company needs to create operational reports from the Orders table with minimal latency. The Or...
Aurora zero-ETL integrationAmazon RedshiftData replicationAnalytical reporting - Question #226Data Ingestion and Transformation
A company is building a new application that ingests CSV files into Amazon Redshift. The company has developed the frontend for the application. The files are stored in an Amazon S...
S3 Event NotificationsAmazon EventBridgeAWS LambdaData Ingestion - Question #227Data Security and Governance
A company stores sensitive data in an Amazon Redshift table. The company needs to give specific users the ability to access the sensitive data. The company must not create duplicat...
Amazon RedshiftDynamic Data MaskingData SecurityIAM Roles - Question #228Data Security and Governance
A data engineer uses AWS Lake Formation to manage access to data that is stored in an Amazon S3 bucket. The data engineer configures an AWS Glue crawler to discover data at a speci...
AWS Lake FormationAWS Glue CrawlerS3Data Governance - Question #229Data Security and Governance
A company built a data lake and a data warehouse on AWS. The company wants to implement a data catalog to enhance the current data storage solutions. The company wants to have the...
Data CatalogBusiness MetadataAmazon DataZoneOperational Overhead - Question #230Data Ingestion and Transformation
A data engineer is using an AWS Glue ETL job to remove outdated customer records from a table that contains customer account information. The data engineer is using the following S...
AWS GlueETLSQLData Transformation - Question #231Data Ingestion and Transformation
A company receives marketing campaign data from a vendor. The company ingests the data into an Amazon S3 bucket every 40 to 60 minutes. The data is in CSV format. File sizes are be...
AWS LambdaAmazon RedshiftS3 Event NotificationsData Ingestion - Question #232Data Ingestion and Transformation
A company wants to build a dimension table in an Amazon S3 bucket. The bucket contains historical data that includes 10 million records. The historical data is 1 TB in size. A data...
Data Lake UpdatesApache SparkAmazon EMRS3 Data Processing - Question #233Data Ingestion and Transformation
A data engineer develops an AWS Glue Apache Spark ETL job to perform transformations on a dataset. When the data engineer runs the job, the job returns an error that reads, "No spa...
AWS GlueApache SparkPerformance TuningData SkewTroubleshooting - Question #234Data Security and Governance
A company has a data pipeline that uses an Amazon RDS instance, AWS Glue jobs, and an Amazon S3 bucket. The RDS instance and AWS Glue jobs run in a private subnet of a VPC and in t...
Security GroupsVPC NetworkingAWS GlueAmazon RDS - Question #235Data Ingestion and Transformation
A company builds a new data pipeline to process data for business intelligence reports. Users have noticed that data is missing from the reports. A data engineer needs to add a dat...
Data QualityAWS Glue Data QualityETL PipelineOperational Overhead - Question #236Data Ingestion and Transformation
A company is setting up a data pipeline in AWS. The pipeline extracts client data from Amazon S3 buckets, performs quality checks, and transforms the data. The pipeline stores the...
AWS Glue ETLData PipelinesData TransformationCost Optimization - Question #237Data Ingestion and Transformation
A company uses Amazon Redshift as a data warehouse solution. One of the datasets that the company stores in Amazon Redshift contains data for a vendor. Recently, the vendor asked t...
AWS GlueETLData TransferRedshift to S3 - Question #238Data Security and Governance
A company uses an Amazon Redshift cluster as a data warehouse that is shared across two departments. To comply with a security policy, each department must have unique access permi...
RedshiftData Access ControlSchema DesignLeast Privilege - Question #239Data Ingestion and Transformation
A company wants to ingest streaming data into an Amazon Redshift data warehouse from an Amazon Managed Streaming for Apache Kafka (Amazon MSK) cluster. A data engineer needs to dev...
AWS Glue Streaming ETLAmazon MSKAmazon RedshiftStreaming Data Ingestion - Question #240Data Ingestion and Transformation
A sales company uses AWS Glue ETL to collect, process, and ingest data into an Amazon S3 bucket. The AWS Glue pipeline creates a new file in the S3 bucket every hour. File sizes va...
AWS Glue ETLPerformance OptimizationSmall Files ProblemDynamicFrame - Question #241Data Ingestion and Transformation
A company wants to combine data from multiple software as a service (SaaS) applications for analysis. A data engineering team needs to use Amazon QuickSight to perform the analysis...
SaaS data integrationAmazon AppFlowData ingestionOperational efficiency - Question #242Data Ingestion and Transformation
A company runs multiple applications on AWS. The company configured each application to output logs. The company wants to query and visualize the application logs in near real time...
Log AnalyticsOpenSearch ServiceKinesis Data FirehoseReal-time Data Processing - Question #243Data Ingestion and Transformation
An ecommerce company processes millions of orders each day. The company uses AWS Glue ETL to collect data from multiple sources, clean the data, and store the data in an Amazon S3...
Data Lake OptimizationApache ParquetAWS Glue ETLStorage Cost Optimization - Question #244Data Store Management
A data engineer is optimizing query performance in Amazon Athena notebooks that use Apache Spark to analyze large datasets that are stored in Amazon S3. The data is partitioned. An...
Athena Query OptimizationData PartitioningS3 Data LakesApache Spark - Question #245Data Store Management
A company manages an Amazon Redshift data warehouse. The data warehouse is in a public subnet inside a custom VPC. A security group allows only traffic from within itself. An ACL i...
Amazon RedshiftAWS QuickSightSecurity GroupsVPC Networking - Question #246Data Ingestion and Transformation
A data engineer is building a data pipeline. A large data file is uploaded to an Amazon S3 bucket once each day at unpredictable times. An AWS Glue workflow uses hundreds of worker...
S3 Event NotificationsAWS Glue WorkflowsEvent-Driven ArchitectureData Pipeline Orchestration - Question #247Data Ingestion and Transformation
A data engineer needs to run a data transformation job whenever a user adds a file to an Amazon S3 bucket. The job will run for less than 1 minute. The job must send the output thr...
AWS LambdaS3 Event NotificationsServerlessData Transformation - Question #248Data Ingestion and Transformation
A company uses Amazon S3 and AWS Glue Data Catalog to manage a data lake that contains contact information for customers. The company uses PySpark and AWS Glue jobs with a DynamicF...
Data QualityData TransformationAWS GluePySpark Schema - Question #249Data Operations and Support
A data engineer is troubleshooting an AWS Glue workflow that occasionally fails. The engineer determines that the failures are a result of data quality issues. A business reporting...
Amazon SNSEmail NotificationsWorkflow MonitoringAWS Messaging - Question #250Data Operations and Support
A company uses AWS Glue jobs to implement several data pipelines. The pipelines are critical to the company. The company needs to implement a monitoring mechanism that will alert s...
AWS GlueEventBridgeSNSMonitoring and Alerting