DEA-C01 Exam Questions
308 real DEA-C01 exam questions with expert-verified answers and explanations. Page 3 of 7.
- Question #101Data Security and Governance
A company has a data lake on AWS. The data lake ingests sources of data from business units. The company uses Amazon Athena for queries. The storage layer is Amazon S3 with an AWS...
AWS Lake FormationData Lake SecurityColumn-level Access ControlAmazon Athena - Question #102Data Ingestion and Transformation
A company has developed several AWS Glue extract, transform, and load (ETL) jobs to validate and transform data from Amazon S3. The ETL jobs load the data into Amazon RDS for MySQL...
AWS GlueETL JobsJob BookmarksIncremental Processing - Question #103Data Ingestion and Transformation
An online retail company has an application that runs on Amazon EC2 instances that are in a VPC. The company wants to collect flow logs for the VPC and analyze network traffic. Whi...
VPC Flow LogsAmazon S3Amazon AthenaCost Optimization - Question #104Data Store Management
A retail company stores transactions, store locations, and customer information tables in four reserved ra3.4xlarge Amazon Redshift cluster nodes. All three tables use even table d...
RedshiftTable DistributionQuery OptimizationCost Optimization - Question #105Data Store Management
A company has a data warehouse that contains a table that is named Sales. The company stores the table in Amazon Redshift. The table includes a column that is named city_name. The...
Amazon RedshiftSQLRegular ExpressionsData Querying - Question #106Data Operations and Support
A company needs to send customer call data from its on-premises PostgreSQL database to AWS to generate near real-time insights. The solution must capture and load updates from oper...
AWS DMSChange Data CaptureCloudWatch MetricsTroubleshooting - Question #107Data Ingestion and Transformation
A lab uses IoT sensors to monitor humidity, temperature, and pressure for a project. The sensors send 100 KB of data every 10 seconds. A downstream process will read the data from...
Real-time Data IngestionKinesis Data StreamsApache FlinkLow-latency Data Delivery - Question #108Data Ingestion and Transformation
A company wants to use machine learning (ML) to perform analytics on data that is in an Amazon S3 data lake. The company has two data transformation requirements that will give con...
Data TransformationAWS GlueAmazon EMRData Lake Architecture - Question #109Data Ingestion and Transformation
A retail company uses AWS Glue for extract, transform, and load (ETL) operations on a dataset that contains information about customer orders. The company wants to implement specif...
AWS GlueData QualityETLData Validation - Question #110Data Store Management
An insurance company stores transaction data that the company compressed with gzip. The company needs to query the transaction data for occasional audits. Which solution will meet...
S3 GlacierS3 Glacier SelectCost OptimizationData Archiving - Question #111Data Operations and Support
A data engineer finished testing an Amazon Redshift stored procedure that processes and inserts data into a table that is not mission critical. The engineer wants to automatically...
Amazon RedshiftStored ProceduresSchedulingCost Optimization - Question #112Data Store Management
A marketing company collects clickstream data. The company sends the clickstream data to Amazon Kinesis Data Firehose and stores the clickstream data in Amazon S3. The company want...
Amazon AthenaAmazon QuickSightServerless AnalyticsCost Optimization - Question #113Data Operations and Support
A data engineer is building a data orchestration workflow. The data engineer plans to use a hybrid model that includes some on-premises resources and some resources that are in the...
Data OrchestrationHybrid CloudApache AirflowOpen Source - Question #114Data Store Management
A gaming company uses a NoSQL database to store customer information. The company is planning to migrate to AWS. The company needs a fully managed AWS solution that will handle hig...
Amazon DynamoDBNoSQL DatabasesFully Managed ServicesGlobal High Availability - Question #115Data Operations and Support
A data engineer creates an AWS Lambda function that an Amazon EventBridge event will invoke. When the data engineer tries to invoke the Lambda function by using an EventBridge even...
AWS LambdaAmazon EventBridgeIAM PermissionsResource-based Policy - Question #116Data Security and Governance
A company uses a data lake that is based on an Amazon S3 bucket. To comply with regulations, the company must apply two layers of server-side encryption to files that are uploaded...
S3 EncryptionDSSE-KMSData SecurityCompliance - Question #117Data Operations and Support
A data engineer notices that Amazon Athena queries are held in a queue before the queries run. How can the data engineer prevent the queries from queueing?
Amazon AthenaQuery PerformanceProvisioned CapacityWorkgroups - Question #118Data Ingestion and Transformation
A data engineer needs to debug an AWS Glue job that reads from Amazon S3 and writes to Amazon Redshift. The data engineer enabled the bookmark feature for the AWS Glue job. The dat...
Glue BookmarksGlue Job ConfigurationData ReprocessingDebugging Glue Jobs - Question #119Data Ingestion and Transformation
An ecommerce company wants to use AWS to migrate data pipelines from an on-premises environment into the AWS Cloud. The company currently uses a third-party tool in the on- premise...
Data OrchestrationManaged ServicesApache AirflowData Migration - Question #120Data Ingestion and Transformation
A retail company stores data from a product lifecycle management (PLM) application in an on- premises MySQL database. The PLM application frequently updates the database when trans...
Database Migration Service (DMS)Change Data Capture (CDC)On-premises IntegrationRedshift - Question #121Data Store Management
A marketing company uses Amazon S3 to store clickstream data. The company queries the data at the end of each day by using a SQL JOIN clause on S3 objects that are stored in separa...
Amazon AthenaServerless QueryingData Lake AnalyticsACID Transactions - Question #122Data Ingestion and Transformation
A company wants to migrate data from an Amazon RDS for PostgreSQL DB instance in the eu- east-1 Region of an AWS account named Account_A. The company will migrate the data to an Am...
AWS DMSCross-account migrationCross-region migrationReplication instance placement - Question #123Data Ingestion and Transformation
A company uses Amazon S3 as a data lake. The company sets up a data warehouse by using a multi-node Amazon Redshift cluster. The company organizes the data files in the data lake b...
Amazon RedshiftData IngestionPerformance OptimizationManifest File - Question #124Data Ingestion and Transformation
A company plans to use Amazon Kinesis Data Firehose to store data in Amazon S3. The source data consists of 2 MB .csv files. The company must convert the .csv files to JSON format....
Kinesis Data FirehoseData TransformationApache ParquetManaged Services - Question #125Data Security and Governance
A company is using an AWS Transfer Family server to migrate data from an on-premises environment to AWS. Company policy mandates the use of TLS 1.2 or above to encrypt the data in...
AWS Transfer FamilyTLS encryptionSecurity policyData in transit - Question #126Data Ingestion and Transformation
A company wants to migrate an application and an on-premises Apache Kafka server to AWS. The application processes incremental updates that an on-premises Oracle database sends to...
Apache KafkaMSK ServerlessMigration StrategiesManaged Streaming - Question #127Data Ingestion and Transformation
A data engineer is building an automated extract, transform, and load (ETL) ingestion pipeline by using AWS Glue. The pipeline ingests compressed files that are in an Amazon S3 buc...
AWS GlueETLIncremental processingJob bookmarks - Question #128Data Ingestion and Transformation
A banking company uses an application to collect large volumes of transactional data. The company uses Amazon Kinesis Data Streams for real-time analytics. The company's applicatio...
Exactly-once deliveryKinesis Data StreamsData DeduplicationStreaming Data Pipelines - Question #129Data Store Management
A company stores logs in an Amazon S3 bucket. When a data engineer attempts to access several log files, the data engineer discovers that some files have been unintentionally delet...
S3 VersioningData ProtectionUnintentional DeletionAmazon S3 - Question #130Data Ingestion and Transformation
A telecommunications company collects network usage data throughout each day at a rate of several thousand data points each second. The company runs an application to process the u...
Real-time data processingStreaming analyticsLow-latency detectionKinesis & Lambda - Question #131Data Ingestion and Transformation
A data engineer is processing and analyzing multiple terabytes of raw data that is in Amazon S3. The data engineer needs to clean and prepare the data. Then the data engineer needs...
Data preparationETLServerless data processingRedshift data loading - Question #132Data Ingestion and Transformation
A company uses an AWS Lambda function to transfer files from a legacy SFTP environment to Amazon S3 buckets. The Lambda function is VPC enabled to ensure that all communications be...
AWS LambdaVPC EndpointsAmazon S3Network Connectivity - Question #133Data Ingestion and Transformation
A company reads data from customer databases that run on Amazon RDS. The databases contain many inconsistent fields. For example, a customer record field that iPnamed place_id in o...
Data IntegrationETLData MatchingAWS Glue - Question #134Data Ingestion and Transformation
A finance company receives data from third-party data providers and stores the data as objects in an Amazon S3 bucket. The company ran an AWS Glue crawler on the objects to create...
AWS Glue CrawlerS3 Data LakeData CatalogingSchema Inference - Question #135Data Store Management
An application consumes messages from an Amazon Simple Queue Service (Amazon SQS) queue. The application experiences occasional downtime. As a result of the downtime, messages with...
SQSMessage RetentionDead-Letter QueueData Durability - Question #136Data Operations and Support
A company is creating near real-time dashboards to visualize time series data. The company ingests data into Amazon Managed Streaming for Apache Kafka (Amazon MSK). A customized da...
Near real-time analyticsData visualizationOpenSearch ServiceLow latency - Question #137Data Store Management
A data engineer maintains a materialized view that is based on an Amazon Redshift database. The view has a column named load_date that stores the date when each row was loaded. The...
RedshiftMaterialized ViewsStorage ManagementSQL Commands - Question #138Data Ingestion and Transformation
A media company wants to use Amazon OpenSearch Service to analyze rea-time data about popular musical artists and songs. The company expects to ingest millions of new data events e...
Kinesis Data FirehoseAWS LambdaData IngestionReal-time Data - Question #139Data Security and Governance
A company stores customer data tables that include customer addresses in an AWS Lake Formation data lake. To comply with new regulations, the company must ensure that users cannot...
AWS Lake FormationRow-Level SecurityData GovernanceAccess Control - Question #140Data Security and Governance
A company has implemented a lake house architecture in Amazon Redshift. The company needs to give users the ability to authenticate into Redshift query editor by using a third-part...
Redshift authenticationFederated identityIdentity Provider (IdP)Security configuration - Question #141Data Ingestion and Transformation
A company currently uses a provisioned Amazon EMR cluster that includes general purpose Amazon EC2 instances. The EMR cluster uses EMR managed scaling between one to five task node...
EMR Cost OptimizationEC2 Instance TypesApache Spark ETLResource Optimization - Question #142Data Ingestion and Transformation
A company uploads .csv files to an Amazon S3 bucket. The company's data platform team has set up an AWS Glue crawler to perform data discovery and to create the tables and schemas....
AWS GlueAmazon RedshiftUpsertData Deduplication - Question #143Data Ingestion and Transformation
A company is using Amazon Redshift to build a data warehouse solution. The company is loading hundreds of files into a fact table that is in a Redshift cluster. The company wants t...
Amazon RedshiftData LoadingCOPY CommandPerformance Optimization - Question #144Data Ingestion and Transformation
A company ingests data from multiple data sources and stores the data in an Amazon S3 bucket. An AWS Glue extract, transform, and load (ETL) job transforms the data and writes the...
AWS GlueETLRecord LinkageData Quality - Question #145Data Ingestion and Transformation
A data engineer is using an AWS Glue crawler to catalog data that is in an Amazon S3 bucket. The S3 bucket contains both .csv and json files. The data engineer configured the crawl...
AWS GlueAmazon AthenaAmazon S3Query Optimization - Question #146Data Security and Governance
A data engineer set up an AWS Lambda function to read an object that is stored in an Amazon S3 bucket. The object is encrypted by an AWS KMS key. The data engineer configured the L...
KMS EncryptionS3 Access ControlIAM PermissionsLambda Function - Question #147Data Operations and Support
A data engineer has implemented data quality rules in 1,000 AWS Glue Data Catalog tables. Because of a recent change in business requirements, the data engineer must edit the data...
AWS Glue Data QualityAWS LambdaAPI AutomationOperational Overhead - Question #148Data Operations and Support
Two developers are working on separate application releases. The developers have created feature branches named Branch A and Branch B by using a GitHub repository's master branch a...
GitVersion ControlGit RebaseBranching Strategy - Question #149Data Store Management
A company stores employee data in Amazon Resdshift. A table names Employee uses columns named Region ID, Department ID, and Role ID as a compound sort key. Which queries will MOST...
RedshiftSort KeysQuery OptimizationTable Design - Question #150Data Ingestion and Transformation
A company receives test results from testing facilities that are located around the world. The company stores the test results in millions of 1 KB JSON files in an Amazon S3 bucket...
AWS GlueData OptimizationSmall Files ProblemDynamic Frames