DAS-C01 Exam Questions
190 real DAS-C01 exam questions with expert-verified answers and explanations. Page 1 of 4.
- Question #1Processing
A company has developed an Apache Hive script to batch process data stared in Amazon S3. The script needs to run once every day and store the output in Amazon S3. The company teste...
EMRLambdaCost OptimizationBatch Processing - Question #2Processing
A company wants to improve the data load time of a sales data dashboard. Data has been collected as .csv files and stored within an Amazon S3 bucket that is partitioned by date. Th...
Redshift data loadingPerformance optimizationRedshift COPY commandS3 integration - Question #3Processing
A mortgage company has a microservice for accepting payments. This microservice uses the Amazon DynamoDB encryption client with AWS KMS managed keys to encrypt the sensitive data b...
DynamoDB StreamsAWS LambdaKMS Encryption ClientData Integration - Question #4Collection
A company is building a data lake and needs to ingest data from a relational database that has time-series data. The company wants to use managed services to accomplish this. The p...
AWS GlueData IngestionIncremental Data LoadingRelational Databases - Question #5Security
An Amazon Redshift database contains sensitive user data. Logging is necessary to meet compliance requirements. The logs must contain database authentication attempts, connections,...
RedshiftAudit LoggingComplianceSecurity Logging - Question #6Collection
A company that monitors weather conditions from remote construction sites is setting up a solution to collect temperature data from the following two weather stations. Station A, w...
Kinesis Data StreamsPartition KeysSharding StrategyThroughput Optimization - Question #7Storage and Data Management
Once a month, a company receives a 100 MB .csv file compressed with gzip. The file contains 50,000 property listing records and is stored in Amazon S3 Glacier. The company needs it...
S3 SelectAmazon S3 GlacierCost OptimizationData Querying - Question #8Storage and Data Management
A retail company is building its data warehouse solution using Amazon Redshift. As a part of that effort, the company is loading hundreds of files into the fact table created in it...
Amazon RedshiftData LoadingCOPY commandPerformance Optimization - Question #9Processing
A data analyst is designing a solution to interactively query datasets with SQL using a JDBC connection. Users will join data stored in Amazon S3 in Apache ORC format with data sto...
Federated QueryInteractive SQLData FreshnessPresto/EMR - Question #10Analysis and Visualization
A company developed a new elections reporting website that uses Amazon Kinesis Data Firehose to deliver full logs from AWS WAF to an Amazon S3 bucket. The company is now seeking a...
AWS GlueData CatalogServerless AnalyticsCost Optimization - Question #11Security
A large company has a central data lake to run analytics across different departments. Each department uses a separate AWS account and stores its data in an Amazon S3 bucket in tha...
AWS Lake FormationData Lake SecurityCross-Account AccessFine-Grained Access Control - Question #12Collection
A company wants to improve user satisfaction for its smart home system by adding more features to its recommendation engine. Each sensor asynchronously pushes its nested JSON data...
Kinesis Data StreamsKinesis Producer Library (KPL)Real-time data ingestionLatency optimization - Question #13Analysis and Visualization
A global company has different sub-organizations, and each sub-organization sells its products and services in various countries. The company's senior leadership wants to quickly i...
QuickSightAthenaData VisualizationData Lake Query - Question #14Storage and Data Management
A company has 1 million scanned documents stored as image files in Amazon S3. The documents contain typewritten application forms with information including the applicant first nam...
Amazon OpenSearch ServiceMetadata IndexingFull-Text SearchData Lake Query - Question #15Collection
A mobile gaming company wants to capture data from its gaming app and make the data available for analysis immediately. The data record size will be approximately 20 KB. The compan...
Kinesis Data StreamsData IngestionReal-time StreamingAPI Throughput - Question #16Storage and Data Management
A marketing company wants to improve its reporting and business intelligence capabilities. During the planning phase, the company interviewed the relevant stakeholders and discover...
Data WarehousingData LakesCost OptimizationAmazon Redshift - Question #17Processing
A media company wants to perform machine learning and analytics on the data residing in its Amazon S3 data lake. There are two data transformation requirements that will enable the...
AWS GlueAmazon EMRData TransformationData Lake ETL - Question #18Collection
A hospital uses wearable medical sensor devices to collect data from patients. The hospital is architecting a near-real-time solution that can ingest the data securely at scale. Th...
Kinesis Data FirehoseAWS LambdaData TransformationStream Ingestion - Question #19Processing
A company is migrating its existing on-premises ETL jobs to Amazon EMR. The code consists of a series of jobs written in Java. The company needs to reduce overhead for the system a...
Amazon EMRCustom AMIRoot volume encryptionAWS CloudFormation - Question #20Processing
A transportation company uses IoT sensors attached to trucks to collect vehicle data for its global delivery fleet. The company currently sends the sensor data in small .csv files...
Data Format OptimizationApache ParquetAWS Glue ETLRedshift Data Loading - Question #21Analysis and Visualization
An online retail company with millions of users around the globe wants to improve its ecommerce analytics capabilities. Currently, clickstream data is uploaded directly to Amazon S...
Streaming IngestionReal-time AnalyticsOpenSearch ServiceKibana - Question #22Processing
A company is streaming its high-volume billing data (100 MBps) to Amazon Kinesis Data Streams. A data analyst partitioned the data on account_id to ensure that all records belongin...
Kinesis Data StreamsShard ReshardingConsumer Data OrderKinesis Client Library - Question #23Processing
A media analytics company consumes a stream of social media posts. The posts are sent to an Amazon Kinesis data stream partitioned on user_id. An AWS Lambda function retrieves the...
Kinesis Data StreamsLambda ConsumersStream Processing PerformanceParallelization Factor - Question #24Collection
A company launched a service that produces millions of messages every day and uses Amazon Kinesis Data Streams as the streaming service. The company uses the Kinesis SDK to write d...
Kinesis Data StreamsThrottlingScalabilityPartitioning - Question #25Processing
A smart home automation company must efficiently ingest and process messages from various connected devices and sensors. The majority of these messages are comprised of a large num...
Small File ProblemAWS Glue ETLEMR OptimizationData Lake Performance - Question #26Processing
A large financial company is running its ETL process. Part of this process is to move data from Amazon S3 into an Amazon Redshift cluster. The company wants to use the most cost-ef...
Redshift Data LoadingETL Best PracticesCost OptimizationAmazon S3 - Question #27Storage and Data Management
A university intends to use Amazon Kinesis Data Firehose to collect JSON-formatted batches of water quality readings in Amazon S3. The readings are from 50 sensors scattered across...
AthenaS3 Data FormatsData PartitioningCost Optimization - Question #28Processing
A company ingests a large set of clickstream data in nested JSON format from different sources and stores it in Amazon S3. Data Analysts need to analyze this data in combination wi...
AWS GlueETLNested JSONData Transformation - Question #29Processing
A Publisher website captures user activity and sends clickstream data to Amazon Kinesis Data Streams. The Publisher wants to design a cost-effective solution to process the data to...
Clickstream AnalyticsSession TrackingData ModelingKinesis Data Streams - Question #30Analysis and Visualization
A financial services company needs to aggregate daily stock trade data from the exchanges into a data store. The company requires that data be streamed directly into the data store...
Data StreamingData WarehousingAnalyticsBusiness Intelligence - Question #31Security
A financial company hosts a data lake in Amazon S3 and a data warehouse on an Amazon Redshift cluster. The company uses Amazon QuickSight to build dashboards and wants to secure ac...
Active Directory IntegrationSingle Sign-On (SSO)Amazon QuickSight AuthenticationHybrid Cloud Security - Question #32Storage and Data Management
A real estate company has a mission-critical application using Apache HBase in Amazon EMR. Amazon EMR is configured with a single master node. The company has over 5 TB of data sto...
EMREMRFSHigh AvailabilityS3 Storage - Question #33Collection
A software company hosts an application on AWS, and new features are released weekly. As part of the application testing process, a solution must be developed that analyzes logs fr...
Log CollectionReal-time StreamingKinesisEC2 Instances - Question #34Processing
A data analyst is using AWS Glue to organize, cleanse, validate, and format a 200 GB dataset. The data analyst triggered the job to run with the Standard worker type. After 3 hours...
AWS GlueJob Performance TuningCloudWatch MetricsDPU Optimization - Question #35Storage and Data Management
A company has a business unit uploading .csv files to an Amazon S3 bucket. The company's data platform team has set up an AWS Glue crawler to do discovery, and create tables and sc...
AWS GlueAmazon RedshiftData Ingestion PatternsIdempotent Loads - Question #36Storage and Data Management
A streaming application is reading data from Amazon Kinesis Data Streams and immediately writing the data to an Amazon S3 bucket every 10 seconds. The application is reading data f...
Athena performanceS3 data lake optimizationSmall files problemData compaction - Question #37Storage and Data Management
A company is currently using Amazon DynamoDB as the database for a user support application. The company is developing a new version of the application that will store a PDF file f...
DynamoDBAmazon S3Object StorageCost Optimization - Question #38Collection
A company needs to implement a near-real-time fraud prevention feature for its ecommerce site. User and order details need to be delivered to an Amazon SageMaker endpoint to flag s...
Streaming DataReal-time ProcessingLow LatencyData Ingestion - Question #39Storage and Data Management
A company uses Amazon Elasticsearch Service (Amazon ES) to store and analyze its website clickstream data. The company ingests 1 TB of data daily using Amazon Kinesis Data Firehose...
Elasticsearch PerformanceShard ManagementResource OptimizationCluster Sizing - Question #40Storage and Data Management
A manufacturing company has been collecting IoT sensor data from devices on its factory floor for a year and is storing the data in Amazon Redshift for daily analysis. A data analy...
Redshift data lifecycleData tieringCost optimizationRedshift Spectrum - Question #41Storage and Data Management
An insurance company has raw data in JSON format that is sent without a predefined schedule through an Amazon Kinesis Data Firehose delivery stream to an Amazon S3 bucket. An AWS G...
AWS Glue CrawlerS3 Event NotificationsAWS LambdaData Catalog - Question #42Storage and Data Management
A company that produces network devices has millions of users. Data is collected from the devices on an hourly basis and stored in an Amazon S3 data lake. The company runs analyses...
Data LakeColumnar StorageData PartitioningPerformance Optimization - Question #43Security
A banking company is currently using an Amazon Redshift cluster with dense storage (DS) nodes to store sensitive data. An audit found that the cluster is unencrypted. Compliance re...
Redshift EncryptionHSM IntegrationData MigrationCompliance - Question #44Collection
A company is planning to do a proof of concept for a machine earning (ML) project using Amazon SageMaker with a subset of existing on-premises data hosted in the company's 3 TB dat...
AWS DMSData IngestionData CurationData Warehouse Migration - Question #45Processing
A media company is migrating its on-premises legacy Hadoop cluster with its associated data processing scripts and workflow to an Amazon EMR environment running the latest Hadoop r...
EMRHadoop MigrationJob SubmissionJava Applications - Question #46Processing
An online retail company wants to perform analytics on data in large Amazon S3 objects using Amazon EMR. An Apache Spark job repeatedly queries the same data to populate an analyti...
Spark PerformanceS3 SelectEMR OptimizationData Caching - Question #47Security
A US-based sneaker retail company launched its global website. All the transaction data is stored in Amazon RDS and curated historic transaction data is stored in Amazon Redshift i...
QuickSightRedshiftSecurity GroupsNetwork Security - Question #48Processing
An airline has .csv-formatted data stored in Amazon S3 with an AWS Glue Data Catalog. Data analysts want to join this data with call center data stored in Amazon Redshift as part o...
Redshift SpectrumServerless QueryingQuery OffloadingAWS Glue Data Catalog - Question #49Analysis and Visualization
A data analyst is using Amazon QuickSight for data visualization across multiple datasets generated by applications. Each application stores files within a separate Amazon S3 bucke...
QuickSight PermissionsS3 Data AccessSPICE ImportData Visualization Setup - Question #50Analysis and Visualization
A team of data scientists plans to analyze market trend data for their company's new investment strategy. The trend data comes from five different data sources in large volumes. Th...
Kinesis Data StreamsKinesis Data AnalyticsKinesis Data FirehoseReal-time processing