DP-203 Exam Questions
311 real DP-203 exam questions with expert-verified answers and explanations. Page 3 of 7.
- Question #136
You are designing a streaming data solution that will ingest variable volumes of data. You need to ensure that you can change the partition count after creation. Which service shou...
- Question #137
You are designing a date dimension table in an Azure Synapse Analytics dedicated SQL pool. The date dimension table will be used by all the fact tables. Which distribution type sho...
- Question #138
You have an Azure data solution that contains an enterprise data warehouse in Azure Synapse Analytics named DW1. Several users execute ad hoc queries to DW1 concurrently. You regul...
- Question #139
You have an Azure Synapse Analytics dedicated SQL pool named Pool1 and a database named DB1. DB1 contains a fact table named Table1. You need to identify the extent of the data ske...
- Question #140
You are monitoring an Azure Stream Analytics job. You discover that the Backlogged Input Events metric is increasing slowly and is consistently non-zero. You need to ensure that th...
- Question #141
You are designing a star schema for a dataset that contains records of online orders. Each record includes an order date, an order due date, and an order ship date. You need to ens...
- Question #148Write Transact-SQL queries - specifically using conditional expressions (CASE/ELSE) to derive computed column values based on existing data, typically mapped to the 'Query Data Using T-SQL' or 'Implement Programmatic T-SQL Constructs' domain in Microsoft SQL/DP-900/70-761 certifications.
Drag and Drop Question You have the following table named Employees. You need to calculate the employee_type value based on the hire_date value. How should you complete the Transac...
CASE ExpressionConditional LogicT-SQLData Transformation - Question #149Design and Implement Data Storage / Query and Transform Data using Azure Synapse Serverless SQL Pools (DP-203 / Azure Data Engineer Associate)
Drag and Drop Question You have an Azure Synapse Analytics workspace named WS1. You have an Azure Data Lake Storage Gen2 container that contains JSON-formatted files in the followi...
Azure Synapse AnalyticsServerless SQL PoolOPENROWSETJSON Data Processing - Question #150Transform and query data using Apache Spark SQL - specifically reshaping tabular data with PIVOT and ensuring correct data types with CAST, typically aligned with Azure Databricks or DP-203 Data Engineering certification objectives.
Drag and Drop Question You have an Apache Spark DataFrame named temperatures. A sample of the data is shown in the following table. You need to produce the following table by using...
Spark SQLPIVOT transformationDataFrame reshapingType casting - Question #155Implement data security and encryption in Azure Synapse Analytics using Bring Your Own Key (BYOK) with customer-managed TDE protectors stored in Azure Key Vault
Drag and Drop Question You have an Azure Synapse Analytics SQL pool named Pool1 on a logical Microsoft SQL server named Server1. You need to implement Transparent Data Encryption (...
Transparent Data EncryptionAzure Synapse AnalyticsCustomer-Managed KeysAzure Key Vault - Question #156
Case Study 1 - Contoso, Ltd Overview Contoso, Ltd. is a clothing retailer based in Seattle. The company has 2,000 retail stores across the United States and an emerging online pres...
- Question #157Design and Implement Data Integration / Manage and Monitor Data Pipelines using Source Control and DevOps practices in Azure
Case Study 1 - Contoso, Ltd Overview Contoso, Ltd. is a clothing retailer based in Seattle. The company has 2,000 retail stores across the United States and an emerging online pres...
Azure Data FactoryCI/CD PipelineSource ControlGit Branching Strategy - Question #158
You build a data warehouse in an Azure Synapse Analytics dedicated SQL pool. Analysts write a complex SELECT query that contains multiple JOIN and CASE statements to transform data...
- Question #159
You have an Azure Synapse Analytics workspace named WS1 that contains an Apache Spark pool named Pool1. You plan to create a database named DB1 in Pool1. You need to ensure that wh...
- Question #160
You are planning a solution to aggregate streaming data that originates in Apache Kafka and is output to Azure Data Lake Storage Gen2. The developers who will implement the stream...
- Question #161
You are designing a financial transactions table in an Azure Synapse Analytics dedicated SQL pool. The table will have a clustered columnstore index and will include the following...
- Question #164
You have an Azure Data Lake Storage Gen2 account that contains two folders named Folder1 and Folder2. You use Azure Data Factory to copy multiple files from Folder1 to Folder2. You...
- Question #165
You are implementing a batch dataset in the Parquet format. Data files will be produced be using Azure Data Factory and stored in Azure Data Lake Storage Gen2. The files will be co...
- Question #166Design and Implement Data Storage – Configure and query external tables and data sources in Azure Synapse Analytics serverless SQL pools (DP-203 / Azure Data Engineer Associate)
Drag and Drop Question You need to build a solution to ensure that users can query specific files in an Azure Data Lake Storage Gen2 account from an Azure Synapse Analytics serverl...
Azure Synapse AnalyticsServerless SQL PoolAzure Data Lake Storage Gen2External Tables / PolyBase - Question #167
You are designing a data mart for the human resources (HR) department at your company. The data mart will contain employee information and employee transactions. From a source syst...
- Question #168Design and Implement Data Storage / Batch Data Ingestion using PolyBase in Azure Synapse Analytics
Drag and Drop Question You have data stored in thousands of CSV files in Azure Data Lake Storage Gen2. Each file has a header row followed by a properly formatted carriage return (...
PolyBaseAzure Synapse AnalyticsAzure Data Lake Storage Gen2External Tables - Question #171
You use Azure Stream Analytics to receive data from Azure Event Hubs and to output the data to an Azure Blob Storage account. You need to output the count of records received from...
- Question #175
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some q...
- Question #176
You have the following Azure Data Factory pipelines: Ingest Data from System1 Ingest Data from System2 Populate Dimensions Populate Facts Ingest Data from System1 and Ingest Data f...
- Question #177Design and Implement Data Storage / Configure data loading with PolyBase in Azure Synapse Analytics
Drag and Drop Question You are responsible for providing access to an Azure Data Lake Storage Gen2 account. Your user account has contributor access to the storage account, and you...
PolyBaseAzure Synapse AnalyticsAzure Data Lake Storage Gen2External Tables - Question #178
You are monitoring an Azure Stream Analytics job by using metrics in Azure. You discover that during the last 12 hours, the average watermark delay is consistently greater than the...
- Question #180
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some q...
- Question #181
You are designing an Azure Databricks cluster that runs user-defined local processes. You need to recommend a cluster configuration that meets the following requirements: Minimize...
- Question #183Design and implement data storage
Drag and Drop Question You are designing an Azure Data Lake Storage Gen2 structure for telemetry data from 25 million devices distributed across seven key geographical regions. Eac...
Azure Data Lake Storage Gen2Data PartitioningEvent HubsSynapse Serverless SQL Pools - Question #185
You are creating a new notebook in Azure Databricks that will support R as the primary language but will also support Scala and SQL. Which switch should you use to switch between l...
- Question #186
You have an Azure Data Factory pipeline that performs an incremental load of source data to an Azure Data Lake Storage Gen2 account. Data to be loaded is identified by a column nam...
- Question #187
You plan to build a structured streaming solution in Azure Databricks. The solution will count new events in five-minute intervals and report only events that arrive during the int...
- Question #188
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some q...
- Question #189
You are designing a security model for an Azure Synapse Analytics dedicated SQL pool that will support multiple companies. You need to ensure that users from each company can view...
- Question #190
You have a SQL pool in Azure Synapse that contains a table named dbo.Customers. The table contains a column name Email. You need to prevent nonadministrative users from seeing the...
- Question #191
You have an Azure Data Lake Storage Gen2 account named adls2 that is protected by a virtual network. You are designing a SQL pool in Azure Synapse that will use adls2 as a source....
- Question #193
You are designing an Azure Synapse solution that will provide a query interface for the data stored in an Azure Storage account. The storage account is only accessible from a virtu...
- Question #194
You are developing an application that uses Azure Data Lake Storage Gen2. You need to recommend a solution to grant permissions to a specific application for a limited time period....
- Question #197
You manage an enterprise data warehouse in Azure Synapse Analytics. Users report slow performance when they run commonly used queries. Users do not report performance changes for i...
- Question #198
You have an Azure Databricks resource. You need to log actions that relate to changes in compute for the Databricks resource. Which Databricks services should you log?
- Question #199
You are designing a highly available Azure Data Lake Storage solution that will include geo-zone- redundant storage (GZRS). You need to monitor for replication delays that can affe...
- Question #200
You have an Azure Synapse Analytics dedicated SQL pool. You run PDW_SHOWSPACEUSED('dbo.FactInternetSales'); and get the results shown in the following table. Which statement accura...
- Question #201
You have two fact tables named Flight and Weather. Queries targeting the tables will be based on the join between the following columns. You need to recommend a solution that maxim...
- Question #203
You are designing the folder structure for an Azure Data Lake Storage Gen2 account. You identify the following usage patterns: - Users will query data by using Azure Synapse Analyt...
- Question #204
You have the following Azure Data Factory pipelines: - Ingest Data from System 1 - Ingest Data from System2 - Populate Dimensions - Populate facts Ingest Data from System1 and Inge...
- Question #205
You are designing an Azure Synapse Analytics workspace. You need to recommend a solution to provide double encryption of all the data at rest. Which two components should you inclu...
- Question #206
You have an Azure Synapse Analytics dedicated SQL pool. You need to Create a fact table named Table1 that will store sales data from the last three years. The solution must be opti...
- Question #207
You are designing a folder structure for the files in an Azure Data Lake Storage Gen2 account. The account has one container that contains three years of data. You need to recommen...
- Question #208
You have an Azure Databricks workspace named workspace1 in the Standard pricing tier. Workspace1 contains an all-purpose cluster named cluster1. You need to reduce the time it take...
- Question #209
You have an Azure data factory. You need to examine the pipeline failures from the last 180 days. What should you use?