DP-203 Exam Questions
311 real DP-203 exam questions with expert-verified answers and explanations. Page 5 of 7.
- Question #267
Which one of the following T-SQL commands is useful to check the disk space usage, data skew used for tables in the SQL data warehouse database?
- Question #268
Bryan is executing an init script which is required to run a bootstrap script during the Databricks Spark driver or worker node startup. Which kind of init script can he choose?
- Question #269
The Complex event processing streaming solution which Jeffrey is working on the IoT platform, is a hybrid cloud platform where few data sources are transformed into on-premises Big...
- Question #270
Nicole is working on migrating on-premises SQL Server databases to Azure SQL data warehouse (Synapse dedicated SQL pools) tables. The tables of the dedicated SQL pools of Synapse A...
- Question #271
Which of the following five kinds of log types can Phil select for storing Databricks Diagnostic logs?
- Question #273Develop data processing
You have an Azure Databricks workspace and an Azure Data Lake Storage Gen2 account named storage1. New files are uploaded daily to storage1. Incrementally process new files as they...
Azure Data FactoryDatabricksIncremental Data ProcessingStructured Streaming - Question #275Design and implement data storage
You are implementing a star schema in an Azure Synapse Analytics dedicated SQL pool. You plan to create a table named DimProduct. DimProduct must be a Type 3 slowly changing dimens...
Slowly Changing DimensionsType 3 SCDDimension ModelingAzure Synapse Analytics - Question #277Design and implement data engineering workloads using Azure Databricks, including transforming semi-structured data (JSON) into structured tabular formats using PySpark DataFrame APIs - typically aligned with the 'Implement data transformation' or 'Process and serve data' domain of the DP-203 Azure Data Engineer Associate certification.
Drag and Drop Question You use PySpark in Azure Databricks to parse the following JSON input. You need to output the data in the following tabular format. How should you complete t...
PySparkAzure DatabricksJSON ParsingDataFrame Operationsexplode() function - Question #279
You have an Azure Synapse Analytics dedicated SQL pool that contains a table named Table1. Table1 contains the following: - One billion rows - A clustered columnstore index - A has...
- Question #280
You have an Azure Databricks workspace that contains a Delta Lake dimension table named Table1. Table1 is a Type 2 slowly changing dimension (SCD) table. You need to apply updates...
- Question #281
You are designing an Azure Data Lake Storage solution that will transform raw JSON files for use in an analytical workload. You need to recommend a format for the transformed files...
- Question #282
You are designing a dimension table in an Azure Synapse Analytics dedicated SQL pool. You need to create a surrogate key for the table. The solution must provide the fastest query...
- Question #284
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some q...
- Question #285
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some q...
- Question #286Design and implement data storage and processing pipelines - specifically managing deployment and scheduling of Azure Synapse pipelines using Git-based source control and the publish workflow (DP-203 / Azure Data Engineer Associate)
Drag and Drop Question You have an Azure subscription that contains an Azure Synapse Analytics workspace named workspace1. Workspace1 connects to an Azure DevOps repository named r...
Azure Synapse AnalyticsCI/CD PipelineTrigger ManagementAzure DevOps Integration - Question #288
You have an Azure subscription that contains an Azure Synapse Analytics dedicated SQL pool named SQLPool1. SQLPool1 is currently paused. You need to restore the current state of SQ...
- Question #289
You have an Azure Data Factory pipeline named Pipeline1. Pipeline1 contains a copy activity that sends data to an Azure Data Lake Storage Gen2 account. Pipeline1 is executed by a s...
- Question #290
You have an Azure Data Factory pipeline named pipeline1 that is invoked by a tumbling window trigger named Trigger1. Trigger1 has a recurrence of 60 minutes. You need to ensure tha...
- Question #292
You have an Azure subscription that contains an Azure Synapse Analytics dedicated SQL pool named Pool1. Pool1 receives new data once every 24 hours. You have the following function...
- Question #293
You are creating an Apache Spark job in Azure Databricks that will ingest JSON-formatted data. You need to convert a nested JSON string into a DataFrame that will contain multiple...
- Question #294Implement and manage storage security in Azure, including configuring access control for Azure Data Lake Storage Gen2 using ACLs and RBAC roles (Azure AZ-104 / DP-203 - Secure and Monitor Data Storage)
Drag and Drop Question You have an Azure subscription that contains an Azure Data Lake Storage Gen2 account named storage1. Storage1 contains a container named container1. Containe...
Azure Data Lake Storage Gen2ACL PermissionsLeast PrivilegeAzure RBAC - Question #298Design and Implement Data Storage Security / Manage Authorization for Azure Storage
Drag and Drop Question You have an Azure Data Lake Storage Gen 2 account named storage1. You need to recommend a solution for accessing the content in storage1. The solution must m...
Azure Data Lake Storage Gen2RBACAccess Control ListsAzure AD Authentication - Question #299
A company purchases IoT devices to monitor manufacturing machinery. The company uses an Azure IoT Hub to communicate with the IoT devices. The company must be able to monitor the d...
- Question #300
A company purchases IoT devices to monitor manufacturing machinery. The company uses an Azure IoT Hub to communicate with the IoT devices. The company must be able to monitor the d...
- Question #301
You have an Azure Synapse Analytics dedicated SQL pool named Pool1. Pool1 contains a fact table named Table1. You need to identify the extent of the data skew in Table1. What shoul...
- Question #302
You have an Azure Synapse Analytics dedicated SQL pool named Pool1 that contains a table named Sales. Sales has row-level security (RLS) applied. RLS uses the following predicate f...
- Question #304
You have an Azure SQL database named DB1 and an Azure Data Factory data pipeline named pipeline. From Data Factory, you configure a linked service to DB1. In DB1, you create a stor...
- Question #305
You have a Microsoft Purview account. The Lineage view of a CSV file is shown in the following exhibit. How is the data for the lineage populated?
- Question #306
You have an Azure Synapse Analytics dedicated SQL pool named pool1. You plan to implement a star schema in pool and create a new table named DimCustomer by using the following code...
- Question #308
You have an Azure subscription that contains an Azure Data Lake Storage Gen2 account named account1 and an Azure Synapse Analytics workspace named workspace1. You need to create an...
- Question #310
You have an Azure Databricks workspace and an Azure Data Lake Storage Gen2 account named storage1. New files are uploaded daily to storage1. You need to recommend a solution that c...
- Question #311
You have an Azure subscription that contains the resources shown in the following table. You need to read the TSV files by using ad-hoc queries and the OPENROWSET function. The sol...
- Question #312
You have an Azure Synapse Analytics dedicated SQL pool. You plan to create a fact table named Table1 that will contain a clustered columnstore index. You need to optimize data comp...
- Question #313
You have an Azure Synapse Analytics dedicated SQL pool that contains a table named DimSalesPerson. DimSalesPerson contains the following columns: - RepSourceID - SalesRepID - First...
- Question #317Design and Implement Data Storage / Query and Transform Data using Azure Synapse Analytics Serverless SQL Pools (DP-203: Data Engineering on Microsoft Azure)
Drag and Drop Question You have an Azure Synapse Analytics serverless SQL pool. You have an Azure Data Lake Storage account named adls1 that contains a public container named conta...
Azure Synapse AnalyticsServerless SQL PoolOPENROWSETData Lake Storage Query - Question #318
You have an Azure Data Lake Storage Gen2 account named storage1. You plan to implement query acceleration for storage1. Which two file types support query acceleration? Each correc...
- Question #319
You have an Azure subscription that contains the resources shown in the following table. You need to read the files in storage1 by using ad-hoc queries and the OPENROWSET function....
- Question #321Design and implement a data storage solution - specifically understanding and applying Slowly Changing Dimension (SCD) types in a data warehouse context (DP-203 / Azure Data Engineer or similar certification)
Drag and Drop Question You have a data warehouse. You need to implement a slowly changing dimension (SCD) named Product that will include three columns named ProductName, ProductCo...
Slowly Changing DimensionsData WarehousingDimensional ModelingSCD Types - Question #323Design and Implement Data Storage and Processing Pipelines with Source Control Integration in Azure Synapse Analytics
Drag and Drop Question You have a project in Azure DevOps that contains a repository named Repo1. Repo1 contains a branch named main. You create a new Azure Synapse workspace named...
Azure Synapse AnalyticsGit IntegrationSource ControlAzure DevOps Repositories - Question #324
You have an Azure subscription that contains an Azure SQL database named DB1 and a storage account named storage1. The storage1 account contains a file named File1.txt. File1.txt c...
- Question #325
You have an Azure data factory that connects to a Microsoft Purview account. The data factory is registered in Microsoft Purview. You update a Data Factory pipeline. You need to en...
- Question #326
You have an Azure subscription that contains a Microsoft Purview account named MP1, an Azure data factory named DF1, and a storage account named storage1. MP1 is configured to scan...
- Question #328Design and Implement Data Storage - Managing and securing data in Azure Synapse Analytics dedicated SQL pools, including transactional integrity during batch data loading operations (DP-203 / Azure Data Engineer Associate)
Drag and Drop Question You are batch loading a table in an Azure Synapse Analytics dedicated SQL pool. You need to load data from a staging table to the target table. The solution...
Azure Synapse AnalyticsDedicated SQL PoolDistributed TransactionsError Handling and Rollback - Question #331
You use Azure Data Factory to create data pipelines. You are evaluating whether to integrate Data Factory and GitHub for source and version control. What are two advantages of the...
- Question #332Design and Implement Data Storage and Processing Pipelines / Implement Azure Synapse Analytics Workspace Source Control and DevOps
Drag and Drop Question You have an Azure Synapse Analytics workspace named Workspace1. You perform the following changes: - Implement source control for Workspace1. - Create a bran...
Azure Synapse AnalyticsSource ControlCI/CD PipelineBranch Management - Question #333
You have two Azure Blob Storage accounts named account1 and account2. You plan to create an Azure Data Factory pipeline that will use scheduled intervals to replicate newly created...
- Question #334
You have an Azure Data Factory pipeline named pipeline1 that contains a data flow activity named activity1. You need to run pipeline1. Which runtime will be used to run activity1?
- Question #336
You have an Azure data factory named ADF1 and an Azure Synapse Analytics workspace that contains a pipeline named SynPipeLine1. SynPipeLine1 includes a Notebook activity. You creat...
- Question #339
You have an Azure Synapse Analytics dedicated SQL pool. You need to create a pipeline that will execute a stored procedure in the dedicated SQL pool and use the returned result set...
- Question #340
You have an Azure SQL database named DB1 and an Azure Data Factory data pipeline named pipeline1. From Data Factory, you configure a linked service to DB1. In DB1, you create a sto...