nerdexam
DatabricksDatabricks

CERTIFIED-DATA-ENGINEER-PROFESSIONAL · Question #58

CERTIFIED-DATA-ENGINEER-PROFESSIONAL Question #58: Real Exam Question with Answer & Explanation

The correct answer is A: All logic expressed in the notebook associated with tasks A and B will have been successfully. The query uses the CREATE TABLE USING DELTA syntax to create a Delta Lake table from an existing Parquet file stored in DBFS. The query also uses the LOCATION keyword to specify the path to the Parquet file as /mnt/finance_eda_bucket/tx_sales.parquet. By using the LOCATION keywor

Data Orchestration

Question

A Databricks job has been configured with 3 tasks, each of which is a Databricks notebook. Task A does not depend on other tasks. Tasks B and C run in parallel, with each having a serial dependency on task A. If tasks A and B complete successfully but task C fails during a scheduled run, which statement describes the resulting state?

Options

  • AAll logic expressed in the notebook associated with tasks A and B will have been successfully
  • BAll logic expressed in the notebook associated with tasks A and B will have been successfully
  • CAll logic expressed in the notebook associated with task A will have been successfully completed;
  • DBecause all tasks are managed as a dependency graph, no changes will be committed to the
  • EUnless all tasks complete successfully, no changes will be committed to the Lakehouse; because

Explanation

The query uses the CREATE TABLE USING DELTA syntax to create a Delta Lake table from an existing Parquet file stored in DBFS. The query also uses the LOCATION keyword to specify the path to the Parquet file as /mnt/finance_eda_bucket/tx_sales.parquet. By using the LOCATION keyword, the query creates an external table, which is a table that is stored outside of the default warehouse directory and whose metadata is not managed by Databricks. An external table can be created from an existing directory in a cloud storage system, such as DBFS or S3, that contains data files in a supported format, such as Parquet or CSV. The resulting state after running the second command is that an external table will be created in the storage container mounted to /mnt/finance_eda_bucket with the new name prod.sales_by_store. The command will not change any data or move any files in the storage container; it will only update the table reference in the metastore and create a new Delta transaction log for the renamed table.

Topics

#Databricks Jobs#Task Dependencies#Job Execution#Failure Handling

Community Discussion

No community discussion yet for this question.

Full CERTIFIED-DATA-ENGINEER-PROFESSIONAL PracticeBrowse All CERTIFIED-DATA-ENGINEER-PROFESSIONAL Questions