DATABRICKS-CERTIFIED-DATA-ENGINEER-ASSOCIATE Exam Questions
83 real DATABRICKS-CERTIFIED-DATA-ENGINEER-ASSOCIATE exam questions with expert-verified answers and explanations. Page 2 of 2.
- Question #54Data Management
A data engineer needs access to a table new_table, but they do not have the correct permissions. They can ask the table owner for permission, but they do not know who the table own...
Unity CatalogTable OwnershipData ExplorerAccess Control - Question #55Data Management
A new data engineering team team has been assigned to an ELT project. The new data engineering team will need full privileges on the table sales to fully manage the project. Which...
Access ControlSQL GRANTTable PrivilegesDatabricks Security - Question #56Databricks Lakehouse Platform
A data organization leader is upset about the data analysis team's reports being different from the data engineering team's reports. The leader believes the siloed nature of their...
Databricks LakehouseSingle Source of TruthData ConsistencyData Architecture - Question #57Deployment and Operations
Which of the following describes a scenario in which a data team will want to utilize cluster pools?
Cluster poolsPerformance optimizationJob executionResource management - Question #58Databricks Lakehouse Platform
Which of the following is hosted completely in the control plane of the classic Databricks architecture?
Databricks architectureControl planeData planeWeb application - Question #59Databricks Lakehouse Platform
Which of the following benefits of using the Databricks Lakehouse Platform is provided by Delta Lake?
Delta LakeBatch ProcessingStreaming ProcessingLakehouse Platform - Question #60Databricks Lakehouse Platform
Which of the following describes the storage organization of a Delta table?
Delta LakeStorage OrganizationTable ArchitectureFile Format - Question #61Data Management
Which of the following code blocks will remove the rows where the value in column age is greater than 25 from the existing Delta table my_table and save the updated table?
SQL DELETEData ManipulationDelta Lake DML - Question #62Data Management
A data engineer has realized that they made a mistake when making a daily update to a table. They need to use Delta time travel to restore the table to a version that is 3 days old...
Delta LakeTime TravelVACUUM commandData Retention - Question #63Deployment and Operations
Which of the following Git operations must be performed outside of Databricks Repos?
Databricks ReposGit operationsVersion controlBranch management - Question #64Databricks Lakehouse Platform
Which of the following data lakehouse features results in improved data quality over a traditional data lake?
Data LakehouseData QualityACID Transactions - Question #65Deployment and Operations
An engineering manager wants to monitor the performance of a recent project using a Databricks SQL query. For the first week following the project's release, the manager wants the...
SQL Query SchedulingCost ManagementDatabricks SQL EndpointResource Management - Question #66Databricks Lakehouse Platform
A data analysis team has noticed that their Databricks SQL queries are running too slowly when connected to their always-on SQL endpoint. They claim that this issue is present when...
Databricks SQL EndpointPerformance TuningConcurrencyAuto Scaling - Question #67Deployment and Operations
A data engineer wants to schedule their Databricks SQL dashboard to refresh once per day, but they only want the associated SQL endpoint to be running when it is necessary. Which o...
Databricks SQL EndpointsCost OptimizationAuto-Stop FeatureResource Management - Question #68Monitoring and Logging
A data engineer has been using a Databricks SQL dashboard to monitor the cleanliness of the input data to an ELT job. The ELT job has its Databricks SQL query that returns the numb...
Databricks SQL AlertsWebhook NotificationsData Quality MonitoringELT Monitoring - Question #69Monitoring and Logging
A single Job runs two notebooks as two separate tasks. A data engineer has noticed that one of the notebooks is running slowly in the Job's current run. The data engineer asks a te...
Databricks JobsJob MonitoringTroubleshootingUI Navigation - Question #70Deployment and Operations
A data engineer has a Job with multiple tasks that runs nightly. Each of the tasks runs slowly because the clusters take a long time to start. Which of the following actions can th...
Cluster PoolsJob OptimizationCompute ManagementPerformance Tuning - Question #71Data Management
A new data engineering team team. has been assigned to an ELT project. The new data engineering team will need full privileges on the database customers to fully manage the project...
Access ControlSQL GRANTPermissionsData Security - Question #72Databricks Lakehouse Platform
A new data engineering team has been assigned to work on a project. The team will need access to database customers in order to see what tables already exist. The team has its own...
SQL PermissionsAccess ControlDatabricks SecurityDatabase Management - Question #74ELT with Spark SQL and Python
A data engineer has a Python variable table_name that they would like to use in a SQL query. They want to construct a Python code block that will run the query using table_name. Th...
Spark SQLPython in SparkSparkSessionDatabricks Notebooks - Question #75Databricks Lakehouse Platform
A data engineer has created a new database using the following command: CREATE DATABASE IF NOT EXISTS customer360; In which of the following locations will the customer360 database...
Database StorageDBFSHive Metastore DefaultsCREATE DATABASE Command - Question #76Data Management
A data engineer is attempting to drop a Spark SQL table my_table and runs the following command: DROP TABLE IF EXISTS my_table; After running this command, the engineer notices tha...
Spark SQLTable ManagementManaged Tables - Question #78ELT with Spark SQL and Python
In which of the following scenarios should a data engineer use the MERGE INTO command instead of the INSERT INTO command?
MERGE INTOINSERT INTODelta LakeUPSERT - Question #80Data Management
A data engineer needs to create a table in Databricks using data from a CSV file at location /path/to/csv. They run the following command: Which of the following lines of code fill...
SQLCREATE TABLECSVExternal Tables - Question #81Databricks Lakehouse Platform
A data engineer needs to determine whether to use the built-in Databricks Notebooks versioning or version their project using Databricks Repos. Which of the following is an advanta...
Databricks ReposVersion ControlNotebooksCollaboration - Question #82Data Management
A data engineer has left the organization. The data team needs to transfer ownership of the data engineer's Delta tables to a new data engineer. The new data engineer is the lead e...
Databricks AdministrationDelta Table OwnershipPermissionsWorkspace Administrator - Question #83Databricks Lakehouse Platform
A data analyst has created a Delta table sales that is used by the entire data analysis team. They want help from the data engineering team to implement a series of tests to ensure...
PySparkDataFramesData AccessDelta Lake - Question #84Data Management
Which of the following commands will return the location of database customer360?
SQL CommandsDatabase MetadataDatabricks SQLDatabase Location - Question #85Data Management
A data engineer wants to create a new table containing the names of customers that live in France. They have written the following command: A senior data engineer mentions that it...
SQL DDLTable PropertiesMetadata Management - Question #86ELT with Spark SQL and Python
Which of the following benefits is provided by the array functions from Spark SQL?
Spark SQL functionsArray functionsNested dataSemi-structured data - Question #87Data Management
Which of the following commands can be used to write data into a Delta table while avoiding the writing of duplicate records?
Delta LakeMERGE statementUpsertData manipulation - Question #89ELT with Spark SQL and Python
A data analyst has a series of queries in a SQL program. The data analyst wants this program to run every day. They only want the final query in the program to run on Sundays. They...
PySparkPython Control FlowConditional ExecutionJob Orchestration - Question #90ELT with Spark SQL and Python
A data engineer runs a statement every day to copy the previous day's sales into the table transactions. Each day's sales are in their own file in the location "/transactions/raw"....
COPY INTOData IngestionIdempotencyDelta Lake