CERTIFIED-DATA-ANALYST-ASSOCIATE Practice Questions
94 real CERTIFIED-DATA-ANALYST-ASSOCIATE exam questions with expert-verified answers and explanations. Page 1 of 2.
- Question #1
A data analyst created and is the owner of the managed table my_table. They now want to change ownership of the table to a single other user using Data Explorer. Which of the follo...
- Question #2
A data analyst has a managed table table_name in database database_name. They would now like to remove the table from the database and all of the data files associated with the tab...
- Question #4
A data analyst runs the following command: INSERT INTO stakeholders.suppliers TABLE stakeholders.new_suppliers; What is the result of running this command?
- Question #5
A data engineer is working with a nested array column products in table transactions. They want to expand the table so each unique item in products for each row has its own row whe...
- Question #6
A data analysis team is working with the table_bronze SQL table as a source for one of its most complex projects. A stakeholder of the project notices that some of the downstream d...
- Question #9
In which of the following situations should a data analyst use higher-order functions?
- Question #10
Consider the following two statements: Statement 1: Statement 2: Which of the following describes how the result sets will differ for each statement when they are run in Databricks...
- Question #11
A data analyst has created a user-defined function using the following line of code: CREATE FUNCTION price(spend DOUBLE, units DOUBLE) RETURNS DOUBLE RETURN spend / units; Which of...
- Question #12
A data analyst has been asked to count the number of customers in each region and has written the following query: If there is a mistake in the query, which of the following descri...
- Question #14
Which of the following is a benefit of Databricks SQL using ANSI SQL as its standard SQL dialect?
- Question #15
How can a data analyst determine if query results were pulled from the cache?
- Question #16
Which of the following statements about a refresh schedule is incorrect?
- Question #17
A data analyst creates a Databricks SQL Query where the result set has the following schema: region STRING number_of_customer INT When the analyst clicks on the "Add visualization"...
- Question #18
A data analyst has created a Query in Databricks SQL, and now they want to create two data visualizations from that Query and add both of those data visualizations to the same Data...
- Question #19
A data analyst has been asked to provide a list of options on how to share a dashboard with a client. It is a security requirement that the client does not gain access to any other...
- Question #20
A data team has been given a series of projects by a consultant that need to be implemented in the Databricks Lakehouse Platform. Which of the following projects should be complete...
- Question #21
A data organization has a team of engineers developing data pipelines following the medallion architecture using Delta Live Tables. While the data analysis team working on a projec...
- Question #22
Which of the following statements describes descriptive statistics?
- Question #23
In which of the following situations will the mean value and median value of variable be meaningfully different?
- Question #24
A data analyst is working with gold-layer tables to complete an ad-hoc project. A stakeholder has provided the analyst with an additional dataset that can be used to augment the go...
- Question #25
A manufacturing company wants to use data from sensors installed on the machinery to continually monitor the performance of its production line. Which of the following Databricks S...
- Question #26
A data analyst has been asked to create a Databricks SQL query that will summarize sales data by product category and month. Which SQL function can you use to accomplish this?
- Question #27
A data analyst of a large online retailer wants to integrate Databricks SQL with Partner Connect to obtain real-time data on customer behavior from a social media platform. Which o...
- Question #28
A company needs to analyze a large amount of data stored in its Hadoop cluster. Which of the following best describes the benefit of using Databricks SQL with a Hadoop cluster?
- Question #29
A Data analyst has been tasked with optimizing a Databricks SQL query for a large dataset. What should you consider when trying to improve query performance?
- Question #30
Which of the following layers of the medallion architecture is most commonly used by data analysts?
- Question #31
A data analyst has recently joined a new team that uses Databricks SQL, but the analyst has never used Databricks before. The analyst wants to know where in Databricks SQL they can...
- Question #32
Which of the following describes how Databricks SQL should be used in relation to other business intelligence (BI) tools like Tableau, Power BI, and looker?
- Question #33
Which of the following approaches can be used to connect Databricks to Fivetran for data ingestion?
- Question #34
Data professionals with varying titles use the Databricks SQL service as the primary touchpoint with the Databricks Lakehouse Platform. However, some users will use other services...
- Question #35
A data analyst has set up a SQL query to run every four hours on a SQL endpoint, but the SQL endpoint is taking too long to start up with each run. Which of the following change ca...
- Question #36
A data engineering team has created a Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables. The microbatches are triggered every minut...
- Question #37
A data analyst has been asked to produce a visualization that shows the flow of users through a website. Which of the following is used for visualizing this type of flow?
- Question #38
An analyst writes a query that contains a query parameter. They then add an area chart visualization to the query. While adding the area chart visualization to a dashboard, the ana...
- Question #39
A data analyst has been asked to configure an alert for a query that returns the income in the accounts_receivable table for a date range. The date range is configurable using a Da...
- Question #40
Which of the following statements about adding visual appeal to visualizations in the Visualization Editor is incorrect?
- Question #41
Which of the following approaches can be used to ingest data directly from cloud-based object storage?
- Question #42
A data analyst wants to create a dashboard with three main sections: Development, Testing, and Production. They want all three sections on the same dashboard, but they want to clea...
- Question #43
A data analyst needs to use the Databricks Lakehouse Platform to quickly create SQL queries and data visualizations. It is a requirement that the compute resources in the platform...
- Question #44
A data analyst is attempting to drop a table my_table. The analyst wants to delete all table metadata and data. They run the following command: DROP TABLE IF EXISTS my_table; While...
- Question #45
After running DESCRIBE EXTENDED accounts.customers;, the following was returned: Now, a data analyst runs the following command: DROP accounts.customers; Which of the following des...
- Question #46
Which of the following should data analysts consider when working with personally identifiable information (PII) data?
- Question #47
Delta Lake stores table data as a series of data files, but it also stores a lot of other information. Which of the following is stored alongside data files when using Delta Lake?
- Question #48
Which of the following is an advantage of using a Delta Lake-based data lakehouse over common data lake solutions?
- Question #49
Which of the following benefits of using Databricks SQL is provided by Data Explorer?
- Question #50
The stakeholders.customers table has 15 columns and 3,000 rows of data. The following command is run: After running SELECT * FROM stakeholders.eur_customers, 15 rows are returned....
- Question #51
Which statement about subqueries is correct?
- Question #52
What describes Partner Connect in Databricks?
- Question #53
A data engineering team has created a Structured Streaming pipeline that processes data in micro- batches and populates gold-level tables. The microbatches are triggered every 10 m...
- Question #54
What does Partner Connect do when connecting Power Bl and Tableau?