nerdexam
DatabricksDatabricks

CERTIFIED-DATA-ANALYST-ASSOCIATE · Question #76

CERTIFIED-DATA-ANALYST-ASSOCIATE Question #76: Real Exam Question with Answer & Explanation

The correct answer is A: Open-source formats. Open-source formats like Parquet (used by Delta Lake) allow data to be accessed by any compatible tool - Spark, Presto, Hive, etc. - without vendor lock-in, which is a fundamental architectural advantage over proprietary EDW storage formats like those used by Teradata or Oracle.

Question

What is an advantage of using a Delta Lake-based data lakehouse over classic enterprise data warehouse solutions?

Options

  • AOpen-source formats
  • BSchema enforcement
  • CACID transactions
  • DGeneric optimizations

Explanation

Open-source formats like Parquet (used by Delta Lake) allow data to be accessed by any compatible tool - Spark, Presto, Hive, etc. - without vendor lock-in, which is a fundamental architectural advantage over proprietary EDW storage formats like those used by Teradata or Oracle. Schema enforcement (B) is not a differentiator - traditional EDWs have enforced schemas for decades; Delta Lake actually adds schema enforcement to data lakes, but that doesn't make it an advantage over EDWs. ACID transactions (C) are similarly a long-standing feature of classic EDWs, so Delta Lake achieving parity here is not a lakehouse-specific advantage over them. "Generic optimizations" (D) is nonsensical as an advantage - it implies less specialization, which would be a weakness, not a strength.

Memory tip: Associate Delta Lake with the word open - open formats, open ecosystem, no lock-in. Whenever a question asks what a lakehouse does better than a proprietary EDW, think "open-source" first.

Community Discussion

No community discussion yet for this question.

Full CERTIFIED-DATA-ANALYST-ASSOCIATE PracticeBrowse All CERTIFIED-DATA-ANALYST-ASSOCIATE Questions