CERTIFIED-DATA-ANALYST-ASSOCIATE · Question #76
CERTIFIED-DATA-ANALYST-ASSOCIATE Question #76: Real Exam Question with Answer & Explanation
The correct answer is A: Open-source formats. Open-source formats like Parquet (used by Delta Lake) allow data to be accessed by any compatible tool - Spark, Presto, Hive, etc. - without vendor lock-in, which is a fundamental architectural advantage over proprietary EDW storage formats like those used by Teradata or Oracle.
Question
What is an advantage of using a Delta Lake-based data lakehouse over classic enterprise data warehouse solutions?
Options
- AOpen-source formats
- BSchema enforcement
- CACID transactions
- DGeneric optimizations
Explanation
Open-source formats like Parquet (used by Delta Lake) allow data to be accessed by any compatible tool - Spark, Presto, Hive, etc. - without vendor lock-in, which is a fundamental architectural advantage over proprietary EDW storage formats like those used by Teradata or Oracle. Schema enforcement (B) is not a differentiator - traditional EDWs have enforced schemas for decades; Delta Lake actually adds schema enforcement to data lakes, but that doesn't make it an advantage over EDWs. ACID transactions (C) are similarly a long-standing feature of classic EDWs, so Delta Lake achieving parity here is not a lakehouse-specific advantage over them. "Generic optimizations" (D) is nonsensical as an advantage - it implies less specialization, which would be a weakness, not a strength.
Memory tip: Associate Delta Lake with the word open - open formats, open ecosystem, no lock-in. Whenever a question asks what a lakehouse does better than a proprietary EDW, think "open-source" first.
Community Discussion
No community discussion yet for this question.