nerdexam
SnowflakeSnowflake

DAA-C01 · Question #153

DAA-C01 Question #153: Real Exam Question with Answer & Explanation

The correct answer is B: Parquet format optimizes query performance and reduces storage requirements. Parquet is a columnar storage format purpose-built for analytics workloads: it enables column pruning (reading only the columns a query needs), supports efficient compression algorithms, and stores metadata like min/max values per column to enable predicate pushdown - all of whic

Performance Optimization

Question

When working with CSV, JSON, and Parquet data types in Snowflake, how does selecting the appropriate data format impact query performance and storage efficiency?

Options

  • ACSV format enhances query performance and minimizes storage needs
  • BParquet format optimizes query performance and reduces storage requirements
  • CAll formats have similar impacts on query performance and storage
  • DJSON format accelerates query execution but increases storage requirements

Explanation

Parquet is a columnar storage format purpose-built for analytics workloads: it enables column pruning (reading only the columns a query needs), supports efficient compression algorithms, and stores metadata like min/max values per column to enable predicate pushdown - all of which dramatically reduce both I/O and storage costs in Snowflake.

Why the distractors are wrong:

  • A (CSV): CSV is a flat, row-oriented text format with no compression or metadata - it's the worst performer for query optimization and consumes more storage than binary formats.
  • C (all formats similar): Meaningfully false - format choice is one of the most impactful decisions for query performance and storage in Snowflake; Parquet consistently outperforms both CSV and raw JSON.
  • D (JSON accelerates queries): JSON is semi-structured and schema-less, which adds parsing overhead at query time and stores redundant key names with every record, increasing storage rather than reducing it.

Memory tip: Think of Parquet as a pre-organized filing cabinet - data is sorted by column, pre-compressed, and indexed. CSV and JSON are like loose papers in a pile - you have to read everything to find anything.

Topics

#Data formats#Parquet#Query performance#Storage efficiency

Community Discussion

No community discussion yet for this question.

Full DAA-C01 PracticeBrowse All DAA-C01 Questions