SOL-C01 · Question #197
SOL-C01 Question #197: Real Exam Question with Answer & Explanation
The correct answer is C: Utilize Snowflake's data skipping features by partitioning the external stage based on document. Option C is correct because data skipping allows Snowflake to avoid scanning unnecessary files in the external stage, reducing the amount of data that needs to be processed. Option D is correct because pre-processing reduces the total data input to the PARSE_DOCUMENT function. Op
Question
You're using Snowflake Cortex's 'PARSE DOCUMENT function to extract data from a large number of PDF documents stored in an AWS S3 bucket. You want to optimize costs by minimizing the amount of data processed by the function. Which of the following strategies would be MOST effective? (Choose all that apply)
Options
- AUse a smaller virtual warehouse size to reduce compute costs, even if it increases the overall
- BImplement data masking policies to redact sensitive information (e.g., personally identifiable
- CUtilize Snowflake's data skipping features by partitioning the external stage based on document
- DPre-process the documents to extract only the specific sections containing the information you
- EDisable Snowflake's result caching to ensure that PARSE_DOCUMENT always processes the
Explanation
Option C is correct because data skipping allows Snowflake to avoid scanning unnecessary files in the external stage, reducing the amount of data that needs to be processed. Option D is correct because pre-processing reduces the total data input to the PARSE_DOCUMENT function. Option A might seem intuitive, but often a larger warehouse processes data much faster and cheaper overall due to parallelization and efficiency. Option B is relevant for security but doesn't directly minimize the amount of data processed by PARSE_DOCUMENT. Option E will increase the costs. Always review the Snowflake query history for error messages or contact Snowflake support for assistance.
Topics
Community Discussion
No community discussion yet for this question.