SOL-C01 · Question #181
SOL-C01 Question #181: Real Exam Question with Answer & Explanation
The correct answer is A: Increase the virtual warehouse size to 'MEDIUM' or larger.. Increasing the warehouse size (Option A) provides more compute resources for the data loading process, leading to faster performance. Splitting the large CSV file into smaller files (Option B) allows Snowflake to load the data in parallel, leveraging the distributed architecture.
Question
You are using a virtual warehouse 'X-SMALL' to load data from a large CSV file (50GB) residing in an external stage into a Snowflake table. During the data loading process, you observe that the warehouse is consistently running at 100% utilization, and the loading process is taking an unacceptably long time. Identify the strategies that can improve data loading performance (Multiple Answers Possible).
Options
- AIncrease the virtual warehouse size to 'MEDIUM' or larger.
- BSplit the large CSV file into smaller files and load them in parallel.
- CUse the 'VALIDATE' function before loading to ensure data quality, which will speed up the loading
- DPartition the Snowflake table based on a relevant column in the CSV file before loading the data.
- ECompress the CSV file using gzip or bzip2 before uploading it to the external stage.
Explanation
Increasing the warehouse size (Option A) provides more compute resources for the data loading process, leading to faster performance. Splitting the large CSV file into smaller files (Option B) allows Snowflake to load the data in parallel, leveraging the distributed architecture. Compressing the CSV file (Option E) reduces the amount of data that needs to be transferred from the external stage, resulting in faster load times. 'VALIDATE function checks data quality but doesn't improve loading speed (Option C). Partitioning before loading is not a valid operation; it can be done after loading if needed based on query patterns (Option D).
Topics
Community Discussion
No community discussion yet for this question.