A data engineering team is tasked with loading a large dataset (5TB) into Snowflake from an external S3 bucket. The data loading process is experiencing significant performance bottlenecks. Which of the following strategies would MOST effectively improve the data loading performance, assuming the network bandwidth between Snowflake and S3 is sufficient?

Question

Accepted Answer

A. Increase the size of the virtual warehouse to a larger size (e.g., from SMALL to LARGE) before E. Partition the data in S3 into smaller files and ensure the virtual warehouse is appropriately sized

Answer

B. Use multiple virtual warehouses concurrently to load different subsets of the data from S3.

Answer

C. Use a larger virtual warehouse indefinitely to handle any potential performance peaks, even after

Answer

D. Disable auto-suspend on the virtual warehouse to prevent it from idling during the data load.

A data engineering team is tasked with loading a large dataset (5TB) into Snowflake from an external S3 bucket. The data loading process is experiencing significant performance bottlenecks. Which of t

Question

Options

How the community answered

Explanation

Topics

Community Discussion