nerdexam
AmazonAmazon

DAS-C01 · Question #8

DAS-C01 Question #8: Real Exam Question with Answer & Explanation

The correct answer is D: Use a single COPY command to load the data into the Amazon Redshift cluster.. {"question_number": 5, "correct_answer": "D", "explanation": "Using a single COPY command to load all the data files (Option D) achieves the highest throughput and optimal cluster resource utilization. When a single COPY command references multiple files in S3 (e.g., via a manife

Storage and Data Management

Question

A retail company is building its data warehouse solution using Amazon Redshift. As a part of that effort, the company is loading hundreds of files into the fact table created in its Amazon Redshift cluster. The company wants the solution to achieve the highest throughput and optimally use cluster resources when loading data into the company's fact table. How should the company meet these requirements?

Options

  • AUse multiple COPY commands to load the data into the Amazon Redshift cluster.
  • BUse S3DistCp to load multiple files into the Hadoop Distributed File System (HDFS) and use an
  • CUse LOAD commands equal to the number of Amazon Redshift cluster nodes and load the data
  • DUse a single COPY command to load the data into the Amazon Redshift cluster.

Explanation

{"question_number": 5, "correct_answer": "D", "explanation": "Using a single COPY command to load all the data files (Option D) achieves the highest throughput and optimal cluster resource utilization. When a single COPY command references multiple files in S3 (e.g., via a manifest file or a common prefix), Redshift automatically distributes the load across all compute node slices in parallel. This is the AWS-recommended best practice for bulk loading. Option A (multiple simultaneous COPY commands) causes workload contention - Redshift serializes or queues concurrent COPY commands, which can reduce throughput rather than increase it. Option B introduces unnecessary complexity by adding an EMR/HDFS layer. Option C is invalid because 'LOAD' is not a Redshift command; the correct command is COPY.", "generated_by": "claude-sonnet", "llm_judge_score": 4}

Topics

#Amazon Redshift#Data Loading#COPY command#Performance Optimization

Community Discussion

No community discussion yet for this question.

Full DAS-C01 PracticeBrowse All DAS-C01 Questions