A Data Engineer needs to rewrite SQL code into Snowpark code using the Snowpark Python API. Data is stored in a DataFrame, df_orders. How can this SQL query be rewritten so that it will run in Snowpark? A. B. C. D.

Question

Accepted Answer

A. df_summary = df_orders.group_by('username').agg(count('id').alias('orders_count'), 
 sum('price').alias('revenue'), sum('tax').alias('total_tax')).select('username', 'orders_count', 
 'revenue', 'total_tax').filter(col('orders_count')>1).df_summary.show()

Answer

B. df_summary = df_orders.select('username', 'orders_count', 'revenue', 'total_tax', 
 count('id').as('orders_count'), sum('price').as('revenue'), sum('tax').as('total_tax')) 
 .group_by('username').sort('orders_count').desc().filter(having('orders_count')>1).df_summary.show()

Answer

C. df_summary = select(D.count('id').AS('orders_count'), 
 D.sum('price').AS('revenue'), D.sum('tax').AS('total_tax')) 
 .from_('orders') 
 .group_by('username').having('orders_count'>1) 
 .sort('orders_count').desc() 
 df_summary.show()

Answer

D. df_summary = df_orders.group_by('username').count('id').alias('orders_count'), 
 sum('price').alias('revenue'), sum('tax').alias('total_tax') 
 .select('username', 'orders_count', 'revenue', 'total_tax') 
 .sort('orders_count').desc().having(having('orders_count')>1).df_summary.show()

DEA-C02 Question #143: Real Exam Question with Answer & Explanation

Question

Options

Explanation

Topics

Community Discussion