DatabricksDatabricks
DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK · Question #11
DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK Question #11: Real Exam Question with Answer & Explanation
The correct answer is E: DataFrame.drop_duplicates(subset = "all"). See the full explanation below for the reasoning.
DataFrame Transformations
Question
Which of the following operations fails to return a DataFrame with no duplicate rows?
Options
- ADataFrame.dropDuplicates()
- BDataFrame.distinct()
- CDataFrame.drop_duplicates()
- DDataFrame.drop_duplicates(subset = None)
- EDataFrame.drop_duplicates(subset = "all")
Topics
#PySpark DataFrame API#Data Deduplication#DataFrame Transformations#Method Parameters
Community Discussion
No community discussion yet for this question.