Associate-Developer-Apache-Spark試験無料問題集「Databricks Certified Associate Developer for Apache Spark 3.0 認定」

Which of the following describes a narrow transformation?

解説: (GoShiken メンバーにのみ表示されます)
Which of the following code blocks returns a DataFrame where columns predError and productId are removed from DataFrame transactionsDf?
Sample of DataFrame transactionsDf:
1.+-------------+---------+-----+-------+---------+----+
2.|transactionId|predError|value|storeId|productId|f |
3.+-------------+---------+-----+-------+---------+----+
4.|1 |3 |4 |25 |1 |null|
5.|2 |6 |7 |2 |2 |null|
6.|3 |3 |null |25 |3 |null|
7.+-------------+---------+-----+-------+---------+----+

解説: (GoShiken メンバーにのみ表示されます)
The code block displayed below contains an error. The code block should return the average of rows in column value grouped by unique storeId. Find the error.
Code block:
transactionsDf.agg("storeId").avg("value")

解説: (GoShiken メンバーにのみ表示されます)
Which of the following code blocks creates a new DataFrame with two columns season and wind_speed_ms where column season is of data type string and column wind_speed_ms is of data type double?

解説: (GoShiken メンバーにのみ表示されます)
Which of the following code blocks reduces a DataFrame from 12 to 6 partitions and performs a full shuffle?

解説: (GoShiken メンバーにのみ表示されます)
Which of the following options describes the responsibility of the executors in Spark?

解説: (GoShiken メンバーにのみ表示されます)
Which of the following statements about data skew is incorrect?

解説: (GoShiken メンバーにのみ表示されます)
Which of the following describes Spark's standalone deployment mode?

解説: (GoShiken メンバーにのみ表示されます)
Which of the following code blocks stores DataFrame itemsDf in executor memory and, if insufficient memory is available, serializes it and saves it to disk?

解説: (GoShiken メンバーにのみ表示されます)
Which of the following is one of the big performance advantages that Spark has over Hadoop?

解説: (GoShiken メンバーにのみ表示されます)