[FEA] Support posexplode in nested type columns. #6025

nartal1 · 2020-08-18T16:43:56Z

This looks like extension of #2975. Please let me know if we could merge this FEA in #2975 itself.

When processing a nested column, create a new row for each element with position in the given array or map column.
Spark API docs

Example:
val df = Seq(List(1,2,3),List(3,4,5)).toDF("Integers")
df.show
+---------+
| Integers|
+---------+
|[1, 2, 3]|
|[4, 5, 6]|
+---------+

df.select(posexplode($"Integers")).show
+---+---+
|pos|col|
+---+---+
| 0| 1|
| 1| 2|
| 2| 3|
| 0| 4|
| 1| 5|
| 2| 6|
+---+---+

github-actions · 2021-02-16T21:18:12Z

This issue has been marked rotten due to no recent activity in the past 90d. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

nartal1 · 2021-02-17T01:16:56Z

This is duplicate of #6151. Closing.

nartal1 added feature request New feature or request Needs Triage Need team to review and classify libcudf Affects libcudf (C++/CUDA) code. labels Aug 18, 2020

kkraus14 added Spark Functionality that helps Spark RAPIDS and removed Needs Triage Need team to review and classify labels Aug 18, 2020

nartal1 mentioned this issue Aug 18, 2020

[FEA] Audit GenerateExec NVIDIA/spark-rapids#228

Closed

github-actions bot added the rotten label Feb 16, 2021

nartal1 closed this as completed Feb 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Support posexplode in nested type columns. #6025

[FEA] Support posexplode in nested type columns. #6025

nartal1 commented Aug 18, 2020

github-actions bot commented Feb 16, 2021

nartal1 commented Feb 17, 2021

[FEA] Support posexplode in nested type columns. #6025

[FEA] Support posexplode in nested type columns. #6025

Comments

nartal1 commented Aug 18, 2020

github-actions bot commented Feb 16, 2021

nartal1 commented Feb 17, 2021