Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[SPARK-17641][SQL] Collect_list/Collect_set should not collect null v…
…alues. ## What changes were proposed in this pull request? We added native versions of `collect_set` and `collect_list` in Spark 2.0. These currently also (try to) collect null values, this is different from the original Hive implementation. This PR fixes this by adding a null check to the `Collect.update` method. ## How was this patch tested? Added a regression test to `DataFrameAggregateSuite`. Author: Herman van Hovell <[email protected]> Closes #15208 from hvanhovell/SPARK-17641. (cherry picked from commit 7d09232) Signed-off-by: Reynold Xin <[email protected]>
- Loading branch information