Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-20232][Python] Improve combineByKey docs #17545

Closed

Conversation

dgingrich
Copy link

What changes were proposed in this pull request?

Improve combineByKey documentation:

  • Add note on memory allocation
  • Change example code to use different mergeValue and mergeCombiners

How was this patch tested?

Doctest.

Legal

This is my original work and I license the work to the project under the project’s open source license.

* Add note on memory allocation
* Change example code to use different mergeValue and mergeCombiners
@SparkQA
Copy link

SparkQA commented Apr 6, 2017

Test build #3643 has finished for PR 17545 at commit 74691d1.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@holdenk
Copy link
Contributor

holdenk commented Apr 12, 2017

So the associated JIRA says "Doesn't include note about memory allocation (on aggregateBykey)" - what did you want to add in aggregateBykey?

@dgingrich
Copy link
Author

The note meant that combineByKey doesn't include the comment about memory allocation that's on aggregateByKey. I did add the memory allocation note to combineByKey, starting on line 1810.

@holdenk
Copy link
Contributor

holdenk commented Apr 13, 2017

Ok sounds good, I just wanted to make sure since the JIRA had it in two separate bullet points and I wasn't sure if it was a second thing you wanted to talk about.

LGTM.

@holdenk
Copy link
Contributor

holdenk commented Apr 13, 2017

Merged to master, thanks for working on this @dgingrich

@asfgit asfgit closed this in 8ddf0d2 Apr 13, 2017
peter-toth pushed a commit to peter-toth/spark that referenced this pull request Oct 6, 2018
## What changes were proposed in this pull request?

Improve combineByKey documentation:

* Add note on memory allocation
* Change example code to use different mergeValue and mergeCombiners

## How was this patch tested?

Doctest.

## Legal

This is my original work and I license the work to the project under the project’s open source license.

Author: David Gingrich <[email protected]>

Closes apache#17545 from dgingrich/topic-spark-20232-combinebykey-docs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants