-
Notifications
You must be signed in to change notification settings - Fork 28.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-4118] [MLlib] [PySpark] Python bindings for StreamingKMeans #6499
Conversation
Test build #33738 has finished for PR 6499 at commit
|
cc @freeman-lab |
It's still in WIP yet. |
I would really like some help here. The However, the A small example to reproduce.
This does not seem to terminate. |
Test build #33807 has finished for PR 6499 at commit
|
I narrowed it down to this small test case.
Why does this run indefinitely? |
Figured it out thanks to @davies through mail. Will finish this up tomorrow. |
Cool, excited to look at this! Can definitely take a pass after you update. |
@freeman-lab @mengxr |
Test build #33985 has finished for PR 6499 at commit
|
Test build #33987 has finished for PR 6499 at commit
|
@@ -818,6 +830,78 @@ def test_model_transform(self): | |||
self.assertEqual(model.transform([1.0, 2.0, 3.0]), DenseVector([1.0, 2.0, 3.0])) | |||
|
|||
|
|||
class StreamingKMeansTest(MLLibStreamingTestCase): | |||
def test_model_params(self): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@davies Could you please verify if these are the best way to add MLlib Streaming tests? I copied the idea from the tests in streaming.py.
Test build #34090 has finished for PR 6499 at commit
|
Test build #34089 has finished for PR 6499 at commit
|
Test build #34156 has finished for PR 6499 at commit
|
Test build #34158 has finished for PR 6499 at commit
|
Test build #34190 has finished for PR 6499 at commit
|
Test build #34196 has finished for PR 6499 at commit
|
Test build #34206 has finished for PR 6499 at commit
|
jenkins retest this please |
Test build #34251 has finished for PR 6499 at commit
|
|
||
{% highlight python %} | ||
model.trainOn(trainingData) | ||
model.predictOnValues(testData.map(lambda lp: (lp.label, lp.features))) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is no output from this example. Call print
.
Test build #35161 has finished for PR 6499 at commit
|
Test build #35163 has finished for PR 6499 at commit
|
@mengxr updated ! please review. |
Test build #35232 has finished for PR 6499 at commit
|
jenkins retest this please. |
Test build #35235 has finished for PR 6499 at commit
|
Test build #35238 has finished for PR 6499 at commit
|
Test build #35246 has finished for PR 6499 at commit
|
jenkins retest this please. |
Test build #35269 has finished for PR 6499 at commit
|
test this please |
(run another time in case the streaming tests are flaky) |
LGTM pending Jenkins. |
Test build #35289 has finished for PR 6499 at commit
|
Merged into master. Thanks! |
Thanks for your reviews and help. |
Python bindings for StreamingKMeans
Will change status to MRG once docs, tests and examples are updated.