[SPARK-20348] [ML] Support squared hinge loss (L2 loss) for LinearSVC #17645

hhbyyh · 2017-04-16T01:07:44Z

What changes were proposed in this pull request?

While Hinge loss is the standard loss function for linear SVM, Squared hinge loss (a.k.a. L2 loss) is also popular in practice. L2-SVM is differentiable and imposes a bigger (quadratic vs. linear) loss for points which violate the margin. Some introduction can be found from http://mccormickml.com/2015/01/06/what-is-an-l2-svm/

Liblinear and scikit learn both offer squared hinge loss as the default loss function for linear SVM.

How was this patch tested?

strengthen existing unit test and add new unit test for comparison.

SparkQA · 2017-04-16T02:10:19Z

Test build #75830 has finished for PR 17645 at commit 6541f69.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

yanboliang · 2017-04-24T15:50:24Z

mllib/src/main/scala/org/apache/spark/ml/classification/LinearSVC.scala

+   * @group param
+   */
+  @Since("2.3.0")
+  final val lossFunction: Param[String] = new Param(this, "lossFunction", "Specifies the loss " +


I'd prefer to move this out to shared params, since it can be used by other algorithms as well. Thanks.

Sure we can do it.
But I'm thinking maybe we should conduct an integrated refactor about the common optimization parameters some time in the future, either through shared params or other trait or abstract class.

OK, let leave as is and refactor in the future. One minor issue: What about renaming it to loss? I found the name of corresponding params in sklearn.svm.linearSVC is loss. Thanks.

HyukjinKwon · 2017-06-02T12:54:00Z

ping @hhbyyh, where are we on this?

hhbyyh · 2017-06-07T19:56:55Z

Hi @HyukjinKwon I think this is a feature we need, but currently we are still having some discussion about optimizer interface.

HyukjinKwon · 2017-06-07T20:24:03Z

I took out this in the list. Though, shouldn't we maybe close this for now and reopen again when it's ready if it takes quite long? It'd be probably better than leaving this open without further updates for long time.

HyukjinKwon · 2017-06-10T12:31:29Z

ping @hhbyyh. WDYT?

hhbyyh · 2017-06-12T05:53:33Z

OK. I'll close it for now and try to merge it with #17862.
Thanks for the comment from @yanboliang

add l2 loss

6541f69

yanboliang reviewed Apr 24, 2017

View reviewed changes

hhbyyh mentioned this pull request May 5, 2017

[SPARK-20602] [ML]Adding LBFGS optimizer and Squared_hinge loss for LinearSVC #17862

Closed

HyukjinKwon mentioned this pull request Jun 7, 2017

[INFRA] Close stale PRs #18223

Closed

hhbyyh closed this Jun 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-20348] [ML] Support squared hinge loss (L2 loss) for LinearSVC #17645

[SPARK-20348] [ML] Support squared hinge loss (L2 loss) for LinearSVC #17645

hhbyyh commented Apr 16, 2017

SparkQA commented Apr 16, 2017

yanboliang Apr 24, 2017

hhbyyh Apr 28, 2017

yanboliang May 8, 2017

HyukjinKwon commented Jun 2, 2017

hhbyyh commented Jun 7, 2017

HyukjinKwon commented Jun 7, 2017 •

edited

Loading

HyukjinKwon commented Jun 10, 2017

hhbyyh commented Jun 12, 2017

[SPARK-20348] [ML] Support squared hinge loss (L2 loss) for LinearSVC #17645

[SPARK-20348] [ML] Support squared hinge loss (L2 loss) for LinearSVC #17645

Conversation

hhbyyh commented Apr 16, 2017

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Apr 16, 2017

yanboliang Apr 24, 2017

Choose a reason for hiding this comment

hhbyyh Apr 28, 2017

Choose a reason for hiding this comment

yanboliang May 8, 2017

Choose a reason for hiding this comment

HyukjinKwon commented Jun 2, 2017

hhbyyh commented Jun 7, 2017

HyukjinKwon commented Jun 7, 2017 • edited Loading

HyukjinKwon commented Jun 10, 2017

hhbyyh commented Jun 12, 2017

HyukjinKwon commented Jun 7, 2017 •

edited

Loading