loss for training and evaluation in estimator could be different #16879

liuzh47 · 2019-11-21T08:59:51Z

Description

In current estimator implementation, fit_batch and evaluate_batch use the same loss function.
Code snippet in the fit_batch is shown below:

     with autograd.record():
            pred = [self.net(x) for x in data]
            loss = [self.loss(y_hat, y) for y_hat, y in zip(pred, label)]

The code snippet for evaluate_batch is shown below:

        data, label = self._get_data_and_label(val_batch, self.context, batch_axis)
        pred = [self.net(x) for x in data]
        loss = [self.loss(y_hat, y) for y_hat, y in zip(pred, label)]

both training and evaluation are using the same loss function self.loss to compute the batch loss. In many use cases, it does not hold true. For example, when training LSTM, user may use joint
regularization loss during training whereas standard cross entropy during evaluation.

When writing customized estimator, it is cumbersome to define a new loss when evaluation does not share the same loss with training. So it would be good if estimator api could include two losses: self.train_loss and self.evaluate_loss to tackle different cases separately.

The text was updated successfully, but these errors were encountered:

leezu · 2019-11-21T09:30:16Z

How about introducing a new evaluation_loss or evaluate_loss argument to the constructor. If it is None, we use the train loss during evaluation. For backwards compatibility, let's keep self.loss and just introduce a new self.evaluation_loss. What do you think?
Will you open a PR to fix this issue?

liuzh47 · 2019-11-21T09:49:09Z

How about introducing a new evaluation_loss or evaluate_loss argument to the constructor. If it is None, we use the train loss during evaluation. For backwards compatibility, let's keep self.loss and just introduce a new self.evaluation_loss. What do you think?
Will you open a PR to fix this issue?

It is viable to do that. I'll fix this issue.

sxjscience · 2019-11-25T18:07:27Z

Fixed in #16888, Thanks @liuzh91

liuzh47 added the Bug label Nov 21, 2019

leezu assigned liuzh47 Nov 22, 2019

liuzh47 mentioned this issue Nov 22, 2019

Add evaluation_loss to the estimator base class. #16888

Merged

sxjscience closed this as completed Nov 25, 2019

Yiyan66 mentioned this issue Feb 6, 2020

[numpy] add logical op #17534

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loss for training and evaluation in estimator could be different #16879

loss for training and evaluation in estimator could be different #16879

liuzh47 commented Nov 21, 2019 •

edited

Loading

leezu commented Nov 21, 2019

liuzh47 commented Nov 21, 2019

sxjscience commented Nov 25, 2019

loss for training and evaluation in estimator could be different #16879

loss for training and evaluation in estimator could be different #16879

Comments

liuzh47 commented Nov 21, 2019 • edited Loading

Description

leezu commented Nov 21, 2019

liuzh47 commented Nov 21, 2019

sxjscience commented Nov 25, 2019

liuzh47 commented Nov 21, 2019 •

edited

Loading