How does model.fit () calculate loss and acc ? Documentation will be helpful. #10426

whatdhack · 2018-06-14T00:40:01Z

I think it uses functions like binary_entropy , binary_accuracy etc. Is any smoothing used ? Some description in the documentation will be good ( pointers to the exact code will also help).

raymond-yuan · 2018-06-14T22:56:15Z

Loss and accuracy are calculated as you train, according to the loss and metrics specified in compiling the model. Before you train, you must compile your model to configure the learning process. This allows you to specify the optimizer, loss function, and metrics, which in turn are how the model fit function knows what loss function to use, what metrics to keep track of, etc. The loss function (like binary cross entropy) documentation can be found here and the metrics (like accuracy) documentation can be found here

In summary, in order for you to train (and for fit to calculate loss and acc):

Instantiate model

model = Sequential()
...

Compile the model. This is where the functions like binary cross entropy are specified for the model to call fit. The built in loss functions are documented and implemented here.

# For a binary classification problem
model.compile(optimizer='rmsprop',
              loss='binary_crossentropy',
              metrics=['accuracy'])

Train by calling fit

model.fit(data, one_hot_labels, epochs=10, batch_size=32)

whatdhack · 2018-06-14T23:30:19Z

Thanks , that is a little bit more information, but not exactly the answer to what I was asking. My question is what exactly is the formula used for calculating loss and accuracy per batch ( e.g. the numbers that get displayed during model.fit() ) . If you know where that is exactly done in the code , that answer should be right there.

raymond-yuan · 2018-06-14T23:52:57Z

Yes, like I linked, the formulas for calculating loss per batch can be found here. The exact implementation is backend specific. For example if you were using tensorflow look here

whatdhack · 2018-06-15T00:00:36Z

Thanks again - I have seen those code by the way. Let me try again. The following is an output from a model.fit () session .

5/125 [>.............................] - ETA: 4:35 - loss: 1.1342 - acc: 0.5250

Which exact variable holds the loss (e.g. 1.1342) and accuracy (e.g. 0.5250) ?

raymond-yuan · 2018-06-15T00:51:50Z

I believe that the loss is stored here; similarly the accuracy is handled by the handle_metrics function. However, you shouldn't be working with these variables directly, if you want to calculate the loss and accuracy for your model given some data use model.evaluate

whatdhack · 2018-06-15T01:07:58Z

Thanks again. Basically my question is what mathematically the loss and accuracy displayed during model.fit() are. Is it averaged over the batch , or some other function is used to calculate the loss/acc that gets displayed on the screen. The answer is still not there, hence need to keep looking till the answer is found and document it.

raymond-yuan · 2018-06-15T01:18:51Z

For training loss, keras does a running average over the batches. For validation loss, a conventional average over all the batches in validation data is performed. The training accuracy is the average of the accuracy values for each batch of training data during training.

whatdhack · 2018-06-15T01:22:49Z

Good. Is the running average over just the batch or there is carryover from previous batch ? Would be great to see the code for that.

raymond-yuan · 2018-06-15T02:04:09Z

The code is here, BaseLogger is applied to every Keras model and will update things like loss and acc. The training loss is carried over from the previous batch: it's the average of the losses over each batch of training data.

whatdhack · 2018-06-15T06:23:22Z

You are probably close. If the code ( see below) indeed is responsible for the fit() per batch loss/acc output, then the mystery is probably is how 'v' got into the logs list before entering the on_batch_end() function . This function primarily seems to be for keeping track variables for on_epoch_end () calculations.

def on_batch_end(self, batch, logs=None):

.
.
.
for k, v in logs.items():
if k in self.stateful_metrics:
self.totals[k] = v < ----
else:
if k in self.totals:
self.totals[k] += v * batch_size
else:
self.totals[k] = v * batch_size

raymond-yuan · 2018-06-15T16:06:22Z

'v' gets added into the batch logs here. outs is the output of the train function, f, which is built when you call fit on your model, which will output your loss and metrics from here given some inputs. An example of the log list looks like this (if you were to add a print(logs) in the on_batch_end method)

{'loss': 1.8399812, 'batch': 2, 'size': 32, 'acc': 0.5625}
{'loss': 1.6450999, 'batch': 3, 'size': 32, 'acc': 0.53125}
{'loss': 1.6244895, 'batch': 4, 'size': 32, 'acc': 0.40625}
{'loss': 1.4333215, 'batch': 5, 'size': 32, 'acc': 0.5625}
{'loss': 1.1506481, 'batch': 6, 'size': 32, 'acc': 0.75}
{'loss': 1.0616728, 'batch': 7, 'size': 32, 'acc': 0.65625}
{'loss': 1.313815, 'batch': 8, 'size': 32, 'acc': 0.46875}
{'loss': 1.2656202, 'batch': 9, 'size': 32, 'acc': 0.5625}
{'loss': 1.2441672, 'batch': 10, 'size': 32, 'acc': 0.5625}
{'loss': 0.99731344, 'batch': 11, 'size': 32, 'acc': 0.65625}
{'loss': 1.0144608, 'batch': 12, 'size': 32, 'acc': 0.71875}
{'loss': 1.2803181, 'batch': 13, 'size': 32, 'acc': 0.4375}
{'loss': 1.2766386, 'batch': 14, 'size': 32, 'acc': 0.5}
{'loss': 0.87459761, 'batch': 15, 'size': 32, 'acc': 0.6875}

whatdhack · 2018-06-16T19:48:08Z

That is great. So, it is still not clear if the per batch loss/acc is the average over just that batch or something else.

kaiche12 · 2019-03-15T07:42:52Z

Sorry, how do Keras calculate loss (e.g mse) when doing predict_on_batch. For example you have 10 batches, the model normally will return 10 losses

DayChan · 2019-03-24T03:42:08Z

@raymond-yuan When we choose "acc" to calculate accuracy, how does keras choose the calculating function? I found that there are several accuracy functions.

chrismaliszewski · 2021-03-07T22:17:22Z

@DayChan, check a comment to this page https://stackoverflow.com/a/59637143/5536388.

NEGU93 · 2021-11-25T15:00:14Z

After having anxiety by reading this. I believe the conclusion is:

Loss: displays the loss of the current batch
Validation loss: displays the loss of all the validation batches
Accuracy and validation accuracy: displays the acc of all the batches (train and val respectively).

Did I understand correctly?

chrismaliszewski · 2021-11-25T19:55:28Z

All good but the last point training part. I'll sum this up again + extras:

if acc/accuracy metric is specified, TF automatically chooses it based on the loss function (LF), it can either be tf.keras.metrics.BinaryAccuracy, tf.keras.metrics.CategoricalAccuracy or tf.keras.metrics.SparseCategoricalAccuracy and it's hidden under the name accuracy,
when a metric is calculated, it usually has two public variables, total and count; the first calculates the total sum of results of the metric, whereas count, how many times we calculated the results; the displayed metric value comes from dividing one by another,
(the following I'm not 100% sure) most of the LF inherit from MeanWrapper; if it's not, i.e. custom LF, it's wrapped around it automatically to calculate mean results after each batch in the same way metrics are calculated (see the previous point),
while training: all metrics (and LF) are calculated on batch_end and displayed immediately,
while validation: all metrics (and LF) are calculated on batch_end but displayed at the very end after averaging.

NEGU93 · 2021-11-26T14:07:17Z

I tried this line of code:

evaluate = model.evaluate(train_ds)
print(evaluate)

And got:

148/148 [==============================] - 23s 122ms/step - loss: 0.2275 - accuracy: 0.8943 
[0.23347871005535126, 0.8992502093315125]

See how loss and ACC are not the same. We have two values here, the 'computation' value (in this case 0.8943) and the 'returned' value, in this case 0.8992.
Therefore, I believe the 'computation' value is a per batch value only, meaning the accuracy of the specific batch. And the returned value of the function is the actual total / count.
That is how I would explain the difference here.

Another option (which I think less likely) is that the return value is total_per_sample / (count * batch_size). and then the computation metric is the total / count

chrismaliszewski · 2021-11-26T21:01:30Z

@NEGU93, I followed the code again, here is how it works:

fit() runs.
self.train_function runs (it's a bit different for evaluation, prediction, test, training), it returns logs.
method compute_metrics is called
it calls compiled_metrics.update_state() which calls update_state for each metric
as I described in the previous post, most of the metrics accumulate their results by creating total and count variables but they return the updated value
(we're back to the place from 3.) after the update, there is a loop across all metrics and their results are being returned with result method of a metric, they are the logs
a callback on batch_end is called that updates progbar, method _batch_update_progbar, with the logs values.

That's how the code works.

You can check that what I say is true by running this code for 2 "batches"/examples: https://colab.research.google.com/drive/12gTrW-k0TntAawVjXv-tVLp4lmM5lFXx?usp=sharing. I created this in the past to simulate how metrics and loss functions work in Numpy.

I'm not sure what causes the difference in your code.

raymond-yuan closed this as completed Jun 15, 2018

mwinton mentioned this issue Nov 16, 2018

Write custom loss function code to match Yang's loss function mwinton/w266-final-project#39

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How does model.fit () calculate loss and acc ? Documentation will be helpful. #10426

How does model.fit () calculate loss and acc ? Documentation will be helpful. #10426

whatdhack commented Jun 14, 2018

raymond-yuan commented Jun 14, 2018

whatdhack commented Jun 14, 2018

raymond-yuan commented Jun 14, 2018

whatdhack commented Jun 15, 2018

raymond-yuan commented Jun 15, 2018

whatdhack commented Jun 15, 2018

raymond-yuan commented Jun 15, 2018

whatdhack commented Jun 15, 2018 •

edited

Loading

raymond-yuan commented Jun 15, 2018

whatdhack commented Jun 15, 2018

raymond-yuan commented Jun 15, 2018

whatdhack commented Jun 16, 2018

kaiche12 commented Mar 15, 2019

DayChan commented Mar 24, 2019

chrismaliszewski commented Mar 7, 2021

NEGU93 commented Nov 25, 2021

chrismaliszewski commented Nov 25, 2021

NEGU93 commented Nov 26, 2021 •

edited

Loading

chrismaliszewski commented Nov 26, 2021

How does model.fit () calculate loss and acc ? Documentation will be helpful. #10426

How does model.fit () calculate loss and acc ? Documentation will be helpful. #10426

Comments

whatdhack commented Jun 14, 2018

raymond-yuan commented Jun 14, 2018

whatdhack commented Jun 14, 2018

raymond-yuan commented Jun 14, 2018

whatdhack commented Jun 15, 2018

raymond-yuan commented Jun 15, 2018

whatdhack commented Jun 15, 2018

raymond-yuan commented Jun 15, 2018

whatdhack commented Jun 15, 2018 • edited Loading

raymond-yuan commented Jun 15, 2018

whatdhack commented Jun 15, 2018

raymond-yuan commented Jun 15, 2018

whatdhack commented Jun 16, 2018

kaiche12 commented Mar 15, 2019

DayChan commented Mar 24, 2019

chrismaliszewski commented Mar 7, 2021

NEGU93 commented Nov 25, 2021

chrismaliszewski commented Nov 25, 2021

NEGU93 commented Nov 26, 2021 • edited Loading

chrismaliszewski commented Nov 26, 2021

whatdhack commented Jun 15, 2018 •

edited

Loading

NEGU93 commented Nov 26, 2021 •

edited

Loading