[Feature Request] Keep delta_t as floats #437

Expertium · 2023-08-19T15:10:46Z

          Other day I was thinking that it might be worth to keep review intervals as floats, as opposed to calculating the day boundary and rounding them to days. Apart from helping those who travel a lot, it will remove necessity to enter timezone and day start, which is annoying an error prone, and also, it might improve accuracy for short interval reviews. Common sense suggests that if I learned a new card in evening and then reviewed it next day morning, I'll have better chance of remembering it if I was to learn it in morning and review it next day evening, however to the optimizer these two intervals are the same. I think this is also related to the spike on the stability to retention graph, which is at around 0.2 in my case. Despite this, I don't know if this change will bring any practical improvement.

Originally posted by @nb9618 in #429 (comment)

The text was updated successfully, but these errors were encountered:

Expertium · 2023-08-19T15:10:59Z

          No, I mean `delta_t` and `t_history`.

This obviously won't affect the scheduling as it is impossible to schedule a card at a specific time of a day, but it might improve accuracy of w and, if implemented in the add-on, accuracy of cards' memory states.

Originally posted by @nb9618 in #429 (comment)

L-M-Sherlock · 2023-08-19T15:41:51Z

And the scheduler also only knows the elapsed days, which is an integer.

user1823 · 2023-08-19T16:05:41Z

And the scheduler also only knows the elapsed days, which is an integer.

But things are changing. When the scheduler will get integrated into Anki's rust backend, several limitations would be removed. I realise that all this would not happen very soon. But, while discussing things like this one, we should think of the long term.

dae · 2023-08-27T10:05:01Z

I wonder whether it's really that useful, as once intervals grow above a few days, the fractional part is rather meaningless. For the training portion, revlog entries record days, so you'd need to infer the actual elapsed time based on the timestamps. For reviewing, the way the code is currently structured, only days are available to the scheduler.

L-M-Sherlock · 2023-10-09T11:00:09Z

https://github.com/open-spaced-repetition/fsrs-optimizer/tree/Feat/float-delta-t

Work in Progress.

ghost · 2023-10-09T13:10:17Z

I was thinking about how to differentiate between in-day and inter-day reviews when the information about day boundaries is not available. The easiest solution would be to pick a cut-off point, e.g. 12 hours, where any pair of reviews with the timestamp difference between them less than this point would be considered as having happened within the same day. There would be always a chance of misinterpreting corner cases, e.g. considering a review in the morning and then after 12 hours in the evening as having happened on different days, or a review late in the evening and then on the next day, in the early morning, as having happened on the same day. Depending on the chosen value of this point, the probability of these outcomes will change.

A more complicated approach would be to define something like an S-curve (note: here I mean the shape of the curve, and it has no relation to S as stability) with two end points, let's say 8 and 20 hours. This way, reviews with intervals of more than 20 hours, will be considered as inter-day, and with less than 8 hours as in-day, in which case S and D of the card after the later review will not be changed. For pairs of reviews within the range, the resulting S and D would be calculated as the intermediate values between the "previous" and the "updated as if it were an inter-day review" states, proportional to the value of the curve. If C = f(delta_t) is the value of the S-curve, and changes from 0 to 1 as delta_t increases, then stability after a "questionable" review will be S = C * S' + (1 - C) * S; and the same would stand for the difficulty. This way, there would be a smooth transition, as opposed to the simpler way of using a single cut-off point, which theoretically should alleviate the corner cases.

As a side note, I'm now convinced that with implementing float delta_t's there would probably be no significant improvement on the prediction accuracy for small stabilities, and taking into account the fact that the optimizer is now built into Anki, the only benefit of this change that I see is that it will improve calculation of memory states for people who change time zones.

L-M-Sherlock · 2023-10-10T11:41:52Z

Algorithm	Log Loss	RMSE	RMSE(bins)
FSRS v4	0.3820	0.3311	0.0547
FSRS v4 (float delta_t)	0.3870	0.3333	0.0557

The performance becomes worse.

Expertium · 2023-10-10T12:10:00Z

That's very strange. I wouldn't be very surprised if it was the same, but worse? That's unexpected.

L-M-Sherlock · 2023-10-10T12:20:26Z

Sorry, I make a mistake in the pretrain stage. float delta_t is incompatible with the current pretrain implementation. So I round the delta_t in the pretrain stage. The final result is:

Algorithm	Log Loss	RMSE	RMSE(bins)
FSRS v4	0.3820	0.3311	0.0547
FSRS v4 (float delta_t)	0.3840	0.3317	0.0542

Only RMSE(bins) is improved slightly.

L-M-Sherlock · 2023-10-10T12:24:45Z

I think it is not very strange. Because the memory consolidation and forgetting mainly happen during sleep.

L-M-Sherlock · 2023-11-14T10:15:16Z

Weighted by number of reviews

Algorithm	Log Loss	RMSE	RMSE(bins)
LSTM float delta_t	0.4219	0.3431	0.0680
LSTM	0.4193	0.3424	0.0662

Weighted by ln(number of reviews)

Algorithm	Log Loss	RMSE	RMSE(bins)
LSTM float delta_t	0.5886	0.3755	0.1387
LSTM	0.5934	0.3755	0.1382

It doesn't improve the performance to keep delta_t as floats. I think it's OK to close this issue.

L-M-Sherlock changed the title ~~[Feature Request] Kepp delta_t as floats~~ [Feature Request] Keep delta_t as floats Aug 19, 2023

L-M-Sherlock mentioned this issue Oct 11, 2023

[Feature Request] Ideas to further improve the accuracy of the algorithm #461

Closed

L-M-Sherlock closed this as completed Nov 14, 2023

Expertium mentioned this issue Jan 28, 2024

[Question] A “raw” version of the tiny_dataset.zip open-spaced-repetition/srs-benchmark#43

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Keep delta_t as floats #437

[Feature Request] Keep delta_t as floats #437

Expertium commented Aug 19, 2023

Expertium commented Aug 19, 2023

L-M-Sherlock commented Aug 19, 2023

user1823 commented Aug 19, 2023

dae commented Aug 27, 2023

L-M-Sherlock commented Oct 9, 2023

ghost commented Oct 9, 2023

L-M-Sherlock commented Oct 10, 2023

Expertium commented Oct 10, 2023

L-M-Sherlock commented Oct 10, 2023

L-M-Sherlock commented Oct 10, 2023

L-M-Sherlock commented Nov 14, 2023

[Feature Request] Keep delta_t as floats #437

[Feature Request] Keep delta_t as floats #437

Comments

Expertium commented Aug 19, 2023

Expertium commented Aug 19, 2023

L-M-Sherlock commented Aug 19, 2023

user1823 commented Aug 19, 2023

dae commented Aug 27, 2023

L-M-Sherlock commented Oct 9, 2023

ghost commented Oct 9, 2023

L-M-Sherlock commented Oct 10, 2023

Expertium commented Oct 10, 2023

L-M-Sherlock commented Oct 10, 2023

L-M-Sherlock commented Oct 10, 2023

L-M-Sherlock commented Nov 14, 2023

Weighted by number of reviews

Weighted by ln(number of reviews)