Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do more singular extensions. #4016

Closed
wants to merge 1 commit into from

Conversation

Vizvezdenec
Copy link
Contributor

This patch continues work on scaling - last VLTC tuning that was really bad at STC but had shown really good performance at LTC and beyond had the most massive change being lowering singular extensions depth threshold.
This patch does it to even bigger extent - lowers it even more. As a result this patch loses ~8 elo at STC, ~2 elo at LTC but gains elo in both 240+2.4 single thread test and 60+0.6 8 thread test, effectively passing double SPRT (240+2.4 test is fixed games test but it actually passed "STC" SPRT somewhere near 35k games).
So this patch is confirmed with usual fishtest confidence to be an elo gainer on time controls higher than 60+0.6 - which are almost always where stockfish is actually used.
Corresponding tests:
STC:
https://tests.stockfishchess.org/tests/view/626a98a88707aa698c008e7e
Elo: -8.27 +-2.5 (95%) LOS: 0.0%
Total: 20000 W: 5031 L: 5507 D: 9462
Ptnml(0-2): 113, 2522, 5179, 2100, 86
nElo: -15.92 +-4.8 (95%) PairsRatio: 0.83
LTC:
https://tests.stockfishchess.org/tests/view/62694d3309e8b84303aa8df3
LLR: -2.93 (-2.94,2.94) <0.50,3.00>
Total: 22000 W: 5681 L: 5819 D: 10500
Ptnml(0-2): 30, 2335, 6398, 2217, 20
VLTC:
https://tests.stockfishchess.org/tests/view/626abd7e8707aa698c0093a8
Elo: 2.35 +-1.5 (95%) LOS: 99.9%
Total: 40000 W: 10991 L: 10720 D: 18289
Ptnml(0-2): 8, 3534, 12648, 3799, 11
nElo: 5.47 +-3.4 (95%) PairsRatio: 1.08
VLTC multicore:
https://tests.stockfishchess.org/tests/view/6272a6afc8f14123163c1997
LLR: 2.94 (-2.94,2.94) <0.50,3.00>
Total: 86808 W: 24165 L: 23814 D: 38829
Ptnml(0-2): 11, 7253, 28524, 7606, 10
bench 7040579

@Vizvezdenec
Copy link
Contributor Author

Vizvezdenec commented May 9, 2022

if this will be committed I invite to add @alabx to authors list :)
It's more of his idea with me supervising it.

@vondele
Copy link
Member

vondele commented May 11, 2022

Posting what I mentioned on discord:

I have some concerns indeed. This one is now clearly negative in our LTC testing conditions. A revert will pass as an Elo gainer. At that point we end in the situation were we will start to randomly test patches at various conditions, because 'they might scale better'.

@vondele
Copy link
Member

vondele commented May 11, 2022

I would like to see some work ongoing that refines the SE conditions, we know there is something there that makes this scale as such. If we can identify that, this will be worthwhile in the long run

@dsmsgms
Copy link
Contributor

dsmsgms commented May 11, 2022

How bad it is if you avoid double extensions in lower depths?

@vondele
Copy link
Member

vondele commented May 11, 2022

We don't know, but it is a reasonable idea to test on fishtest.

@Vizvezdenec
Copy link
Contributor Author

This conditions were attempted to be refined for a lot of times since first xoto LTC tuning which drastically changed them (it was >= 8).
Same with conthist based pruning.
If anything number of failed attempts to make it scale properly is > 100 - nothing has worked so far.

vondele pushed a commit to vondele/Stockfish that referenced this pull request May 14, 2022
This patch makes the SE depth condition more robust and allows it to scale with completed depth
from a previous search.

At long TC this patch is almost equivalent to official-stockfish#4016 which had

VLTC:
https://tests.stockfishchess.org/tests/view/626abd7e8707aa698c0093a8
Elo: 2.35 +-1.5 (95%) LOS: 99.9%
Total: 40000 W: 10991 L: 10720 D: 18289
Ptnml(0-2): 8, 3534, 12648, 3799, 11
nElo: 5.47 +-3.4 (95%) PairsRatio: 1.08

VLTC multicore:
https://tests.stockfishchess.org/tests/view/6272a6afc8f14123163c1997
LLR: 2.94 (-2.94,2.94) <0.50,3.00>
Total: 86808 W: 24165 L: 23814 D: 38829
Ptnml(0-2): 11, 7253, 28524, 7606, 10

however, it is now also gaining at LTC:

LTC:
https://tests.stockfishchess.org/tests/view/627e7cb523c0c72a05b651a9
LLR: 2.94 (-2.94,2.94) <0.50,3.00>
Total: 27064 W: 7285 L: 7046 D: 12733
Ptnml(0-2): 8, 2446, 8390, 2675, 13

and should have nearly no influence at STC as depth 27 is rarely reached.
It was noticed that initializing the threshold with MAX_PLY, had an adverse effect,
possibly because the first move is sensitive to this.

closes official-stockfish#4021
closes official-stockfish#4016

Bench:
@vondele vondele closed this in 5372f81 May 14, 2022
vondele pushed a commit to vondele/Stockfish that referenced this pull request May 15, 2022
This patch makes the SE depth condition more robust and allows it to scale with completed depth
from a previous search.

At long TC this patch is almost equivalent to official-stockfish#4016 which had

VLTC:
https://tests.stockfishchess.org/tests/view/626abd7e8707aa698c0093a8
Elo: 2.35 +-1.5 (95%) LOS: 99.9%
Total: 40000 W: 10991 L: 10720 D: 18289
Ptnml(0-2): 8, 3534, 12648, 3799, 11
nElo: 5.47 +-3.4 (95%) PairsRatio: 1.08

VLTC multicore:
https://tests.stockfishchess.org/tests/view/6272a6afc8f14123163c1997
LLR: 2.94 (-2.94,2.94) <0.50,3.00>
Total: 86808 W: 24165 L: 23814 D: 38829
Ptnml(0-2): 11, 7253, 28524, 7606, 10

however, it is now also gaining at LTC:

LTC:
https://tests.stockfishchess.org/tests/view/627e7cb523c0c72a05b651a9
LLR: 2.94 (-2.94,2.94) <0.50,3.00>
Total: 27064 W: 7285 L: 7046 D: 12733
Ptnml(0-2): 8, 2446, 8390, 2675, 13

and should have nearly no influence at STC as depth 27 is rarely reached.
It was noticed that initializing the threshold with MAX_PLY, had an adverse effect,
possibly because the first move is sensitive to this.

closes official-stockfish#4021
closes official-stockfish#4016

Bench: 6481017
dav1312 pushed a commit to dav1312/Stockfish that referenced this pull request Oct 21, 2022
This patch makes the SE depth condition more robust and allows it to scale with completed depth
from a previous search.

At long TC this patch is almost equivalent to official-stockfish#4016 which had

VLTC:
https://tests.stockfishchess.org/tests/view/626abd7e8707aa698c0093a8
Elo: 2.35 +-1.5 (95%) LOS: 99.9%
Total: 40000 W: 10991 L: 10720 D: 18289
Ptnml(0-2): 8, 3534, 12648, 3799, 11
nElo: 5.47 +-3.4 (95%) PairsRatio: 1.08

VLTC multicore:
https://tests.stockfishchess.org/tests/view/6272a6afc8f14123163c1997
LLR: 2.94 (-2.94,2.94) <0.50,3.00>
Total: 86808 W: 24165 L: 23814 D: 38829
Ptnml(0-2): 11, 7253, 28524, 7606, 10

however, it is now also gaining at LTC:

LTC:
https://tests.stockfishchess.org/tests/view/627e7cb523c0c72a05b651a9
LLR: 2.94 (-2.94,2.94) <0.50,3.00>
Total: 27064 W: 7285 L: 7046 D: 12733
Ptnml(0-2): 8, 2446, 8390, 2675, 13

and should have nearly no influence at STC as depth 27 is rarely reached.
It was noticed that initializing the threshold with MAX_PLY, had an adverse effect,
possibly because the first move is sensitive to this.

closes official-stockfish#4021
closes official-stockfish#4016

Bench: 6481017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants