exit early when nan in grad or hessian #1046

mohamed82008 · 2023-08-09T19:34:10Z

This PR checks that the gradient and Hessian are all finite, otherwise it breaks early with a warning.

pkofod · 2023-08-10T06:35:46Z

I agree that this is a good idea because the current behavior will keep iterating until iterations number of iterations. I am wondering if it would make sense to simply restart the hessian approximation if the method is a quasi-newton instead of failing, but that's for another PR.

pkofod · 2023-08-11T09:58:55Z

Tests fail because TwiceDifferentiableHV does not have an H field, but you could check for NaNs in the Hessian-Vector product. I know it should be a different counter, but that can be another PR.

mohamed82008 · 2023-08-15T03:01:58Z

I excluded TwiceDifferentiableHV from the check.

codecov · 2023-08-15T05:37:20Z

Codecov Report

Merging #1046 (8757ec9) into master (fd93033) will increase coverage by 0.02%.
The diff coverage is 100.00%.

❗ Current head 8757ec9 differs from pull request most recent head 141aa1c. Consider uploading reports for the commit 141aa1c to get more accurate results

@@            Coverage Diff             @@
##           master    #1046      +/-   ##
==========================================
+ Coverage   85.26%   85.29%   +0.02%     
==========================================
  Files          43       43              
  Lines        3210     3216       +6     
==========================================
+ Hits         2737     2743       +6     
  Misses        473      473

Files Changed	Coverage Δ
src/multivariate/optimize/optimize.jl	`91.42% <100.00%> (+0.80%)`	⬆️

MilesCranmer · 2023-08-17T19:33:48Z

Is there a way to hide this warning? PySR and SymbolicRegression.jl have started generating TONS of these warnings during the expression search (which is fine; they will just skip such an expression).

MilesCranmer · 2023-08-17T19:39:46Z

Does this interfere with the behavior of Backtracking line searches? If so I think this is a breaking change and should be by default turned off, right? It seems to be messing with PySR & SymbolicRegression.jl's optimization loop

MilesCranmer · 2023-08-17T19:45:25Z

Yeah I think this is a breaking change. See LineSearches.BackTracking: https://github.com/JuliaNLSolvers/LineSearches.jl/blob/ded667a80f47886c77d67e8890f6adb127679ab4/src/backtracking.jl#L64

    # Hard-coded backtrack until we find a finite function value
    iterfinite = 0
    while !isfinite(ϕx_1) && iterfinite < iterfinitemax
        iterfinite += 1
        α_1 = α_2
        α_2 = α_1/2

        ϕx_1 = ϕ(α_2)
    end

^ i.e., it will backtrack when it hits a NaN. Whereas this PR makes it exit when it hits a NaN, which is fundamentally different behavior.

Could you please roll this change back and make it optional? It has changed the algorithmic behavior of PySR and SymbolicRegression.jl.

mohamed82008 · 2023-08-18T06:16:29Z

can you share an example that is broken now but used to work?

mohamed82008 · 2023-08-18T06:26:49Z

Gradient-based optimisers in Optim work as follows:

Evaluate the objective, gradient and optionally Hessian at the current point x
Find a search direction based on the gradient (and Hessian if available)
Do a line search along the search direction and find the best solution on the line by objective value f(x)
Move to the best solution on the line and go back to step 1

The check here is in step 1 checking if it results in NaN gradient/Hessian. For the current point in step 1 to have a NaN gradient, it must not have had a NaN objective because it was the output of the line search in step 3. None of the line search algorithms in step 3 will terminate at a point with a NaN objective if the initial point had a finite objective value but they can terminate at a point with NaN gradient and/or Hessian. So if the gradient/Hessian in step 1 has NaNs then the search direction will have NaNs and the line search will fail because all steps along a NaN search direction lead to a NaN x and then NaN objective value. So the rest of the iterations will all be spent failing line searches and doing unnecessary function evaluations.

Given the above reasoning, I am curious what exactly did this PR break?

MilesCranmer · 2023-08-18T07:07:53Z

Hm, I wonder if it was due to the special handling of NaNs in my package - I treat NaNs as a signal. I will have a look this weekend and see what the issue is. I fixed the version to 1.7.6 in SymbolicRegression.jl for now.

In any case, the warnings would be nice to turn off; is that possible?

mohamed82008 · 2023-08-18T13:00:05Z

It should be possible to turn the warnings off with a flag but that requires a new option in Optim so the decision will be Patrick's. It's also possible to turn off warnings from your end as a user.

MilesCranmer · 2023-08-29T11:55:01Z

Thanks @mohamed82008. I made a PR in #1049 to turn off the warning messages when show_trace = false. i.e., this avoids the need of making a new option by just using the verbosity option already present.

pkofod · 2023-10-07T16:02:05Z

Yeah I think this is a breaking change.

I think this is false. This check happens after the backtracking out of non-finite values loop.

MilesCranmer · 2023-10-07T17:52:21Z

Thanks. Could you please see #1049? That is blocking me from updating to the latest Optim.

mohamed82008 added 2 commits August 10, 2023 05:32

exit early when nan in grad or hessian

8f5c5f9

change position of checks

7a5c492

exclude TwiceDifferentiableHV

141aa1c

pkofod merged commit 2a44548 into JuliaNLSolvers:master Aug 16, 2023
6 checks passed

This was referenced Aug 17, 2023

Silence warnings for Optim.jl MilesCranmer/PySR#408

Closed

Silence warnings for Optim.jl MilesCranmer/SymbolicRegression.jl#255

Closed

MilesCranmer mentioned this pull request Aug 17, 2023

Hotfix for breaking change in Optim.jl MilesCranmer/SymbolicRegression.jl#256

Merged

MilesCranmer mentioned this pull request Aug 29, 2023

Only warn for early stop if options.show_trace #1049

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exit early when nan in grad or hessian #1046

exit early when nan in grad or hessian #1046

mohamed82008 commented Aug 9, 2023

pkofod commented Aug 10, 2023

pkofod commented Aug 11, 2023

mohamed82008 commented Aug 15, 2023

codecov bot commented Aug 15, 2023 •

edited

Loading

MilesCranmer commented Aug 17, 2023

MilesCranmer commented Aug 17, 2023 •

edited

Loading

MilesCranmer commented Aug 17, 2023

mohamed82008 commented Aug 18, 2023

mohamed82008 commented Aug 18, 2023

MilesCranmer commented Aug 18, 2023

mohamed82008 commented Aug 18, 2023

MilesCranmer commented Aug 29, 2023 •

edited

Loading

pkofod commented Oct 7, 2023

MilesCranmer commented Oct 7, 2023

exit early when nan in grad or hessian #1046

exit early when nan in grad or hessian #1046

Conversation

mohamed82008 commented Aug 9, 2023

pkofod commented Aug 10, 2023

pkofod commented Aug 11, 2023

mohamed82008 commented Aug 15, 2023

codecov bot commented Aug 15, 2023 • edited Loading

Codecov Report

MilesCranmer commented Aug 17, 2023

MilesCranmer commented Aug 17, 2023 • edited Loading

MilesCranmer commented Aug 17, 2023

mohamed82008 commented Aug 18, 2023

mohamed82008 commented Aug 18, 2023

MilesCranmer commented Aug 18, 2023

mohamed82008 commented Aug 18, 2023

MilesCranmer commented Aug 29, 2023 • edited Loading

pkofod commented Oct 7, 2023

MilesCranmer commented Oct 7, 2023

codecov bot commented Aug 15, 2023 •

edited

Loading

MilesCranmer commented Aug 17, 2023 •

edited

Loading

MilesCranmer commented Aug 29, 2023 •

edited

Loading