Add ScalarNonlinearFunction support #2059

odow · 2022-12-08T02:07:11Z

Overview

This PR is an implementation of the idea in #846, and is based on our work at the JuMP-level: jump-dev/JuMP.jl#3106.

Preview links

There are two key bits of documentation to read. The actual implementation is very straight forward:

https://jump.dev/MathOptInterface.jl/previews/PR2059/reference/standard_form/#MathOptInterface.ScalarNonlinearFunction

One complicating issue is how to deal with user-defined function. I've added a new attribute:

https://jump.dev/MathOptInterface.jl/previews/PR2059/reference/models/#MathOptInterface.UserDefinedFunction

I've also added ListOfSupportedNonlinearOperators <: AbstractModelAttribute and UnsupportedNonlinearOperator <: UnsupportedError

Solver support

Ipopt is adds support for ScalarNonlinearFunction in jump-dev/Ipopt.jl#346.

What to do with NLPBlock

There's an open question about what we do with the existing NLPBlock stuff. The easiest answer is just to leave it, and say that solvers should support both, although they can error if both appear in the same model.

It'll be a harder decision at the JuMP-level, but we can wait until this is merged to figure that out.

Blockers

There's nothing major blocking this from my end. It's actually pretty lightweight. The harder part is JuMP and documenting the two different nonlinear approaches. But this can be added without breaking anything because it is strictly a feature addition.

Needs:

Tests to improve code coverage

odow · 2022-12-13T07:45:25Z

Here's a current demo with the Ipopt PR:

julia> import Ipopt

julia> const MOI = Ipopt.MOI
MathOptInterface

julia> function main()
           model = Ipopt.Optimizer()
           x = MOI.add_variable(model)
           con_f = MOI.ScalarNonlinearFunction{Float64}(:^, Any[x, 2])
           MOI.add_constraint(model, con_f, MOI.LessThan(1.0))
           MOI.set(model, MOI.ObjectiveSense(), MOI.MIN_SENSE)
           my_f(x) = (x - 0.5)^2
           MOI.set(model, MOI.UserDefinedFunction(:my_f, 1), (my_f,))
           obj_f = MOI.ScalarNonlinearFunction{Float64}(:my_f, Any[x])
           MOI.set(model, MOI.ObjectiveFunction{typeof(obj_f)}(), obj_f)
           MOI.optimize!(model)
           return MOI.get(model, MOI.VariablePrimal(), x)
       end
main (generic function with 1 method)

julia> main()
This is Ipopt version 3.14.4, running with linear solver MUMPS 5.4.1.

Number of nonzeros in equality constraint Jacobian...:        0
Number of nonzeros in inequality constraint Jacobian.:        1
Number of nonzeros in Lagrangian Hessian.............:        2

Total number of variables............................:        1
                     variables with only lower bounds:        0
                variables with lower and upper bounds:        0
                     variables with only upper bounds:        0
Total number of equality constraints.................:        0
Total number of inequality constraints...............:        1
        inequality constraints with only lower bounds:        0
   inequality constraints with lower and upper bounds:        0
        inequality constraints with only upper bounds:        1

iter    objective    inf_pr   inf_du lg(mu)  ||d||  lg(rg) alpha_du alpha_pr  ls
   0  2.5000000e-01 0.00e+00 1.00e+00  -1.0 0.00e+00    -  0.00e+00 0.00e+00   0
   1  6.2500000e-02 0.00e+00 4.90e-01  -1.7 2.50e-01    -  1.00e+00 1.00e+00f  1
   2  1.1441994e-04 0.00e+00 1.74e-03  -1.7 2.39e-01    -  1.00e+00 1.00e+00h  1
   3  7.8304570e-06 0.00e+00 2.89e-04  -2.5 6.50e-02    -  1.00e+00 1.00e+00h  1
   4  1.5182438e-08 0.00e+00 2.74e-05  -3.8 2.72e-03    -  1.00e+00 1.00e+00h  1
   5  1.6255746e-12 0.00e+00 5.28e-08  -5.7 1.29e-04    -  1.00e+00 1.00e+00h  1
   6  2.8086636e-18 0.00e+00 6.35e-12  -8.6 1.29e-06    -  1.00e+00 1.00e+00h  1

Number of Iterations....: 6

                                   (scaled)                 (unscaled)
Objective...............:   2.8086636097379302e-18    2.8086636097379302e-18
Dual infeasibility......:   6.3507046478545526e-12    6.3507046478545526e-12
Constraint violation....:   0.0000000000000000e+00    0.0000000000000000e+00
Variable bound violation:   0.0000000000000000e+00    0.0000000000000000e+00
Complementarity.........:   2.5091160166026672e-09    2.5091160166026672e-09
Overall NLP error.......:   2.5091160166026672e-09    2.5091160166026672e-09


Number of objective function evaluations             = 7
Number of objective gradient evaluations             = 7
Number of equality constraint evaluations            = 0
Number of inequality constraint evaluations          = 7
Number of equality constraint Jacobian evaluations   = 0
Number of inequality constraint Jacobian evaluations = 7
Number of Lagrangian Hessian evaluations             = 6
Total seconds in IPOPT                               = 0.045

EXIT: Optimal Solution Found.
0.4999999983240932

Things are plumbed end-to-end, including user-defined functions.

I'm still undecided on how to implement nonlinear parameters and expressions.

dourouc05

Only minor comments in the code. I'm wondering what the API would look like to rewrite such functions using bridges, for instance for CP. (Or this issue has been discussed elsewhere and I'm not aware of it.)

src/Nonlinear/parse.jl

src/attributes.jl

odow · 2022-12-14T21:52:01Z

I'm wondering what the API would look like to rewrite such functions using bridges, for instance for CP.

I think, instead of a bridge, it'd likely be a transformation layer, like Dualization.Optimizer.

We don't really have tools for bridging based on the content of a constraint, only on the type, so you'd likely need to be able to see the entire problem in order to construct a valid transformation.

blegat · 2022-12-15T10:35:39Z

We don't really have tools for bridging based on the content of a constraint, only on the type, so you'd likely need to be able to see the entire problem in order to construct a valid transformation.

I don't think it's about content vs type. If a transformation can be done independently for each constraint then it can be done by a bridge.

For the non-breaking path, I don't need we need to be non-breaking for the solvers. Once we have the NL API set up, we can drop support for NLPBlock in Ipopt and tag Ipopt v2.0. What we need is to continue supporting solvers requiring NLPBlock from MOI until MOI v2.0.
One way we could do that is to have an MOI layer that transforms ScalarNonlinearFunction into NLPBlock.
In MOI.instantiate, if supports(::Optimizer, ::NLPBlock) is true then we add this layer.

I still think transforming from NL expression to gradient/hessian oracle dealing with AD, UserDefinedFunction, etc... is best done by LazyBridgeOptimizer instead of each NL solver as detailed in #846. Do you still agree with this long term goal ?

odow · 2022-12-15T20:48:52Z

Do you still agree with this long term goal ?

No. Computing derivatives really needs the entirety of the problem in one go, whereas the bridges work on a constraint-by-constraint basis. Dealing with the derivatives is not complicated (look how small the Ipopt PR is jump-dev/Ipopt.jl#346) because we've abstracted all of the helpers into MOI.Nonlinear.Model.

blegat · 2022-12-16T10:09:46Z

Computing derivatives really needs the entirety of the problem in one go, whereas the bridges work on a constraint-by-constraint basis.

Why computing derivatives cannot be done on a constraint-by-constraint basis ?

odow · 2022-12-16T21:15:25Z

Why computing derivatives cannot be done on a constraint-by-constraint basis ?

Because that's a muuuuuch larger change to our AD engines. It also makes it a lot more complicated to deal with sparsity etc. We don't really want to have to deal with 10,000 constraints, each with 10,000 instances of an AD tape. JuMP's reverse diff amortizes a lot of allocations over the constraints. It also makes it hard to have shared common subexpressions.

mlubin · 2022-12-16T23:33:28Z

Shared sub-expressions is a good reason. Sparsity of the hessian of the lagrangian is another one, although this could probably be refactored if we really wanted to.

blegat · 2022-12-17T13:15:33Z

Could an MOI layer that does the transformation from MOI.ScalarNonlinearFunction to MOI.NLPBlock be a good first step ? I would be needed for retro-compatibility anyway and we can see if it can produce oracles as MOI.AbstractScalarFunction later and if it can be done as bridged even later

frapac · 2023-01-11T20:45:35Z

we can see if it can produce oracles as MOI.AbstractScalarFunction later and if it can be done as bridged even later

To me, the question is: what would be the use case for this?

Usually, the nonlinear solver requires only access to the full Jacobian and to the full Hessian, discarding the individual constraints. Seeing all the constraints as a single multivalued function has also its advantages, such as:

we can evaluate the full Jacobian J efficiently using forward-mode AD to evaluate p vector products J v, with p a number of colors associated to the sparsity pattern of the Jacobian. If we look at the individual constraints, evaluating the full Jacobian would require m evaluations of reverse-mode AD to evaluate the gradients of the m constraints one by one.
we can evaluate the Hessian of the Lagrangian efficiently using forward-over-reverse AD, using the same coloring trick to compress the number of Hessian-vector product required.

odow · 2023-01-24T00:22:07Z

We should check and confirm support for complex-valued nonlinear functions (polynomials of complex variables)

odow · 2023-02-01T00:09:44Z

Here's a proof of concept of PATHSolver solving a nonlinear complementarity problem with VectorNonlinearFunction-in-Complements: chkwon/PATHSolver.jl#82

Still an open question as to what the proper definition is. We can also revert the commit and start it in a separate PR if we decide to merge ScalarNonlinearFunction first.

src/functions.jl

chriscoey · 2023-02-01T00:41:40Z

Still an open question as to what the proper definition is.

For VNF here you mean? This seems like the obvious approach to me - what alternatives are there?

odow · 2023-02-01T00:46:15Z

what alternatives are there

Some sort of vector-valued expression tree? Not sure. You could imagine A * x being VectorNonlinearExpr(:*, Any[A, x]).

odow · 2023-02-15T23:52:23Z

#2094 is a resolution to the "how to we deal with parameters" question. Ipopt would declare support for MOI.Parameter, and then it could add parameters to the MOI.Nonlinear.Model object, and replace any parameter variables in the expression with the Nonlinear.ParameterIndex. It doesn't resolve the nonlinear subexpression question, but perhaps that's for the solver or MOI.Nonlinear.Model to deal with.

odow · 2023-02-19T23:03:52Z

Parameter is added, so I'll have a go at updating this PR to use it.

Co-authored-by: Chris Coey <[email protected]>

mlubin

LGTM to merge.

In terms of breaking this in the future, I think we should do our best to stick to semver. So let's test this out and see what needs to change. Then MOI 2.0 can include any breaking changes to this API, plus deleting the old NLPBlockData interface?

odow · 2023-05-28T22:49:32Z

So let's test this out and see what needs to change. Then MOI 2.0 can include any breaking changes to this API, plus deleting the old NLPBlockData interface?

Sounds good. The actual interface is pretty minimal, so I think there are probably a bunch of missing methods and things that we'll need to add, rather than breaking changes.

odow · 2023-05-29T01:57:42Z

I'll merge now, but hold off from tagging for a couple of days to see if anything else pops up once I update other other packages to MathOptInterface#master.

odow added the Project: next-gen nonlinear support Issues relating to nonlinear support label Dec 8, 2022

odow mentioned this pull request Dec 8, 2022

Add support for MOI.ScalarNonlinearFunction jump-dev/Ipopt.jl#346

Merged

dourouc05 reviewed Dec 14, 2022

View reviewed changes

src/Nonlinear/parse.jl Outdated Show resolved Hide resolved

src/attributes.jl Outdated Show resolved Hide resolved

odow marked this pull request as draft January 11, 2023 19:53

chriscoey mentioned this pull request Jan 15, 2023

experiment with converting quadratic functions to nonlinear, to simplify/streamline code jump-dev/Pavito.jl#67

Open

odow force-pushed the od/nlp-expr branch from af40d84 to 69eae72 Compare January 30, 2023 22:27

odow changed the title ~~WIP: explore ScalarNonlinearFunction support~~ Add ScalarNonlinearFunction support Jan 30, 2023

odow marked this pull request as ready for review January 31, 2023 21:06

odow mentioned this pull request Feb 1, 2023

Add support for VectorNonlinearFunction chkwon/PATHSolver.jl#82

Merged

1 task

chriscoey reviewed Feb 1, 2023

View reviewed changes

src/functions.jl Outdated Show resolved Hide resolved

odow force-pushed the od/nlp-expr branch from f5fdc67 to 1f3ca0f Compare February 2, 2023 21:39

odow changed the title ~~Add ScalarNonlinearFunction support~~ Add {Vector,Scalar}NonlinearFunction support Feb 2, 2023

This was referenced Feb 13, 2023

[Bridges] Add FixParametricVariablesBridge #2092

Closed

Add Parameter{T} <: AbstractScalarSet #2094

Closed

odow force-pushed the od/nlp-expr branch from b6a4030 to 1a2ed78 Compare February 15, 2023 23:46

odow and others added 23 commits May 29, 2023 09:33

More tweaks

c0b1632

Add more tests

4e6a810

Fix formatting

5708f06

Fix docs

7912fec

Drop type parameter in ScalarNonlinearFunction

5d2a5ca

Update

b643b1a

More code coverage

4e60dc2

Fix tests

7cc4c75

Test ListOfSupportedNonlinearOperators

4009904

More docstrings

700895f

Apply suggestions from code review

fce34aa

Co-authored-by: Chris Coey <[email protected]>

Fix printing

ba54d06

Updates

283f74a

Fix formatting

3908638

Fix typo

3b2c61f

ListOfSupportedNonlinearOperators is optimizer attribute

4f22a7b

More parsing of ScalarNonlinearFunction

67a9034

Fix changing objective to nonlinear

59c68ba

Update

e3b3c39

Update

f080282

Update

098c85d

Add another promote_operation method

9b227af

Add more docs for UserDefinedFunction

95adb08

odow force-pushed the od/nlp-expr branch from 6df73da to 95adb08 Compare May 28, 2023 21:38

mlubin approved these changes May 28, 2023

View reviewed changes

odow merged commit a189a89 into master May 29, 2023

odow deleted the od/nlp-expr branch May 29, 2023 01:58

odow mentioned this pull request Jun 9, 2023

Add VectorNonlinearFunction #2201

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ScalarNonlinearFunction support #2059

Add ScalarNonlinearFunction support #2059

odow commented Dec 8, 2022 •

edited

Loading

odow commented Dec 13, 2022

dourouc05 left a comment

odow commented Dec 14, 2022

blegat commented Dec 15, 2022

odow commented Dec 15, 2022

blegat commented Dec 16, 2022

odow commented Dec 16, 2022

mlubin commented Dec 16, 2022

blegat commented Dec 17, 2022

frapac commented Jan 11, 2023

odow commented Jan 24, 2023

odow commented Feb 1, 2023

chriscoey commented Feb 1, 2023

odow commented Feb 1, 2023

odow commented Feb 15, 2023

odow commented Feb 19, 2023

mlubin left a comment

odow commented May 28, 2023

odow commented May 29, 2023

Add ScalarNonlinearFunction support #2059

Add ScalarNonlinearFunction support #2059

Conversation

odow commented Dec 8, 2022 • edited Loading

Overview

Preview links

Solver support

What to do with NLPBlock

Blockers

odow commented Dec 13, 2022

dourouc05 left a comment

Choose a reason for hiding this comment

odow commented Dec 14, 2022

blegat commented Dec 15, 2022

odow commented Dec 15, 2022

blegat commented Dec 16, 2022

odow commented Dec 16, 2022

mlubin commented Dec 16, 2022

blegat commented Dec 17, 2022

frapac commented Jan 11, 2023

odow commented Jan 24, 2023

odow commented Feb 1, 2023

chriscoey commented Feb 1, 2023

odow commented Feb 1, 2023

odow commented Feb 15, 2023

odow commented Feb 19, 2023

mlubin left a comment

Choose a reason for hiding this comment

odow commented May 28, 2023

odow commented May 29, 2023

odow commented Dec 8, 2022 •

edited

Loading