Rosenbrock function for testing Nonlinear Conjugate Gradient Method #1879

varunagrawal · 2024-10-17T17:37:09Z

I implemented the Rosenbrock function as a 1D factor graph to test the correctness and efficiency of NonlinearConjugateGradientOptimizer.

Good news is that it is able to optimize to the correct value $(x, y) = (a, a^2)$. The bad news is that it takes many iterations if the initial estimation is not close enough.
For example, given a=12 and x=3, y=5, it takes over 300 iterations for it to converge to the correct solution which seems like a lot for Conjugate Gradient in the 1D case.
I'll test this against simple steepest descent to see how long this takes so we can compare the two.

I feel this can be significantly improved by implementing additional line search methods which have theoretical bounds following Wolfe Conditions. The Golden Section Search which is currently performed is robust but slow. @mehregandor and I can analyze this further to improve the convergence rates.

mehregandor

I think there's an issue with the full Rosenbrock function formulation.

mehregandor · 2024-10-17T20:48:21Z

gtsam/nonlinear/tests/testNonlinearConjugateGradientOptimizer.cpp

+  using namespace rosenbrock;
+  double a = 1.0, b = 100.0;
+  Rosenbrock1Factor f1(X(0), a, noiseModel::Unit::Create(1));
+  Rosenbrock2Factor f2(X(0), Y(0), noiseModel::Isotropic::Sigma(1, b));


The way this is graph is constructed currently, it seems to me that the full objective function becomes $$f(x,y) = 2(x-a)^2 + 2\frac{1}{b^2}(x^2-y)^2$$. Instead it should be $$b(x^2-y)^2$$, so it should be $$\frac{1}{\sqrt{b}}$$ in the noise model sigma, no? Also, why do we need the $$\sqrt{2}$$, is it because the full cost is computed as $$\frac{1}{2}f_1^\top \Sigma_1^{-1} f_1 + \frac{1}{2}f_2^\top \Sigma_2^{-1} f_2$$? Instead consider taking out the $$\sqrt{2}$$ from the error and putting it into the covariance directly. Thanks for clarifying

Yes this one is wrong. The correct definition is in GetRosenbrockGraph. That's a good suggestion to put 2 in the covariance/precision.

Please do note that GTSAM error is defined as 0.5*r^2, so please be consistent with that. No point in having 2 definitions of the objective function.

Yup, I updated the tests to always use the same definition.

JD-Florez · 2024-10-18T01:44:22Z

gtsam/nonlinear/tests/testNonlinearConjugateGradientOptimizer.cpp

+  graph.emplace_shared<Rosenbrock1Factor>(
+      X(0), a, noiseModel::Isotropic::Precision(1, 2));
+  graph.emplace_shared<Rosenbrock2Factor>(
+      X(0), Y(0), noiseModel::Isotropic::Precision(1, 2 * b));


Maybe I am missing something, why is b scaled here?

The error function for factors is $\frac{1}{2}e^2$ so by defining $e^2 = (x^2 - y)\Sigma^{-1} (x^2 - y) = (x^2 - y)2b (x^2 - y) = 2b(x^2 - y)^2$ we get the second term as $b(x^2 - y)^2$.

varunagrawal · 2024-10-19T14:53:48Z

I re-ran the test with a=2, b=100, x=1.0, y=1.0 and it is able to converge in 13 steps.
Similarly, for a = 12, b = 100 and x = 10.0, y = 135.0, it takes about 23 steps, so it seems like the optimization is highly dependent on how close the initial estimate is to the final solution.

mehregandor

Looks good to me.

varunagrawal added 4 commits October 17, 2024 11:10

Rosenbrock function

872eae9

lots of small improvements

591e1ee

Rosenbrock function implemented as factor graph

83e3fda

test optimization with Rosenbrock function

8a0bbe9

varunagrawal requested review from dellaert, JD-Florez and mehregandor October 17, 2024 17:37

varunagrawal self-assigned this Oct 17, 2024

minor improvements

631fa2b

mehregandor requested changes Oct 17, 2024

View reviewed changes

varunagrawal added 4 commits October 17, 2024 18:27

fix test

58ae5c6

move 2 to the Precision matrix and check if error is correct

b5b5e15

fix jacobians

6d6d392

remove sqrt definition

893e69d

varunagrawal requested a review from mehregandor October 17, 2024 23:16

JD-Florez reviewed Oct 18, 2024

View reviewed changes

Merge branch 'cg-methods' into rosenbrock

3c2ddc8

Base automatically changed from cg-methods to develop October 20, 2024 20:44

mehregandor approved these changes Oct 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rosenbrock function for testing Nonlinear Conjugate Gradient Method #1879

Rosenbrock function for testing Nonlinear Conjugate Gradient Method #1879

varunagrawal commented Oct 17, 2024

mehregandor left a comment

mehregandor Oct 17, 2024

varunagrawal Oct 17, 2024

varunagrawal Oct 17, 2024

dellaert Oct 18, 2024

varunagrawal Oct 18, 2024

JD-Florez Oct 18, 2024

varunagrawal Oct 18, 2024

varunagrawal commented Oct 19, 2024

mehregandor left a comment

Rosenbrock function for testing Nonlinear Conjugate Gradient Method #1879

Are you sure you want to change the base?

Rosenbrock function for testing Nonlinear Conjugate Gradient Method #1879

Conversation

varunagrawal commented Oct 17, 2024

mehregandor left a comment

Choose a reason for hiding this comment

mehregandor Oct 17, 2024

Choose a reason for hiding this comment

varunagrawal Oct 17, 2024

Choose a reason for hiding this comment

varunagrawal Oct 17, 2024

Choose a reason for hiding this comment

dellaert Oct 18, 2024

Choose a reason for hiding this comment

varunagrawal Oct 18, 2024

Choose a reason for hiding this comment

JD-Florez Oct 18, 2024

Choose a reason for hiding this comment

varunagrawal Oct 18, 2024

Choose a reason for hiding this comment

varunagrawal commented Oct 19, 2024

mehregandor left a comment

Choose a reason for hiding this comment