You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Whether we should use A for trajectory optimization has been a long-standing mystery, ever since Terry discovered that setting the q_u block of A to identity and the rest of A to 0 leads to better convergence for iRS-MPC. We looked at the problem again from a calculus perspective.
Background
It is assumed that the system consists of an un-actuated part, q_u, and an actuated part, q_a. There is a large part of the configuration space that results in inter-penetration (configuration space obstacles). The standard notation is C_obs vs C_free.
In order to change q_u, contact between the robot and the object is necessary. The effect of this simple observation on bundled dynamics is that, in order to get informative samples to estimate the B that is useful for planning, the samples need to live on the boundary between C_obs vs C_free.
What we think the problem is.
Traditional non-linear control assumes that the dynamics function,
x_{t+1} = f(x_t, u_t),
has a valid linearization in a small neighborhood of a nominal state, i.e.
x_{t+1} = A * x_t + B * u_t,
which can then be used for synthesizing linear controllers.
But this is not true for x that's in contact as it lives on the boundary of C_free. Furthermore, derivatives on the boundary of the domain of the dynamics function is not well-defined. One-sided derivatives (and partial derivatives?) can be defined in C_free, and might be what differentiable simulators actually compute. But when A and B are used in MPC, the C_free constraint is dropped, and the optimizer may take the system into C_obs in order to minimize cost.
How to test our hypotheses
Correctly implement the chain rule for computing A in QuasistaticSimulator. Right now it's missing the term on the derivatives w.r.t. contact points and normals.
Work through the differentiable simulator implementation line by line on a simple 1D, 2-cart system to confirm/disprove the theory that the auto-diffed derivative is some form of one-sided derivative.
In iRS-MPC, after each iteration, observe how much collision there is in the x trajectory solved by the optimizer.
Test whether adding to iRS-MPC the non-penetration constraints which appear in the quasistatic dynamics QP improves convergence.
The text was updated successfully, but these errors were encountered:
Whether we should use
A
for trajectory optimization has been a long-standing mystery, ever since Terry discovered that setting theq_u
block of A to identity and the rest ofA
to 0 leads to better convergence for iRS-MPC. We looked at the problem again from a calculus perspective.Background
It is assumed that the system consists of an un-actuated part,
q_u
, and an actuated part,q_a
. There is a large part of the configuration space that results in inter-penetration (configuration space obstacles). The standard notation isC_obs
vsC_free
.In order to change
q_u
, contact between the robot and the object is necessary. The effect of this simple observation on bundled dynamics is that, in order to get informative samples to estimate theB
that is useful for planning, the samples need to live on the boundary betweenC_obs
vsC_free
.What we think the problem is.
Traditional non-linear control assumes that the dynamics function,
has a valid linearization in a small neighborhood of a nominal state, i.e.
which can then be used for synthesizing linear controllers.
But this is not true for
x
that's in contact as it lives on the boundary ofC_free
. Furthermore, derivatives on the boundary of the domain of the dynamics function is not well-defined. One-sided derivatives (and partial derivatives?) can be defined inC_free
, and might be what differentiable simulators actually compute. But when A and B are used in MPC, theC_free
constraint is dropped, and the optimizer may take the system intoC_obs
in order to minimize cost.How to test our hypotheses
QuasistaticSimulator
. Right now it's missing the term on the derivatives w.r.t. contact points and normals.x
trajectory solved by the optimizer.The text was updated successfully, but these errors were encountered: