[02] Feedback Motion Planning using LQR-Trees

[02] Feedback Motion Planning using LQR-Trees

  1. Reference papers

    1. Russ Tedrake, Ian R. Manchester, Mark M. Tobenkin, and John W. Roberts. LQR-Trees: Feedback motion planning via sums of squares verification. International Journal of Robotics Research, 29:1038-1052, July 2010

    2. Philipp Reist, Pascal V. Preiswerk, and Russ Tedrake. Feedback-motion-planning with simulation-based lqr-treesTo appear in the International Journal of Robotics Research, 2016.

      2. Implementation Strategies

    1. Implementation using MATLAB Toolbox provided Russ Tedrake Lab
    2. Presenting results of LQR-Tree in case of very simple system (1DOF inverted Pendulum model)
    3. Programming LQR-Trees in Python or other language
    4. Applying LQR-Trees to control mobile robot based on the trust model

      3. Toolbox provided by Russ Tedrak Lab

 a. MATLAB

       4.  1DOF inverted Pendulum Model

a. System description

 

 

 

 

 

 

 

 

 

b. The system dynamics

c. The state equation 

  • msspoly deon't support sin and cos function so that we approximate the sin value using Taylor series 

5. Conditions for simulation

a. m = 1.0 kg, l=0.5 m, b=0.1 Nms, g= 9.81 m/s^2 

b. u = -0.001*t*(t - 5) *(t + 5)

c. t= [0, 5]

d x0 = [0, 0]

6. Simulation Results under different conditions of R, Q and inputs

7. Issues for Programming

a. In MATLAB Toolbox, SPOT does't provide 'sin' for input arguments of type 'msspoly'. -> replace sin(th) approx (th)

b. In Python, there are some issues of CVXPY in the process of implementation of J*

 

8. Implementations in detail

 a. Approximation of the state equation 

b. The linearized the system equation becomes

 

 

 

 

 

 

 

c. The LQR Problem of Cost-to-go function

    and Optimal feedback policy is 

In case of initial conditions are x0=[pi ; 0], u0 =0  and the final goal state is xG=[0; 0], we can get results of S, K,

d.Time-Invariant LQR Verification

: The time derivative of Lyapunov function becomes 

and then, we should search for h which must be of sufficiently large order to counteract higher order terms in \dot J*

In this problem, the system dynamics is not polynomial, we can approximate the condition using Taylor expansion. In the paper, 2nd order expansion was applied but we utilized 3rd order in this example.

 

 

As a result, We can obtain the figure which is exactly same to the Figure 3. in the Paper. (The Region of Attraction: ROA)

 

5. Our Application Scenario ( Time-varying Motion Planning considering Pedestrians)

: we are going to apply this LQR-tree to safety guaranteed robot navigation. Our work has very similar purpose to that of  "Nonlinear Controller Synthesis and Automatic Workspace Partitioning for Reactive High-Level Behaviors.

We want to modify ROA in terms of trust zone for pedestrians, and then, to make a shooting method for stable motion planning under dynamic environment where the pedestrians are moving.