Previous Work - PowerPoint PPT Presentation

1 / 11
About This Presentation
Title:

Previous Work

Description:

DDP provides local feedback controllers and exhibit '2nd-order convergence. ... 3-link Biped ankle, hip, shoulder. 15Ns Impulse. 0.01s timestep (Euler) N=50:25:150 ... – PowerPoint PPT presentation

Number of Views:24
Avg rating:3.0/5.0
Slides: 12
Provided by: scie5
Category:
Tags: biped | previous | work

less

Transcript and Presenter's Notes

Title: Previous Work


1
(No Transcript)
2
Previous Work
  • Hamilton-Jacobi-Bellman equation methods globally
    solve complex problems, but high computational
    cost
  • DDP provides local feedback controllers and
    exhibit 2nd-order convergence.
  • Iterative LQR iteratively linearizes dynamics and
    is more efficient than DDP
  • DDP and ILQR cannot handle constraints.
  • All methods assume deterministic dynamics

3
Features of Iterative LQG
  • Nonlinear Dynamics
  • Non-quadratic cost function
  • Control-dependent noise
  • Constraints
  • Un-biased nonadaptive linear filter
  • Computes optimal feedback control law
  • Computes optimal filter given control law

4
Overview
  • Nonlinear Dynamics
  • Linearized about nominal trajectory
  • Optimal Control Estimator

5
Computing Cost-to-go
  • Partially Observable
  • Fully Observable

6
Controller Design
  • Portion of Cost-to-go depending on control
  • Expectation over state trajectory
  • In absense of control constraints

7
Estimator Design
  • Given a control law (u1, u2,,uN) and previous
    filter gains, (K1,K2,KN), compute new filter
    gains that minimize estimated error results in
    a minimized expected cost.

8
Example ILQR (in MATLAB)
  • 3-link Biped ankle, hip, shoulder
  • 15Ns Impulse
  • 0.01s timestep (Euler)
  • N5025150
  • Q diag(1e1,1e1,1e1,1e0,1e0,1e0)
  • R diag(1e-2,1e-3,1e-3)
  • Qf 1e3Q
  • Rf 1e1R

9
(No Transcript)
10
(No Transcript)
11
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com