responsiveness and manipulability of formation of...

17
Responsiveness and Manipulability of Formation of Multi-Robot Networks Hiroaki Kawashima *1 , Guangwei Zhu *2 , Jianghai Hu *2 , Magnus Egerstedt *3 *1 Kyoto University *2 Purdue University *3 Georgia Institute of Technology CDC2012 Session: Network Identification and Analysis 1

Upload: hatuong

Post on 21-Mar-2018

217 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Responsiveness and Manipulability of Formation of Multi-Robot Networks

Hiroaki Kawashima*1, Guangwei Zhu*2, Jianghai Hu*2, Magnus Egerstedt*3

*1 Kyoto University *2 Purdue University *3 Georgia Institute of Technology

CDC2012

Session: Network Identification and Analysis

1

Page 2: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Energ

y

𝑥𝑖 − 𝑥𝑗

Distance-Based Formation Control

• Interaction rule for follower agent i :

2

: pair-wise edge-tension energy weighted consensus protocol

( depends on )

: state (position)

Underlying graph

𝑑𝑖𝑗 W

eig

ht

𝑥𝑖 − 𝑥𝑗

: desired distance

between i and j

ℰ𝑖𝑗 𝑥𝑖 − 𝑥𝑗 = 0 if 𝑥𝑖 − 𝑥𝑗 = 𝑑𝑖𝑗

𝑥 𝑖 𝑡 = − 𝜕ℰ𝑖𝑗 𝑥𝑖 − 𝑥𝑗

𝜕𝑥𝑖

𝑇

𝑗∈𝒩 𝑖

= − 𝑤𝑖𝑗 𝑥𝑖 − 𝑥𝑗 (𝑥𝑖 − 𝑥𝑗)

𝑗∈𝒩 𝑖

Page 3: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Motivation

• Assume some agents (leaders) can be arbitrarily controlled – Remaining agents (followers) obey the original interaction rule

– Leaders’ motion is considered as the inputs to the network

3

How the network can be adaptively changed to

maximize the effectiveness of leader’s inputs?

How can we measure the impact of leader’s input to the network?

(𝑁𝑓 followers and 𝑁ℓ leaders)

ℰ𝑖𝑗 = 0 if i, j ∉ 𝐄

𝑥𝑓 ∈ ℝ𝑁𝑓𝑑

𝑥ℓ ∈ ℝ𝑁ℓ𝑑

ℰ 𝑥(𝑡) =1

2 ℰ𝑖𝑗 𝑥𝑖 − 𝑥𝑗

𝑁

𝑗=1

𝑁

𝑖=1

𝑥 𝑓 𝑡 = −

𝜕ℰ 𝑥𝑓, 𝑥ℓ

𝜕𝑥𝑓

𝑇

𝑥 ℓ 𝑡 = 𝑢(𝑡)

Page 4: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Evaluate Leader’s Influence on the Network

• Reachability/controllability [Rahmani,Mesbahi&Egerstedt 2009]

– Global point-to point property (arbitrary configurations)

– Long-term index

• Instantaneous network response

– Local property (depends on a particular configuration)

– Short-term index

4

The instantaneous index can be more useful under a dynamic situation (e.g., change the topology adaptively to maximize the leader’s effect)

Manipulability index of leader-follower networks [Kawashima&Egerstedt, CDC2011]

Responsiveness

Page 5: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

r

Robot-arm manipulability [Yoshikawa 1985, Bicchi, et al. 2000]

Leader-follower manipulability

Kinematic relation

Velocity of end-effector is directly connected with the angular velocity

leaders’ vel.

followers’ vel.

angular velocity of joints

end-effector velocity

We need integral action w.r.t. time to get the followers’ response

5

𝑟 𝑇𝑊𝑟𝑟

𝜃 𝑇𝑊𝜃𝜃

𝑚 =𝑥 𝑓

𝑇𝑄𝑓𝑥 𝑓

𝑥 ℓ𝑇𝑄ℓ𝑥 ℓ

𝑟 = 𝑓 𝜃 , 𝑟 =𝜕𝑓

𝜕𝜃 𝜃

𝜃

𝑟 : states of end-effector 𝜃 : joint angles 𝑊𝑟 ,𝑊𝜃 ≻ 0 : weight matrices

Can we get more direct connection between 𝑥ℓ 𝑡 and 𝑥𝑓 𝑡 ?

𝑥𝑓 (𝑡) = −𝜕ℰ 𝑥𝑓, 𝑥ℓ

𝜕𝑥𝑓

𝑇

𝑥ℓ 𝑡 = 𝑢 𝑡 : given

Dynamics of agents

Page 6: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Approximation of the Dynamics

Def: Rigid-link approximation [Kawashima&Egerstedt CDC2011]

– Followers are fast enough to perfectly maintain the desired distance 𝑑𝑖𝑗 ∀ 𝑖, 𝑗 ∈ 𝐄 ∀𝑡

Constraint on each

connected agent pair

6

∀ 𝑖, 𝑗 ∈ 𝐄

(𝐄: edge set)

∀ 𝑖, 𝑗 ∈ 𝐄

• Rewrite with the rigidity matrix [B.Roth 1981]

• Under this approximation,

Page 7: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Manipulability under the Rigid-link Approximation

• Theorem [Kawashima&Egerstedt CDC2011]

Approximate manipulability Manipulability of leader-

follower networks

Given the rigidity matrix

7

𝑚 can evaluate the instantaneous effectiveness leader’s input 𝑥ℓ by considering current config.(𝑥) and network topology (𝐄).

– Provides us the direct connection between 𝑥ℓ 𝑡 and 𝑥𝑓 𝑡

𝑅 𝑥 = 𝑅𝑓 𝑥 𝑅ℓ(𝑥)

𝑁𝑓𝑑

𝑥𝑓 𝑡 = −𝜕ℰ 𝑥

𝜕𝑥𝑓

𝑇

𝑥𝑓 𝑡 = −𝑅𝑓†𝑅ℓ 𝑥ℓ 𝑡 𝐽 𝑥, 𝐸 ≜ −𝑅𝑓

†𝑅ℓ

𝑁ℓ𝑑

𝑚 =𝑥𝑓

𝑇𝑄𝑓𝑥𝑓

𝑥ℓ 𝑇𝑄ℓ𝑥ℓ

𝑚 (𝑥ℓ , 𝑥, 𝐄) =𝑥ℓ

𝑇𝐽𝑇𝑄𝑓𝐽𝑥ℓ

𝑥ℓ 𝑇𝑄ℓ𝑥ℓ

Page 8: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Application Example: Effective Topology

• Given leader’s input direction, what is the most effective network topology in terms of maximizing the manipulability?

8

Topology optimization argmax𝐄

𝑚 (𝑥ℓ , 𝑥, 𝐄) =𝑥ℓ

𝑇𝐽𝑇𝑄𝑓𝐽𝑥ℓ

𝑥ℓ 𝑇𝑄ℓ𝑥ℓ

(The number of edges,

𝐄 , is constrained)

After t=2

After t=2

Page 9: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Application Example: Online Leader Selection

9

Online leader selection

• To move the network toward a target point, which is the most effective leader

[Kawashima&Egerstedt, ACC2012]

𝑖ℓ 𝑡 = argmax𝑖

𝑚 𝑒(𝑖, 𝑥 𝑡 , 𝐄)

Page 10: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Limitation of Manipulability

1. Manipulability cannot compare “rigid formations” with the same agent configurations.

2. Manipulability cannot deal with “edge weights” (i.e., gains of the pair-wised edge-tension energies).

10

Why? In the rigid-link approximation, we only considered the convergence point of the followers when the leader is moved.

𝑚 = 3.0

𝑚 = 3.0

𝑚 = 1.3

How can we overcome these limitations?

Page 11: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Stiffness and Rigidity Indices

• Stiffness matrix is defined based on the spring-mass analogy, and it coincides with the Hessian of the edge-tension energy ℰ, in the formation-control context:

• Rigidity indices are defined by the eigenvalues of the Hessian

– Worst-case rigidity index (WRI): smallest eigenvalue of 𝐻: 𝜆𝑟 𝐻 , 𝑟 = rank(𝐻)

11

𝛿𝑥 (𝑡) = −𝐻𝛿𝑥(𝑡), 𝛿𝑥 0 = 𝛿𝑥0 where 𝐻 ≜𝜕2ℰ

𝜕𝑥2 𝑥=𝑥∗

[Zhu&Hu, CDC09, CDC11]

It dictates how fast the formation can be recovered

(𝜆𝑘 ≥ 𝜆𝑘+1)

Complementary with the manipulability

Page 12: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Responsiveness

• Unifies the notions of stiffness and manipulability – Analyze the convergence process of the followers

12

𝛿𝑥𝑓 (𝑡) = −𝐻𝑓𝑓𝛿𝑥𝑓(𝑡) − 𝐻𝑓ℓ𝛿𝑥ℓ,

𝜈 𝑡, 𝑥, 𝐺, 𝛿𝑥ℓ =𝛿𝑥𝑓(𝑡)

2

𝛿𝑥ℓ2

Solve the following zero-state response with 𝛿𝑥𝑓 0 = 0:

using the fact 𝐻𝑓ℓ = −𝐻𝑓𝑓 𝟏𝑁𝑓⊗ 𝐼𝑑 and the decomposition 𝐻𝑓𝑓 = 𝑉𝛬𝑉𝑇

𝛿𝑥𝑓 𝑡 = 𝑉Diag 1 − 𝑒−𝜆𝑘𝑡 𝑉𝑇 𝟏𝑁𝑓⊗ 𝐼𝑑 𝛿𝑥ℓ

𝛬 = 𝐃𝐢𝐚𝐠 𝜆1, … , 𝜆𝑟 :

𝑉 = [𝑣1, … , 𝑣𝑟]: eigen vec.

non-zero eigenvalues

where 𝐻𝑓𝑓 ≜𝜕2ℰ

𝜕𝑥𝑓2

𝑥∗, 𝐻𝑓ℓ ≜

𝜕2ℰ

𝜕𝑥𝑓𝜕𝑥ℓ 𝑥∗

t is often finite (leader cannot wait

followers for infinite time)

𝑡 ⟶ ∞ Manipulability 𝑚

Page 13: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Responsiveness

• Responsiveness takes into account the rate of convergence and the convergence point simultaneously.

– How fast followers respond? (evaluated by given time 𝑡)

– How much followers finally respond? (evaluated by given input, i.e., leader’s motion, 𝛿𝑥ℓ)

13

Manipuability

(convergence point)

Stiffness

(rate of convergence)

𝜈 𝑡, 𝛿𝑥ℓ, 𝑥, 𝐄 =𝛿𝑥𝑓(𝑡)

2

𝛿𝑥ℓ2

=𝛿𝑥ℓ

𝑇𝐽 𝑡 𝑇𝐽 𝑡 𝛿𝑥ℓ

𝛿𝑥ℓ𝑇𝛿𝑥ℓ

= 1 − 𝑒−𝜆𝑘𝑡 2𝑣𝑘

𝑇 𝟏𝑁𝑓⊗ 𝐼𝑑 𝛿𝑥ℓ

2

𝑟

𝑘=1

𝐻𝑓𝑓 =𝜕2ℰ

𝜕𝑥𝑓2

𝑥∗

= 𝑉𝛬𝑉𝑇

𝛬 = 𝐃𝐢𝐚𝐠 𝜆1, … , 𝜆𝑟

𝑉 = [𝑣1, … , 𝑣𝑟]

Non-zero eigs.

𝛿𝑥𝑓 𝑡 = 𝑉Diag 1 − 𝑒−𝜆𝑘𝑡 𝑉𝑇 𝟏𝑁𝑓⊗ 𝐼𝑑 𝛿𝑥ℓ

J(t)

Page 14: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Comparison of Responsiveness

14

𝜈 𝑡, 𝛿𝑥ℓ, 𝑥, 𝐄 = 1 − 𝑒−𝜆𝑘𝑡 2𝑣𝑘

𝑇 𝟏𝑁𝑓⊗ 𝐼𝑑 𝛿𝑥ℓ

2

𝑟

𝑘=1

t

t

Resp

on

siv

eness

Resp

on

siv

eness

G4

G5

G1

G2

G4

1. Responsiveness can be used to compare rigid formations

Page 15: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Optimal Edge Weights (Link Resource Allocation)

• Link weights can be viewed as the amount of resource.

• Given limited amount of total resource, find the optimal link resource allocation in terms of maximizing the convergence rate

of the responsiveness 𝜆𝑟 𝐻𝑓𝑓 .

maximize𝑐𝑖𝑗

𝜆𝑟 𝐻𝑓𝑓

subject to 𝑐𝑖𝑗2

𝑖,𝑗 ∈𝖤

≤ 𝐶 and 𝑐𝑖𝑗 = 0 for 𝑖, 𝑗 ∉ 𝖤

• 𝑟 – rank of 𝐻𝑓𝑓

• 𝐶 – total amount of resource

• 𝑐𝑖𝑗 – weight for link {𝑖, 𝑗}

i

j

𝑐𝑖𝑗2

ℰ𝑖𝑗

𝑥𝑖 − 𝑥𝑗

(𝜆𝑘 ≥ 𝜆𝑘+1)

Page 16: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Optimal Edge Weights (Link Resource Allocation) Numerical Results

optimize optimize optimize

2. Responsiveness can deal with edge weights

Page 17: Responsiveness and Manipulability of Formation of …vision.kuee.kyoto-u.ac.jp/.../Responsiveness_CDC12slide.pdfMotivation • Assume some agents (leaders) can be arbitrarily controlled

Conclusion

• We proposed the responsiveness of leader-follower networks, which unifies the previously defined notions of stiffness and manipulability.

• It measures the impact of leader’s input in terms of – how fast / how large (finally) followers respond.

• Future work – Application to human-swarm interaction (e.g., networked

robots adaptively changes gains and topologies to maximize human inputs injected through the leader).

17

Manipulability Stiffness

𝜈 𝑡, 𝛿𝑥ℓ, 𝑥, 𝐆 = 1 − 𝑒−𝜆𝑘𝑡 2𝑣𝑘

𝑇 𝟏𝑁𝑓⊗ 𝐼𝑑 𝛿𝑥ℓ

2

𝑟

𝑘=1

Thank you for your attention!