adaptivelab talk1

42
Adaptive Lab Talk Series Electrical and Computer Engineering Outline Problem Definition System Model Overall Model Theory of Planned Behavior Cognitive Dissonance Theory of Overjustification Controller Design Assumptions and Initial Conditions Controller Design: Stage1 Controller Design: Stage2 Simulation A Model-Based Feedback-Control Approach to Behaviour Modification Through Reward-Induced Attitude Change J.Ni, D. Kulic, and D. Davison presented by: Noha El-Prince April 16, 2013 Electrical and Computer Engineering Adaptive Lab Talk Series

Upload: noha-elprince

Post on 23-Jun-2015

258 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

A Model-Based Feedback-Control Approach toBehaviour Modification ThroughReward-Induced Attitude Change

J.Ni, D. Kulic, and D. Davison

presented by: Noha El-Prince

April 16, 2013

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 2: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

1 Outline

2 Problem Definition

3 System ModelOverall ModelTheory of Planned BehaviorCognitive DissonanceTheory of Overjustification

4 Controller DesignAssumptions and Initial ConditionsController Design: Stage1Controller Design: Stage2

5 Simulation Results

6 Conclusion

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 3: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Problem Definition

Trying to change the behavior of a person to a desiredbehavior.

The person may have either a negative/positive attitudetowards the desired behavior.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 4: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Methodology

Model the internal cognitive psychological state of aperson.Design a controller based on the cognitive model.Goal: Tracking desired behavior via a sequence ofrewards.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 5: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Overall System Model

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 6: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Theory of Planned Behavior

Aout[k] = Aout[k − 1] + ∆Aout[k − 1], (1)

∆Aout[k] = ∆ACDout [k] + ∆AOJ

out[k], (2)

Arew[k] = r1Arew[k − 1] + µ1(1− r1)R[k − 1], (3)

BI[k] = Aout[k] +Arew[k], (4)

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 7: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Theory of Planned Behavior

B[k] =

Bd[k] if BI[k] ≥ Bd[k] and Aout[k] ≤ Bd[k]

Aout[k] if (BI[k] < Bd[k] and Aout[k] ≥ 0)

or Aout[k] > Bd[k]

0 otherwise.(5)

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 8: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Cognitive Dissonance Theory (Block A)

A person’s behavior is inconsistent with one of hisattitudes ⇒ dissonance pressure

A person trying to reduce dissonance pressure by changingattitude/behavior

In our case : Inconsistency arises in 2 situations:

� The child declines the reward vs. value money� The child accepts the reward vs. feeling bored

How to quanitify dissonance pressure ?

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 9: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Cognitive Dissonance Theory (Block A)

A person’s behavior is inconsistent with one of hisattitudes ⇒ dissonance pressure

A person trying to reduce dissonance pressure by changingattitude/behavior

In our case : Inconsistency arises in 2 situations:

� The child declines the reward vs. value money� The child accepts the reward vs. feeling bored

How to quanitify dissonance pressure ?

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 10: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Cognitive Dissonance Theory (Block A)

A person’s behavior is inconsistent with one of hisattitudes ⇒ dissonance pressure

A person trying to reduce dissonance pressure by changingattitude/behavior

In our case : Inconsistency arises in 2 situations:

� The child declines the reward vs. value money� The child accepts the reward vs. feeling bored

How to quanitify dissonance pressure ?

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 11: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Cognitive Dissonance Theory (Block A)

A person’s behavior is inconsistent with one of hisattitudes ⇒ dissonance pressure

A person trying to reduce dissonance pressure by changingattitude/behavior

In our case : Inconsistency arises in 2 situations:

� The child declines the reward vs. value money� The child accepts the reward vs. feeling bored

How to quanitify dissonance pressure ?

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 12: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Cognitive Dissonance Theory (Block A)

A person’s behavior is inconsistent with one of hisattitudes ⇒ dissonance pressure

A person trying to reduce dissonance pressure by changingattitude/behavior

In our case : Inconsistency arises in 2 situations:

� The child declines the reward vs. value money� The child accepts the reward vs. feeling bored

How to quanitify dissonance pressure ?

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 13: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Quantifying Dissonance Pressure

Dissonance = ”%” of inconsistent cognitive pairs

PCDraw [k] =

Bsgn[k] Mincon[k]

Mincon[k]+Mcon[k]if Mincon[k] +Mcon[k] > 0

0 otherwise.

(6)

Bsgn[k] =

{+1 if B[k] ≥ Bd[k] or Aout[k] ≥ 0−1 otherwise.

(7)

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 14: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Quantifying Dissonance Pressure - cont.

M1incon[k] =

{|Arew[k]| if sgn(Arew [k]) 6= Brel[k]

0 otherwise,(8)

M2incon[k] =

{|Aout[k]| if sgn(Aout[k]) 6= Bsgn[k]

0 otherwise,(9)

M1con[k] =

{|Arew[k]| if sgn(Arew [k]) = Brel[k]

0 otherwise,(10)

M2con[k] =

{|Aout[k]| if sgn(Aout[k]) = Bsgn[k]

0 otherwise,(11)

Mincon[k] =2∑

i=1

Miincon[k], Mcon[k] =

2∑i=1

Micon[k], (12)

Brel[k] =

{+1 if B[k] ≥ Bd[k]−1 otherwise.

(13)

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 15: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Special Case: Attitude Reversal

Aout[k] small, R[k] small, Bd[k] is high ⇒ Child declinesthe reward

To reduce Diss. pressure: increase Aout OR “give up”jogging ⇒ Aout[k] <<<

r[k] =

+1 if Bd[k]−BI[k] > αrevAout[k], Aout[k] ≥ 0,

K1PCD[k] > 2Aout[k], and Arew[k] > 0,

−1 otherwise.

(14)

PCD[k] =

{(1− r2)PCD

raw [k] if r[k − 1] = 1

r2PCD[k − 1] + (1− r2)PCD

raw [k] otherwise.(15)

PrawCD

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 16: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Quantifying ∆Aout

Assume the change in Aout[k] is proportional to dissonancepressure, with proportionality constant K1 > 0:

∆ACDout [k] =

{−K1P

CD[k] if r[k] = 1

+K1PCD[k] otherwise.

(16)

PCD

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 17: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Overjustification Theory (Block B)

Overjustification Theory

when a reward is given to a person to do something thatshe/he already enjoys doing, such rewards arecounter-productive in that they reduce the intrinsic desire ofthe person towards that behavior.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 18: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Overjustification Theory - cont.

Let Bt[k] = minimal attitude level to which theoverjustification effect can drive Aout[k].Assume Bt[k] is a constant fraction of Bd[k], i.e.,

Bt[k] = αBd·Bd[k], (17)

for some constant 0 < αBd< 1.

If Bt[k] > Aout[k] ⇒ overjustification pressure does notdecrease Aout, and the reverse is true i.e.

Arelout[k] = max{0, Aout[k]−Bt[k]}. (18)

where Arelout[k]: a relative attitude with respect to Bt[k]

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 19: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Overjustification Theory - cont.

Then the raw and filtered overjustification pressures, and the resulting change in intrinsic attitude, arecomputed just as in our previous work, but using Arel

out instead of Aout, as follows:

POJraw [k] =

Arelout[k]Arew[k] if Arel

out[k] > 0 and Arew[k] > 0and B[k] ≥ Bd[k]

0 otherwise,

(19)

POJ

[k] = r3POJ

[k − 1] + (1− r3)POJraw[k], (20)

∆AOJout[k] =

{−K2P

OJ [k] if K2POJ [k] ≤ Arel

out[k]

−Arelout[k] otherwise.

(21)

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 20: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Assumptions

Mother Knows varoius plant parameters(µ1, r1, r2, r3, αrev, αBd, k1, k2)andA

∗0.

The child do not know the value of B∗d .

Bd[k + 1] is assigned to the child by end of day k.

i.c: PCD[0] = POJ [0] = Arew[0] = 0, Aout = A∗0.

Reward is not given everyday: N= Settling time

If impulsive reward applied at time 0, a transient(1− rk−1

2 ) appears.

Approach: wait for the transient to settle before applyingthe next impulsive reward.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 21: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Assumptions

Mother Knows varoius plant parameters(µ1, r1, r2, r3, αrev, αBd, k1, k2)andA

∗0.

The child do not know the value of B∗d .

Bd[k + 1] is assigned to the child by end of day k.

i.c: PCD[0] = POJ [0] = Arew[0] = 0, Aout = A∗0.

Reward is not given everyday: N= Settling time

If impulsive reward applied at time 0, a transient(1− rk−1

2 ) appears.

Approach: wait for the transient to settle before applyingthe next impulsive reward.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 22: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Assumptions

Mother Knows varoius plant parameters(µ1, r1, r2, r3, αrev, αBd, k1, k2)andA

∗0.

The child do not know the value of B∗d .

Bd[k + 1] is assigned to the child by end of day k.

i.c: PCD[0] = POJ [0] = Arew[0] = 0, Aout = A∗0.

Reward is not given everyday: N= Settling time

If impulsive reward applied at time 0, a transient(1− rk−1

2 ) appears.

Approach: wait for the transient to settle before applyingthe next impulsive reward.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 23: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Assumptions

Mother Knows varoius plant parameters(µ1, r1, r2, r3, αrev, αBd, k1, k2)andA

∗0.

The child do not know the value of B∗d .

Bd[k + 1] is assigned to the child by end of day k.

i.c: PCD[0] = POJ [0] = Arew[0] = 0, Aout = A∗0.

Reward is not given everyday: N= Settling time

If impulsive reward applied at time 0, a transient(1− rk−1

2 ) appears.

Approach: wait for the transient to settle before applyingthe next impulsive reward.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 24: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Assumptions

Mother Knows varoius plant parameters(µ1, r1, r2, r3, αrev, αBd, k1, k2)andA

∗0.

The child do not know the value of B∗d .

Bd[k + 1] is assigned to the child by end of day k.

i.c: PCD[0] = POJ [0] = Arew[0] = 0, Aout = A∗0.

Reward is not given everyday: N= Settling time

If impulsive reward applied at time 0, a transient(1− rk−1

2 ) appears.

Approach: wait for the transient to settle before applyingthe next impulsive reward.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 25: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Assumptions

Mother Knows varoius plant parameters(µ1, r1, r2, r3, αrev, αBd, k1, k2)andA

∗0.

The child do not know the value of B∗d .

Bd[k + 1] is assigned to the child by end of day k.

i.c: PCD[0] = POJ [0] = Arew[0] = 0, Aout = A∗0.

Reward is not given everyday: N= Settling time

If impulsive reward applied at time 0, a transient(1− rk−1

2 ) appears.

Approach: wait for the transient to settle before applyingthe next impulsive reward.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 26: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Assumptions

Mother Knows varoius plant parameters(µ1, r1, r2, r3, αrev, αBd, k1, k2)andA

∗0.

The child do not know the value of B∗d .

Bd[k + 1] is assigned to the child by end of day k.

i.c: PCD[0] = POJ [0] = Arew[0] = 0, Aout = A∗0.

Reward is not given everyday: N= Settling time

If impulsive reward applied at time 0, a transient(1− rk−1

2 ) appears.

Approach: wait for the transient to settle before applyingthe next impulsive reward.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 27: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Stage1

BI[k + 1] ≥ Bd[k + 1].

⇓R[k] >>> enough to force B[k + 1] > 0. >>

⇓Bsgn[k + 1] = +1.

⇓PCDraw [k + 1] > 0. >>

⇓Goal: increase Aout from −ve to +ve.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 28: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Stage1- cont.

BI[k] = Aout[k] +Arew[k]

= A0 + µ1R[k] ≥ Bd[k + 1]

R[k] =Bd[k + 1] + |A0|

µ1(22)

The associated dissonance pressure is:

PCDraw [k + 1] =

Bsgn[k + 1] · |Aout[k + 1]

|Aout[k + 1]|+Arew[k + 1]=

|A0||A0|+ µ1R[k]

.

(23)

Maximizing (23) subject to (22) results in Bd[k + 1] = 0 andR[k] = |A0|/µ1. For improved robustness:

Bd[k + 1] = 2ε (24)

R[k] =2Bd[k + 1] + |Aout[k]|

µ1=

2ε+ |Aout[k]|µ1

(25)

(26)Electrical and Computer Engineering Adaptive Lab Talk Series

Page 29: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Stage2

Goal: (0 ≤ Aout[k] ≤ B∗d) for k = 0, N, 2N, 3N, . . .

Use sequence of reward impulses, each impulse appliedevery N days.

Inorder to raise Aout[k], give the child R[k]<<< enoughto be :

Rejected by the child ⇒ PCD < 0⇒ Aout ⇑ .

Avoid exciting the OVJ dynamics that makes Aout ⇓ .Avoid attitude reversal.

Q. What is the appropriate value of R[k] that guaranteeabove three conditions satisfied ?

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 30: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Stage2

Goal: (0 ≤ Aout[k] ≤ B∗d) for k = 0, N, 2N, 3N, . . .

Use sequence of reward impulses, each impulse appliedevery N days.

Inorder to raise Aout[k], give the child R[k]<<< enoughto be :

Rejected by the child ⇒ PCD < 0⇒ Aout ⇑ .Avoid exciting the OVJ dynamics that makes Aout ⇓ .

Avoid attitude reversal.

Q. What is the appropriate value of R[k] that guaranteeabove three conditions satisfied ?

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 31: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Stage2

Goal: (0 ≤ Aout[k] ≤ B∗d) for k = 0, N, 2N, 3N, . . .

Use sequence of reward impulses, each impulse appliedevery N days.

Inorder to raise Aout[k], give the child R[k]<<< enoughto be :

Rejected by the child ⇒ PCD < 0⇒ Aout ⇑ .Avoid exciting the OVJ dynamics that makes Aout ⇓ .Avoid attitude reversal.

Q. What is the appropriate value of R[k] that guaranteeabove three conditions satisfied ?

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 32: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Stage2 - cont.

To enforce the child to reject the reward R[k] force:

BI[k + 1] < Bd[k + 1]

Aout[k] +Arew[k] < Bd[k + 1]

Aout[k] + r1Arew[k − 1] + µ1(1− r1)R[k − 1] < Bd[k + 1]

R[k] <Bd[k + 1]− r1Arew[k]−Aout[k]

µ1(1− r1)

R[k] <Bd[k + 1]−Aout[k]

µ1(27)

Equation(27) gurantees child reject reward and OJ = 0.

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 33: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Stage2 - cont.

Attitude reversal is avoided on day k+1 if R[k] is chosen s.t :

Bd[k]−BI[k] ≤ αrevAout[k], Aout[k] ≥ 0

Bd[k] +Aout[k]−Arew[k] ≤ αrevAout[k]

Aout[k] + r1Arew[k − 1] + µ1(1− r1)R[k − 1] ≤ Bd[k + 1]

R[k] ≥ Bd[k + 1]− (αrev + 1)Aout[k]

µ1(28)

Equation(28) gurantees avoidance of attitude reversal.Q. How to keep R[k] at a reasonable level ?

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 34: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Stage2 - cont.

By introducing a controller tuning parameter β ∈ (0, 1), theaggressiveness of attitude increase can be adjusted:

Ad = βAout[k] + (1− β)(Aout[k] +K1(1− rN−12 )).

R[k] =Aout[k]

µ1

(K1(1− rN−1

2 )

Aout[k] +K1(1− rN−12 )−Ad

− 1

). (29)

To avoid driving the attitude higher than needed (i.e., beyondB∗

d), we add a saturator as follows:

Ad = min{B∗d , βAout[k] + (1− β)(Aout[k] +K1(1− rN−1

2 ))}.(30)

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 35: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Controller Design: Stage2 - cont.

Get the value of Bd[k + 1] from the formulas of R[k] :

Bdmin[k] = Aout[k](K1(1− r2)N−1

Aout[k] +K1(1− r2)N−1 −Ad(31)

Bdmax[k] = Aout[k](K1(1− r2)N−1

Aout[k] +K1(1− r2)N−1 −Ad+ αrev

(32)

Bdmin[k] < Bd[k + 1] ≤ Bdmax[k]. (33)

Bd[k + 1] = γBdmin[k] + (1− γ)Bdmax[k]. (34)

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 36: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Simulation Results

0 5 10 15 20 25 30 350

50

100

150

Day number (k)

Behavio

r (m

ins)

Bd

*

B[k]

Bd[k]

Open−Loop Implementation

0 5 10 15 20 25 30 35

0

50

100

YESYES

YESYES NO

NO

NO

Day number (k)

Rew

ard

Offere

d (

$)

R[k]

0 5 10 15 20 25 30 35

−50

0

50

Day number (k)

Attitude (

min

s)

Aout

[k]

∆ Aout

CD[k]

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 37: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Simulation Results

0 5 10 15 20 25 30 350

50

100

150

Day number (k)

Behavio

r (m

ins)

Bd

*

B[k]

Bd[k]

Open−Loop Implementation

0 5 10 15 20 25 30 35

0

50

100

YESYES

YESYES

NONO

NO

NO

Day number (k)

Rew

ard

Offere

d (

$)

R[k]

0 5 10 15 20 25 30 35

−50

0

50

Day number (k)

Attitude (

min

s)

Aout

[k]

∆ Aout

CD[k]

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 38: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Simulation Results

0 5 10 15 20 25 30 350

50

100

150

Day number (k)

Behavio

r (m

ins)

Bd

*

B[k]

Bd[k]

Open−Loop Implementation

0 5 10 15 20 25 30 35

0

50

100

YESYES

YESYES

NO NO NO NO NO NONO

Day number (k)

Rew

ard

Offere

d (

$)

R[k]

0 5 10 15 20 25 30 35

−50

0

50

Day number (k)

Attitude (

min

s)

Aout

[k]

∆ Aout

CD[k]

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 39: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Conclusion and Future Work

A new model-based behavior-modification algorithm havebeen developed.

Pros:

No reward are required in the long term.

Good transient behavior (i.e. no overshoot).Flexible timing of the control scheme.

Cons:

The approach requires good knowledge of the plantparameters.

In case closed-loop implementation: A regularmeasurement of Aout is needed.Lacks experimental validation of the plant model.

Future work:

Online parameter estimation of plant parameters.

Experimental validation of plant model

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 40: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Conclusion and Future Work

A new model-based behavior-modification algorithm havebeen developed.

Pros:

No reward are required in the long term.Good transient behavior (i.e. no overshoot).

Flexible timing of the control scheme.

Cons:

The approach requires good knowledge of the plantparameters.In case closed-loop implementation: A regularmeasurement of Aout is needed.

Lacks experimental validation of the plant model.

Future work:

Online parameter estimation of plant parameters.Experimental validation of plant model

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 41: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Conclusion and Future Work

A new model-based behavior-modification algorithm havebeen developed.

Pros:

No reward are required in the long term.Good transient behavior (i.e. no overshoot).Flexible timing of the control scheme.

Cons:

The approach requires good knowledge of the plantparameters.In case closed-loop implementation: A regularmeasurement of Aout is needed.Lacks experimental validation of the plant model.

Future work:

Online parameter estimation of plant parameters.Experimental validation of plant model

Electrical and Computer Engineering Adaptive Lab Talk Series

Page 42: AdaptiveLab Talk1

Adaptive LabTalk Series

Electrical andComputer

Engineering

Outline

ProblemDefinition

System Model

Overall Model

Theory ofPlannedBehavior

CognitiveDissonance

Theory ofOverjustification

ControllerDesign

Assumptionsand InitialConditions

ControllerDesign: Stage1

ControllerDesign: Stage2

SimulationResults

Conclusion

Conclusion and Future Work

A new model-based behavior-modification algorithm havebeen developed.

Pros:

No reward are required in the long term.Good transient behavior (i.e. no overshoot).Flexible timing of the control scheme.

Cons:

The approach requires good knowledge of the plantparameters.In case closed-loop implementation: A regularmeasurement of Aout is needed.Lacks experimental validation of the plant model.

Future work:

Online parameter estimation of plant parameters.Experimental validation of plant model

Electrical and Computer Engineering Adaptive Lab Talk Series