learning and using contextual information in the control ...pilarski/docs/talks/... · unsupervised...
TRANSCRIPT
![Page 1: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/1.jpg)
Patrick M. Pilarski
Reinforcement Learning & Artificial Intelligence LaboratoryAlberta Innovates Centre for Machine Learning
Learning and Using Contextual Information in the Control of Assistive Devices
![Page 2: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/2.jpg)
Known Barriers
“Three main problems were mentioned as reasons that amputees stop using their ME prostheses: nonintuitive control, lack of sufficient feedback, and insufficient functionality.”
— Peerdeman et al., JRRD, 2011.
Also: cost!
![Page 3: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/3.jpg)
Known Barriers
“Three main problems were mentioned as reasons that amputees stop using their ME prostheses: nonintuitive control, lack of sufficient feedback, and insufficient functionality.”
— Peerdeman et al., JRRD, 2011.
Also: cost!
![Page 4: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/4.jpg)
Adaptation & Scalability“Supervised adaptation should be considered for incorporation into any clinically viable pattern recognition controller, and unsupervised adaptation should receive renewed interest in order to provide transparent adaptation.”
— Sensinger et al., 2009.
“Completely stable, unsupervised [adaptation] has yet to be realized but is of great clinical interest.”
— Scheme and Englehart, 2011.
![Page 5: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/5.jpg)
Adaptive Prosthetics Project• Develop new machine
learning methods to improve human-machine interaction.
• Translate these techniques to preliminary use by amputee and non-amputee subjects.
• Demonstrate clinical impact in studies with amputee participants.
![Page 6: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/6.jpg)
Our Ongoing Approaches
• Real-time control learning without a priori information about a user or device.
• Prediction and anticipation of signals during amputee-device interaction.
• Collaborative algorithms for the online human improvement of limb controllers.
![Page 7: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/7.jpg)
!KEY IDEA
Temporally Extended Predictions are important for
improving and adapting control systems.
![Page 8: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/8.jpg)
Pilarski et al., IEEE RAM, 2013.
Anticipating Human and Robot Dynamics
![Page 9: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/9.jpg)
Pilarski et al., IEEE RAM, 2013.
Anticipating Human and Robot Dynamics
![Page 10: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/10.jpg)
Prediction Learning with General Value Functions• Thousands of accurate predictions can be
made and learned in real time (i.e.,100Hz)
• A single stream of data be used to accurately predict many different sensors at many different time scales.
• Rapid learning that is non-episodic and that continue indefinitely (incremental learning).
Sutton et al., AAMAS, 2011.
Multi-timescale Nexting in a Reinforcement Learning Robot, Modayil, White, and Sutton, 2012.
![Page 11: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/11.jpg)
Parameters
Controller
StateInformation
ControlActions
Mapping
![Page 12: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/12.jpg)
Parameters
Controller
StateInformation
ControlActions
MappingPredictions
![Page 13: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/13.jpg)
Parameters
Controller
Facts (Predictive Model)
StateInformation
ControlActions
Mapping
Prediction 1, Prediction 2Prediction 3, Prediction 4
...Prediction N
![Page 14: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/14.jpg)
Parameters
Controller
Facts (Predictive Model)
StateInformation
ControlActions
Mapping
Prediction 1, Prediction 2Prediction 3, Prediction 4
...Prediction N
Parameters
PredictionSignals
![Page 15: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/15.jpg)
Parameters
Controller
Facts (Predictive Model)
StateInformation
ControlActions
Mapping
Prediction 1, Prediction 2Prediction 3, Prediction 4
...Prediction N
Mapping
Predictive StateInformation
![Page 16: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/16.jpg)
Parameters
Controller
Facts (Predictive Model)
StateInformation
ControlActions
Mapping
Prediction 1, Prediction 2Prediction 3, Prediction 4
...Prediction N
Mapping
Predictive StateInformation
![Page 17: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/17.jpg)
Parameters
Controller
Facts (Predictive Model)
StateInformation
ControlActions
Mapping
Prediction 1, Prediction 2Prediction 3, Prediction 4
...Prediction N
![Page 18: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/18.jpg)
Parameters
Controller
Facts (Predictive Model)
StateInformation
ControlActions
Mapping
Prediction 1, Prediction 2Prediction 3, Prediction 4
...Prediction N
Learning and Blending Multiple Contexts
![Page 19: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/19.jpg)
Example 1:Prediction to Enhance Conventional
Control Systems
![Page 20: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/20.jpg)
Prediction-based Improvement of a Switched Control Interface
![Page 21: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/21.jpg)
Predicting what a user wants ...
Pilarski et al., BioRob, 2012.Pilarski and Sutton, AAAI-FS, 2012.
![Page 22: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/22.jpg)
Predicting what a user wants ...
Pilarski et al., BioRob, 2012.Pilarski and Sutton, AAAI-FS, 2012.
Increased ideal switching suggestions (+23%) Decreased switching overhead (-%14)
![Page 23: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/23.jpg)
... and when they want it.
![Page 24: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/24.jpg)
... and when they want it.
![Page 25: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/25.jpg)
... and when they want it.Un-normalizedPredictions
Switching Signal from User
![Page 26: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/26.jpg)
Example 2:Using Predictions as State Information(Predictive Representations of State)
![Page 27: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/27.jpg)
Simultaneous, anticipatory myoelectric control of multiple actuators.
Coupled Prediction and Control Learning
Pilarski, Dick, and Sutton, ICORR, 2013.
Direct W-Predictive Control (0.25x Speed)
![Page 28: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/28.jpg)
Coupled Prediction and Control Learning
Pilarski, Dick, and Sutton, ICORR, 2013.
![Page 29: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/29.jpg)
Example 3:Detecting and Using Context During Learning and Control
![Page 30: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/30.jpg)
Learning during Contextual Shifts
Context 1 Context 2
True JointAngle
Pred. JointAngle
![Page 31: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/31.jpg)
Learning during Contextual Shifts
Context 1 Context 2
![Page 32: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/32.jpg)
Conclusions
![Page 33: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/33.jpg)
Potential Utility for Extended Predictions
• Maintain consistency in controller or control interface for the user (as in Carmena et al.; Mataric et al.) ...
• ... yet adapt quickly to things that are impossible or challenging for a user to learn about or model.
• Recognize context or different use domains (situation aware controllers and predictions).
• Avenues discussed here: controller enhancement, state enhancement, control learning, contextual learning.
![Page 34: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/34.jpg)
Summary• Learning and using temporally extended
predictions (sensorimotor knowledge) is a promising area for enhancing assistive devices.
• Strong preliminary results to show unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining HMIs that use switching.
• Big picture: a move toward more advanced, persistent machine intelligence in NiPNS-HMIs.
Also: general value functions with TD-learning are a practical way to build up and maintain a diverse predictive model during the real-time operation of a system.
![Page 35: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/35.jpg)
Richard S. Sutton, Mike Bowling, Travis Dick, Ann L. Edwards, Alexandra Kearney, Adam Parker, Anna Koop, RLAI, Dept. Computing Science, University of Alberta
Thomas Degris, INRIA, Bordeaux, France
Michael R. Dawson, Jacqueline S. Hebert, K. Ming ChanGlenrose Rehabilitation Hospital & University of Alberta
Jason P. CareyDept. of Mechanical Engineering, University of Alberta
![Page 36: Learning and Using Contextual Information in the Control ...pilarski/docs/talks/... · unsupervised adaptation, facilitation of simultaneous multi-joint control, and streamlining](https://reader036.vdocuments.net/reader036/viewer/2022081600/6045c5fff3356922c07e41a8/html5/thumbnails/36.jpg)
Funders: Alberta Innovates Centre for Machine Learning (AICML), the Natural Sciences and Engineering Research Council (NSERC), Alberta Innovates – Technology Futures (AITF), and the Glenrose Rehabilitation Hospital Foundation.
?QUESTIONS
http://www.ualberta.ca/~pilarski/