collaboration development through interactive learning between human and robot
DESCRIPTION
Collaboration Development through Interactive Learning between Human and Robot. Tetsuya OGATA, Noritaka MASAGO, Shigeki SUGANO, Jun TANI. Introduction. “Recent” studies about welfare robots or robots as pets attracted lots of attention They must work flexibly and cooperatively with humans - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/1.jpg)
Collaboration Development through Interactive Learning between Human and Robot
Tetsuya OGATA, Noritaka MASAGO, Shigeki SUGANO, Jun TANI.
![Page 2: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/2.jpg)
Introduction
“Recent” studies about welfare robots or robots as pets attracted lots of attention
They must work flexibly and cooperatively with humans
They would also have a establish relations with people in daily life
![Page 3: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/3.jpg)
The Aims
To demonstrate interactive learning between a human operator and a robot system
Both human and robot are in the role of the learner
But...These sorts of systems are usually difficult to
stabilize over long operation times
![Page 4: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/4.jpg)
Previous Work
Most similar studies focus on short operations Exploring collapse and modification of
relationships between people
![Page 5: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/5.jpg)
The Robot
Robovie 2 arms
4 DOF 'Human-like' head
Audiovisual sensors Many tactile sensors
attached to its body
![Page 6: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/6.jpg)
The Environment
A 4x4m course The outside walls
marked alternately red and blue
![Page 7: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/7.jpg)
The Experiment
The human and robot join arms and attempt to travel clockwise through the maze without hitting obstacles
Try to do it in the shortest time The movement is a
combination of the human's influence and the robot's neural network
![Page 8: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/8.jpg)
Limited Senses
Both the human and robot have very limited sensory information
Robot has poor vision, and only local information such as ultrasonic sensors.
It has no global position information The human has a blindfold on
But can see the space before the experiment begins
Both sides are anticipating future sensory input and generating the next motor commands
![Page 9: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/9.jpg)
The Model
A Recurrent Neural Network (RNN)
The input consists of: Current sensory input Current motor values
The output is predictions of: Next sensory input Next motor values
![Page 10: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/10.jpg)
Their model can run in one of two modes
It can work in Open Loop Mode which directly maps inputs to outputs
Closed Loop Mode takes the output and puts it straight into the input
Can generate predictions of arbitrary length
Similar to mental rehearsal
![Page 11: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/11.jpg)
Consolidation Learning
When a RNN tries to learn something new, it severely damages everything it already knows.
One way to avoid this: Save all past teaching data in a database Add new data Use all of the data to retrain
Learning time increases with data
![Page 12: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/12.jpg)
Consolidation Learning
Analogous to biology Temporary memory stored in hippocampus Consolidated into long-term memory during sleep
New data is stored in a database The RNN corresponds to the long-term memory The RNN is trained using both the rehearsed
patterns and the sequence of the new experience This enables the incremental learning without
damaging the structure of the RNN
![Page 13: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/13.jpg)
Navigation
In initial stages, performs very badly Has a collision avoidance system to help with
the training Simplified reinforcement learning for initial
training Robot and human go around workspace Time measured If performance is better, train RNN to
incorporate new trial
![Page 14: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/14.jpg)
Experiments
A feed forward neural network (FFNN) A RNN A RNN with consolidation learning Trials interlaced with questionnaires meant to
judge workloads Effort, workload, complexity, performance,
concentration...
![Page 15: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/15.jpg)
Results
FFNN ultimately deteriorates
RNN ultimately stagnates
RNN with consolidation learning continued to improve
![Page 16: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/16.jpg)
Robustness
The analyze the effect of consolidation learning they compared the conventional RNN to the consolidation learning RNN when subject to noise
Used the closed loop mode and introduced different amounts of noise to the inputs
Consolidation learning proved far more robust Linked to 'operability' – Robots which don't
cope well with noise can seem unwieldy
![Page 17: Collaboration Development through Interactive Learning between Human and Robot](https://reader036.vdocuments.net/reader036/viewer/2022070410/56814657550346895db37221/html5/thumbnails/17.jpg)
Collaboration
Miwa showed that human collaboration was developed through repeated phases (Miwa et. al, 2001)
The consolidation-learning method arguably demonstrates these phases
The RNN with consolidation learning might have similarity with human learning