rewards-driven control of robot arm by decoding eeg...

13
Rewards-Driven Control of Robot Arm by Decoding EEG Signals Ajay K. Tanwani * , José del R. Millán ** , Aude Billard * {ajay.tanwani, jose.millan, aude.billard}@epfl.ch 1 * Learning Algorithms and Systems Laboratory (LASA), EPFL, Switzerland ** Chair in Non-Invasive Brain-Machine Interface (CNBI), EPFL, Switzerland {ajay.tanwani, jose.millan, aude.billard}@epfl.ch

Upload: vuduong

Post on 19-Apr-2018

218 views

Category:

Documents


3 download

TRANSCRIPT

Rewards-Driven Control of Robot Arm by DecodingEEG Signals

Ajay K. Tanwani*, José del R. Millán**, Aude Billard*

{ajay.tanwani, jose.millan, aude.billard}@epfl.ch

1

* Learning Algorithms and Systems Laboratory (LASA), EPFL, Switzerland** Chair in Non-Invasive Brain-Machine Interface (CNBI), EPFL, Switzerland

{ajay.tanwani, jose.millan, aude.billard}@epfl.ch

Problem Scope

High-dimensionality Reward function

Brain Controlled Prosthesis

2

Non-stationary Large variabilityacross users

Brain Controlled Prosthesis

Experimental Set-Up

- Self-paced planar center-out reaching movement task

- One healthy subject performed 3 runs for a total of 230 trials in the interval [-2 1] seconds

3

64 Channel EEG Data(2 kHz)

Kinematic Data(100 Hz)

Cartesian planar position and velocity of robot arm64 Channel EEG data

Framework

Detect Intention Generate Motion

4

Intention Decoder

ClassificationEEG Signals

- EEG data is sampled with moving window of 250 ms overlapping every 62.5 ms

Feature Selection

- 10 features with the highest discriminating power for every window using CanonicalVariant Analysis (CVA)

5t

u

- 10 features with the highest discriminating power for every window using CanonicalVariant Analysis (CVA)

- Brain activity is dominant in frontal-parietal regions

Intention Decoder

- LDA classifier is used to predict goalestimate given the EEG featurevector in every time window

ClassificationEEG Signals Feature Selection

xgt

6

- LDA classifier is used to predict goalestimate given the EEG featurevector in every time window

Classification accuracyexceeds chance level

- Goal is estimated starting from 687.5ms before the movement onset

Framework

Detect Intention Generate Motion

7

Trajectory Decoder

Kinematic data Initial Policy Optimized Policy

- Collect data of 230 trials re-sampled at 5 Hz

- Learn the initial policy using Support Vector Regression (SVR),

8¹x

_¹x

- Hyper-parameters of SVR found through grid search,

Trajectory Decoder

- Support vectors of initial policy act as basis functions for optimized policy

Kinematic data Initial Policy Optimized Policy

minimize squared acceleration normduring movement

reach goal position at t=T 9

- Weights of the support vector are optimized by stochastic gradient ascent on valuefunction

- Reward function for reaching the goal

J ( ¹x)

zero final velocity at t=T

Trajectory Decoder

10

- Optimal motion plans are generated away from the training data

Combined Framework

_x x

11

utxgt

Intention Decoder

Trajectory Decoder

Experimental Set-Up

12

Conclusions

We have proposed a brain signal decoder that:

generates optimal motion plans for robot control away from training data

starts to detect intention before the movement onset

13

starts to detect intention before the movement onset