training scheme analysis for memristor-based neuromorphic design

1
POSTER TEMPLATE BY: www.PosterPresentations.com Training Scheme Analysis for Memristor-Based Neuromorphic Design Miao Hu 1 , Hai Li 1 , Yiran Chen 1 , Qing Wu 2 and Garrett S. Rose 2 1.Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA, USA 15261. 2.2. Air Force Research Laboratory, Information Directorate, 525 Brooks Road, Rome, New York, USA 13441. Abstract Memristor Theory Simulation Results Physical constraints in hardware training Embedded BSB Training Circuit In this work, we propose a hardware realization of the Brain-State-in-a-Box (BSB) neural network model training algorithm. This method can be implemented as an analog/digital mixed-signal circuit to train memristor crossbar arrays within BSB circuits. The training effect is demonstrated through experimentation and the quality as an auto-associative memory is also analyzed and compared with software based training methods. The impacts of non- ideal device characteristics and fabrication defects in crossbar arrays are discussed. Our hardware architecture shows great potential for low power, high speed, small hardware size computations, and provides inherent security features. Table 1. Quality of domain of attraction. STEP 1: Initializing the crossbar arrays. At the beginning of a training procedure, all memristance values in M 1 and M 2 are initialized to approximate the average of R L and R H . The initialization doesn’t have to be precisely accurate. Indeed, even when all the memristors are all at LRS or HRS, the crossbar arrays can still be successfully trained but it requires more time to reach convergence. Convergence Speed Analysis Limited data access: A crossbar array represents the terms in a weight matrix by analog states of memristors – memristance. Reading the exact memristance of the entire array is significantly costly in terms of design complexity as well as performance overhead. Limited accuracy of signal detection : The output signals of the memristor crossbar in analog format shall be used to control the training scheme. The accuracy in detecting these output signals affects the quality, speed, and cost of the overall training procedure. Non-ideal device characteristics: For example, the evolution of the memristance for actual memristive devices tends to be nonlinear. Therefore, applying the same excitation can result in different memristance change, depending on the state of memristor. Process variations and defects : Due to process variations, memristors in a crossbar array are not always the same. For example, the upper- and lower-bounds of memristance varies [16][17]. Moreover, “dead cells” stuck at high-resistance state (HRS) or low-resistance state (LRS) could exist. Memristor-based BSB Recall Circuit In 2008, HP Lab demonstrated the first memristive device, in which the memristive effect was achieved by moving the doping front along a TiO 2 thin-film. Figure (a) and (b) illustrate the conceptual view of the TiO 2 thin-film memristor and the corresponding variable resistor model, which is equivalent to two serially-connected resistors. Here, R L (R H ) is used to denote the low (high) resistance state (LRS and HRS). The overall resistance or memristance can be expressed as: where is the relative doping front position, which is the ratio of doping front position over the total thickness of the TiO 2 thin-film. When the electrical excitation through a memristor is greater The training effect is achieved. An clear convergence gap can be observed between trained pattern and other untrained patterns. Acknowledgement This material is based upon work supported by the National Science Foundation under Grant No. EECS-1311747, ECCS-1202225, and CNS-1253424. Received and cleared for public release by AFRL on May 1, 2012, case number 88ABW-2012-2568. Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of NSF, Error correction rate. Doping front Voltage L z h R L R H (a) (b) Pt Pt TiO 2 TiO 2-x Doping front Voltage L z h RL RH (a) (b) Pt Pt TiO2 TiO2-x Doping front Voltage L z h R L R H (a) (b) Pt Pt TiO 2 TiO 2-x Doping front Voltage L z h RL RH (a) (b) Pt Pt TiO2 TiO2-x Doping front Voltage L z h RL RH (a) (b) Pt Pt TiO 2 TiO2-x Doping front Voltage L z h RL RH (a) (b) Pt Pt TiO2 TiO2-x Doping front Voltage L z h RL RH (a) (b) Pt Pt TiO2 TiO2-x L H R R M ) 1 ( ) ( M 1 & M 2 , ST Initialization BSB recallcircuit Errordetection, refresh D iff Training signalgeneration & program mem ristor crossbar Training pattern selection ST = m NO Training ends YES D iff= 0 ST++ ST = 0 YES NO Training starts Registers γ (k) , k=1...m T raining complete V in V in R/W Control BSB R ecall Circuit Error Detection V out Training Signal Generate ST & A rbiter M sel D iff V th_ h ± V th_ l ± (a) (b) (c) X 0 0 V in ( i )>0 V out ( i )>0 V out ( i )> V th_ h V out ( i )> V th_ l V out ( i )> V th_ h V out ( i )> V th_ l D iff( i )[0] D iff( i )[1] D iff( i ) X 1 0 0 1 1 0 1 -1 D iff( i )[0] D iff( i )[1] )) ( ) ( ) ( ( ' ) 1 ( 2 1 1 t G t G t G S t A A V V V V . if , if , if , ) ( ' op op op op op op V V V V V V V V V V v S i j j ij y t w ) ( ) ( ) ( i j j ij sign y t sign w STEP 2: Selecting a prototype pattern (k) B n (k=1,…,m). Here, B is the n-dimension binary space (1, 1). Assume a training set includes m prototype patterns and each pattern (k) has the same probability to be chosen every time. Here, we use a counter ST to record the number of patterns that have passed training in sequence. When ST > 0, the corresponding patterns that passed training are excluded from the selection. STEP 3: Sending (k) to BSB recall circuit. We convert (k) in binary space (1, 1) to a set of input voltages within the boundary (0.1V, 0.1V). These input signals are supplied to the two memristor crossbars simultaneously. The output signals can be obtained at the end of the BSB recall circuit. STEP 4: Error detection. Since an input signal V in (i) is ±0.1V, the preset threshold voltages can be used to compare V out (i) and λ∙ V in (i),. We employ four threshold voltages, including: Here, θ represents the tolerable difference. 1 . 0 _ h th V 1 . 0 _ l th V 1 . 0 _ h th V 1 . 0 _ l th V Algorithm modification: STEP 5: Program memristor crossbar. Note that training the memristor crossbar array is conducted column by column. To train the j th column, the polarity and amplitude of the training pulse is determined by Diff(j). The training pulses are supplied on the rows of memristor crossbar arrays. The j th column is connected to ground and all the other columns are floating. For a column, the training pattern is either the current selected prototype pattern or the opposite pattern. STEP 6: If training is completed? The counter ST increases by 1 if a prototype pattern goes through STEP 25 and reports no error without further tuning M 1 and M 2 . Otherwise, ST is reset to 0 whenever an error is detected. Repeat STEP 25 until ST reaches m. 0% 20% 40% 60% 80% 100% 0 1 2 3 Errorcorrection rate Ham m ingdistance( l ) Hardw are (linear) Hardw are(nonlinear) Lillo(1994) Perfetti (1995) Park(2010) Best Good Negati ve Bad Hardware (linear) 419 6 465 134 Hardware (nonlinear) 465 0 473 86 Lillo (1994) 164 1 859 0 Perfetti (1995) 478 34 512 0 Park (2010) 502 10 512 0 0% 20% 40% 60% 80% 100% r1 r2 r3 r4 r5 U niform sizeofdom ain of attraction Prototypepatternsγ (k) , k =1,...,5 Hardw are (linear) Hardw are (nonlinear) Lillo (1994) Perfetti (1995) Park(2010) Uniform size of domain of attraction. 0 500 1000 1500 2000 2500 3000 1 2 3 4 5 Q ualityof D om ain of attraction D ead cellpercentage(% ) LRS/LRS(G1=30) LRS/HRS(G1=30) HRS/HRS(G1=30) HRS/HRS(G1=50) HRS/HRS(G1=50,digonal)

Upload: graiden-burt

Post on 02-Jan-2016

42 views

Category:

Documents


2 download

DESCRIPTION

Training Scheme Analysis for Memristor-Based Neuromorphic Design Miao Hu 1 , Hai Li 1 , Yiran Chen 1 , Qing Wu 2 and Garrett S. Rose 2 Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA, USA 15261. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Training Scheme Analysis for Memristor-Based Neuromorphic Design

POSTER TEMPLATE BY:

www.PosterPresentations.com

Training Scheme Analysis for Memristor-Based Neuromorphic DesignMiao Hu1, Hai Li1, Yiran Chen1, Qing Wu2 and Garrett S. Rose2

1.Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, PA, USA 15261.2.2. Air Force Research Laboratory, Information Directorate, 525 Brooks Road, Rome, New York, USA 13441.

Abstract

Memristor Theory

Simulation Results

Physical constraints in hardware training

Embedded BSB Training Circuit

In this work, we propose a hardware realization of the Brain-State-in-a-Box (BSB) neural network model training algorithm. This method can be implemented as an analog/digital mixed-signal circuit to train memristor crossbar arrays within BSB circuits. The training effect is demonstrated through experimentation and the quality as an auto-associative memory is also analyzed and compared with software based training methods. The impacts of non-ideal device characteristics and fabrication defects in crossbar arrays are discussed. Our hardware architecture shows great potential for low power, high speed, small hardware size computations, and provides inherent security features.

Table 1. Quality of domain of attraction.

STEP 1: Initializing the crossbar arrays. At the beginning of a training procedure, all memristance values in M1 and M2 are initialized to approximate the average of RL and RH. The initialization doesn’t have to be precisely accurate. Indeed, even when all the memristors are all at LRS or HRS, the crossbar arrays can still be successfully trained but it requires more time to reach convergence.

Convergence Speed Analysis

Limited data access: A crossbar array represents the terms in a weight matrix by analog states of memristors – memristance. Reading the exact memristance of the entire array is significantly costly in terms of design complexity as well as performance overhead. Limited accuracy of signal detection: The output signals of the memristor crossbar in analog format shall be used to control the training scheme. The accuracy in detecting these output signals affects the quality, speed, and cost of the overall training procedure. Non-ideal device characteristics: For example, the evolution of the memristance for actual memristive devices tends to be nonlinear. Therefore, applying the same excitation can result in different memristance change, depending on the state of memristor. Process variations and defects: Due to process variations, memristors in a crossbar array are not always the same. For example, the upper- and lower-bounds of memristance varies [16][17]. Moreover, “dead cells” stuck at high-resistance state (HRS) or low-resistance state (LRS) could exist.

Memristor-based BSB Recall Circuit

In 2008, HP Lab demonstrated the first memristive device, in which the memristive effect was achieved by moving the doping front along a TiO2 thin-film. Figure (a) and (b) illustrate the conceptual view of the TiO2 thin-film memristor and the corresponding variable resistor model, which is equivalent to two serially-connected resistors. Here, RL (RH) is used to denote the low (high) resistance state (LRS and HRS). The overall resistance or memristance can be expressed as:

where is the relative doping front position, which is the ratio of doping front position over the total thickness of the TiO2 thin-film. When the electrical excitation through a memristor is greater than the threshold voltage, the memristance changes (in training). Otherwise, a memristor behaves like a resistor.

The training effect is achieved. An clear convergence gap can be observed between trained pattern and other untrained patterns.

Acknowledgement

This material is based upon work supported by the National Science Foundation under Grant No. EECS-1311747, ECCS-1202225, and CNS-1253424. Received and cleared for public release by AFRL on May 1, 2012, case number 88ABW-2012-2568. Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of NSF, AFRL or its contractors.

Error correction rate.

Doping front

VoltageLzh

RL

RH

(a) (b)

Pt

Pt

TiO2

TiO2-x

Doping front

VoltageLzh

RL

RH

(a) (b)

Pt

Pt

TiO2

TiO2-x

Doping front

VoltageLzh

RL

RH

(a) (b)

Pt

Pt

TiO2

TiO2-x

Doping front

VoltageLzh

RL

RH

(a) (b)

Pt

Pt

TiO2

TiO2-x

Doping front

VoltageLzh

RL

RH

(a) (b)

Pt

Pt

TiO2

TiO2-x

Doping front

VoltageLzh

RL

RH

(a) (b)

Pt

Pt

TiO2

TiO2-x

Doping front

VoltageLzh

RL

RH

(a) (b)

Pt

Pt

TiO2

TiO2-x

LH RRM )1()(

M1& M2, ST Initialization

BSB recall circuit

Error detection, refresh Diff

Training signal generation& program memristor

crossbar

Training pattern selection

ST = mNO

Training endsYES

Diff = 0

ST++

ST = 0

YES

NO

Training starts

Reg

iste

rs

γ(k) , k

=1.

..m

Training complete

Vin

Vin

R/WControl

BSBRecallCircuit

Error Detection

Vout

Training Signal

Generate

ST &Arbiter

Msel

Diff

Vth_h±

Vth_l±

(a) (b) (c)

11

111

X 00

0

Vin(i)>0

Vout(i)>0

Vout(i) > Vth_h

Vout(i) > Vth_l

Vout(i) > Vth_h

Vout(i) > Vth_l

Diff(i)[0]Diff(i)[1] Diff(i)

X10

011

01-1

Diff(i)[0]

Diff(i)[1]

))()()((')1( 211 tGtGtGSt AA VVVV

.

if ,

if ,

if ,

)('

opop

opop

opop

VVV

VVVV

VVV

vS

ijjij ytw )( )()( ijjij signytsignw

STEP 2: Selecting a prototype pattern (k) Bn(k=1,…,m). Here, B is the n-dimension binary space (1, 1). Assume a training set includes m prototype patterns and each pattern (k) has the same probability to be chosen every time. Here, we use a counter ST to record the number of patterns that have passed training in sequence. When ST > 0, the corresponding patterns that passed training are excluded from the selection.

STEP 3: Sending (k) to BSB recall circuit. We convert (k) in binary space (1, 1) to a set of input voltages within the boundary (0.1V, 0.1V). These input signals are supplied to the two memristor crossbars simultaneously. The output signals can be obtained at the end of the BSB recall circuit.

STEP 4: Error detection. Since an input signal Vin(i) is ±0.1V, the preset threshold voltages can be used to compare Vout(i) and λ∙ Vin(i),. We employ four threshold voltages, including:

Here, θ represents the tolerable difference.

1.0_ hthV 1.0_ lthV

1.0_ hthV 1.0_ lthV

Algorithm modification:

STEP 5: Program memristor crossbar. Note that training the memristor crossbar array is conducted column by column. To train the jth column, the polarity and amplitude of the training pulse is determined by Diff(j). The training pulses are supplied on the rows of memristor crossbar arrays. The jth column is connected to ground and all the other columns are floating. For a column, the training pattern is either the current selected prototype pattern or the opposite pattern.STEP 6: If training is completed? The counter ST increases by 1 if a prototype pattern goes through STEP 25 and reports no error without further tuning M1 and M2. Otherwise, ST is reset to 0 whenever an error is detected. Repeat STEP 25 until ST reaches m.

0%

20%

40%

60%

80%

100%

0 1 2 3

Err

or c

orre

ctio

n r

ate

Hamming distance (l)

Hardware (linear)Hardware (nonlinear)Lillo (1994)Perfetti (1995)Park (2010)

Best Good Negative BadHardware (linear) 419 6 465 134

Hardware (nonlinear) 465 0 473 86Lillo (1994) 164 1 859 0

Perfetti (1995) 478 34 512 0Park (2010) 502 10 512 0

0%

20%

40%

60%

80%

100%

r1 r2 r3 r4 r5

Un

ifor

m si

ze o

f dom

ain

of

attr

acti

on

Prototype patterns γ(k), k =1,...,5

Hardware (linear)Hardware (nonlinear)Lillo (1994)Perfetti (1995)Park (2010)

Uniform size of domain of attraction.

0

500

1000

1500

2000

2500

3000

1 2 3 4 5

Qua

lity

of D

omai

n of

at

trac

tion

Dead cell percentage (%)

LRS/LRS (G1=30) LRS/HRS (G1=30)HRS/HRS (G1=30) HRS/HRS (G1=50)HRS/HRS (G1=50,digonal)