offline test and diagnose of network-on-chips

30
Offline Test and Diagnosis of Network-on-Chips Presenter: Supervisor: Reviewers: Haupt-Seminar June 09, 2009 Tarek Zaki Dipl.-Inf. Stefan Holst Prof. Dr. M. Radetzki, Prof. Dr. H.-J. Wunderlich Reliable Networks-on-Chip in the Many-Core Era 6/9/2009

Upload: others

Post on 03-Feb-2022

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Offline Test and Diagnose of Network-on-Chips

Offline Test and Diagnosis of Network-on-Chips

Presenter:

Supervisor:

Reviewers:

Haupt-Seminar

June 09, 2009

Tarek Zaki

Dipl.-Inf. Stefan Holst

Prof. Dr. M. Radetzki,

Prof. Dr. H.-J. Wunderlich

Reliable Networks-on-Chip in the Many-Core Era

6/9/2009

Page 2: Offline Test and Diagnose of Network-on-Chips

Agenda

1. Motivation

2. Testing Inter-Switch Wire Segments

3. Testing Switch Blocks

4. Conclusion

Offline Test and Diagnosis of Network-on-Chips6/9/2009 2

Page 3: Offline Test and Diagnose of Network-on-Chips

Agenda

1. Motivation

2. Testing Inter-Switch Wire Segments

3. Testing Switch Blocks

4. Conclusion

Offline Test and Diagnosis of Network-on-Chips6/9/2009 3

Page 4: Offline Test and Diagnose of Network-on-Chips

1. Motivation

Offline Test and Diagnosis of Network-on-Chips6/9/2009 4

Network-on-Chip (NoC) TestingTechnology miniaturization

Post fabrication testing

Low cost testing

Use NoCs in critical applications

Types of Test:Offline Test

Online Test

Page 5: Offline Test and Diagnose of Network-on-Chips

1. Tradeoffs

NoC NoC

Test Source

Test Source

External Test

Methodology

Built-In-Self-Test

Methodology

Offline Test and Diagnosis of Network-on-Chips6/9/2009 5

Small Area OverheadHigher Test Time

Large Area OverheadLower Test Time

The network is used as a Test-Access-Mechanism (TAM)

Minimized by using parallelism (Multicast)

Unlimited parallel test

Minimized by using shared test units instead of dedicatedunits

Page 6: Offline Test and Diagnose of Network-on-Chips

1. Tradeoffs

NoC NoC

Test Source

Test Source

External Test

Methodology

Built-In-Self-Test

Methodology

Offline Test and Diagnosis of Network-on-Chips6/9/2009 5

Small Area OverheadHigher Test Time

Large Area OverheadLower Test Time

Tradeoff(test area overhead & testing time)

Page 7: Offline Test and Diagnose of Network-on-Chips

1. NoC Testing

Blocks to be tested in a NoC:1. Inter-switch wire segments

2. Switches

3. IP Cores with the RNIs

IP Core

RNI

SW

IP Core

RNI

SW

IP Core

RNI

SW

IP Core

RNI

SW

Offline Test and Diagnosis of Network-on-Chips6/9/2009 6

IP: Intellectual PropertyRNI: Resource Network Interface

Page 8: Offline Test and Diagnose of Network-on-Chips

Agenda

1. Motivation

2. Testing Inter-Switch Wire Segments

3. Testing Switch Blocks

4. Conclusion

Offline Test and Diagnosis of Network-on-Chips6/9/2009 7

Page 9: Offline Test and Diagnose of Network-on-Chips

2.1 Maximum Aggressor Fault Model (MAF)

This fault model addresses worst case

Crosstalk errors during data transmission

Y2 is called Victim Line & Y1,Y3 are called Aggressor Lines

86/9/2009

Y1

Y2

Y3

Offline Test and Diagnosis of Network-on-Chips

[Cuviello, ICCAD’99]

Falling delay ( )

Positive glitch ( )

Rising delay ( )

Negative glitch ( )

Rising speed-up ( )

Falling speed-up ( )

Page 10: Offline Test and Diagnose of Network-on-Chips

2.2 Testing Inter-Switch Wire Segments

Built-In-Self-Test (BIST) Structure

TDG: Test Data Generator

TED: Test Error Detector

96/9/2009

Driver Load

TEDTDG

Swit

ch

Swit

ch Inter-switchLink

Offline Test and Diagnosis of Network-on-Chips

[Grecu, VTS’06]

Page 11: Offline Test and Diagnose of Network-on-Chips

2.2 The BIST Structure

Implementation of the TDG and TEDFinite state machine

Counter

Barrel shifter

XOR network for comparison (TED only)

106/9/2009

0 0 0 0 0

1 1 1 1 1

0 0 0 0 0

1 1 0 1 1

0 0 1 0 0

1 1 0 1 1

1 1 1 1 1

0 0 1 0 0

s1

s2

s3

s4

s5

s6

s7

s8

statei-2 … i … i+2

s0

s1s2

s3

s4s5

s8

s6

s7

Count = n-1T_start

T_startLine:

Offline Test and Diagnosis of Network-on-Chips

[Grecu, VTS’06]

Reminder

Y1

Y2

Y3

Falling delay ( )

Page 12: Offline Test and Diagnose of Network-on-Chips

Agenda

1. Motivation

2. Testing Inter-Switch Wire Segments

3. Testing Switch Blocks

4. Conclusion

Offline Test and Diagnosis of Network-on-Chips6/9/2009 11

Page 13: Offline Test and Diagnose of Network-on-Chips

3.1 Switch Architecture

Main Blocks:1. FIFO Buffers

2. Routing Logic Block

3. Multiplexers withintra-switch links

MUX

M U X

M U X

MUX

FIFO Out

FIFO In

FIFO

In

FIFO

Out

FIFO Out

FIFO In

FIFO

Out

FIFO

In

East

South

North

West

NERS

(W)

NRSW(E)

N E R W (S)

E R S W (N)

RoutingLogicBlock(RLB)

Offline Test and Diagnosis of Network-on-Chips6/9/2009 12

[Raik, ETS’07]

Page 14: Offline Test and Diagnose of Network-on-Chips

3.3 Testing the RLBs

External-based test structure

Test source is connected to theTest Access Switch (TAS)

Each Switch forwards the test-vectors to all neighbor switches NoC

SW1

SW2

SW3

SW4

TestSource

TAS

Offline Test and Diagnosis of Network-on-Chips6/9/2009 13

CLK

SW2Input

Output

t0 t1 t2 t3 cap

t0 t1 t2 t3 idle r0.1 r0.2 r0.3

Comparing the incoming

result with the captured

result

SW1Input

Output

t0 t1 t2 t3 cap r0.1 r0.2 r0.3

SW3Input

Output

t0 t1 t2 t3 cap

t0 t1 t2 t3 idle

r0.1 r0.2 r0.3

r1.1 r1.2 r1.3

SW4Input

Output

t0 t1 t2 t3 cap r1.1 r1.2 r1.3

[Hosseinabady, DATE’06]

Page 15: Offline Test and Diagnose of Network-on-Chips

3.3 Diagnose the Faulty RLB

Assume single fault

Offline Test and Diagnosis of Network-on-Chips6/9/2009 14

NoC

SW1

SW2

SW3

SW4

TestSource

TAS

SW1 is Faulty

+ve

-ve

feedback

feedback

Page 16: Offline Test and Diagnose of Network-on-Chips

3.3 Diagnose the Faulty RLB

Assume single fault

Offline Test and Diagnosis of Network-on-Chips6/9/2009 14

NoC

SW1

SW2

SW3

SW4

TestSource

TAS

SW1 is FaultySW3 is Faulty

+ve

-ve

feedback

feedback

Page 17: Offline Test and Diagnose of Network-on-Chips

3.3 Diagnose the Faulty RLB

Assume single fault

Offline Test and Diagnosis of Network-on-Chips6/9/2009 14

NoC

SW1

SW2

SW3

SW4

TestSource

TAS

SW1 is FaultySW3 is Faulty

??SW2 is Faulty +ve

-ve

feedback

feedback

Page 18: Offline Test and Diagnose of Network-on-Chips

3.3 Broadcast routing tree

Two different NoC topologies configured with optimum broadcast routing tree:

TAS

(1)

(1)(1)

(2)

(2) (2)

(2)

(2)

(2)

(3)(3)(3)

(3)

(1)

(1)

(1)

(2)

(2)(2)

(2)

TAS

Topology 1 Topology 2

Offline Test and Diagnosis of Network-on-Chips6/9/2009 15

[Hosseinabady, DATE’06]

Page 19: Offline Test and Diagnose of Network-on-Chips

3.4 Testing the Multiplexers and Intra-Switch Links

Multiplexers:Distinguished value at the tested input

All possible inputs should be tested

Intra-Switch Link:Physical link connecting I/O portsinside the switch

Offline Test and Diagnosis of Network-on-Chips6/9/2009 16

MUX

00

01

10

11

In1=X

In2=X

In3=X

In4=X

Out

Address = 00 Targeted

structural

faults

N

S R

EW

Link

[Raik, ETS’07]

Page 20: Offline Test and Diagnose of Network-on-Chips

3.4 Testing Configurations

Offline Test and Diagnosis of Network-on-Chips6/9/2009 17

N

S R

EW

Deterministic

XY/YX-Routings

N

S R

EW

Straight Path

Routings

N

S R

EW

Resource

I/O-Routings

Page 21: Offline Test and Diagnose of Network-on-Chips

N

R

EW

3.4 Testing Configurations

Configuration 1: Straight Path Routings

Offline Test and Diagnosis of Network-on-Chips6/9/2009 18

S

N

S R

EW

N

S R

EW

4-possibilities

Deterministic

XY/YX-Routings

Straight Path

Routings

Resource

I/O-Routings

Page 22: Offline Test and Diagnose of Network-on-Chips

N

R

EW

3.4 Testing Configurations

Configuration 2: Deterministic XY/YX Routings

Offline Test and Diagnosis of Network-on-Chips6/9/2009 19

S

N

S R

EW

N

S R

EW

4-possibilities8-possibilities

Deterministic

XY/YX-Routings

Straight Path

Routings

Resource

I/O-Routings

Page 23: Offline Test and Diagnose of Network-on-Chips

N

R

EW

3.4 Testing Configurations

Configuration 3: Resource I/O Routings

Offline Test and Diagnosis of Network-on-Chips6/9/2009 20

S

N

S R

EW

N

S R

EW

4-possibilities8-possibilities 8-possibilities

20 total possibilities

Deterministic

XY/YX-Routings

Straight Path

Routings

Resource

I/O-Routings

Page 24: Offline Test and Diagnose of Network-on-Chips

3.4 Testing Configurations

Three test configurations for mesh-based network

Offline Test and Diagnosis of Network-on-Chips6/9/2009 21

2.

Deterministic

XY/YX-Routings

1.

Straight Path

Routings

3.

Resource

I/O-Routings

[Raik, ETS’07]

Page 25: Offline Test and Diagnose of Network-on-Chips

End

Start

Row: r++

Error E-Wr,?

Error E-Wr,k-1 Error E-Wr,n

Error E-Nr,k

Error N-Er,k

Error R-E,W,Sr,k

Error N-Rr,k

E-W Test

E-N Test

N-E Test

ResourceTest

E-N Test

Fail Pass

Pass all turnsFailat

Turn‘k’ Fail

atTurn‘k’ Mismatch

at singleO/P Mismatch

betweenI/P & O/P

Pass

Pass all turns

Failat

Turn‘k’

All rows tested: r=n

r<n

6/9/2009 22Offline Test and Diagnosis of Network-on-Chips

3.4 Diagnose Faulty Intra-Switch Link

Passall

turns

Intra-switch faulty linkTest vector 101010…

[Raik, ETS’07]

Example

Page 26: Offline Test and Diagnose of Network-on-Chips

End

Start

Row: r++

Error E-Wr,?

Error E-Wr,k-1 Error E-Wr,n

Error E-Nr,k

Error N-Er,k

Error R-E,W,Sr,k

Error N-Rr,k

E-W Test

E-N Test

N-E Test

ResourceTest

E-N Test

Fail Pass

Pass all turnsFailat

Turn‘k’ Fail

atTurn‘k’ Mismatch

at singleO/P Mismatch

betweenI/P & O/P

Pass

Pass all turns

Failat

Turn‘k’

All rows tested: r=n

r<n

6/9/2009 22Offline Test and Diagnosis of Network-on-Chips

3.4 Diagnose Faulty Intra-Switch Link

Passall

turns

Intra-switch faulty linkTest vector 101010…

[Raik, ETS’07]

Example

Page 27: Offline Test and Diagnose of Network-on-Chips

End

Start

Row: r++

Error E-Wr,?

Error E-Wr,k-1 Error E-Wr,n

Error E-Nr,k

Error N-Er,k

Error R-E,W,Sr,k

Error N-Rr,k

E-W Test

E-N Test

N-E Test

ResourceTest

E-N Test

Fail Pass

Pass all turnsFailat

Turn‘k’ Fail

atTurn‘k’ Mismatch

at singleO/P Mismatch

betweenI/P & O/P

Pass

Pass all turns

Failat

Turn‘k’

All rows tested: r=n

r<n

6/9/2009 22Offline Test and Diagnosis of Network-on-Chips

3.4 Diagnose Faulty Intra-Switch Link

Passall

turns

Intra-switch faulty linkTest vector 101010…

[Raik, ETS’07]

Example

Page 28: Offline Test and Diagnose of Network-on-Chips

Agenda

1. Motivation

2. Testing Inter-Switch Wire Segments

3. Testing Switch Blocks

4. Conclusion

Offline Test and Diagnosis of Network-on-Chips6/9/2009 23

Page 29: Offline Test and Diagnose of Network-on-Chips

4. Conclusion

Steps for the overall test strategy:

Offline Test and Diagnosis of Network-on-Chips6/9/2009 24

SW

TEDTDG

TEDTD

G

TED

TDG

SW

SWSW

1.Testing theInter-switch linksBy BIST structure

TEDTDG

BIS

TC

ontr

olle

r

SWFIFO

SWFIFO

SWFIFO

SWFIFO

LRA LRA

LRA LRA

2.Testing theFIFO BuffersBy BIST structure

2a 3

2b1TestSource

3.Testing the RLBBy external test

Test Source

VH-Test

XY-Test

4.Testing theMUXs andIntra-switch linksBy 3-test config.& Diagnose Algor.

Page 30: Offline Test and Diagnose of Network-on-Chips

Offline Test and Diagnosis of Network-on-Chips

Thank you

6/9/2009