the challenges of correlating silicon and models in … challenges of correlating silicon and models...

34
1 The Challenges of Correlating Silicon and Models in High Variability CMOS Processes Rob Aitken ARM R&D [email protected]

Upload: vunhi

Post on 29-Apr-2018

214 views

Category:

Documents


1 download

TRANSCRIPT

1

The Challenges of Correlating Silicon and Models in High

Variability CMOS Processes

Rob Aitken

ARM R&D

[email protected]

222

©2009Rob Aitken

Outline

§ Background

§ Variability

§ Characterization

§ Library validation

§ Debug examples

§ Conclusions

333

©2009Rob Aitken

Benchmark Results

Google Scholar Citations

100

1000

10000

100000

n10 n30 n50 n100 n300

Benchmark

Cit

ati

on

s unmodified

route

benchmark

opportunity

optimal

444

©2009Rob Aitken

Physical IP is more than NAND gates

§ 15+ foundries

§ Up to 8 process generations per foundry

§ Up to 11 variants in each generation

§ 6+ memory generators

§ 900+ standard cells per library

§ 200+ I/Os

§ 10+ views per cell

§ Plus PHY, analog IP, etc.

§ 300+ libraries per year, each with thousands of individual data elements

555

©2009Rob Aitken

Moore’s Law

§ Original Paper:

§ “Cramming more components onto integrated circuits”

§ Electronics, 38-8, 4/19/65

§ Tracks changes from 1959

to 1965 and predicts trend going forward

§ It’s still going…

666

©2009Rob Aitken

Nintendo GameBoy (1989)

CPU: 8-bit Z-80 processor, 1.05 MHz

Screen: 2.6" 160 x 144 LCD 4 b/w

Connectivity: 4 players by serial cable

Introductory price - $169

Nintendo DSi (2008/2009)

CPU: ARM9™ (133 MHz), ARM7™ (33MHz)

Screen: Two 3" 256 X 192 color LCDs

256MB Flash, AAC audio, 2 VGA cameras

Connectivity: Wifi, web browser, shopping

Introductory price - $169

>1000x performance

for the same price

Feedback: Moore’s Law and Consumer Expectations

© Nintendo

777

©2009Rob Aitken

Variability trends

§ Non Gaussian behavior

§ Local spread close to global

§ Reduced correlation

1000 Samples of Variation

0.00014

0.00015

0.00016

0.00017

0.00018

0.00019

0.0002

0.00E+00 2.00E-08 4.00E-08 6.00E-08 8.00E-08 1.00E-07 1.20E-07

Leakage

Idsat

global

local

1000 Samples of Variation

0.00014

0.00015

0.00016

0.00017

0.00018

0.00019

0.0002

0.00021

0.00E+00 2.00E-09 4.00E-09 6.00E-09 8.00E-09 1.00E-08 1.20E-08

Leakage

Idsat

global

local

45

90

888

©2009Rob Aitken

Background

§ Classes of test chip:

§ ARM has multiple classes of test chip

§Library qualification chips

§Processor qualification chips

§Experimental test chips

§ Chips:

§ 2 32nm tapeouts since 9/08

§ ~40 Tapeouts in 2008, mainly first group

§ Part of shuttle or multi-project wafer

§ Usually 40-100 packaged chips

§ Challenge:

§ Silicon validation of library elements (standard cells, memory, IO)

999

©2009Rob Aitken

Background cont.

§ The library qualification test chip program objectives

§ Verify basic functionality

§ Validate new architectures in silicon

§ Provide silicon correlation for timing and power measurements

§ Objectives for other programs (not these chips)

§ Serve as yield predictors or measure defect density

§ Evaluate transistor properties or develop SPICE models

§ Study lithography issues

§ Determine the statistical properties of a process (FEOL or BEOL)

§Measure reliability

101010

©2009Rob Aitken

Library characterization history

200

450475

550 550

650

900

0

100

200

300

400

500

600

700

800

900

1000

.35um .25um .18um .15um .13um 90nm 65nm

Cell Count

Disk Usage

CPU Usage

900*

45nm

111111

©2009Rob Aitken

Characterization History

§ Three delay numbers: slow, typ, fast

§ Just too inaccurate

§ Linear delay: f(cap)

§What about slew rates?

§ NLDM: table f(cap, slew), interpolate between points

§Which table? How many points?

§Multiple voltage support complicated

§ Complex interconnects not modeled well

§ Current source models (CCS, ECSM)

§More complex modeling of device behavior

§ Allows for more accuracy, especially at intermediate points

§Giant files

§ Statistical models…

121212

©2009Rob Aitken

How much accuracy do you need?

§ Depends on tools, stage of design flow

§ Need for accuracy varies

§ Preliminary floorplanner needs different accuracy than final extraction flow

§ Interconnect modeling needs good, but not perfect accuracy

§ 3D field solver not required, but need more than interpolation between table data points, especially for complex shapes, long distances

§ Accuracy is important, but needs to be defined correctly"0.015667, 0.020832, 0.030359, 0.048810, 0.086296, 0.161212, 0.311795", \

"0.017252, 0.023636, 0.033981, 0.052357, 0.089780, 0.164684, 0.315177", \

These are not “within X% of SPICE”, they are SPICE!

values("7.117695e-03, 1.329720e-02, 3.667810e-02, 4.492980e-02, 5.068780e-02, 5.180790e-02, 5.086270e-02, 4.749920e-02, 5.026640e-02, 4.425650e-02, 2.832520e-02, 2.129720e-02,

1.605390e-02, 9.854060e-03, 5.124280e-03, 2.224300e-03, 1.424661e-03");

So are these.

131313

©2009Rob Aitken

§ Interpolation accuracy: 2-3%

§ Temperature, voltage variation handled with current source models

§ Variability: SS versus TT: -20% to 40%

12

34

56

7

0.032

0.048

0.08

0.14

0.264

0.512

1

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

12

34

56

7

0.032

0.048

0.08

0.14

0.264

0.512

1

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

1234567

0.032

0.048

0.08

0.14

0.264

0.512

1

0.8

0.85

0.9

0.95

1

1.05

1.1

1.15

1.2

1.25

1.3

1.35

1.4

SS

TT

Accuracy and precision:NLDM interpolation versus transistor variability

141414

©2009Rob Aitken

0.0040.0120.0280.0640.1480.344

0.0003

0.0009

0.0027

0.0083

0.0253

0.0778

0.2387

40%

60%

80%

100%

120%

140%

160%

180%

200%

perf

orm

an

ce

rati

o A

to

B

transition (scaled)

load (scaled)

1.8-2

1.6-1.8

1.4-1.6

1.2-1.4

1-1.2

0.8-1

0.6-0.8

0.4-0.6

“Scaling” of standard cell delay

§ Relative delay

for equivalent cells in two different

technologies A and B

§ Is A slower

than B or

faster?

151515

©2009Rob Aitken

Silicon Validation of Libraries

§ Basic idea

§Measure silicon, compare with model prediction

§ Things to measure

§ Delay

§ Power

§Leakage

§Dynamic

§ Challenges

§Where does silicon fit in “corners”

§Measurement accuracy

§ Single point versus table

§Model versus SPICE

§ SPICE versus silicon

§ Parametric variation

§ Presence of “soft” defects

161616

©2009Rob Aitken

Overcoming challenges

§ Challenges

§ Where does silicon fit in “corners”

§Oscillator data, test structure data

§ Measurement accuracy

§Understand equipment, measure deltas

§Big challenge for power

§ Single point versus table

§Carefully select design point

§Shmoo across voltage, temperature

§ Model versus SPICE

§Understand characterization issues

§ SPICE versus silicon

§Work with foundries to understand

§ Parametric variation

§Design around local variation

§ Presence of “soft” defects

§Look for trends across chips

171717

©2009Rob Aitken

Validation in practice

§ Variability observed for similar objects across chips

§ “Correct” value somewhere in the middle

§ Does this validate it?

110

115

120

125

130

135

140

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

181818

©2009Rob Aitken

How Close is it to SPICE?

§ Significant difference between simulators

observed

§ Variety of issues represented

§Model file interpretation

§ Performance options

§ Extraction issues

§ Silicon variability

Delay versus Silicon

50%

60%

70%

80%

90%

100%

110%

120%

130%

140%

150%

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

Instance

Simulator 1

Simulator 2

Remember this thenext time some toolclaims to be within

X% of SPICE

191919

©2009Rob Aitken

Number of Contacts (Yield vs Speed)

§Typical questions after early data collection (small number of units)

§3% shift in mean but what is mean – 3σ?

Units 208/304/501 contact experiments

92%

94%

96%

98%

100%

102%

104%

106%

108%

oscila

tor_

inv_x1_cont1

1

oscila

tor_

inv_x1_cont1

2

oscila

tor_

inv_x1_cont2

1

oscila

tor_

inv_x1_cont2

2

Sta

ge D

ela

y, n

orm

alize

d

202020

©2009Rob Aitken

Min VDD for Different Design Styles

§ Question: Are any of these designs better?

§ Statistically: No

§ But: More data might give the edge to design 4

0.46

0.47

0.48

0.49

0.5

0.51

0.52

0.53

0.54

1 2 3 4 5

Instance Class

Min

VD

D

Design 1

Design 2

Design 3

Design 4

212121

©2009Rob Aitken

Variability and Validation

§ Look for correlation

§ 65nm data

§ Result: small, but measurable, distance-based effect observed

Delay Correlation vs. Circuit Distance (SEC, RVT, -40C, 1.08V, H2L)

0.955

0.96

0.965

0.97

0.975

0.98

0.985

0.99

0.995

1

1.005

0 200 400 600 800 1000 1200 1400 1600 1800 2000

Circuit block distance (um)

Co

rre

lati

on

co

eff

icie

nt

222222

©2009Rob Aitken

Sources of Variability

§ Lithography

§ Line edge roughness

§ CD variation

§ Influence of neighbors

§ Device

§Well boundary effects

§ Variation between N and P

§ Stress/strain effects

§ Interconnect

§ Dielectric variation

§ Via/contact quality

§Metal width/height variation

§ Deterministic versus Random

232323

©2009Rob Aitken

Effects of Variability

§Leakage

§ Variation in L, Vt, µ, tox

§Performance

§ Changes in L, W, R, C, Vt, µ,

§Min VDD

§ Changes in Vt, L, W

§ SRAM bit cell main limiter

§Dynamic power

§ Changes in C

§ Side effect of changes in performance, leakage

§Yield

§ Indirect result of others

§ Parameter goes beyond spec + tolerance

αµ )( tgsoxd VVL

WCI −

⋅≈

+3σσσσ

-3σσσσ

242424

©2009Rob Aitken

Local Variation Dominates in VDSM

§ Local variation (within chip) is nearly as much as global variation (between chips) at 45nm

Histogram of Leakage

0

50

100

150

200

250

0.5

1.5

2.5

3.5

4.5

5.5

6.5

7.5

8.5

9.5

10.5

local

global

Histogram of Idsat

0

20

40

60

80

100

120

140

160

1.40

E-0

41.

44E

-04

1.48

E-0

41.

52E

-04

1.56

E-0

41.

60E

-04

1.64

E-0

41.

68E

-04

1.72

E-0

41.

76E

-04

1.80

E-0

41.

84E

-04

1.88

E-0

4

local

global

1000 Monte Carlo samples, 45nm technology

252525

©2009Rob Aitken

Critical Defect Behavior

17201730

1750

1770

18002000good

250ps

1750

600ps

1720

400ps

1730

<10ps70ps150psSA1Delay

3000200018001700Resistance (Ω)

Nominal delay Worst-case delay

262626

©2009Rob Aitken

Critical Variability

§ Designed for a predetermined operating point, plus margin

§ Example: SS Corner, 0.9V, 125C, 0 slack

§ Case 1: TT silicon, 0.9V, 85C

§>2X nominal delay will still function correctly

§ Case 2: SS silicon, 0.9V, 85C

§~20% extra delay will cause failure

§ Influential factors:

§Process, voltage, temperature, slack, noise

§ Expected silicon distribution: SS < 1%, TT(+/-) > 50%

§Might expect a 1-2% yield hit if SS corner just misses timing

§Guaranteed silicon distribution: None (usually)

§Could wind up with no yield 1-2% of the time (1 week per year)

§Or worse…

272727

©2009Rob Aitken

Outline

§ Background

§ Variability

§ Characterization

§ Library validation

§ Debug examples

§ Conclusions

282828

©2009Rob Aitken

Key to Memory Debug: Bit Mapping

§ Pass/Fail information is adequate for production testing

§ For yield improvement, debug, system bring-up, etc., it is also

useful to be able to identify each failing address and data

pattern

§ Need logical to physical mapping for this also

§ Some common patterns are shown below:

Single Cell Entire Column Half Row

292929

©2009Rob Aitken

Learning From Bit Maps

Vertical Pair:

Bit Line ContactPartial Column:

Resistive Bit Line Short

Catastrophic:

Timing circuit

Multi-Row:

Address Decoder

Swath:

CMP ScratchEntire Bit:

Sense amp, I/O

303030

©2009Rob Aitken

What can happen?

§ Temperature related leakage problem

§ Root causes

§ 5-10X worst case leakage

§ Circuit marginality

0.001

0.01

0.1

1

Worse case

sub-threshold

current

Normalizedsub-threshold

current

-20 0 20 40 60 80

Temperature

313131

©2009Rob Aitken

Lithography troubles

§Early silicon may be affected by

incomplete or improper processing

§This is less common later in

process cycle (memory optimized

first)

§May still be an issue for logic

323232

©2009Rob Aitken

Power Design

§ The good and the bad of power connection

§ Memory provider can only account for so much!

§ More margin = less performance

333333

©2009Rob Aitken

Analog problem: Read disturb fault

§ Data node voltage increases with successive reads

§ Given time, settles back to zero

§ Root cause: defective ground contact

1st read 250 mV

2nd read 520 mV

343434

©2009Rob Aitken

Conclusions

§ Validation is more complicated than you’d think

§ Variation increasingly important

§ Understanding sources of variation helps

§ An effective debug methodology helps when new troubles

arise

§Tools

§Infrastructure (silicon, equipment,

software)

§Experience

§ Feel free to send questions to [email protected]