good dram interface tutorial

91
Introduction on Introduction on DRAM Interface DRAM Interface Won-Joo Yun 2011. 11. 18

Upload: ravisguptaji

Post on 16-Jul-2016

103 views

Category:

Documents


13 download

DESCRIPTION

DRAM Basics

TRANSCRIPT

Page 1: GOOD DRAM Interface Tutorial

Introduction onIntroduction onDRAM InterfaceDRAM Interface

Won-Joo Yun2011. 11. 18

Page 2: GOOD DRAM Interface Tutorial

What is What is ‘‘MemoryMemory ’’??

• 1. the mental capacity or faculty of retaining and reviving facts, events, impressions, etc., or of recalling or recognizing previous experiences.

• …

• Also called computer memory, storage.– a. the capacity of a computer to store information subject to recall.– b. the components of the computer in which such information is stored.

[dictionary.com]

2011-11-18 2

Page 3: GOOD DRAM Interface Tutorial

What is What is ‘‘DRAMDRAM’’??

• Dynamic Random Access Memory– RAM

• Unlike electromagnetic tape or disk, it allows stored data to be accessed in any order (i.e. at random)

• “Random” refers to the idea that any piece of data can be returned in a constant time, regardless of its physical location and whether it is related to the previous piece of data [wikipedia.com]

– Dynamic• vs. static

• needs refresh• the charge stored on the input

capacitance will leak off over time

2011-11-18 3

[3Tr Cell of 1k DRAM]

Page 4: GOOD DRAM Interface Tutorial

What is What is ‘‘DRAMDRAM’’??

2011-11-18 4

Sequential access

long time to accessdifferent access time of locations

Random access

constant time to access regardless of locations

Page 5: GOOD DRAM Interface Tutorial

Semiconductor memorySemiconductor memory

2011-11-18 5

RAM

DRAM 1 Tr. + 1 Cap. Dynamic (Need refresh)Volatile

SRAM 4 Tr. or 6 Tr. Static

FeRAM 1 Tr. + 1 Cap. Almost Static

Non-VolatileROM

Mask ROM1 Tr. (Single

Poly)Not Erasable

EPROM 1 Tr. (Dual Poly) Erasable by UV

EEPROM 1 Tr. (Dual Poly) Electrically Erasable (by bit)

FLASH 1 Tr. (Dual Poly)Electrically Erasable (by

block)

Page 6: GOOD DRAM Interface Tutorial

Memory comparisonMemory comparison

2011-11-18 6

DRAMDRAM SRAMSRAM FLASHFLASH FeRAMFeRAM MRAMMRAM PRAMPRAM

Mechanism for data storage

charge and discharge of

Cap.

switching of cross-coupled

inv.

charge and discharge of

F.G.

Dipole switching of Ferro-Cap

resistivity with magnetic

polarization state

resistivity with chalcogenide

material phase change

Access Time < 100ns < 50ns < 100ns < 100ns < 50ns < 100ns

Write Time < 100ns < 50ns < 10us < 100ns < 50ns < 500ns

Erase Time No need No need ~ms No need No need No need

# of RD/WR operation

R&W infinite(> 1015)

R&W infinite(> 1015)

106 ~ 1010 1012 ~ 1016 R&W infinite(> 1015)

109 ~ 1011

Data Retention

Timeneed refresh

need not refresh

~ 10 years ~ 10 years ~ 10 years ~ 10 years

Operating Current

~ 100mA ~ 100mA ~ 10mA ~ 10mA ~ 10mA ~ 10mA

Standby Current

~ 200uA ~ 10uA ~ 10uA ~ 10uA ~ 10uA ~ 10uA

[Hynix]

Page 7: GOOD DRAM Interface Tutorial

Memory cell structureMemory cell structure

2011-11-18 7

DRAM : Dynamic Random Access MemorySRAM : Static Random Access MemoryNVM : Non-Volatile Memory, Flash [Hynix]

Page 8: GOOD DRAM Interface Tutorial

ComparisonsComparisons

2011-11-18 8

Intel Penryn Dual Coreprocess : 45nmdie area : 107mm 2

6MB L2 cache� 48Mb/38.5mm 2 = 1.25Mb/mm 2

Micron DDR3 SDRAMprocess : 42nmdie area : 49.2mm 2

4Gb� 4Gb/43.3mm 2 = 92Mb/mm 2

Intel-Micron (IM) Flashprocess : 25nmdie area : 167mm 2

64Gb� 64Gb/141mm 2 = 454Mb/mm 2

[Intel, Micron]

Page 9: GOOD DRAM Interface Tutorial

Standard DRAM genealogyStandard DRAM genealogy

2011-11-18 9

Asynchronous

Synchronous

Page 10: GOOD DRAM Interface Tutorial

DRAM technology evolutionDRAM technology evolution

2011-11-18 10

Density 1K 4K 16K 64K 256K 1M 4M 16M 64M 256M 2G

Year 1971 1975 1979 1982 1985 1988 1991 1994 1997 2000 2005

Design Rule(um)

10 8 5 3 2 1 0.8 0.5 0.30 0.18 0.08

Chip Size(mm2)

10 13 26 30 35 50 70 110 140 160 200

Cell Size(um2)

3000 860 400 180 65 25 10 2.5 0.72 0.26 0.05

Power Supply(V)

20 12 5 3.3 2.5 1.5

OperationMode

SRAM Page Mode Fast Page Mode EDO SDRDDRDRD

DDR3

Gate Oxide(nm)

120 100 75 35 30 20 16 12 9 7 4

Cell Type 3Tr 1Tr Planar Capacitor 3-D Capacitor High ε

[Hynix]

Page 11: GOOD DRAM Interface Tutorial

DRAM voltage trendDRAM voltage trend

2011-11-18 11

[Hynix]

Page 12: GOOD DRAM Interface Tutorial

Lower voltage means slower device speedLower voltage means slower device speed

2011-11-18 12

[Hynix]

Page 13: GOOD DRAM Interface Tutorial

DRAM CellDRAM Cell

2011-11-18 13

DRAM unit cell : 1 Cell Transistor + 1 Capacitor Invented in 1968 – R. H. Dennard/IBMUS Patent # 3,387,286

Page 14: GOOD DRAM Interface Tutorial

DRAM core structureDRAM core structure

2011-11-18 14

1) Memory Cell : 1T, 1C

2) X Decoder & Word Line

3) Bit Line

4) Sense Amp

5) Column Select

Page 15: GOOD DRAM Interface Tutorial

DRAM core operationDRAM core operation

2011-11-18 15

1) Bit line floating

2) Word line select: Charge sharing

3) DRAM sensing: Write recovery

4) Column select: data read (or write)

Page 16: GOOD DRAM Interface Tutorial

Charge sharingCharge sharing

2011-11-18 16

BL

BL

Cell

Cell

‘1’

‘0’

Page 17: GOOD DRAM Interface Tutorial

Charge sharingCharge sharing

2011-11-18 17

BLCs

Cb

WLCb Cs

VBLP=½ VCC VCELL=VCC, 0

Stand byQ=C*V=Cb*VBLP + Cs*VCELL

Cb Cs

VBL=VCELL=Vout

Word line turn onQ=C*V=(Cb + Cs)*Vout

Vout =Cb*VBLP + Cs*VCELL

Cb + Cs

Page 18: GOOD DRAM Interface Tutorial

BL SA operationBL SA operation

2011-11-18 18

Cross-coupled sense amp

[Hynix]

Page 19: GOOD DRAM Interface Tutorial

Memory I/O Interface

Page 20: GOOD DRAM Interface Tutorial

Why Why ““ SynchronousSynchronous ”” ??

2011-11-18 20

• Asynchronous DRAM

– Page Mode DRAM

– Fast Page Mode DRAM– EDO(Extended Data Out) DRAM

• Synchronous DRAM

– SDRAM– DDR SDRAM

– Rambus DRAM

• Synchronous DRAM can output more data[Hynix]

Page 21: GOOD DRAM Interface Tutorial

Conventional DRAM circuitsConventional DRAM circuits• Cell Array

– (sub) Matrix Array,, Cell (1T1C), WL, bit line (Folded )– Cap. -Data retention DRAM Tech. Core part.– SA array -- DRAM sensing, Refresh all page Cell

• Decoder Mux, Add input– Pre decoding � Decoding.– Redundancy, internal refresh counter,– Row address path, Col add path

• Data I/O– Read/write , + Data bus sense amp , block write Driver– Various DRAM according to Col.(add./data path) control– Fast page, EDO, SDRAM, DDR, DDR2,– PKG option x4, x8, x16

• Control circuits– Operation of Read, Write, Refresh (Timing & Selection) according to /RAS, /CAS, /WE

• Internal bias voltage– Vbb, Vpp, Vblp, Vcp, Vint, Vref

2011-11-18 21

Page 22: GOOD DRAM Interface Tutorial

SDRAM features +SDRAM features +• Pipeline

– In previous DRAM, column address path time determines data freq.– With partitioning internal path, data are outputted every clock cycle after 2 or 3 clocks

• Clock input– Up to EDO, input signals are directly controlled by /RAS, /CAS, /WE– Changed to command (referenced rising edge of clock) � various operation and simple

spec.• Multi bank (2/4)

– Independent row access is consisted of multiple bank � increase the size of page– capable of continuous operation with hiding pre-charge time

• Mode register set– Programmable /CAS latency and burst length suitable for system environments (clock

frequency)• Internal address generator

– Internally generates sequential column address for Burst (fast column access) operation• I/O Power

– Dedicated power of data (Vccq, Vssq) for stable operation

2011-11-18 22

Page 23: GOOD DRAM Interface Tutorial

PipelinePipeline

2011-11-18 23

-. separate signals having long access time for faster input command

Page 24: GOOD DRAM Interface Tutorial

MultiMulti --bank Architecturebank Architecture

2011-11-18 24

-. Bank is a unit which can be active independently and has same data bus width as external output bus width-. Interleaving bank operation � while one bank accessed, another active

[Hynix]

Page 25: GOOD DRAM Interface Tutorial

DRAM clock speed trendsDRAM clock speed trends

2011-11-18 25

[K-h Kim, et al. JSSC 2007] [Hynix]

Page 26: GOOD DRAM Interface Tutorial

DDR features +DDR features +• DDR data I/O

– Double data rate = rising & falling edge of clock– Twice performance compared to SDRAM

• DDR performance by 2n-bit pre-fetch• On chip clock by DLL

– Frequency not limited to the access time• SSTL interface

– Input reference voltage– guarantee of dout data window, termination

• Differential input– reference by VREF

• data strobe by DQS– edge align, bi-directional, source synchronous

• Differential clock– CLK, /CLK

• EMRS control– Dout driver size & DLL

2011-11-18 26

Page 27: GOOD DRAM Interface Tutorial

SDR/DDR/DDR2/DDR3 operationSDR/DDR/DDR2/DDR3 operation

2011-11-18 27

Data Rate

Page 28: GOOD DRAM Interface Tutorial

SDR/DDR/DDR2/DDR3 operationSDR/DDR/DDR2/DDR3 operation

2011-11-18 28

DDR(2b pre-fetch)

DDR2(4b pre-fetch)

DDR3(8b pre-fetch)

Page 29: GOOD DRAM Interface Tutorial

SDR/DDR/DDR2/DDR3 operationSDR/DDR/DDR2/DDR3 operation

2011-11-18 29

High bandwidth concept : pre-fetch-. Fetch cycle means one column cycle that is executed by read or write command issue

Page 30: GOOD DRAM Interface Tutorial

Memory I/O interfaceMemory I/O interface

• Memory clocking system– Source synchronous scheme– DLL supports– Impedance control

• Design trends on graphics memory– Low power techniques

• Input – clock – output

– Low cost techniques• clock

– Low jitter & high performance techniques• clock – output• power distribution network

2011-11-18 30

Page 31: GOOD DRAM Interface Tutorial

Common clock schemeCommon clock scheme

• Data transfer is performed relative to a single master “clock”signal (Synchronously)

2011-11-18 31

[Hynix]

Page 32: GOOD DRAM Interface Tutorial

Common clock schemeCommon clock scheme

• Timing budget

2011-11-18 32

[Hynix]

Page 33: GOOD DRAM Interface Tutorial

SSource ource SSynchronousynchronous

2011-11-18 33

• Common (master) clock is not used for data transfer• Devices have an additional strobe pin• Minimizing differences in routed length & layer characteristics between strobe and data signals is required

• Data / STB are synchronized at driver• Device speed (fast, slow) is irrelevant since data & STB are supplied by the same device• The significant issue is the accumulated skew between data & STB as the signals travel between devices

Page 34: GOOD DRAM Interface Tutorial

Source SynchronousSource Synchronous

• Timing budget– ideal case : tSTB=tDATA

• � Maximum speed is limited only by setup + hold time

– real : Maximum speed is limited by setup + hold + |tSTB - tDATA|max + …

2011-11-18 34

Page 35: GOOD DRAM Interface Tutorial

DLL supports SS schemeDLL supports SS scheme

2011-11-18 35

External CLK

Internal CLK(no DLL)

Internal CLK(w/ DLL)

DQ

Desired DQ

tD1

tD2

Data 1 Data 2 Data 3 Data 4

Data 1 Data 2 Data 3 Data 4

tD2

tCK-(tD1 +tD

2 )

tD1 + tD2 have large P.V.T. variations

Page 36: GOOD DRAM Interface Tutorial

DRAM interface on channelDRAM interface on channel

2011-11-18 36

[Hynix]

Page 37: GOOD DRAM Interface Tutorial

TX driver (with impedance control)TX driver (with impedance control)

2011-11-18 37

[Hynix]

Page 38: GOOD DRAM Interface Tutorial

Read data eye measurement (DDR)Read data eye measurement (DDR)

2011-11-18 38

[Hynix]

Page 39: GOOD DRAM Interface Tutorial

DDR2 OnDDR2 On--Die Termination (ODT)Die Termination (ODT)

2011-11-18 39

[Samsung]

On board termination resistance is integrated inside of DRAM

Page 40: GOOD DRAM Interface Tutorial

ODT value selection and on/off ctrl.ODT value selection and on/off ctrl.

2011-11-18 40

[Samsung]

Page 41: GOOD DRAM Interface Tutorial

ODT case study @DDR2ODT case study @DDR2 --667 writes667 writes

2011-11-18 41

[Samsung]

• For two slot population, 50ohm seems to be better than 75ohm in terms of signal integrity• For one slot population, 150ohm seems OK

Page 42: GOOD DRAM Interface Tutorial

Signal integritySignal integrity

2011-11-18 42

[Hynix]

Page 43: GOOD DRAM Interface Tutorial

Interface for Graphics memory

Page 44: GOOD DRAM Interface Tutorial

GDDR3 applicationsGDDR3 applications

2011-11-18 44

Game ConsolesGame Consoles Laptop / MobileLaptop / Mobile

High-End / D-THigh-End / D-T

DDR

data

rate (

Gbps

)

Page 45: GOOD DRAM Interface Tutorial

Interface for Graphics memoryInterface for Graphics memory

• Challenges for Graphics Memory– High-speed over 2Gbps for GDDR3, 4 Gbps for GDDR5– Low voltage under 1.5V for GDDR5, 1.8V for GDDR3– Low current consumption– Good quality of clock itself– Robust operation against various noisy environments– guarantee of operation under various power down mode

2011-11-18 45

71.35V

Page 46: GOOD DRAM Interface Tutorial

Design trendsDesign trends

2011-11-18 46

Low CostLow Cost-- Small areaSmall area-- Design for testabilityDesign for testability

High PerformanceHigh Performance-- Robust DLLRobust DLL-- Low jitter DLLLow jitter DLL-- Good quality of DCCGood quality of DCC-- Low SSO noiseLow SSO noise

Low PowerLow Power-- Reduce operating currentReduce operating current-- Guarantee operations at low voltageGuarantee operations at low voltage-- Data outputData output

Low heat, low voltage dropDVS (Dynamic Voltage Swing) at mobile app.Data Bus Inversion

Die cost downTest cost down

Wide data valid window

Page 47: GOOD DRAM Interface Tutorial

Clocking systems for DRAM interfaceClocking systems for DRAM interface

2011-11-18 47

Input• Input clock buffer

• robust clock generation from poor input signal• support low power mode

Clocks• DLL / PLL

• delay (phase) compensation• wide operation range (voltage / frequency)• good quality of clock signal � low jitter, duty-corrected

clock

Output• Clock control

• output enable • Driver

• Impedance matching• Multi slew-rate• Data Bus Inversion

Page 48: GOOD DRAM Interface Tutorial

Low power techniquesLow power techniques

• General concept– Power consumption = V(supply voltage) X I(current)

• Input– Buffer

• in mobile : just inverters• in graphics : low current two-stage amps

– Buffer with low power mode• guarantee of low power function in mobile applications• stable operation under off-terminated environments

• Clock– DLL

• Architecture for low power consumption• Systematically low power operation

• Output– Data Bus Inversion DC mode

2011-11-18 48

Page 49: GOOD DRAM Interface Tutorial

Low power in clock (DLL)Low power in clock (DLL)

• Architecture – compact circuits and architecture– In digital DLL, Dual-loop � Single-loop– lower VDD than external VDD

• Vperi using regulated power

– decrease internal frequency [GDDR4]

– minimize voltage drop

• Smart power down control [GDDR3]

2011-11-18 49

Power Consumption vs. tCK

0

10

20

30

40

50

60

70

80

90

100

1.0 1.1 1.2 1.4 1.6 2.0 10.0tCK (ns)

mW

Proposed onePrevious one

VDD = 1.5V, Temp= 25 °C

Power Consumption vs. tCK

tCK (ns)

4.2mW 20mW

Power Consumption vs. tCK

0

10

20

30

40

50

60

70

80

90

100

1.0 1.1 1.2 1.4 1.6 2.0 10.0tCK (ns)

mW

Proposed onePrevious one

VDD = 1.5V, Temp= 25 °C

Power Consumption vs. tCK

tCK (ns)

4.2mW 20mW

79% reduction

[ISSCC ‘08]

Page 50: GOOD DRAM Interface Tutorial

Low power in outputLow power in output

• DBI DC mode [GDDR4]

– data ‘0’ consumes current– maximum # of ‘0’ ≤ 4

2011-11-18 50

[SJ Bae. JSCC ‘08]

pseudo open drain I/O system

Page 51: GOOD DRAM Interface Tutorial

Receiver type comparisonReceiver type comparison

• Pseudo open drain (GDDR3) vs. Push-Pull (GDDR2)

– Assumption : Same channel condition for both cases.– It doesn’t represent absolute number of ODT power difference between

pseudo open drain case and push-pull case.2011-11-18 51

[Samsung]

Page 52: GOOD DRAM Interface Tutorial

Low power in outputLow power in output

• DBI DC mode [GDDR4]

– data ‘0’ consumes current– maximum # of ‘0’ ≤ 4

2011-11-18 52

[SJ Bae. JSCC ‘08]

pseudo open drain I/O system

Page 53: GOOD DRAM Interface Tutorial

Low jitter / High performanceLow jitter / High performance

• Clock– DLL

• Low jitter operation with dual-loop architecture• Power noise tolerant replica

• Dual DCC for stable duty error correction

– Dual-mode with DLL and PLL• DLL for phase lock, PLL for jitter reduction [ISSCC ‘09]

– Meshed power plan

• Output– Driver

• Data Bus Inversion AC mode : reduce SSO noise

2011-11-18 53

Page 54: GOOD DRAM Interface Tutorial

Low jitter in outputLow jitter in output

• DBI AC mode [GDDR4]

– reduce SSO noise– In data byte sequence, maximum # of change ≤ 4

2011-11-18 54

[SJ Bae. JSCC ‘08]

Power supply noise generation

Page 55: GOOD DRAM Interface Tutorial

GDDR5

Page 56: GOOD DRAM Interface Tutorial

JEDEC GDDR SGRAM comparisonJEDEC GDDR SGRAM comparison

2011-11-18 56

[AMD(ATi)]

Page 57: GOOD DRAM Interface Tutorial

JEDEC GDDR SGRAM comparisonJEDEC GDDR SGRAM comparison

2011-11-18 57

[AMD(ATi)]

Page 58: GOOD DRAM Interface Tutorial

Industry signal interface trendIndustry signal interface trend

2011-11-18 58

[AMD(ATi)]

Page 59: GOOD DRAM Interface Tutorial

GDDR5 GDDR5 –– key elements for reliable high key elements for reliable high speed data transmissionspeed data transmission

2011-11-18 59

[AMD(ATi), Qimonda]

Page 60: GOOD DRAM Interface Tutorial

Comparison GDDR3 vs. GDDR5Comparison GDDR3 vs. GDDR5

2011-11-18 60

Synchronization issues on every pin� “combs” on PCB

No needs of “combs” on PCB� cheaper solution with higher performance

[AMD(ATi)]

Page 61: GOOD DRAM Interface Tutorial

Clamshell mode (x16 mode)Clamshell mode (x16 mode)

2011-11-18 61

[AMD(ATi)]

Page 62: GOOD DRAM Interface Tutorial

Recent researches on DRAM I/FRecent researches on DRAM I/F

Ref Applications Conf. Year Issues

[6] GDDR3 ISSCC 2006 Latency control, 2.5Gbps

[5] GDDR3 ASSCC 2006Low power/Wide range DLL architecture,

3Gbps

[2] GDDR3 ISSCC 2008 Dual DCC, 3Gbps

[10] GDDR3 ISSCC 2008Multi-slew-rate output driver, impedance

control, 3Gbps

[1] GDDR3 ISSCC 2009 Dual PLL/DLL, pseudo-rank, 3.3Gbps

[9] Graphics ISSCC 2009pseudo-differential, common mode rejection,

referenceless, 6Gbps

[7] GDDR5 ESSCIRC 2009 CML CDN, 5.2Gbps

[8] GDDR5 VLSI 2009 Fast DCC, 7Gbps

[11] GDDR5 ISSCC 2010 GDDR5 Architecture, Bank control, 7Gbps

[12] GDDR5 VLSI 2010 Jitter and ISI reduction, 7Gbps

2011-11-18 62

Page 63: GOOD DRAM Interface Tutorial

DDR4

Page 64: GOOD DRAM Interface Tutorial

DDR4DDR4

2011-11-18 64

Page 65: GOOD DRAM Interface Tutorial

DDR4DDR4

2011-11-18 65

[PCwatch]

Page 66: GOOD DRAM Interface Tutorial

2011-11-18 66

Page 67: GOOD DRAM Interface Tutorial

2011-11-18 67

Page 68: GOOD DRAM Interface Tutorial

2011-11-18 68

Page 69: GOOD DRAM Interface Tutorial

SummarySummary

• DRAM Introduction• DRAM Evolutions• Memory Interface• Interface for graphics memory

– GDDR3• Low power, low cost, low jitter / high performance

– GDDR5• CDR for read (data training), external VPP, error correction, clamshell

mode

• DDR4 preview

2011-11-18 69

Page 70: GOOD DRAM Interface Tutorial

ReferencesReferences

• Web sites and published data from Hynix, Samsung, Rambus, Elpida, Micron, AMD(ATi), Intel, nVidia, SONY, Nintendo, Microsoft, Pcwatch, JEDEC

• “DRAM Circuit Design,”B. Keeth, R. J. Baker, B. Johnson, F. Lin, IEEE Press

• [1] H. W. Lee, et al. “A 1.6V 3.3Gb/s GDDR3 DRAM with Dual-Mode Phase- and Delay-Locked Loop Using Power-Noise Management with Unregulated Power Supply in 54nm CMOS,” ISSCC 2009

• [2] W. J. Yun, et al. “A 0.1-to-1.5GHz 4.2mW All-Digital DLL with Dual Duty-Cycle Correction Circuit and Update Gear Circuit for DRAM in 66nm CMOS Technology,”ISSCC 2008

• [3] S. J. Bae, et al. “An 80 nm 4 Gb/s/pin 32 bit 512 Mb GDDR4 Graphics DRAM With Low Power and Low Noise Data Bus Inversion,”JSSC 2008

• [4] K. h. Kim, et al. “An 8 Gb/s/pin 9.6 ns Row-Cycle 288 Mb Deca-Data Rate SDRAM With an I/O Error Detection Scheme,”JSSC 2007

• [5] W. J. Yun, et al. “A Low Power Digital DLL with Wide Locking Range for 3Gbps 512Mb GDDR3 SDRAM,”ASSCC 2006

2011-11-18 70

Page 71: GOOD DRAM Interface Tutorial

ReferencesReferences

• [6] D. U. Lee, et al. “A 2.5Gb/s/pin 256Mb GDDR3 SDRAM with Series Pipelined CAS Latency Control and Dual-Loop Digital DLL,”ISSCC 2006

• [7] K. H. Kim, et al. “A 5.2Gb/s GDDR5 SDRAM with CML Clock Distribution Network,” ESSCIRC 2009

• [8] D. Shin, et al. “Wide-Range Fast-Lock Duty-Cycle Corrector with Offset-Tolerant Duty-Cycle Detection Scheme for 54nm 7Gb/s GDDR5 DRAM Interface,”VLSI 2009

• [9] K. S. Ha, et al. “A 6Gb/s/pin Pseudo-Differential Signaling Using Common-Mode Noise Rejection Techniques Without Reference Signal for DRAM Interface,”ISSCC 2009

• [10] D. U. Lee, et al. “Multi-Slew-Rate Output Driver and Optimized Impedance-Calibration Circuit for 66nm 3.0Gb/s/pin DRAM Interface,” ISSCC 2008

• [11] T. Y. Oh, et al. “A 7Gb/s/pin GDDR5 SDRAM with 2.5ns Bank-to-Bank Active Time and No Bank-Group Restriction,”ISSCC 2010

• [12] S. J. Bae, et al. “A 40nm 7Gb/s/pin Single-ended Transceiver with Jitter and ISI Reduction Techniques for High-Speed DRAM Interface,” VLSI 2010

2011-11-18 71

Page 72: GOOD DRAM Interface Tutorial

Thank you

Page 73: GOOD DRAM Interface Tutorial

Appendix

Page 74: GOOD DRAM Interface Tutorial

SDRAM categorizationSDRAM categorization

• by Speed / Applications– … / DDR1 / DDR2 / DDR3 / DDR4 / …– GDDR1 / GDDR2 / GDDR3 / GDDR4 / GDDR5 / GDDR5+ / GDDR6 …– mDDR / LPDDR2 / …

• by Density– … / 256Mb / 512Mb / 1Gb / 2Gb / 4 ~ 8Gb / …

• by Bus-Width– x4 / x8 / x16 / x32 / …

2011-11-18 74

Page 75: GOOD DRAM Interface Tutorial

DRAM density & busDRAM density & bus --widthwidth

• bus-width – # of data output pins– determined by applications– for examples,

• PC : x64• Server : x64

• Graphics card : x64 / x128 / x256 / x512 / …

• Game consoles : x32 / x128 / …

2011-11-18 75

Page 76: GOOD DRAM Interface Tutorial

DRAM total densityDRAM total density

• DRAM device density X number of devices• for servers

– x4 configurations � to increase total memory capacity– x4 4Gb 16 devices can be used (64bit) : 4Gb X 16 = 8GB

• for laptops– x16 4Gb 4 devices can be used (64bit) : 4Gb X 4 = 2GB

• for PCs– x4 / x8 / x16 configurations

• for Graphics applications– x16 / x32 configurations– wide bus-width > total amounts of memory

2011-11-18 76

16GB

[Hynix]

Page 77: GOOD DRAM Interface Tutorial

DRAM total densityDRAM total density

2011-11-18 77

Component (bit) Module (byte=x8) Applications1Gb 256M x4 16ea. 256M x64 2GB Server2Gb DDP 512M x4 32ea. 1024M x64 8GB1Gb 128M x8 8ea. 128M x64 1GB PC1Gb 64M x16 4ea. 64M x64 512MB Notebook

Component (bit) Number Bus-width Total dens. Applications512Mb 16M x32 16ea./12ea. 512bit/384bit 1GB/768MB High-End512Mb 16M x32 8ea. 128bit (mirror) 512MB XBOX 360512Mb 16M x32 4ea. 128bit 256MB PS3512Mb 16M x32 1ea. 32bit 64MB Nintendo Wii

Graphics memory

Conventional memory

Page 78: GOOD DRAM Interface Tutorial

Data Data bandwidthbandwidth

• For example of GDDR3 on PS3– 700MHz/pin– � 1.4Gb/s/data channel(pin)– Each device has 32bit data I/O– � 1.4Gb/s X 32 = 44.8Gb/s/component– 4 components configurations (32bit X 4 = 128bit)– � 44.8Gb/s/component X 4 = 179.2Gb/s– Data bandwidth is 22.4GB/s

• To increase data bandwidth– Clock speed– Wide I/O– More components (in other words, wide I/O in total)

2011-11-18 78

[SONY]

Page 79: GOOD DRAM Interface Tutorial

Data bandwidthData bandwidth

• Increasing clock speed per pin– 700MHz � 1GHz– � 2.0Gb/s X 32 X 4 / 8 = 32GB/s– � ex) High-end graphics cards use 1.3GHz (2.6Gbps) [GDDR3]– � ex) 3.6 ~ 4.8Gbps [GDDR5] / up-to 7Gbps (@ES)

• Increasing I/O bits per component (wide I/O per component)– 32bit � 64bit– � 1.4Gb/s X 64 X 4 / 8 = 44.8GB/s– � 32bit is maximum in mass production– � x4 x 128 (x512) in TSV

• Increasing # of components (in other words, wide I/O in total)– 4 � 8– � 1.4Gb/s X 32 X 8 / 8 = 44.8GB/s– � to increase bus-width : ex) High-end graphics cards use 16

components (=512bit)

2011-11-18 79

Page 80: GOOD DRAM Interface Tutorial

Mirror functionMirror function

• To increase total density without increasing data bus-width

2011-11-18 80

an example of 512Mb GDDR3

[Hynix][ISSCC ‘09]

Page 81: GOOD DRAM Interface Tutorial

XBOX 360XBOX 360

2011-11-18 81

8 x 32b

128b

[Microsoft]

Page 82: GOOD DRAM Interface Tutorial

Prefetch operationPrefetch operation

2011-11-18 82

Page 83: GOOD DRAM Interface Tutorial

DDR2/3 ArchitectureDDR2/3 Architecture

2011-11-18 83

Page 84: GOOD DRAM Interface Tutorial

DDR2 block diagramDDR2 block diagram

2011-11-18 84

Page 85: GOOD DRAM Interface Tutorial

Simulation schematicSimulation schematic

2011-11-18 85

[Hynix]

Page 86: GOOD DRAM Interface Tutorial

Measurement vs. SimulationMeasurement vs. Simulation

2011-11-18 86

[Hynix]

Page 87: GOOD DRAM Interface Tutorial

DRAM core speed pathDRAM core speed path

2011-11-18 87

Page 88: GOOD DRAM Interface Tutorial

Internal voltagesInternal voltages

2011-11-18 88

Page 89: GOOD DRAM Interface Tutorial

ZQ CalZQ Cal

2011-11-18 89

Page 90: GOOD DRAM Interface Tutorial

Design trendsDesign trends

2011-11-18 90

Page 91: GOOD DRAM Interface Tutorial

Receiver type comparisonReceiver type comparison

2011-11-18 91