the semiconductor memory saga – 50 years

84
The Semiconductor Memory Saga – 50 years Betty Prince, PhD. Memory Strategies International <www.memorystrategies.com> [email protected]

Upload: others

Post on 16-Apr-2022

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Semiconductor Memory Saga – 50 years

The Semiconductor Memory Saga –50 years

Betty Prince, PhD.Memory Strategies International<www.memorystrategies.com>

[email protected]

Page 2: The Semiconductor Memory Saga – 50 years

The Saga of Semiconductor Memories – 50 years

Overall - The Evolution of the Cache Hierarchy

1. Main Memory for Computers – SRAMs and DRAMs

2. The Memory Wall – feeding enough data to the Computer

3. The Advent of Non-Volatile Memory (NOR and NAND)

4. Embedded Memories, Single Chip (MCU + SRAM + Flash)

5. NAND Flash Memory – Flash Passes DRAM in Volume

6. The New Moore’s Law* - Vertical NAND, Vertical PCM

7. The End of Moore’s Law. - Emerging Memories

8. Neuromorphic Memory, Neural Synapses, and AI

[email protected] 2*Moore’s Law - The number of transistors on a chip doubles every two years.

Page 3: The Semiconductor Memory Saga – 50 years

1. Main Memory for Computers– SRAMs and DRAMs

[email protected] 3

Page 4: The Semiconductor Memory Saga – 50 years

6T SRAM – First Main Memory

The original MOS memory was a 6T cell SRAM invented in 1964 at Fairchild as part of their CMOS logic portfolio. A flipflop with transistors accessing the storage nodes.

It competed initially for Main Memory with a variety of memories: 4T SRAMs, 4T DRAMs and 3T DRAMs.

[email protected] 4

Page 5: The Semiconductor Memory Saga – 50 years

6T SRAM – Too Big for Main MemoryAdvantages: Fast Access/Cycle Time

Logic CompatibleLow Standby Power Random Access Non-Destructive Read

Disadvantages: Dynamic (lost data when power off)High active powerIn Shrunk technology

- High Subthreshold leakage - Standby power increased- Process Variation increased

Standalone – large cell size increased cost Embedded – large cell size (6 T) not as important

[email protected] 5

Page 6: The Semiconductor Memory Saga – 50 years

SRAM Market Split Standalone and Embedded in 2000.By 2006, the SRAM was primarily an embedded memory as Cache in Processors and as Data and Code RAM in the Microcontroller.

[email protected] 6

Page 7: The Semiconductor Memory Saga – 50 years

SRAM and MPU form first cache hierarchy

MPU

Cache SRAM

The SRAM is made of transistors so it is compatible with a logic process. The MPU and SRAM can be on a single chip. This was the first embedded Memory.

[email protected] 7

Page 8: The Semiconductor Memory Saga – 50 years

Cache Hierarchy

The theory of the cache hierarchy is that the data that will be next requested by the processor can be contained in a small and very fast memory that sits next to the processor.

If this small fast cache can supply the data fast enough that the processor never, or seldom, needs to go to the main memory bank, then the main memory can use slower, cheaper memory and the average speed of the system still remain high.

[email protected] 8

Page 9: The Semiconductor Memory Saga – 50 years

High Density MegabytesLow Cost Small cell size /more chips on a wafer High Speed Random Access

Candidates: Dynamic RAM (DRAM) Charge-Coupled Devices (CCD)

The 1T DRAM was patented in 1968 at IBM. The Charge Coupled device (CCD) was invented at Bell Labs in 1969.

In 1970 Computers Needed Working Memory

[email protected] 9

Page 10: The Semiconductor Memory Saga – 50 years

Intel 1103, A 3T Cell DRAMFirst Commercial DRAM Chip – Intel 1103.Presented at the 1970 ISSCC.Best Selling Semiconductor Chip by 1972Density – 1024 bitsUsed in the HP 9800 Computer Organization: 1Kx1System: 64Kbytes (512 chips)Advantages: Small Cell Size

Low Price, Random AccessDisadvantages: Not enough memory capacity, slow.

COMPUTERS NEEDED RAM. IT WAS IN HIGH [email protected] 10

Page 11: The Semiconductor Memory Saga – 50 years

1T1C DRAMHigh memory demand meant volume production from beginningAmericans -> Japanese -> Korean -> (Taiwan, Europe small)Today: Micron, Samsung, Hynix

All the problems were dealt with in production:12V, 5V 5V only 3.3V Soft Errors – Cosmic RaysWide I/O – for smaller systems

JEDEC DRAM standardization driven by large users, set standards and followed them. The users didn’t buy non-standard parts.

By 1980 the 1T DRAM was 27% of the total memory market.By 1990 the DRAM dominated the main memory market. The CCD Memory had essentially vanished.

Word Line

Bit Line

Capacitor

Ground

Transistor

[email protected] 11

Page 12: The Semiconductor Memory Saga – 50 years

DRAM Shipments 1970-2005

N.M. Victor, J.H. Ausubel, Science Direct, April 2002

4Kb16Kb

64Kb256Kb

1Mb

4Mb

16Mb64Mb

128Mb256Mb

[email protected] 12

10 GENERATIONS IN 30 YEARS

Page 13: The Semiconductor Memory Saga – 50 years

Size Reduction of successive generations of DRAM chips

Lg 1.8um 1.2um 0.8 um 0.5um1980 1983 1986 1989 1992 1995

[email protected] 13

Page 14: The Semiconductor Memory Saga – 50 years

Cost of a DRAM Fab

Source: Morgan [email protected] 14

Page 15: The Semiconductor Memory Saga – 50 years

2. The Memory WallFeeding Enough Memory to the Computer

[email protected] 15

Page 16: The Semiconductor Memory Saga – 50 years

The Memory Wall (Processor-Memory Speed Gap)

M.S. Parekh, P Thadesar, M. Kakir, ECTC, 2011

CPU

Memory

[email protected] 16

Page 17: The Semiconductor Memory Saga – 50 years

WHAT MATTERS TO THE SYSTEM IS THE AMOUNT OF DATA THAT ARRIVES PER SECOND

BANDWIDTH = SPEED OF BUS X WIDTH OF BUS

WIDE SLOW BUS EQUALS NARROW FAST BUS

X64 1600 MB/SEC X16200 MHZ 800 MHz

NOTE: "DATARATE" USED TO MEAN BANDWIDTHIT IS BEING USED NOW TO MEAN SPEED OF ONE BIT OF DATA ON THE BUSI.E. THE SPEED OF THE BUS

BANDWIDTH CONSIDERATIONS

[email protected] 17

Page 18: The Semiconductor Memory Saga – 50 years

[email protected]

SIMPLIFIED DIAGRAM OF INTERNAL DRAM

Bit Line Bit Line\

Bit Line Precharge

Word Line

column switch

DRAM Cell

18

NOTE: Destructive Read. After reading the Data, Data must be restored to the cell and the WL closed before precharging the bit-lines for the next read. Charge leaks away and must be refreshed.

The InternalInternal Function of the DRAM is Slow

Page 19: The Semiconductor Memory Saga – 50 years

[email protected] 19

Simplified Circuit Diagram of 16-bit DRAM

Page 20: The Semiconductor Memory Saga – 50 years

20

SYNCHRONOUS(b) AND ASYNCHRONOUS(a) CONTROL

[email protected]

Page 21: The Semiconductor Memory Saga – 50 years

[email protected]

Basic Asynchronous DRAMREFRESH COUNTER

ROW ADDRESSLATCH

DATA CONTROL X32 LOGIC

COLUMN ADDRESS DECODER

MEMORY ARRAY

COLUMN ADDRESSLATCH

ROW ADDRESSDECODER

I/O

21B. Prince, “High Performance Memories”, 1998, Wiley

Page 22: The Semiconductor Memory Saga – 50 years

[email protected]

Add SDRAM Command Interface SDRAM COMMAND INTERFACE DRAM ARRAY

REFRESH COUNTER CLOCK GENERATOR

COMMAND DECODE

CONTROL LOGIC

MODE ROW ADDRESS REGISTER MUX

BANK DATA CONTROL X32

CONTROL LOGIC

ADDRESS COLUMN ADDRESS REGISTERS DECODER

BANK 0 MEMORY ARRAY

COLUMN ADDRESS

ROW ADDRESSLATCH&DECOD

B. Prince, “High Performance Memories”, 1998, Wiley22

Page 23: The Semiconductor Memory Saga – 50 years

[email protected] 23

Add DDR SDRAM Interface SDRAM COMMAND INTERFACE DRAM ARRAY DDR SDRAM INTERFACE

REFRESH COUNTER CLOCK GENERATOR CK

COMMAND DECODE DLL

CONTROL LOGICX16

X32 READ MUX DRIVERS MODE ROW ADDRESS LATCH X16 REGISTER MUX

DQS EXTENDED DQS MODE REGISTER BANK DATA CONTROL X32 GENERATOR

CONTROL LOGIC

ADDRESS COLUMN ADDRESS WRITE X16 REGISTERS DECODER X32 FIFO & INPUT RECEIVERS

DRIVERS X16 REGISTER

CK

BANK 0 MEMORY ARRAY

COLUMN ADDRESS

ROW ADDRESSLATCH&DECOD

B. Prince, “High Performance Memories”, 1998, Wiley

Page 24: The Semiconductor Memory Saga – 50 years

Memory Wall Solution – Clocked MemoryIssues: Memory Wall – Processors faster than DRAMSSolutions:1994 Synchronous Interface – ran from computer clock

Improved Data Rate, but not latency1998 DDR – doubled datarate , latency same.

[email protected] 24

Page 25: The Semiconductor Memory Saga – 50 years

DRAM APPLICATIONS (2000)

[email protected] 25

Page 26: The Semiconductor Memory Saga – 50 years

Processor

Cache

Main Memory

NV Mass Storage

Mass Storage Archive

Register

SRAM

DRAM

Non-Volatile Memory

HDD

Access Latency

1ns

10 ns

100 ns

1 us

10 us

100 us

>2 ms

Cache Memory Hierarchy

[email protected] 26

NV Mass Storage

Page 27: The Semiconductor Memory Saga – 50 years

3. Non-Volatile Memory

1970’s 1980’s 1990’s

ROM EPROM EEPROM NOR Flash eFlash in MCU

1990’s 2000 2010 2020

NAND Flash Highest Volume End of Scaling 3D Vertical NAND

[email protected] 27

Page 28: The Semiconductor Memory Saga – 50 years

EPROM – Erased by UV Light

[email protected] 28

Page 29: The Semiconductor Memory Saga – 50 years

UV-EPROM Cell Structure The UV-EPROM was electrically programmed, but erased by irradiation with ultra-violet (UV) light.

It had two polysilicon gates. The upper control gate is used for selection and is connected to the wordline. The isolated floating gate is capacitively coupled to the control gate and the substrate.

A charge on the floating gate alters the threshold voltage as seen from the control gate. When the floating gate is charged, the threshold voltage increases and a higher voltage is needed on the control gate to turn the transistor on.

CGFG

[email protected] 29

Page 30: The Semiconductor Memory Saga – 50 years

[email protected]

EPROM PROGRAM/ERASE I-V CHARACTERISTIC

30

Normally ON Normally OFF

Page 31: The Semiconductor Memory Saga – 50 years

[email protected]

Early FLASH Cell vs. EPROM Cell

Flash Cell EPROM Cell

Control Gate Control Gate

Floating Gate Floating Gate

n+ n+ n+ n+

p-type substrate p-type substrate

xxgate oxide

FLASH EPROM

Erased byUV Light

Thick Oxide

Electrically Erased

31

Page 32: The Semiconductor Memory Saga – 50 years

[email protected]

Operation of the Stacked Flash EEPROM Cell

Programming EraseBy CHE Injection By FN Tunneling from Floating Gate to Source

The NOR Flash had three contacts to each cell: Source, Drain, Gate

32

Page 33: The Semiconductor Memory Saga – 50 years

NOR Flash Circuits

1 contact to every cell

A NOR Flash array could be integrated onto the processor chip along with the SRAM. The era of the Microcontroller had begun

[email protected] 33

Page 34: The Semiconductor Memory Saga – 50 years

34

NOR FLASH APPLICATIONSCOMMUNICATIONS CELL PHONES

MODEMS

NETWORKING EQUIPMENT

COMPUTER PC BIOS

FLASH CARDS

DISK DRIVES

CONSUMER SET TOP BOXES

DIGITAL CAMERA’S

WIRELESS

PDA’S

INDUSTRIAL AUTOMOTIVE

DISTRIBUTED CONTROL

CONFIGURATION [email protected]

Page 35: The Semiconductor Memory Saga – 50 years

4. Embedded Memory

Single Chip Microcontrollers(MCU + SRAM + Flash)

[email protected] 35

Page 36: The Semiconductor Memory Saga – 50 years

Microcontrollers (MCU + SRAM + Flash)

[email protected] 36

Page 37: The Semiconductor Memory Saga – 50 years

MCU Applications for Smart Society and Smart City

B.Prince, D. Prince, Adapted from Memories for the Intelligent Internet of [email protected] 37

Page 38: The Semiconductor Memory Saga – 50 years

System Problems solved by integration

System Problem On-Chip Solution

Tight System Form Factor Reduces package countReduces board size

Power Reduction Reduces weight of batteryIncreases life of batteryReduces cost of cooling

Bandwidth limited Permits wider internal busFewer external I/Os and wiresReduces System EMIReduces Ground Bounce

Granularity No silicon wasted on standard sizesNon standard configurations okay

[email protected] 38

Page 39: The Semiconductor Memory Saga – 50 years

[email protected]

Embedded Memory Reduces Silicon and Board Area Resulting in Smaller Formfactor

Memory Array

I/O BUFFERSBONDING PADS

I/O BUFFERSBONDING PADS

LOGIC

Memoryand Logic

Separate Chips Integrated Chip

Circuit Board

Page 40: The Semiconductor Memory Saga – 50 years

[email protected] 40

Power Reduction by Embedding Memory

(Power = 1/2 Capacitance x Voltage2 x Frequency)

Embedding Memory:

Reduces Capacitance of the Interface

Reduces Required Speed of Operation (wider bus)

May reduce voltage swing

Page 41: The Semiconductor Memory Saga – 50 years

41

EXAMPLE OF POWER SAVING IN EMBEDDED DRAM

891 MW100

DRAM

DRAM-MPU I/O

POWER(MW) 500

240 MWSPEECH/DSP

DRAMVIDEO LOGIC +

0 PROCESSOR SRAM

SEPARATE INTEGRATEDCOMPONENTS SYSTEM

TOSHIBA 2000

VIDEO PHONE APPLICATION

[email protected]

Page 42: The Semiconductor Memory Saga – 50 years

42

POWER SUPPLY VOLTAGE INTERFACE TRENDS

0 POWER SUPPLY VOLTAGE0 POWER FACTOR

12V1

10 5V0.17

3.3V0.074

5 2.5V0.0425

1.8V0.022

1975 1980 1985 1990 1995 2000 2005 2010 2015

[email protected]

Page 43: The Semiconductor Memory Saga – 50 years

[email protected] 43

Integrating for Higher System BandwidthHigher Speed and Wider Bus on Chip

1. Reduce RAM – Processor Speed Mismatch

2. Solve Narrow RAM I/O Bandwidth Limitations

3. Eliminate Inter-Chip Transmission Line Effectso Ground Bounce (I/O’s Switching) (L DI/DT Effect)o Reflections on System Wires

4. Reduce System Level EMI Radiation

Page 44: The Semiconductor Memory Saga – 50 years

5. NAND Flash

[email protected] 44

Page 45: The Semiconductor Memory Saga – 50 years

NOR and NAND Flash Circuits

3 contacts to every cell

1 contact to every cell

[email protected] 45

Page 46: The Semiconductor Memory Saga – 50 years

NAND Flash From Emerging Memory to Highest Volume Memory

WSTS, IC Insights

NAND flash was first shown at the IEDM in 1987 by Toshiba. It became the Largest Memory Market in 2011 when it passed the DRAM in Market Value, 24 years later.

[email protected] 46

Page 47: The Semiconductor Memory Saga – 50 years

NAND Flash Bits per Chip by Technology and Year

220

200

180

160

140

120

100

80

60

40

20

2000 2002 2004 2006 2008 2010 2012

Bits per Cell

200 mm 300 mm

256Mb

1 Gb

2 Gb

4 Gb

8 Gb

16 Gb32 Gb

64 Gb

Technology (nm)

220

200

180

160

140

120

100

80

60

40

20

2000 2002 2004 2006 2008 2010 2012

Bits per Cell

200 mm 300 mm

256Mb

1 Gb

2 Gb

4 Gb

8 Gb

16 Gb32 Gb

64 Gb

Technology (nm)

Sandisk ISSCC 2012

Bits per Chip

[email protected] 47

Page 48: The Semiconductor Memory Saga – 50 years

NAND Cell Operation

Program Erase

Double Poly Flash Memory UsingFowler Nordheim Channel Tunneling

n- n- n- n-

CG

grnd

FG

- --

- - -

V++

CG

V++grnd

[email protected]

Page 49: The Semiconductor Memory Saga – 50 years

NAND Flash String

Contact

One Independant Contact per Cell

BL

WL

SL

16 or 32

Common Source

Cell

SL

WL

[email protected]

Page 50: The Semiconductor Memory Saga – 50 years

[email protected] 50

Comparison of conventional NOR cell with NAND cell layout in same technology

Page 51: The Semiconductor Memory Saga – 50 years

NAND Flash Density Trend by Year 2000 - 2013

1 0 0 0

1 0 0

1 0

1

D e n s i t y

( M b / m m 2 )

2 0 0 0 2 0 0 1 2 0 0 2 2 0 0 3 2 0 0 4 2 0 0 5 2 0 0 6 2 0 0 7 2 0 0 8 2 0 0 9 2 0 1 0 2 0 1 1 2 0 1 2 2 0 1 3

N A N D F l a s h D e n s i t y T r e n d1 0 0 0

1 0 0

1 0

1

D e n s i t y

( M b / m m 2 )

2 0 0 0 2 0 0 1 2 0 0 2 2 0 0 3 2 0 0 4 2 0 0 5 2 0 0 6 2 0 0 7 2 0 0 8 2 0 0 9 2 0 1 0 2 0 1 1 2 0 1 2 2 0 1 3

N A N D F l a s h D e n s i t y T r e n d

Year2000 2005 2010 2015

[email protected] 51

Page 52: The Semiconductor Memory Saga – 50 years

Options:

1. Vertical technology (3D Vertical NAND Flash)

2. New technology (MRAM, FeRAM, RRAM, PCM)

Nearing End of Moore’s Law*

[email protected] 52

*Moore’s Law - The number of transistors on a chip doubles every two years.

Page 53: The Semiconductor Memory Saga – 50 years

6. Vertical Technology

[email protected] 53

Page 54: The Semiconductor Memory Saga – 50 years

3D Vertical NAND FlashThe successor to the NAND Flash, the GAA Vertical NAND Flash was introduced in 2009 and by 2019 dominated the NV solid state storage market using four new technologies:

1. SONOS Charge Trapping Memory2. Barrier Engineered SONOS3. Gate All Around (GAA) Nanowire Technology4. Junctionless Nanowire Channel

What it took was three large companies putting the device into production, standing behind it and continuing to engineer it. (similar to the introduction of the DRAM years before)

[email protected] 54

Page 55: The Semiconductor Memory Saga – 50 years

Vertical Channel NAND Flash - BICSIn the BiCS technology, an entire stack of electrode plates are punched through in one operation. A plug consisting of a string of charge trapping (SONOS) NAND flash transistors is built up in the via holes that are created.

This stacked CT memory array has a constant number of critical lithography steps regardless of the number of stacked layers used so the number of NAND transistors can be increased without increasing the number of process steps.

Toshiba,VLSI Symp. June [email protected] 55

Page 56: The Semiconductor Memory Saga – 50 years

Two Bit/Cell Charge Trapping Storage (SONOS)

-CHE Program / HH Erase - Reverse Read

-Symmetric Operation

- Possible 2 bit storage

Saifun/Spansion, 1999

Early SONOS had data retention issues, through the tunnel oxide and back tunneling through the blocking oxide during erase.

[email protected]

Page 57: The Semiconductor Memory Saga – 50 years

TCAT Uses Barrier Engineered SONOS

(a)Band-Offset occurs (b) No Band-Offset in Data (c ) BE-SONOS at high electric field Retention Mode

With Barrier Engineered tunnel technology, at high electric field during erase, the band offset reduces the hole tunneling barrier so it is just that of the O1 layer. During data retention the full O1/N1/O2 barrier stack prohibits electron detrapping.

[email protected] 57

Page 58: The Semiconductor Memory Saga – 50 years

GAA structure has faster erase performance with 1/R effect.(V across Blocking Oxide is 1/R of V across Tunnel Oxide which reduces the back- tunneling during erase)

If the Tunnel Oxide is replaced with thin nitride ONO, then the Barrier Engineered SONOS improves data retention

3D Vertical Channel NAND Flash

K.T. Park, Samsung, ISSCC, [email protected] 58

R = Radius

Page 59: The Semiconductor Memory Saga – 50 years

Junctionless Nanowire Channel

WL WLGSL GSL

n+ n+

Virtual S/D Virtual S/DVirtual S/D

The field from the gates induces a virtual source and drain in the nanowire. This eliminates the need to diffuse a source drain in a 3D buried nanowire channel

[email protected] 59S.J. Choi, KAIST, VLSI 6/2011

Page 60: The Semiconductor Memory Saga – 50 years

One Page of Vertical NAND Flash

G S L ( e v e n ) = - 1 V

V p a s s

V p r g

V p a s s

G S L ( o d d ) = V C CC S L = V C C

S S L ( 1 ) = - 1 V

P r o g r a m P a g e 0

S e l e c t e d P a g e

S S L ( 3 ) = - 1 V

S S L ( 0 ) = V C C S S L ( 2 ) = - 1 V

C S L = V C C

B L ( 0 ) = 0 o r V C C

G S L ( e v e n ) = - 1 V

V p a s s

V p r g

V p a s s

G S L ( o d d ) = V C CC S L = V C C

S S L ( 1 ) = - 1 V

P r o g r a m P a g e 0

S e l e c t e d P a g e

S S L ( 3 ) = - 1 V

S S L ( 0 ) = V C C S S L ( 2 ) = - 1 V

C S L = V C C

B L ( 0 ) = 0 o r V C C

Macronix IEDM !2/[email protected] 60

Page 61: The Semiconductor Memory Saga – 50 years

2000

1500

1000

500

Effective Cell Area (nm2)

Number of Stacked Layers (NLayer)

0 8 16 24 32

Planar 20 nm

Planar 15 nm

50 nm

40 nm

30 nm

Effective Cell Area vs. Number of Stacked Layers

16 stacked layers in 30 nm technology same as planar 15 nm technology

(Based on Y. Yanagihara, et al, U. of Tokyo, IMW, May 20, 2013) [email protected] 61

Page 62: The Semiconductor Memory Saga – 50 years

7. New Technology – Emerging Memories

[email protected] 62

Page 63: The Semiconductor Memory Saga – 50 years

New Technology – Emerging Memories

FeRAM Still Emerging

MRAM became new era embedded memory

PC-RAM became new era high density memory

RRAM Became the synapse in new era AI.

[email protected] 63

Page 64: The Semiconductor Memory Saga – 50 years

64

FeRAM Basics

Adapted from B. Prince, “Emerging Memories”, 2002, Springer [17]

Zr/Ti Ion displaced by external applied voltage gives rise to Hysteresis Curve.

[email protected]

Page 65: The Semiconductor Memory Saga – 50 years

FeRAM

Advantage: Low Power Embedded Memory

Disadvantages: Ferroelectric is contaminant in the Wafer Fab.Read is destructive and cell must be rewritten

FeRAM/FeFET has been “emerging” for 35 years.

The FeRAM Remains an Emerging Memory

[email protected] 65

Page 66: The Semiconductor Memory Saga – 50 years

Magnetic Tunnel Junction (MTJ) MRAM

For the MTJ:Current Tunnels through the Insulator

Advantages: Signal Resistance change between states is 40% or more using a fixed FM layer and a free FM layer

Disadvantage: If WRITE is by Field program with perpendicular external wires,The Write Current is very high about 4 mA/cell.

[email protected] 66

Page 67: The Semiconductor Memory Saga – 50 years

The MTJ Field Programmable MRAM

J.M. Slaughter, Freescale, IEDM, 2005

The free layer of the MTJ MRAM can be programmed by the magnetic fields of two currents running in perpendicular copper cladded external wires near the MTJ. The current per cell for field programming is about 4 mA per cell.

[email protected] 67

Page 68: The Semiconductor Memory Saga – 50 years

Spin Torque Transfer (STT) MRAMIn 2005, Sony used the direction of current running through the MRAM to switch its polarization. This Spin Torque Transfer (STT) switching cell used spin-polarized electrons to flip the polarization of the cell. Power was reduced to less than 100 uAfor a 45 nm technology.

Source: Grandis/Samsung, 2007

Sony, IEDM, 2005

[email protected] 68

Page 69: The Semiconductor Memory Saga – 50 years

Perpendicular (P)-STT MRAM Cell

The p STT-MRAM has even lower power and can be used in L3 cache memory or as a microcontroller universal memory for Internet of Things (IoT) applications such as medical, industrial, mobile processor and automotive. MRAM Foundries: TSMC, Global Foundries, Samsung, UMC

Toshiba, VLSI-T June 2008

[email protected] 69

Page 70: The Semiconductor Memory Saga – 50 years

28 nm P-STT MRAM Macro for eMCU Applications

Adapted from Samsung, IEDM, Dec. 2016

28 nm CMOS LogicCell Size – 0.364 um2P-MTJ (MgOFeB) Diameter: 38-45 nmClock Frequency–40 MhzV Core/IO=1.0V/1.8VI/O – 32/64Endurance 108 cycles 1.8V

[email protected] 70

The MRAM Macro replaces both the NOR Flash and the SRAM in the eMCU

Page 71: The Semiconductor Memory Saga – 50 years

Using p-STT-eMRAM to Replace eSRAM Cache

Adapted from Fujita, IMW, May [email protected] 71

The P-STT MRAM has a comparable speed to the SRAM

Page 72: The Semiconductor Memory Saga – 50 years

LLC Energy for eSRAM and eSTT-MRAM

Adapted from Toshiba, VLSI-DAT April [email protected] 72

The eSTT-MRAM LCC uses 78.3% less energy than the SRAM LLC

Page 73: The Semiconductor Memory Saga – 50 years

PCM 3D Xpoint Memory (Intel )

This Storage Class Memory (SCM) decreases Latency for storage and improves performance for Cloud applications

It is best when run with Vertical NAND SSD Storage which reduces access latency from the mass storage and in theCache hierarchy is just below the PCM 3D XPoint in the SCM gap.

It is Non-Volatile Memory so it does not need to be reloaded.

It is a Vertical 3D Cross-point Phase Change Memory

[email protected] 73

Page 74: The Semiconductor Memory Saga – 50 years

Register

Cache

Main Memory

Performance Gap

Mass StorageMass Storage Archive

Register

SRAM

DRAM

SCM

Flash

HDD

Access Latency

1ns

10 ns

100 ns

1 us

10 us

100 us

>2 ms

Cache Memory Hierarchy

PCM 3D Xpoint3D Vertical NAND Flash

[email protected] 74

Page 75: The Semiconductor Memory Saga – 50 years

Background of Phase Change Memory

Phase Change Memory (PCM) or Chalcogenide RAM has been around since the 1960’s. Intel and IBM have both investigated PCM.This non-volatile RAM uses the property of Chalcogenide glass to beIn either an amorphous (high resistance) or a crystalline (low resistance) state.

Development of the Intel high density 3D Xpoint began around 2012. Both the selector and the storage part of the memory are made from chalcogenide materials

[email protected] 75

Page 76: The Semiconductor Memory Saga – 50 years

[email protected]

Simple PC-Memory CellSchematic Cross-Section

Top Electrode

Bottom Electrode

GSTHeaterProgrammable volume

Ovonyx, Intel, ISSCC, 2002

Data storage is by a thermally induced phase change between amorphous and polycrystalline states in a thin film chalcogenide alloy.

*GST = Ge2Sb2Te5 (Germanium-Antimony-Telurium

76

Page 77: The Semiconductor Memory Saga – 50 years

ON - LRS

SET

RESETCrystalline Material

Amorphous Material

Vth VRead V

Threshold Shift

Phase Change Shift

OFF- HRS

Phase Change Memory I-V Curve

I

Ovonyx, Intel, ISSCC, [email protected] 77

Page 78: The Semiconductor Memory Saga – 50 years

8 Neural Synapses, Neuromorphic Memory and AI

[email protected] 78

Page 79: The Semiconductor Memory Saga – 50 years

Resistive RAM (ReRAM) Devices

D.C. Gilmer, SEMATECH, Stanford, CNSE, U. of Albany, IMW, May 2012

TE

BE

TE

BE

TE

BE

TE

BE

a, b. c. d,Initial State Forming RESET SET

Formation of the Conductive Filament

(a)starting defects in the dielectric enable filament formation.

(b)voltage is applied across the element and a conductive filament forms with diameter determined by the compliance current limit.

(c)during RESET a gap forms in the filament. A HRS/OFF state results.

(d)during SET the filament reforms across the gap with a LRS/ON state

[email protected] 79

Page 80: The Semiconductor Memory Saga – 50 years

Typical Bipolar ReRAM Device IV Curve

Bipolar ReRAM SETs at one voltage polarity and RESETs at the opposite polarity. The voltage can be changed for analog.

D.C. Gilmer, SEMATECH, Stanford, CNSE, U. of Albany, IMW, May 2012

Vrd

[email protected] 80

Page 81: The Semiconductor Memory Saga – 50 years

Resistance (Ohm)

100K

10K

1K

0 5 10 15 20

Pulse Number

Gradual RESET

Repetitive Pulse-1.3V/10ns

(a)Hybrid neuromorphic system with CMOS neurons & RRAM synapses. (b) During RESET a CF is ruptured and a variable tunnel gap forms between electrode and residual filament. Variation in tunnel gap results in multilevel resistance states.

An ReRAM Can be Used as an Analog Synapse

Based on S. Yu, Arizona State, ISCAS, June 2014

Neurons

Neurons

RRAM Synapse

(a) (b)

GAP

[email protected] 81

Page 82: The Semiconductor Memory Saga – 50 years

High Density Cross-Point Array for Neural Aps.

Theoretically this stack can be very high and the wire grid very small to store a high density of data.

B. Prince, 3D Vertical Memory Technology, (Wiley 2014)[email protected] 82

Page 83: The Semiconductor Memory Saga – 50 years

PRE & POST Connections permit overlaps between spikes enabling STDP* with 1T1R Synapse

Based on D. Ielmini, P. Di Milano, IUNET, ISCAS, May 2016

*STDP is Spike Timing Dependent Plasticity

B. Prince, D. Prince, Memories for the Intelligent Internet of Things, 2018, (Wiley)

[email protected]

RRAM

Page 84: The Semiconductor Memory Saga – 50 years

Diagnostic Tool MedicationR&A

Controller

Intelligent Personal Medical Controllers

information Response

Judgement

*32b RISC MCU, coin battery 4KB eEEPROM, 12KB eSRAM256KB eFlashor 1MB eMRAM

Neuromorphic RRAM or PCM

Embedded FlashOr eMRAM

Adapted from “Memories for Intelligent IoT” B. Prince & D. Prince 2018 (Wiley) [email protected] 84