interleaved multi-bank scratchpad memories: a probabilistic description of access conflicts dac...

21
Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Upload: shannon-wilkinson

Post on 12-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of

Access Conflicts

DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Page 2: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Background

• Shared on-chip memory• with multiple separately accessible banks• having a common address space for all

processors

• Advantage: efficient communication between processors• Disadvantage: interference among the

processors• Solution: more banks, optimizing the

address mapping

Page 3: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Address Mapping

• contiguous mapping• pseudo-random mapping• sequentially interleaved mapping (SIM)

The aim of this work is to quantitatively evaluate the properties and characteristics of SIM systems.

Page 4: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Outline

• Background• Problem definition• Occupancy distribution• Markov model • Evaluation

Page 5: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Problem Definition

• We consider a platform with c processor cores and b independently accessible memory banks.• the access probability and the

sequential access probability .• A denotes the random number of

accesses requested in any given cycle, and I represents the number of banks serving accesses in any given cycle.• Given c, b, , and , compute the distribution of the number I of memory banks serving accesses.

ap

seqp

ap seqp

Page 6: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

The classic occupancy distribution

• Actual memory accesses a, a=c

Page 7: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Adding access probabilities

A follows the binomial distribution

Page 8: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Limitations of the model

• Sequential access patterns of the applications cannot be taken into account.• It ignores the fact that accesses that

cannot be immediately served are served in subsequent cycles, then interfering with new accesses.

Page 9: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Markov Model

1 2           1. {s ,s ,...,  s }ka s setate St

,           2. (s     s )k k j ii ja transition matrix T R with T P

         ,T3. ksteady stat Re

Page 10: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Memory throughput by Markov steady state

1 21. {s ,s ,...,s         s

              

}

        c

n

set

s is the number of banks ha

st

ving n accesses

ate

Page 11: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Memory throughput by Markov steady state

Transition probabilities

For a state s, the associated throughput is 0(s) b si

Page 12: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Adding sequential access patterns

Page 13: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Adding sequential access patterns• st• 1. s s’, one access request removed from each’s

queue• 2. s’ t, distributing new access requests

Page 14: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Adding access probabilities

Page 15: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Experiment evaluation

• gem5 ARM simulator.• The GSM, FFT, blowfish, string search and

JPEG examples were chosen to obtain a high diversity in behaviour.

Page 16: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Accuracy of the occupancy model

• For a small number of banks, the throughput is likely to be close to that number.• For a sufficiently large number of banks,

the number of waiting accesses is small.• The maximum relative error is of 12.0%

for b=8.

Page 17: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Benchmark Results

Page 18: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Conclusions from the occupancy model

• As long as the ratio of banks and cores is constant, a system can be arbitrarily scaled without changing the throughput expectation per bank or per core.• The throughput converges exponentially

with the product pa*r to the maximum value b.• For pa*r <0.3, the throughput can be

regarded as growing approximately linearly with pa.

Page 19: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Application example: System design

• System with c=16 cores and b=32 banks.• System 1: interleaving over all 32 banks.• System 2: interleaving for 16 banks + one

“private” memory bank for each core.

1

1[I] b b (1 )occ a

SystemEb

Page 20: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Application example: System design

• System 2 performs better for0.2875privp

Page 21: Interleaved Multi-Bank Scratchpad Memories: A Probabilistic Description of Access Conflicts DAC '15, June 07 - 11, 2015, San Francisco, CA, USA

Discussion of the synchronisation effect

• For <0.4, the synchronisation effect is insignificant.• Even the speedup from pseq=0 to pseq=1 is less than 5% in this

system.• There are only few cases in which performance is likely to be a

decisive factor for opting for a SIM system rather than for pseudo-random mapping.

seqp