uic montone thesis

52
Time-driven reconfiguration-aware floorplacer BY Alessio Montone [email protected] Thesis committee: S. Dutt (chair), A. Khokhar, M.D. Santambrogio UIC Thesis Defense: 05/08/2008

Upload: usrdresd

Post on 13-Jun-2015

199 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Uic Montone Thesis

Time-driven reconfiguration-aware

floorplacer

BY

Alessio Montone

[email protected]

Thesis committee:

S. Dutt (chair), A. Khokhar, M.D. Santambrogio

UIC Thesis Defense: 05/08/2008

Page 2: Uic Montone Thesis

2

Rationale and InnovationRationale and Innovation

Problem statementGiven a reconfigurable architecture, find an on-chip position for each functional unit

Innovative contribution: taking into accountTarget Device HeterogeneityTarget Device reconfiguation capabilities Inter-FU Communication

Page 3: Uic Montone Thesis

3

AimsAims

Considering the area assignment problem tailored for reconfigurable architectures, providea formalization of the problem, andan approach (in 3 algorithms) for solving

Page 4: Uic Montone Thesis

4

OutlineOutline

IntroductionFloorplacementThe Proposed ApproachExperimental ResultsComparison with the state of the artConclusions and Future WorksQuestions

Page 5: Uic Montone Thesis

INTRODUCTIONINTRODUCTION

5

Page 6: Uic Montone Thesis

Reconfigurable Architectures - IReconfigurable Architectures - I

On FPGAsReconfigurable DevicesHeterogeneousReconfiguration Limits

Different types of Reconfigurable Architectures:

TotalPartial (Static)Partial (Dynamic)

6

Page 7: Uic Montone Thesis

Reconfigurable Architectures - IReconfigurable Architectures - I

Total

7

Page 8: Uic Montone Thesis

Reconfigurable Architectures - IReconfigurable Architectures - I

Total

Partial (Static)

8

Page 9: Uic Montone Thesis

Reconfigurable Architectures - IIReconfigurable Architectures - II

Partial Dynamic

9

Page 10: Uic Montone Thesis

Area Assignment ProblemArea Assignment Problem

Let consider a Reconfigurable ArchitectureGiven a scheduled task graph (TG) of the application

Node: Reconfigurable Functional Unit (RFU) [*], A netlist obtained after post synthesis and technology mapping (i.e., before placement and routing)

Aim: find an area assignment for each RFU

10

[*] K. Bazargan, R. Kastner, M.S.: 3-d floorplanning: Simulated annealing and greedy placement methods for reconfigurable computing systems. IEEE Rapid Systems Prototyping (1999)

Page 11: Uic Montone Thesis

Related Works - IRelated Works - I

[*] introduced the concept of 3D floorplanning for reconfigurable systems

SA in order to solve HW/SW codesign problemFor each task choose between

HW implementationSW implementation

LimitsNo device limits consideredNo communicationinfrastructure

11

[*] K. Bazargan, R. Kastner, M.S.: 3-d floorplanning: Simulated annealing and greedy placement methods for reconfigurable computing systems. IEEE Rapid Systems Prototyping (1999)

Page 12: Uic Montone Thesis

Related Works - IIRelated Works - II

[*] is the state of art in 3D floorplanningSimulated Annealing over Transitive Closure GraphTakes into account device reconfiguration limits

LimitsNo heterogeneity consideredHigh overhead communicationinfrastructure solution [**]

12

[*] Ping-Hung Yuh, Chia-Lin Yang, Yao-Wen Chang, Hsin-Lung Chen: Temporal Floorplanning Using 3D-subTCG, Design Automation Conference, 2004

[**] S. P. Fekete, E. Kohler, and J. Teich: Optimal FPGA Module Placement with Temporal Precedence Constraints, Proc. DATE, 2001.

Page 13: Uic Montone Thesis

FLOORPLACEMENTFLOORPLACEMENT

13

Page 14: Uic Montone Thesis

Floorplanning vs. PlacementFloorplanning vs. PlacementCharacteristic Floorplanning Placement

# items <100 >10.000

Items (for FPGAs) IP-Core Slice, CLB

Aim Find a position for each item

obj. function depends

mainly on Area mainly on Wirelength

Constraints Items can be positioned everywhere

There is a set of possible positions

14

PlacementFloor plan

Page 15: Uic Montone Thesis

Floorplacement - IFloorplacement - I

Hierarchical Approach (Floorplanning + Partitioning)

15

S. N. Adya, I. L. Markov, Fixed-outline Floorplanning: Enabling Hierarchical Design, IEEE Transaction on VLSI System, 2002

S. N. Adya, S. Chaturvedi, J. A. Roy, David A. Papa, I. L. Markov: Unification of Partitioning, Placement and Floorplanning , IEEE Intl. Conf. on CAD, 2004

Page 16: Uic Montone Thesis

Floorplacement - IIFloorplacement - II

Reconfigurable Functional Unit (RFU)A netlist obtained after post synthesis and technology mapping (i.e., before placement and routing)

Reconfigurable Region (RR)

16

Page 17: Uic Montone Thesis

Floorplacement - IIIFloorplacement - III

Resource Aware (i.e., not all positions are feasible)

Device heterogeneity

Device Reconfigurationcapabilities

17

Page 18: Uic Montone Thesis

THE PROPOSED THE PROPOSED APPROACHAPPROACH

18

Page 19: Uic Montone Thesis

Proposed Problem DefinitionProposed Problem Definition

19

AimDefine RRsFor each task find

find a RRA position inside RR

Objective FunctionMin. Fragmentation

ConstraintsCommunication issuesDevice limits

Page 20: Uic Montone Thesis

Target Devices: Xilinx Virtex 4 - 5Target architecture based on EAPR design flow

Target Architecture and DevicesTarget Architecture and Devices

20

Page 21: Uic Montone Thesis

InputInput

21

Page 22: Uic Montone Thesis

Proposed Approach: overviewProposed Approach: overview

22

Page 23: Uic Montone Thesis

11stst Algorithm: Partitioning into Algorithm: Partitioning into RRRR

Aim: identify the RRs and associate each RFU to one RRHow: partitioning the TG minimizing resource requirement variance of the RRs (moving and swapping nodes)

23

Resource of type t required by RFU n, at static photo p

Page 24: Uic Montone Thesis

22ndnd Algorithm:TFiRR - I Algorithm:TFiRR - I

24

Temporal Floorplacement inside RR (TFiRR)Aim: for each RR find a set of feasible width-height pairsHow: floorplacing RFUs inside corresponding RR Assumption: RFUs’ height = height of the RR they belong to

Pseudo Code:

Page 25: Uic Montone Thesis

22ndnd Algorithm:TFiRR - II Algorithm:TFiRR - II

25

Let consider an iteration:

Page 26: Uic Montone Thesis

22ndnd Algorithm:TFiRR - III Algorithm:TFiRR - III

26

Let consider an iteration:

Page 27: Uic Montone Thesis

22ndnd Algorithm:TFiRR - IV Algorithm:TFiRR - IV

27

Let consider an iteration:

Page 28: Uic Montone Thesis

22ndnd Algorithm:TFiRR - V Algorithm:TFiRR - V

28

Let consider an iteration:

Page 29: Uic Montone Thesis

22ndnd Algorithm:TFiRR - VI Algorithm:TFiRR - VI

29

Page 30: Uic Montone Thesis

22ndnd Algorithm: TFiRR – Example Algorithm: TFiRR – Example

30

Page 31: Uic Montone Thesis

33rdrd Algorithm: RR floorplacement - I Algorithm: RR floorplacement - I

Simulated AnnealingObjective Function

Data Structure4 Constraint Lists (one per row)

31

Page 32: Uic Montone Thesis

33rdrd Algorithm: RR floorplacement - II Algorithm: RR floorplacement - II

Simulated Annealing: movesSwap two RRsMove one RRsSpan over one more rowUn-Span over one less row

After each move packing is performed(i.e., the floorplacement is compressed)

32

Page 33: Uic Montone Thesis

33rdrd Algorithm: RR floorplacement – Example - I Algorithm: RR floorplacement – Example - I

33

Page 34: Uic Montone Thesis

33rdrd Algorithm: RR floorplacement – Example - II Algorithm: RR floorplacement – Example - II

34

Page 35: Uic Montone Thesis

ImplementationImplementation

Three simulated annealers written in C++ STL

35

Page 36: Uic Montone Thesis

Output ExamplesOutput Examples

TFiRR

RR Floorplacement

36

Page 37: Uic Montone Thesis

EXPERIMENTAL RESULTSEXPERIMENTAL RESULTS

37

Page 38: Uic Montone Thesis

Identification of the number of RRs Identification of the number of RRs

38

Page 39: Uic Montone Thesis

Partitioning’s impact on TFiRRPartitioning’s impact on TFiRR

TFiRR on Partitioned

TG

TFiRR on TG

Execution Time 125ms 114ms 4m54s

Width (normalized)

1.00 1.19 1.04

39

Increasing the number of RFUs decreases the possibility to pick up the right one

Partitioning is a precondition of the 3rd algorithm in order to better exploit FPGA’s area (2D Floorplacement)

Page 40: Uic Montone Thesis

Tests performed directly floorplacing RFUs Execution time about 100 ms (100K iterations)

Floorplacement – Success Floorplacement – Success RateRate

40

Page 41: Uic Montone Thesis

Floorplacement – Aspect RatioFloorplacement – Aspect Ratio

41

Tests performed directly floorplacing RFUs

Page 42: Uic Montone Thesis

COMPARISON WITH THE COMPARISON WITH THE STATE OF THE ARTSTATE OF THE ART

42

Page 43: Uic Montone Thesis

State of the artState of the art

43

Authors Comm. Infrastructure

ResourceAware

Reconfiguration Aware

Device LimitsAware

Bazargan et al. No No Yes No

Yuh et al. Limited, w/ High Overhead

No Yes Yes

Singhal et al. No No Yes No

Feng et al. No Yes No No

Page 44: Uic Montone Thesis

NotesNotes

The comparsion is performed with respect to the description given by Yuh et al in [*]

Yuh’s approach does not support– Multiple Resources– Existence of a Static side

In order perform the comparison– the case study has been chosen in order to avoid

multiple resource limitation– Yuh’s approach has been extended to support a

static side

44

[*] Ping-Hung Yuh, Chia-Lin Yang, Yao-Wen Chang, Hsin-Lung Chen: Temporal Floorplanning Using 3D-subTCG, Design Automation Conference, 2004

Page 45: Uic Montone Thesis

The Case StudyThe Case Study

A Reconfigurable Architecture (for Biomedical Purpose) on XC5VLX30T 1. Collecting data from sensor2. Elaborating them3. Sending to a host computer thorough the net

45

Page 46: Uic Montone Thesis

The Proposed SolutionThe Proposed Solution

It is a reconfigurable architecture with 2 static photos

46

Page 47: Uic Montone Thesis

Area assignments comparisonArea assignments comparison

47

Proposedsolution

Yuh’s solution

Page 48: Uic Montone Thesis

Communication performances comparisonCommunication performances comparison

Considering 100 M samples, 32 bit each, at 75 MHz.

The entire data are transferred by– The Proposed Approach in 2.6 seconds– Yuh’s approach in 416.0 seconds

48

Page 49: Uic Montone Thesis

Conclusions - IConclusions - I

An algorithm for the identification of area constraint for reconfigurable architectures has been introduced

Novelties: taking into account– Target device heterogeneity– Target device reconfiguration capabilities– Communication issues

49

Page 50: Uic Montone Thesis

Conclusions - IIConclusions - II

Results have been published– A. Montone, M.D. Santambrogio, D. Sciuto,

A Design Workflow for the Identification of Area Constraints in Dynamic Reconfigurable Systems, IEEE International Symposium on Electronic Design, Test and Applications (DELTA), 2008

– A. Montone, M.D. Santambrogio, Area Constraint Evaluation for FPGAs, The Syndicated Q1-2008, A technical newsletter for FPGA, ASIC Verification and DSP Designers, Synplicity Incorporation

Under revision:– A Reconfiguration-aware Floorplacer for FPGAs,

IEEE Field Programmable Logic (FPL), 2008

50

Page 51: Uic Montone Thesis

Future WorksFuture Works

Take into consideration IOBs and inter modules communications

Partitioning considering clock regions

51

Page 52: Uic Montone Thesis

QuestionsQuestions

52