fpga-acceleration on cots x86 platforms university of mannheim, 16...

25
Slide 1: XtremeData Inc.: Confidential Information XtremeData, Inc.: Confidential Slide 1 FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 Feb 2007

Upload: vananh

Post on 16-Mar-2018

229 views

Category:

Documents


5 download

TRANSCRIPT

Page 1: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 1: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 1

FPGA-Acceleration on COTS x86 Platforms

University of Mannheim,16 Feb 2007

Page 2: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 2: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 2

TodayTodayTodayToday’’’’s s s s

AgendaAgendaAgendaAgenda

XtremeData Corporate & Team background

Why FPGAs in COTS x86?

Issues and XDI Solution

FPGA acceleration markets

FPGAs in HPC

Summary

Page 3: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 3: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 3

XtremeDataXtremeDataXtremeDataXtremeData: Corporate History: Corporate History: Corporate History: Corporate History…………

Market research & POC completed: target markets identified.

2006

2005

Jan

Apr

Jul

Oct

Apr

Jul

Oct

Apr

2007

SeriesA raised and development started with two teams:

hardware in Chicago and software in Bangalore, India.

System architecture defined: commodity hardware platform,

accelerator and database engine.

FPGA Module offered as a stand-alone product, press releases;

strategic partnerships made, shipments started…

Jan

Jan

Incorporated 2003, Seed funds raised.

SeriesB fund raise closing Jan 2007 for Go-To-Market financing

2004

Page 4: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 4: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 4

Images courtesy of: Toshiba Medical Systems, Philips Medical Systems & BIR Inc.

Team Background

1 in 5 CT scanners worldwide have an

imaging system designed by our team

Ravi Chandran, CEO

� BE Electronics, India, MS EE, University of Texas, Arlington, MBA, Kellogg School, Northwestern University, IL

� President, Binary Machines, Inc., Schaumburg, IL

� COO, VP of Engineering., Bio-Imaging Research, Inc., Lincolnshire, IL (www.bio-imaging.com)

20+ years of product development & design services in medical & industrial (NDT) imaging markets. 20+ years

experience with Toshiba Medical Systems – 20% of worldwide CT scanner installed base.

Images courtesy of: Toshiba Medical Systems, Philips Medical Systems & BIR Inc.

Page 5: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 5: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 5

VisionVisionVisionVision

“Appliance” implies� Easy installation – “plug and use”

� No disruption to existing process

x86 CPU (“PC”) + FPGA“Accelerated Computing” implies

“Accelerated Computing Appliances”XtremeData’s vision is to build

Page 6: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 6: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 6

StrategyStrategyStrategyStrategy

Our strategy is to enable “Accelerated Computing Appliances” by:

1. coupling off-the-shelf x86 hardware (“PC’s”)

2. with FPGA accelerators

3. via a software middleware layer that enables ease-of-use.

We believe that the combination of these 3 key concepts gives us a

compelling and sustainable price/performance advantage over the long term.

High Volume / Low Cost

High Performance

Page 7: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 7: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 7

FPGAs in Computing:Market Environment & Challenges

Page 8: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 8: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 8

� Specialized CPUs & DSPs + FPGA

� Specialized interconnect (Myrinet, Race++, RapidIO..)

� Custom boards, backplanes

� x86 CPUs

� Standard interconnect (PCI-X, GigE)

Low Volume / High CostLow Performance

High-Performance Embedded Systems Commodity “PC” systems

High Volume / Low CostHigh Performance

Market Market Market Market

EnvironmentEnvironmentEnvironmentEnvironment………….~2002.~2002.~2002.~2002

Page 9: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 9: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 9

� Specialized CPUs & DSPs + FPGA

� Specialized interconnect (Myrinet, Race++, RapidIO..)

� Custom boards, backplanes

� x86 CPUs – multi-core, 3Ghz

� Standard interconnect (PCIe, IB, 10GigE)

Low Volume / High Cost

High-Performance Embedded Systems Commodity “PC” systems

High Volume / Low CostHigh Performance

High Performance

Outperformed by IB

Best choice: x86 CPU+FPGA.

How to do this?

Bring FPGA forward to x86-COTS world

Take x86 CPU back to embedded world

Market Market Market Market

EnvironmentEnvironmentEnvironmentEnvironment………….today.today.today.today

Outperformed by ~3Ghz x86 CPUs

Outperformed by FPGAs at high-end

Page 10: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 10: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 10

Physical factors: form factor,

power supply, cooling

PHYSICAL

Radisys, Gidel, Nallatech

External interfaces: I/O, Memory

EXTERNAL I/F

Annapolis Microsystems

Communication & Data

exchange between

FPGA & CPU

COMMUNICATION

Mercury Computer

FPGA interconnect between

Computing Blocks

INTERNAL I/F

INNER

LOOP

FPGA Computing

Blocks

Soft-CPU, IP,

ESL tools

System Architect issues

• How to integrate FPGA into x86-COTS?

Design engineer (s/w & h/w) issues

• How to transition from DSP world to FPGAs?

FPGA in COTS x86:FPGA in COTS x86:FPGA in COTS x86:FPGA in COTS x86:

ChallengesChallengesChallengesChallenges

Page 11: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 11: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 11

Idea: build a simple, minimalist board

with interfaces to HyperTransport and

memory: (Patent Pending)

drop-in replacement for an AMD

Opteron with no changes to

motherboard!

Dual-socket AMD Opteron Motherboard:

Simply remove Opteron &

replace with FPGA Module !

� FPGA uses all motherboard resources meant for CPU:

� HyperTransport Links, Memory interface, power supply, heat-sink

� Usable with any AMD Opteron (or future Intel CPU) server

� Mix & match FPGAs, CPUs on quad-socket systems

Our SolutionOur SolutionOur SolutionOur Solution

Usable in rack-mount or high-density “blade” server systems ( including

ATCA), where plug-in boards are not feasible

Page 12: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 12: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 12

Mechanical• Plugs directly into socket-940• Fits within AMD-specified retention frame• 68 x 60 mm form factor• Can use off-the-shelf Opteron™ heat sink

HyperTransport Interfaces (HT)• Multiple HT interfaces• 16 bits wide @ 800 M Transfers/s • Bridging to additional XD1000™modules

Memory Interface• 128 bits wide DDR-333 memory• 5.4 GBytes/s bandwidth• Up to four 4GB DIMMs of ECC memory

SRAM• 8 MB of Zero Bus Turn-around (ZBT) SRAM• 800 Mbytes/s bandwidth• 32 bits wide with parity• 5 clock cycle latency for reads @ 200MHz

FPGA Configuration• Auto FPGA configuration on power-up• Host triggered FPGA reconfiguration

Monitoring• FPGA mastered I2C bus• Voltage monitoring• Temperature monitoring

Test Support• JTAG test port• 4 programmable LEDs• 8 programmable test pads

Flash ROM• 32 MB of CFI FLASH

• Use for FPGA configuration files, or application data

Development Package• HyperTransport core• Memory controller core• Linux device driver• FPGA messaging infrastructure

TodayTodayTodayToday’’’’s 940s 940s 940s 940----pin pin pin pin

solutionsolutionsolutionsolution…………

Newer Opteron socket solutions on the roadmap…

Page 13: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 13: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 13

FPGA-Acceleration Markets

Page 14: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 14: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 14

FPGA acceleration FPGA acceleration FPGA acceleration FPGA acceleration

marketsmarketsmarketsmarkets

Some examples of companies / applications that are using FPGA acceleration today

• TimeLogic

• Tarari

• Progeniq

Financial, Database,

Geoscience

• Cray

• SGI

• Mercury

• Linux NetworX

High Performance

Computing

Scientific

BioInformatics

Medical Imaging

• Toshiba

• GE Medical

• Siemens

• Phillips

“Embedded”

Computing

Telecom

• Motorola

• Nokia

• Ericcson

Broadcast

• Set Top Box

• Video on Demand

• IPTV

Emerging market:

Video

Consumer

• DVD creation

• HD Video

Page 15: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 15: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 15

FPGAs in High-Performance Computing (HPC)

Page 16: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 16: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 16

“ … more than 80% of data centers are already constrained by electrical power, physical space, or cooling capacity. Simply adding more of the same kinds of systems is clearly no solution, …”

[Sun Whitepaper on Throughput Computing, Nov 2005]

HPC:HPC:HPC:HPC:

““““BurningBurningBurningBurning”””” IssuesIssuesIssuesIssues

“ … IBM Fellow Bernard Meyerson told the crowd at the Hot Chips conference yesterday that he expects a power crisis of sorts to occur in the server market come 2007. That's when the overall cost of powering and cooling all the servers in the US will outpace the amount of money spent on new server sales.…”

[The Register, 23 Aug 2006]

“ … Google is rumored to have a million servers around the world and, according to a knowledgeable source, is already the top electricity user in at least one large U.S. state. .…”

[Fortune, 1 May 2006]

"Just think about where there are windmills, dams, and other natural power sources around the world, and that's where you're going to see server farms,” Ray Ozzie, Chief Software Architect, Microsoft

[Fortune, 1 May 2006]

Performance/Watt is key…..FPGAs are a viable alternative.

Page 17: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 17: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 17

FPGAsFPGAsFPGAsFPGAs in HPC:in HPC:in HPC:in HPC:

ChallengesChallengesChallengesChallenges

HDL-based design flow NOT acceptable!

A software-oriented programming model is NECESSARY

High-level FPGA design tool (ESL) is a must: C-based, MatLab, etc.,?

We have ongoing efforts to enable high-level design flow…

Page 18: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 18: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 18

C Source Code

Compile C to HDL

SOPC Builder

Impulse C

Module

XDI Memory

Controllers

XDI Hypertransport

Interface

Calls SOPC Builder drives

it with scripts

All Components

Connected via Avalon

Fabric

Single and double-precision IEEE 754

floating-point arithmetic supported in

FPGA and inferred from code

1) All RTL for FPGA on XD1000

2) Complete Quartus II Project

3) Complete Quartus II Compile Script

ESL: Our first step:ESL: Our first step:ESL: Our first step:ESL: Our first step:

PSP for PSP for PSP for PSP for ImpulseCImpulseCImpulseCImpulseC

No user-supplied HDLsource code required

Page 19: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 19: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 19

Some FPGA Demo applications…

Page 20: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 20: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 20

� Key Applications

� Derivatives Trading

� Black Scholes Model

� BGM/LIBOR Market Model

� Monte Carlo Simulations

� System Requirements

� Reduce total power consumption

� Cooling capacity is a major concern!

� Run in Tier 1 blade/chassis servers

� Accelerate cycle for trading & risk management decisions

HPC: Financial HPC: Financial HPC: Financial HPC: Financial

AnalyticsAnalyticsAnalyticsAnalytics

02

12

222

=−∂

∂+

∂+

∂rV

S

VrS

S

VS

t

Black-Scholes models the behavior of the price of an

asset (share price of a traded stock, for instance) as a

stochastic process which is described linear parabolic

partial differential equation:

Accelerate mathematical modeling starting from HLL source code

Page 21: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 21: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 21

HPC: Financial HPC: Financial HPC: Financial HPC: Financial

AnalyticsAnalyticsAnalyticsAnalytics…………

Demonstrated 20x speedup today, 50x possible very soon

C to HDL Compile,

HDL synthesis,

Place & Route

16-Lane, 400Mhz DDR,

3.2 GB/s

C Application SoftwareC Inner Loop

Call

Monte Carlo Black-Scholes

Simulation

For 1 to M

Simulations:

Compute Black-

Scholes price

For each

pathway:

Generate Log-Normal

asset price pathways

FPGA

Whitepaper on this case

study available on

XtremeData, ImpulseC and

HT Consortium websites.

Page 22: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 22: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 22

� Hi-Def Video Encoding/Decoding

� Broadcast

� Content Creation

� DVD Authoring

� SD video: (PAL) 704 x 576 at 25 fps

� CPUs

� CPU (~2 GHz) maxed out at 10 fps

� Will not linearly scale up for HD

� FPGAs

� 32 fps

� 14K LEs out of 140K = ~10% utilization of biggest 2006 FPGA.

� Can scale up to 9x with 6 cores: ~300 fps for SD.

� CPU @ 10 fps will encode 2-hour movie in 5 hours

� FPGA @ 300 fps will encode in 10 mins !

� Can easily handle HD video: 1920 x1080 at 30 fps , with < 50% utilized

� 2007 FPGA 2x larger 2006

Video: H.264 EncodingVideo: H.264 EncodingVideo: H.264 EncodingVideo: H.264 Encoding

Page 23: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 23: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 23

FPGA Module:FPGA Module:FPGA Module:FPGA Module:

In the newsIn the newsIn the newsIn the news…………

XtremeData featured in AMD Annual Analyst Day

presentation on “Torrenza” initiative, 1 June 2006

“Altera Launches High-Performance Computing

University Program with AMD, Sun and XtremeData”,

press release, 21 Aug 2006

Plus much more exposure in print and internet media….

Page 24: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 24: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 24

• Objectives:

– Research and drive adoption of:

• FPGA co-processing

• Medical imaging, data analytics, text searches, network security, bioinformatics, & energy apps.

• Programming tool chain

• Partners:

– Altera, AMD, Sun, XtremeData

• 20 Kits Planned (Maybe more in 2007)

– U. of Illinois, Stanford, U. of Florida, Purdue, CMU, UCLA, Boston U, U of Mannheim, etc.

Sun Ultra 40 Workstation

XtremeData XD1000 FPGA co-processor module

AMD Opteron

University ProgramUniversity ProgramUniversity ProgramUniversity Program

Page 25: FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 …ra.ziti.uni-heidelberg.de/coeht/pages/events/20070216/... ·  · 2018-02-02FPGA-Acceleration on COTS x86 Platforms

Slide 25: XtremeData Inc.: Confidential Information

XtremeData, Inc.: Confidential Slide 25

Thank You!