ibm watson - rochester institute of technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf ·...

23
IBM Watson A Power 775 High Performance Computing Cluster Dan Christiani Ben Mihevc

Upload: tranxuyen

Post on 20-Jul-2018

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

IBM WatsonA Power 775 High Performance Computing Cluster

Dan ChristianiBen Mihevc

Page 2: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

● What is Watson● Power 775● Software Stack● Communication● Applications and Motivation● Wrap-up

Agenda

Page 3: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

● Watson is a project to use natural language processing to analyse data.

● The goal is to search through data and draw meaning from it.

● Made famous on Jeopardy in 2011 where it successfully defeated top opponents.

● As an adaptive system, it learns when it is wrongand corrects itself

● A cognitive AI system for understanding complex data and information.

● Multiple Incarnations in IBM research facilities● Involved Universities: RPI, MIT, Carnegie Mellon, UT Austin, ...

What is the IBM Watson

[8]

Page 4: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

The Power 775 Details● 10 Drawers per rack

o 8 Nodes per drawero 4 Processors per node

3.83 GHzo 8 Cores per processor

64 KB L1 cache (32 data, 32 instruction)

256 KB L2 cache 4 MB private L3 cache 32 MB shared L3 cache.

[1][3]

Page 5: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

The Power 775 Details cont.● Makes for an impressive rack

o 80 Nodeso 320 processorso 2560 compute coreso 4 hardware threads per coreo Theoretical computational limit: 78.43 Teraflops

● Interconnected racks can add powero A small setup could utilize 10 Racks

That is 25600 compute cores.

[1][3]

Page 6: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

Software Stack● The IBM Parallel Environment

o MPI, LAPI, OpenSHMEM Cross Core Communication

o Parallel Operating Environment Provides job support across nodes

o Load Leveler Scheduler Manages the job scheduling based on

priority and available resourceso IBM Parallel Debugger (pdb)

similar to gdb, but for parallel applications

o Engineering and Scientific Subroutine Library (ESSL) and Parallel ESSL ESSL is a library of optimised

mathematical functions that have been optimised.

Parallel ESSL is ESSL implemented with MPI for scalable parallel computation.

[2][7][9][10]

Page 7: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

IBM Parallel Environment● Parallel Operating Environment (POE)

o Scalably manages over 1,000,000 tasks.o Organizes core dumpso Supports many communication protocols

MPI, LAPI, OpenSHMEM, etc…o Runs on Linux or AIX 7.1

● IBM Parallel Debugger (pdb)o Uses most of the same commands as gdbo Capable of observing the state of running taskso It enables the user to start jobs under debug

control or take running jobs and place them under debug control.

[2][7][10]

Page 8: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

IBM Parallel Environment● LoadLeveler

o A parallel job scheduling application.o Creates a priority queue to determine which

job is running. Considers: execution time, available

resources, time of day, and administrative instructions.

o Also ensures that jobs are only scheduled to functional machines.

[2][7][10]

Page 9: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

Physical Communication● We will start from the lowest physical

component and work our way up○ Processor level 0○ Node level 1○ Cabinet level 2○ Rack level 3

● Collective Acceleration Unit (CAU)○ A Coprocessor for parallel applications.

[2]

Page 10: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

Physical Communication Level 0

[2]

Page 11: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

Physical CommunicationLevel 1

[2]

Page 12: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

Physical CommunicationLevel 2

[2]

Page 13: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

Physical CommunicationLevel 3

Page 14: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

Physical CommunicationLevel 3

Basic Modeland CAU flow

[2][6]

Page 15: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

CAU Step 1

[2][6]

Page 16: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

CAU Step 2

[2][6]

Page 17: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

CAU Step 3

[2][6]

Page 18: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

CAU Step 4

[2][6]

Page 19: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

What does this all create?● Watson system were designed for Data

Intensive Super Computing (DISC) ● Search and understand massive stores of

data ○ Telescope Imagery○ Medical Records○ Financial Transaction Records○ Web Pages or any other text

● DISC influences unique research○ System design, resource management○ Programming models, Algorithms

[14][4][5]

Page 20: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

● Pattern Recognitiono Focused on large unstructured data sources

● Healthcare○ Patient Records○ Diagnostics ○ Nurses Notes○ Discharge Papers

Motivation

● Outcomes: +Diagnosis +Treatment +Care Intervention +Machine Learned Research

[13][4][5]

Page 21: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

Oh! And we Forgot Jeopardy!● The Jeopardy Challenge

o Competed against historic champions Ken Jennings and Brad Rutter

o 3 Day Tournament in 2011.Results:

Ken Jennings: $24,000Brad Rutter: $21,600Watson: $77,147

Outcome: Definitive Win for Watson“I for one welcome our new computer overlords” - Ken Jennings

[12]

Page 22: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

To Wrap Up...● Much more than Jeopardy

○ Cross industry solution for understanding huge amounts of structured and unstructured data Information.

○ Machine Learning combined with advanced NaturalLanguage Processing allows for powerful and accurate Question-Answer applications.

● Watson’s special software is driven by Parallel High Performance POWER 775 systems developed for Data Intensive Supercomputing.

[5][12]

Page 23: IBM Watson - Rochester Institute of Technologymeseec.ce.rit.edu/756-projects/spring2014/2-2.pdf · o IBM Parallel Debugger (pdb) similar to gdb, but for parallel applications

Citations[1] Maciej Cytowski. (2014, Feb.) PRACE. [Online]. http://www.prace-ri.eu/Best-Practice-Guide-

IBM-Power-775-HTML#id-1.3[2] IBM. (2012, Oct.) IBM RedBooks. [Online].

http://www.redbooks.ibm.com/redbooks/pdfs/sg248003.pdf[3] IBM. IBM.com. [Online]. http://www-03.ibm.com/systems/power/hardware/775/specs.html[4] IBM. IBM.com. [Online]. http://www-

03.ibm.com/systems/power/software/i/db2/products/smp.html[5] Mike Cain. (2011, Dec.) IBM.com. [Online].

http://public.dhe.ibm.com/partnerworld/pub/pdf/courses/4aea.pdf[6] (2013, Apr.) wikipedia.com. [Online]. http://en.wikipedia.org/wiki/IBM_RSCT[7] IBM. IBM.com. [Online]. http://www-03.ibm.com/systems/power/software/aix/resources.html

[8] IBM. IBM.com. [Online]. http://www-03.ibm.com/systems/power/hardware/hpc.html

[9] IBM, "MPI Programming Guide. Parallel Environment Runtime Edition," 2013.

[10] IBM. (2013) IBM.com. [Online]. http://publib.boulder.ibm.com/infocenter/clresctr/vxrx/index.jsp?topic=%2Fcom.ibm.cluster.pe.doc%2Fpebooks.html

[11] Manoj Saxena, "IBM Watson Progress and 2013 Roadmap," IBM, 2014. [Online]. http://www.slideshare.net/manojsaxena2/ibm-watson-progress-and-roadmap-saxena

[12] Eric Brown, Jennifer Chu-Carroll, James Fan, David Gondek, Aditya A. Kalyapur, Adam Lally, J. Willian Murdock, Eric Nyberg, John Pragen, Nico Schlaefer, Chris Welty David Ferrucci, "Building Watson: An Overview of the DeepQA Project," Association for the Advancement of Artificial Intelligence, 2010.

[13] IBM Corporation, "IBM Content and Predictive Analytics for Healthcare," IBM Cororation, Somers, 2012.