do more, code less with parallel computing libraries

Do More, Code Lesswith

Parallel Computing Libraries

Fu Jie2012.Dec.15

General Artificial Intelligence

A Large-Scale Model of the Functioning Brain, Science, 2012

How can we get a competitive advantage

with data?•More data•Better algorithms HOW?

If you have a lot of time on your hands

Parallel Computing with Jacket GPU library

Easy GPU Acceleration of MATLAB code

No GPU-specific stuff involved

no kernels, no threads, no blocks, just regular M code

Easy to Maintain

• Each new library release improves the speed of our code, without any code modification• Each new library release leverages latest GPU

hardware, without any code modification

Needless to Say, We Need Machine Learning for Big Data

48 Hours a MinuteYouTube24 Million

Wikipedia Pages

750 MillionFacebook Users

6 Billion Flickr Photos

“… data a new class of economic asset, like currency or gold.”

How will wedesign and implement

parallel learning systems?

Big Learning

A Shift Towards Parallelism

GPUs Multicore Clusters Clouds Supercomputers

• ML experts repeatedly solve the same parallel design challenges:• Race conditions, distributed state, communication…

• The resulting code is:• difficult to maintain, extend, debug…

Graduate

students

Avoid these problems by using high-level abstractions

CPU 1 CPU 2 CPU 3 CPU 4

Data Parallelism (MapReduce)

Solve a huge number of independent subproblems

What is this?

It’s next to this…

Addressing Graph-Parallel ML

Data-Parallel Graph-Parallel

CrossValidation

Feature Extraction

Map Reduce

Computing SufficientStatistics

Graphical ModelsGibbs Sampling

Belief PropagationVariational Opt.

Semi-Supervised Learning

Label PropagationCoEM

Data-MiningPageRank

Triangle Counting

Collaborative Filtering

Tensor Factorization

Map Reduce?Graph-Parallel Abstraction

• Designed specifically for ML• Graph dependencies• Iterative• Asynchronous• Dynamic

• Simplifies design of parallel programs:• Abstract away hardware issues• Automatic data synchronization• Addresses multiple hardware

architectures

Efficientparallel

predictions

Know how to solve ML problem

on 1 machine

do more, code less with parallel computing libraries

Technology

redpandascience2.files.wordpress.com · web viewthe...

eclipse development process€¦ · parallel ip •parallel...

high performance computing: concepts, methods & means...

data model libraries for i/o: parallel netcdf case study

libraries on the move: reaching out to the less-fortunate &...

libraries australia cataloguing parallel session bemal...

compiler & threaded libraries intel® parallel … ......

parallel programming with mpi - king abdullah … vendors of...

grant writing in 100 minutes or less sonja hudson grants...

cuda libraries and tools - nvidia › content › gtc ›...

parallel programming - archer€¦ · other parallel...

design parallel performance with less risk and more...

big talk from small libraries 2016 - learn more, pay less

the challenges facing libraries and imperative languages...

parallel inverted index for large-scale, dynamic digital...

cuda libraries and tools - nvidia · cuda libraries & tools...

parallel io library benchmarking on gpfs - · pdf file– io...

parallel universe: will libraries and publishers learn to...

parallel and cluster computing 1. 2 array libraries u...

parallel circuits - oakton community college5-4: resistance...