[html5jロボット部第7回勉強会] microsoft cognitive toolkit (cntk) overview

The Microsoft Cognitive Toolkit (CNTK)

Microsoft’s open-source deep-learning toolkit

• ease of use: what, not how

• fast

• flexible

• 1st-class on Linux and Windows

• internal=external version

deep learning at Microsoft

• Microsoft Cognitive Services

• Skype Translator

• Cortana

• Bing

• HoloLens

• Microsoft Research

ImageNet: Microsoft 2015 ResNet

28.225.8

7.3 6.73.5

ILSVRC2010 NECAmerica

ILSVRC2011 Xerox

ILSVRC2012

AlexNet

ILSVRC2013 Clarifi

ILSVRC2014 VGG

ILSVRC2014

GoogleNet

ILSVRC2015 ResNet

ImageNet Classification top-5 error (%)

Microsoft had all 5 entries being the 1-st places this year: ImageNet classification,

ImageNet localization, ImageNet detection, COCO detection, and COCO segmentation

deep learning at Microsoft

• Microsoft Cognitive Services

• Skype Translator

• Cortana

• Bing

• HoloLens

• Microsoft Research

Microsoft’s historicspeech breakthrough

• Microsoft 2016 research system for

conversational speech recognition

• 5.9% word-error rate

• enabled by CNTK’s multi-server scalability

[W. Xiong, J. Droppo, X. Huang, F. Seide, M. Seltzer, A. Stolcke,

D. Yu, G. Zweig: “Achieving Human Parity in Conversational

Speech Recognition,” https://arxiv.org/abs/1610.05256]

• CNTK is Microsoft’s open-source, cross-platform toolkit for learning and evaluating deep neural networks.

• CNTK expresses (nearly) arbitrary neural networks by composing simple building blocks into complex computational networks, supporting relevant network types and applications.

• CNTK is production-ready: State-of-the-art accuracy, efficient, and scales to multi-GPU/multi-server.

CNTK “Cognitive Toolkit”

• open-source model inside and outside the company• created by Microsoft Speech researchers (Dong Yu et al.) in 2012, “Computational Network Toolkit”

• open-sourced (CodePlex) in early 2015

• on GitHub since Jan 2016 under permissive license

• Python support since Oct 2016 (beta), rebranded as “Cognitive Toolkit”

• used by Microsoft product groups; but virtually all code development is out in the open

• external contributions e.g. from MIT and Stanford

• Linux, Windows, docker, cudnn5, next: CUDA 8

• Python and C++ API (beta; C#/.Net on roadmap)

“CNTK is Microsoft’s open-source, cross-platform toolkit for learning and evaluating deep neural networks.”

“CNTK expresses (nearly) arbitrary neural networks by composing simple building blocks into complex computational networks, supporting relevant network types and applications.”

example: 2-hidden layer feed-forward NN

h1 = s(W1 x + b1) h1 = sigmoid (x @ W1 + b1)

h2 = s(W2 h1 + b2) h2 = sigmoid (h1 @ W2 + b2)

P = softmax(Wout h2 + bout) P = softmax (h2 @ Wout + bout)

with input x RM and one-hot label L RM

and cross-entropy training criterion

ce = LT log P ce = cross_entropy (L, P)

Scorpusce = max

with input x RM and one-hot label y RJ

ce = yT log P ce = cross_entropy (L, P)

Scorpusce = max

with input x RM and one-hot label y RJ

ce = yT log P ce = cross_entropy (P, y)

Scorpusce = max

h1 = sigmoid (x @ W1 + b1)

h2 = sigmoid (h1 @ W2 + b2)

P = softmax (h2 @ Wout + bout)

ce = cross_entropy (P, y)

softmax

cross_entropy

h1 = sigmoid (x @ W1 + b1)

h2 = sigmoid (h1 @ W2 + b2)

P = softmax (h2 @ Wout + bout)

ce = cross_entropy (P, y)

softmax

cross_entropy

ce• nodes: functions (primitives)

• can be composed into reusable composites

• edges: values• incl. tensors, sparse

• automatic differentiation• ∂F / ∂in = ∂F / ∂out ∙ ∂out / ∂in

• deferred computation execution engine

• editable, clonable

LEGO-like composability allows CNTK to supportwide range of networks & applications

Cognitive Toolkit Demo

MNIST Handwritten Digits Recognitionon Jupyter Notebook

https://notebooks.azure.com/library/cntkbeta2

Benchmarking on a single server by HKBU

“CNTK is production-ready: State-of-the-art accuracy, efficient, and scales to multi-GPU/multi-server.”

FCN-8 AlexNet ResNet-50 LSTM-64

CNTK 0.037 0.040 (0.054) 0.207 (0.245) 0.122

Caffe 0.038 0.026 (0.033) 0.307 (-) -

TensorFlow 0.063 - (0.058) - (0.346) 0.144

Torch 0.048 0.033 (0.038) 0.188 (0.215) 0.194

“CNTK is production-ready: State-of-the-art accuracy, efficient, and scales to multi-GPU/multi-server.”

Theano only supports 1 GPU

Achieved with 1-bit gradient quantizationalgorithm

CNTK Theano TensorFlow Torch 7 Caffe

speed comparison (samples/second), higher = better[note: December 2015]

1 GPU 1 x 4 GPUs 2 x 4 GPUs (8 GPUs)

Azure NC-Instances

NC6 NC12 NC24 NC24r

Cores 6 12 24 24

GPU1 K80 GPU (1/2 Physical Card)

2 K80 GPUs (1 Physical Card)

4 K80 GPUs (2 Physical Cards)

Memory 56 GB 112 GB 224 GB 224 GB

Disk ~380 GB SSD ~680 GB SSD ~1.5 TB SSD ~1.5 TB SSD

Network Azure Network Azure Network Azure Network InfiniBand

• ease of use• what, not how

• powerful stock library

• fast• optimized for NVidia GPUs & libraries

• best-in-class multi-GPU/multi-server algorithms

• flexible• powerful & composable Python and C++ API

• 1st-class on Linux and Windows

• internal=external version

CNTK: democratizing the AI tool chain

• Web site: https://cntk.ai/

• Github: https://github.com/Microsoft/CNTK

• Wiki: https://github.com/Microsoft/CNTK/wiki

• Issues: https://github.com/Microsoft/CNTK/issues

CNTK: democratizing the AI tool chain

Seeing AI

[html5jロボット部第7回勉強会] microsoft cognitive toolkit (cntk) overview

Software

deep learning with cntk

html5j data visualization_and_d3

html5j aws でできるiot

cntk cineteca morelia cine arquitectura reflejos …

20170324 html5j web_paltform_study

deep learning in microsoft with cntk - gtc on...

直交ロボット...直交ロボット 4-25...

ct4...ct4 高速直交型ロボット直交ロボット...

cntk: deep learning framework -...

ibm worklight@html5j enterprise night seminar

html5jロボット部　第３回勉強会「ロボット...

第6回「アザラシ型ロボット・パロによるロボット...

scalable deep learning - inovex · ›tensorflow, mxnet,...

真面目なアニメーション (html5j 2013, web...

markupcafe - html5j markup group

15分で説明するディープラーニング...

型単軸ロボット controller...481 ロボット...

ロボット・ai分野事業紹介 - nedo ·...

html5j webプラットフォームの紹介

「ロボット使いたい」...

[html5jロボット部 第7回勉強会] microsoft cognitive toolkit (cntk) overview

[html5jロボット部第7回勉強会] microsoft cognitive toolkit (cntk) overview