tensorflow - la ia detrás de google

+Israel Blancas@iblancasa

TensorFlowLa IA detrás de Google

#DevFestGRX

GDG Granada

How Can You Get Started with Machine Learning?Three ways, with varying complexity:

(1) Use a Cloud-based or Mobile API (Vision, Natural Language, etc.)

(2) Use an existing model architecture, and retrain it or fine tune on your dataset

(3) Develop your own machine learning models for new problems

More flexible,

but more effort

required

#DevFestGRX

GDG Granada

Cloud Machine Learning APIs See, Hear and Understand the world

#DevFestGRX

GDG Granada

Cloud Natural Language

Cloud Speech

Cloud Translate

Cloud Vision

#DevFestGRX

GDG Granada

55

Vision APIhttps://cloud.google.com/vision/

#DevFestGRX

GDG Granada

FacesFaces, facial landmarks, emotions

OCRRead and extract text, with support for > 10 languages

LabelDetect entities from furniture to transportation

LogosIdentify product logos

Landmarks & Image PropertiesDetect landmarks & dominant color of image

Safe SearchDetect explicit content - adult, violent, medical and spoof

Cloud Vision API#DevFestGRX

GDG Granada

API Usage: Detect Objects in an Image

Image DetectedItems

Vision API

Create JSON request with the image or pointer to an image

Process the JSON response

Call the REST API1 2 3

#DevFestGRX

GDG Granada

88

Cloud Natural Language API

https://cloud.google.com/natural-language/

#DevFestGRX

GDG Granada

Confidential & ProprietaryGoogle Cloud Platform 9

Cloud Natural Language API

Extract sentence, identify parts of speech and create dependency parse trees for each sentence.

Identify entities and label by types such as person, organization, location, events, products and media.

Understand the overall sentiment of a block of text.

Syntax Analysis Entity Recognition

Sentiment Analysis

#DevFestGRX

GDG Granada

1010

Cloud Speech API Demo

https://cloud.google.com/speech/

#DevFestGRX

GDG Granada

Confidential & ProprietaryGoogle Cloud Platform 11

Cloud Speech API

Automatic Speech Recognition (ASR) powered by deep learning neural networking to power your applications like voice search or speech transcription.

Recognizes over 80 languages and variants with an extensive vocabulary.

Returns partial recognition results immediately, as they become available.

Filter inappropriate content in text results.

Audio input can be captured by an application’s microphone or sent from a pre-recorded audio file. Multiple audio file formats are supported, including FLAC, AMR, PCMU and linear-16.

Handles noisy audio from many environments without requiring additional noise cancellation.

Audio files can be uploaded in the request and, in future releases, integrated with Google Cloud Storage.

Automatic Speech Recognition Global Vocabulary Inappropriate Content Filtering

Streaming Recognition

Real-time or Buffered Audio Support Noisy Audio Handling Integrated API

#DevFestGRX

GDG Granada

Mobile Vision APIProviding on-device vision for applications

#DevFestGRX

GDG Granada

Face APIfaces, facial landmarks, eyes open, smiling

Barcode API1D and 2D barcodes

Text APILatin-based text / structure

Common Mobile Vision APISupport for fast image and video on-device detection and tracking.

#DevFestGRX

GDG Granada

Googly Eyes Android App

Video credit Google

1. Create a face detector for facial landmarks (e.g., eyes)

3. For each face, draw the eyes

FaceDetector detector = new FaceDetector.Builder() .setLandmarkType(FaceDetector.ALL_LANDMARKS) .build();

SparseArray<Face> faces = detector.detect(image);

for (int i = 0; i < faces.size(); ++i) { Face face = faces.valueAt(i); for (Landmark landmark : face.getLandmarks()) { // Draw eyes

2. Detect faces in the image

#DevFestGRX

GDG Granada

Face APIPhoto credit developers.google.com/vision

#DevFestGRX

GDG Granada

Easy to use Java API

image detecteditems

Detector

1. Create a detector object2. detectedItems = detector.detect(image)

Photo credit developers.google.com/vision

#DevFestGRX

GDG Granada

Text DetectionLatin based languageUnderstand text structure

Photo credit Getty Images

#DevFestGRX

GDG Granada

Text Structure

Blocks

Lines

Words

Lines

Words Words Words

#DevFestGRX

GDG Granada

Barcode Detection1D barcodes

EAN-13/8 UPC-A/E Code-39/93/128 ITF Codabar

2D barcodes QR Code Data Matrix PDF-417 AZTEC

UPC

DataMatrix

QR Code

PDF 417

Video and image credit Google

#DevFestGRX

GDG Granada

Combined Vision & Translation#DevFestGRX

GDG Granada

● Open source Machine Learning library

● Especially useful forDeep Learning

● For research and production

● Apache 2.0 license

#DevFestGRX

GDG Granada

Architecture● Core in C++● Different front ends

○ Python and C++ today, community may add more

Core TensorFlow Execution System

CPU GPU Android iOS ...

C++ front end Python front end ...

#DevFestGRX

GDG Granada

Raspberry PiDatacentersYour laptop Android iOS

Portable & Scalable#DevFestGRX

GDG Granada

/Serving

Open-source solution for serving trained models

#DevFestGRX

GDG Granada

From the whitepaper: “TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms.”

In short: TensorFlow is Theano++.

● Symbolic ML dataflow framework that compiles to native / GPU code

What is TensorFlow?

#DevFestGRX

GDG Granada

Data Flow Graphs

Computation is defined as a directed acyclic graph (DAG) to optimize an objective function

● Graph is defined in high-level language (Python)● Graph is compiled and optimized● Graph is executed (in parts or fully) on available low

level devices (CPU, GPU)● Data (tensors) flow through the graph● TensorFlow can compute gradients automatically

#DevFestGRX

GDG Granada

Graph?#DevFestGRX

GDG Granada

Variables are 0-ary stateful nodes which output their current value.

(State is retained across multiple executions of a graph.)

(parameters, gradient stores, eligibility traces, …)

Graph?#DevFestGRX

GDG Granada

Placeholders are 0-ary nodes whose value is fed in at execution time.

(inputs, variable learning rates, …)

Graph?#DevFestGRX

GDG Granada

Mathematical operations:MatMul: Multiply two matrix values.Add: Add elementwise (with broadcasting).ReLU: Activate with elementwise rectified linear function.

Graph?#DevFestGRX

GDG Granada

In code, please!

1. Create model weights, including initialization

a. W ~ Uniform(-1, 1); b = 02. Create input placeholder x

a. m * 784 input matrix3. Create computation graph

import tensorflow as tf

b = tf.Variable(tf.zeros((100,)))

W = tf.Variable(tf.random_uniform((784, 100),

-1, 1))

x = tf.placeholder(tf.float32, (None, 784))

h_i = tf.nn.relu(tf.matmul(x, W) + b)

1

23

#DevFestGRX

GDG Granada

So far we have defined a graph.

We can deploy this graph with a session: a binding to a particular execution context (e.g. CPU, GPU)

How do we run it?#DevFestGRX

GDG Granada

Getting output

sess.run(fetches, feeds)

Fetches: List of graph nodes. Return the outputs of these nodes.

Feeds: Dictionary mapping from graph nodes to concrete values. Specifies the value of each graph node given in the dictionary.

import numpy as np

import tensorflow as tf

b = tf.Variable(tf.zeros((100,)))

W = tf.Variable(tf.random_uniform((784,

100),-1, 1))

x = tf.placeholder(tf.float32, (None, 784))

h_i = tf.nn.relu(tf.matmul(x, W) + b)

1

23

sess = tf.Session()

sess.run(tf.initialize_all_variables())

sess.run(h_i, {x: np.random.random(64, 784)})

#DevFestGRX

GDG Granada

1. Build a grapha. Graph contains parameter specifications, model architecture, optimization process, …

2. Initialize a session3. Fetch and feed data with

Session.runa. Compilation, optimization, etc. happens at this step — you probably won’t notice

Basic flow#DevFestGRX

GDG Granada

Questions?

#DevFestGRX

GDG Granada

tensorflow.orggithub.com/tensorflow

Want to learn more?Udacity class on Deep Learning, goo.gl/iHssII

Guides, codelabs, videosMNIST for Beginners, goo.gl/tx8R2bTF Learn Quickstart, goo.gl/uiefRnTensorFlow for Poets, goo.gl/bVjFILML Recipes, goo.gl/KewA03TensorFlow and Deep Learning without a PhD, goo.gl/pHeXe7

What's Next#DevFestGRX

GDG Granada

http://tensorflow.org

http://tensorflow.org

http://github.com/tensorflow

http://github.com/tensorflow

http://goo.gl/iHssII

http://goo.gl/tx8R2b

http://goo.gl/uiefRn

http://goo.gl/UDFGzh

https://goo.gl/KewA03

http://goo.gl/pHeXe7

+Israel Blancas@iblancasa

TensorFlowLa IA detrás de Google

#DevFestGRX

GDG Granada

tensorflow - la ia detrás de google

Software