applying machine learning and artificial intelligence to business

©GoDataScience 1

Applying Machine Learning and AI for Business

• www.GoDataScience.io• Peter Morgan – Chief Data Scientist• Russell Miles – CEO• @godatascience• @pmzepto• @russmiles

http://www.godatascience.io/

©GoDataScience 2

I. ML - Overview• Definition – “Field of study that gives computers the

ability to learn without being explicitly programmed” - Arthur Samuel, 1959• Used for fitting lines/hyperplanes (regression), finding

models, classifying objects, hypothesis testing, etc.• Three main categories of learning• Supervised (labelled data, classifying)• Unsupervised (unlabelled data, clustering)• Reinforcement learning (reward/penalty)

©GoDataScience 3

ML Algorithm Classes• Regression, e.g., linear, logistic• Decision Trees, e.g., CART• Ensemble, e.g., Random Forests• Bayesian, e.g., Naïve Bayes• Artificial Neural Networks, e.g., RNN• Instance, e.g., k-Nearest Neighbour (kNN)• Support Vector Machines (SVM)• Evolutionary, e.g., genetic (mimics natural selection)• Dimensionality Reduction, e.g., PCA• Clustering, e.g., K-means• Reinforcement, e.g., Q-learning• List of ML algos

https://en.wikipedia.org/wiki/List_of_machine_learning_concepts

https://en.wikipedia.org/wiki/List_of_machine_learning_concepts

©GoDataScience 4

ML Applications• Speech recognition• Object recognition and tracking• Spam filtering• Self-driving cars • Recommendation engines • Fraud detection• Search engines, e.g., PageRank• Ad placement• Financial forecasting

©GoDataScience 5

Algorithm References

• http://en.wikipedia.org/wiki/Machine_learning• http://en.wikipedia.org/wiki/Predictive_analytics• http://en.wikipedia.org/wiki/Pattern_recognition• http://en.wikipedia.org/wiki/Support_vector_machine• http://en.wikipedia.org/wiki/Regression_analysis• http://en.wikipedia.org/wiki/Random_forest• http://en.wikipedia.org/wiki/Non-parametric_statistics• http://en.wikipedia.org/wiki/Decision_tree_learning

http://en.wikipedia.org/wiki/Machine_learning



http://en.wikipedia.org/wiki/Predictive_analytics

http://en.wikipedia.org/wiki/Pattern_recognition

http://en.wikipedia.org/wiki/Pattern_recognition

http://en.wikipedia.org/wiki/Support_vector_machine

http://en.wikipedia.org/wiki/Regression_analysis

http://en.wikipedia.org/wiki/Random_forest

http://en.wikipedia.org/wiki/Non-parametric_statistics

http://en.wikipedia.org/wiki/Decision_tree_learning

http://en.wikipedia.org/wiki/Decision_tree_learning

©GoDataScience 6

Open Source ML Toolkits• Brain https://github.com/harthur/brain• Concurrent Pattern http://www.cascading.org/projects/pattern/• Convnetjs https://github.com/karpathy/convnetjs• Decider https://github.com/danielsdeleo/Decider• etcML www.etcml.com• Etsy Conjecture https://github.com/etsy/Conjecture• Google Sibyl https

://plus.google.com/+ResearchatGoogle/posts/7CqQbKfYKQf• GraphX https://amplab.cs.berkeley.edu/publication/graphx-grades/• KNIME http://www.knime.org/• List https://github.com/showcases/machine-learning• ML software http://www.cs.ubc.ca/~murphyk/Software/index.html• MLPNeuralNet https://github.com/nikolaypavlov/MLPNeuralNet

https://github.com/harthur/brain

http://www.cascading.org/projects/pattern/

https://github.com/karpathy/convnetjs

https://github.com/danielsdeleo/Decider

http://www.etcml.com/

https://github.com/etsy/Conjecture

https://plus.google.com/+ResearchatGoogle/posts/7CqQbKfYKQf

https://plus.google.com/+ResearchatGoogle/posts/7CqQbKfYKQf

https://amplab.cs.berkeley.edu/publication/graphx-grades/

https://amplab.cs.berkeley.edu/publication/graphx-grades/

http://www.knime.org/

https://github.com/showcases/machine-learning

http://www.cs.ubc.ca/~murphyk/Software/index.html

https://github.com/nikolaypavlov/MLPNeuralNet

https://github.com/nikolaypavlov/MLPNeuralNet

©GoDataScience 7

Open Source ML Tookits (cont)• MOA http://moa.cs.waikato.ac.nz/• Neurokernel http://neurokernel.github.io/• NuPic https://github.com/numenta/nupic• Orange http://orange.biolab.si/• RapidMiner http://rapidminer.com• Scikit-learn http://scikit-learn.org/stable/• Spark http://spark.apache.org/mllib/• TunedIT http://tunedit.org/• Vahara https://github.com/thedatachef/varaha • Viv http://viv.ai/ • Vowpal Wabbit https://github.com/JohnLangford/vowpal_wabbit/wiki • Weka http://www.cs.waikato.ac.nz/ml/weka/

http://moa.cs.waikato.ac.nz/

http://neurokernel.github.io/

https://github.com/numenta/nupic

https://github.com/numenta/nupic

http://orange.biolab.si/

http://rapidminer.com/

http://scikit-learn.org/stable/

http://spark.apache.org/mllib/

http://spark.apache.org/mllib/

http://tunedit.org/

https://github.com/thedatachef/varaha

http://viv.ai/

https://github.com/JohnLangford/vowpal_wabbit/wiki

https://github.com/JohnLangford/vowpal_wabbit/wiki

http://www.cs.waikato.ac.nz/ml/weka/

http://www.cs.waikato.ac.nz/ml/weka/

©GoDataScience 8

Open Source ML Libraries• Dlib http://dlib.net/ml.html• MADLib http://madlib.net/• Mahout http://mahout.apache.org/• MCMLL http://mcmll.sourceforge.net/• MLC++ http://www.sgi.com/tech/mlc/• mloss http://mloss.org/software/• mlpack http://mlpack.org/• Shogun http://www.shogun-toolbox.org/• Stan http://mc-stan.org/

http://dlib.net/ml.html

http://dlib.net/ml.html

http://madlib.net/

http://madlib.net/

http://mahout.apache.org/

http://mahout.apache.org/

http://mcmll.sourceforge.net/

http://mcmll.sourceforge.net/

http://www.sgi.com/tech/mlc/

http://www.sgi.com/tech/mlc/

http://mloss.org/software/

http://mloss.org/software/

http://mlpack.org/

http://mlpack.org/

http://www.shogun-toolbox.org/

http://www.shogun-toolbox.org/

http://mc-stan.org/

http://mc-stan.org/

©GoDataScience 9

Proprietary ML Toolkits• Ayasdi http://www.ayasdi.com/• BigML https://bigml.com/• H2O http://h2o.ai• IBM Watson http://www.ibm.com/smarterplanet/us/en/ibmwatson/• Matlab http://uk.mathworks.com/solutions/machine-learning/• Nutonian http://www.nutonian.com/• Prediction.io http://prediction.io/• Rocketfuel http://rocketfuel.com/• Skytree http://www.skytree.net/• Trifacta http://www.trifacta.com/• Wolfram Alpha http://www.wolframalpha.com/ • Wise.io http://www.wise.io/• Yhat https://yhathq.com/

http://www.ayasdi.com/

http://www.ayasdi.com/

https://bigml.com/

https://bigml.com/

http://h2o.ai/

http://h2o.ai/

http://www.ibm.com/smarterplanet/us/en/ibmwatson/



http://uk.mathworks.com/solutions/machine-learning/



http://www.nutonian.com/

http://www.nutonian.com/

http://prediction.io/

http://prediction.io/

http://rocketfuel.com/

http://rocketfuel.com/

http://www.skytree.net/

http://www.skytree.net/

http://www.trifacta.com/

http://www.trifacta.com/

http://www.wolframalpha.com/

http://www.wolframalpha.com/

http://www.wise.io/

http://www.wise.io/

https://yhathq.com/

https://yhathq.com/

https://yhathq.com/

©GoDataScience 10

Other ML Resources• MLaaS (Cloud based)

• Microsoft http://azure.microsoft.com/en-us/documentation/services/machine-learning/• Google https://cloud.google.com/products/prediction-api/• AWS https://aws.amazon.com/marketplace/search?page=1&searchTerms=machine+learning

• Conferences• ICML• NIPS

• ML Journals• ML Journal http://www.springer.com/computer/ai/journal/10994• JMLR http://jmlr.org/papers/• Pattern Recognition http://www.jprr.org/index.php/jprr• arXiv http://arxiv.org/list/stat.ML/recent • gitXiv http://gitxiv.com

http://azure.microsoft.com/en-us/documentation/services/machine-learning/




https://cloud.google.com/products/prediction-api/

https://cloud.google.com/products/prediction-api/

https://aws.amazon.com/marketplace/search?page=1&searchTerms=machine+learning

https://aws.amazon.com/marketplace/search?page=1&searchTerms=machine+learning

http://www.springer.com/computer/ai/journal/10994

http://www.springer.com/computer/ai/journal/10994

http://jmlr.org/papers/

http://jmlr.org/papers/

http://www.jprr.org/index.php/jprr



http://arxiv.org/list/stat.ML/recent

http://arxiv.org/list/stat.ML/recent

http://gitxiv.com/

http://gitxiv.com/

http://gitxiv.com/

©GoDataScience 11

References – Machine Learning• Abu-Mostafa, Yaser et al – Learning from Data, AML, 2012• Alpaydın, Ethem - Introduction to Machine Learning, 2nd ed, MIT Press,

2009 • Bishop, Christopher - Pattern Recognition and Machine Learning, Springer,

2007• Domingos, Pedro - The Master Algorithm, Allen Lane, 2015• Flach, Peter - Machine Learning: The Art and Science of Algorithms that

Make Sense of Data, Cambridge University Press, 2012• Mitchell, Tom – Machine Learning, McGraw-Hill, 1997• Murphy, Kevin - Machine Learning: A Probabilistic Perspective, MIT Press,

2012• Rickhert, Willi and Luis Coelho, Building Machine Learning Systems with

Python, Packt, 2013• Witten, Ian et al - Data Mining, Practical Machine Learning Tools and

Techniques, 3rd ed, Morgan Kaufman, 2011

©GoDataScience 12

II. Deep Learning•Aims• To understand what Deep Learning is• Look at some of the common toolkits• How is it being used today• Challenges to overcome

©GoDataScience 13

Deep Learning Overview• Extract patterns and meaning from data• Modeled after how the human brain processes data• DL methods have gained notable successes in the field of speech and

image recognition as well as in cognitive computing• Outperforming other algorithms• They are essentially ANNs • CNN = Convolutional Neural Networks (images)• RNN = Recurrent Neural Networks (speech & text)• LSTM = Long Short Term Memory

©GoDataScience 14

Deep Learning Progress

Progress in machine classification of images - error rate by year. Red line is the error rate of a trained human

©GoDataScience 15

DL Resources• People• Yann LeCun http://yann.lecun.com/• Geff Hinton http://www.cs.toronto.edu/~hinton/• Yoshua Bengio

http://www.iro.umontreal.ca/~bengioy/yoshua_en/index.html• Andrew Ng http://cs.stanford.edu/people/ang/• Quoc Le http://cs.stanford.edu/~quocle/• Jurgen Schmidhuber http://people.idsia.ch/~juergen/

• Blogs & Communities• FastML http://fastml.com/• Chris Olah http://colah.github.io/• Andrej Karparthy http://karpathy.github.io• DeepLearning.net http://deeplearning.net/

http://yann.lecun.com/

http://yann.lecun.com/

http://www.cs.toronto.edu/~hinton/

http://www.cs.toronto.edu/~hinton/

http://www.iro.umontreal.ca/~bengioy/yoshua_en/index.html

http://cs.stanford.edu/people/ang/

http://cs.stanford.edu/people/ang/

http://cs.stanford.edu/~quocle/

http://cs.stanford.edu/~quocle/

http://people.idsia.ch/~juergen/



http://fastml.com/

http://fastml.com/

http://colah.github.io/

http://colah.github.io/

http://karpathy.github.io/



http://deeplearning.net/



©GoDataScience 16

DL Open Source Packages• Caffe http://caffe.berkeleyvision.org• CUDA Convnet https://code.google.com/p/cuda-convnet/• cuDNN https://developer.nvidia.com/cuDNN• Deeplearning4j http://deeplearning4j.org/• PyBrain http://pybrain.org/• PyLearn2 http://deeplearning.net/software/pylearn2/• SINGA http://singa.incubator.apache.org• TensorFlow http://tensorflow.org• Theano http://deeplearning.net/software/theano/• Torch http://torch.ch/• In fact, Google, IBM, Samsung, Microsoft and Baidu have open sourced their

machine learning frameworks all within the space of the last two weeks

http://caffe.berkeleyvision.org/



https://code.google.com/p/cuda-convnet/

https://code.google.com/p/cuda-convnet/

https://developer.nvidia.com/cuDNN

https://developer.nvidia.com/cuDNN

http://deeplearning4j.org/

http://deeplearning4j.org/

http://pybrain.org/

http://pybrain.org/

http://deeplearning.net/software/pylearn2/

http://deeplearning.net/software/pylearn2/

http://singa.incubator.apache.org/

http://singa.incubator.apache.org/

http://tensorflow.org/

http://tensorflow.org/

http://deeplearning.net/software/theano/

http://torch.ch/

http://torch.ch/

©GoDataScience 17

Deep Learning Companies• AlchemyAPI http://www.alchemyapi.com/• Clarifai http://www.clarifai.com/• Deepmind www.google.com• Ersatz Labs http://www.ersatzlabs.com/• Memkite http://memkite.com/• Nervana http://www.nervanasys.com/• Numenta http://numenta.org/• Nvidiahttps://developer.nvidia.com/deep-learning• Skymind http://www.skymind.io/• Vicarious http://vicarious.com/

http://www.alchemyapi.com/

http://www.alchemyapi.com/

http://www.clarifai.com/

http://www.clarifai.com/

http://www.google.com/

http://www.ersatzlabs.com/

http://www.ersatzlabs.com/

http://memkite.com/

http://www.nervanasys.com/

http://numenta.org/

https://developer.nvidia.com/deep-learning



http://www.skymind.io/

http://vicarious.com/

©GoDataScience 18

References – Deep Learning• Bengio, Yoshua et al – Deep Learning, An MIT Press book in

preparation http://goodfeli.github.io/dlbook/• Buduma, Nikhil – Fundamentals of Deep Learning, O’Reilly, 2015• Gibson, A and J. Patterson - Deep Learning: A Practitioner's Approach,

O’Reilly, 2015• Maters, Timothy, Deep Belief Nets in C++ and CUDA C, Vol. 1,

CreateSpace, 2015• Maters, Timothy, Deep Belief Nets in C++ and CUDA C, Vol. 2,

CreateSpace, 2015

http://goodfeli.github.io/dlbook/

©GoDataScience 19

III. Artificial Intelligence• Definition• Overview• History• Applications• Companies• People• Robotics• Opportunities• Threats• Predictions• References

©GoDataScience 20

AI - Definition“Every aspect of learning or any other feature of intelligence can in principle be so precisely described that a machine can be made to simulate it. Machines will solve the kinds of problems now reserved for humans, and improve themselves ”. Dartmouth Summer Research Project on A.I., 1956.

©GoDataScience 21

Artificial Intelligence Overview

• Agents that learn from and adapt to their environments while achieving goals• Similar to living organisms• Multimodal is goal• AGI - endgame

• New software/algorithms• Neural networks• Deep learning

• New hardware • GPU’s• Neuromorphic chips

• Cloud Enabled• Intelligence in the cloud• IaaS (Watson)• Cloud Robotics

©GoDataScience 22

The Bigger Picture

Universe ComputerScience AI

©GoDataScience 23

AI History I• 1940’s – First computers• 1950 – Turing Machine

• Turing, A.M., Computing Machinery and Intelligence, Mind 49: 433-460, 1950• 1951 – Minsky builds SNARC, a neural network at MIT• 1956 - Dartmouth Summer Research Project on A.I. • 1957 – Samuel drafts algos (Prinz) • 1959 - John McCarthy and Marvin Minsky founded the MIT AI Lab.• 1960’s - Ray Solomonoff lays the foundations of a mathematical theory of AI,

introducing universal Bayesian methods for inductive inference and prediction

©GoDataScience 24

AI History II• 1969 - Shakey the robot at Stanford• 1970s – AI Winter I• 1970s - Natural Language Processing (Symbolic) • 1979 – Music programmes by Kurzweil and Lucas • 1980 – First AAAI conference• 1981 – Connection Machine (parallel AI)• 1980s - Rule Based Expert Systems (Symbolic)• 1985 – Back propagation• 1987 – “The Society of Mind” by Marvin Minsky published• 1990s - AI Winter II (Narrow AI)• 1994 – First self-driving car road test – in Paris• 1997 - Deep Blue beats Gary Kasparov

©GoDataScience 25

AI History III• 2004 - DARPA introduces the DARPA Grand Challenge requiring

competitors to produce autonomous vehicles for prize money• 2007 - Checkers is solved by a team of researchers at the

University of Alberta• 2009 - Google builds self driving car• 2010s - Statistical Machine Learning, algorithms that learn from

raw data • 2011 - Watson beats Ken Jennings and Brad Rutter on Jeopardy• 2012+ Deep Learning (DL); Sub-Symbolic• 2013 - E.U. Human Brain Project (model brain by 2023) • 2014 – Human vision surpassed by DL systems at Google, Baidu,

Facebook• 2015 – Machine dreaming (Google and Facebook NN’s)

©GoDataScience 26

AI Applications• Finance

• Asset allocation• Algo trading

• Fraud detection • Cybersecurity• eCommerce • Search • Manufacturing• Medicine• Law• Business Analytics• Ad serving• Recommendation engines• Smart homes

• Robotics • Industry• Consumer• Space• Military

• UAV (cars, drones etc.)• Scientific discovery• Mathematical theorems• Route Planning• Virtual Assistants• Personalisation• Compose music• Write stories

©GoDataScience 27

AI Applications (cont)• Computer vision• Speech recognition• NLP• Translation• Call centres• Rescue operations• Policing• Military• Political• National security• Anything a human can do but faster and more accurate –

creating, reasoning, decision making, prediction• Google – introduced 60 DL products in last 2 years (Jeff Dean)

©GoDataScience 28

AI Applications - Examples• AI can do all these things already today:

• Translating an article from Chinese to English• Translating speech from Chinese to English, in real time• Identifying all the chairs/faces in an image• Transcribing a conversation at a party (with background noise)• Folding your laundry (robotics)• Proving new theorems (ATP)• Automatically replying to your email, and scheduling

©GoDataScience 29

Learning and doing - from watching videos• Researchers at the University of Maryland, funded by DARPA’s

Mathematics of Sensing, Exploitation and Execution (MSEE) program• System that enables robots to process visual data from a series of

“how to” cooking videos on YouTube - and then cook a meal

©GoDataScience 30

AI Performance evaluation

• Optimal: it is not possible to perform better• Checkers, Rubik’s cube, some poker

• Strong super-human: performs better than all humans• Chess, scrabble, question-answer

• Super-human: performs better than most humans• Backgammon, cars, crosswords

• Par-human: performs similarly to most humans• Go, Image recognition, OCR

• Sub-human: performs worse than most humans• Translation, speech recognition, handwriting

©GoDataScience 31

AI Corporations• IBM Watson• Google Deepmind etc.• Microsoft Project Adam • Facebook • Baidu• Yahoo!

©GoDataScience 32

AI Startups• Numenta • OpenCog • Vicarious • Clarafai • Sentient• Nurture • Wit.ai • Cortical.io• Viv.ai Number is growing rapidly (daily?)

©GoDataScience 33

Robotics - Embodied AI1. Industrial Robotics• Manufacturing (Baxter)• Warehousing (Amazon)• Police/Security• Military• Surgery• Drones (UAV’s)• Self-driving cars• Trains• Ships• Planes• Underwater

©GoDataScience 34

2. Consumer Robotics• Robots with friendly user interface that can understand user’s

emotions• Visual; facial emotions• Tone of voice

• Caretaking• Elderly• Young

• EmoSpark, Echo• Education• Home security• Housekeeping• Companionship• Artificial limbs• Exoskeletons

©GoDataScience 35

Robots & Robotics Companies• Sawyer (ReThink)• iCub (EU) • Asimo (Honda)• Nao (Aldebaran)• Pepper (Softbank) • Many (Google)• Roomba (iRobot) • Kiva (Amazon)• Many (KUKA)• Jibo (startup)• Milo (Robokind)• Oshbot (Fellows)• Valkyrie (NASA)• DURUS (SRI)

©GoDataScience 36

AI & Robotics Websites

• Jobs, News, Trade• Robotics Business review• AI Hub• AZoRobotics• Robohub• Robotics News• I-Programmer

http://www.roboticsbusinessreview.com/

http://aihub.net/

http://www.azorobotics.com/

http://robohub.org/

http://www.roboticsnews.co.uk/

http://www.i-programmer.info/news/105-artificial-intelligence.html

©GoDataScience 37

Opportunities• Free humans to pursue arts and sciences

• The Venus Project

• Solve deep challenges (political, economic, scientific, social)• Accelerate new discoveries in science, technology, medicine

(illness and aging)• Creation of new types of jobs• Increased efficiencies in every market space

• Industry 4.0 (steam, electric, digital, intelligence)

• Faster, cheaper, more accurate• Replace mundane, repetitive jobs• Human-Robot collaboration• A smarter planet

©GoDataScience 38

Threats• Unemployment due to automation

• Replace some jobs but create new ones?• What will these be?

• Widen the inequality gap• New economic paradigm needed• Basic Income Guarantee?

• Existential risk• AI Safety • FHI/FLI/CSER/MIRI

• Legal + Ethical issues• New laws• Machine rights• Personhood

©GoDataScience 39

AI Safety - Oversight

• BARA = British Automation and Robot Association• EU Robotics• RIA = Robotic Industries Association• IFR = International Federation of Robotics• ISO – Robotics

http://www.bara.org.uk/

http://www.eu-robotics.net/

http://www.robotics.org/



http://www.ifr.org/

http://www.sis.se/popup/iso/isotc184sc2/index.asp

©GoDataScience 40

Organisations - xRisk• FHI = Future of Humanity Institute• Oxford

• FLI = Future of Life Institute• MIT• $7million grants awarded in June

• MIRI = Machine Intelligence Research Institute• San Francisco

• CSER = Center for Science and Existential Risk• Cambridge

• AI Safety Facebook Group

https://www.facebook.com/groups/467062423469736/

©GoDataScience 41

Predictions*• More robots (exponential increase)• More automation (everywhere)

• Endgame is to automate all work• 50% will be automated by 2035

• Loosely autonomous agents (2015)• Semi-automomous agents (2020)• Fully autonomous agents (2025)• Cyborgs (has started – biohackers, implants)• Singularity (2029?) – smarter than us• Self-aware? (personhood)• Quantum computing

• Game changer• Quantum algorithms• Dwave

• Advances in science and medicine• Ethics (more debate)• Regulation (safety issues)

*Remembering that progress in technology follows variousexponentially increasing curves - see “The Singularity is Near”, by Ray Kurzweil.

©GoDataScience 43

References• Barrat, James, Our Final Invention, St. Martin's Griffin, 2014• Brynjolfsson, Erik and Andrew McAfee, The Second Machine

Age, W.W. Norton & Co., 2014• Ford, Martin, Rise of the Robots: Technology and the Threat of

a Jobless Future, Basic Books, 2015• Goertzel, Ben et al - Engineering General Intelligence, Part 1,

Atlantis Press, 2014• Hawkins, Jeff – On Intelligence, Owl, 2005• Kaku, Michio, The Future of the Mind, Doubleday, 2014• Kaplan, Jerry – Humans Need Not Apply, Yale University Press,

2015

©GoDataScience 44

References (cont)• Kurzweil, Ray, The Singularity is Near, Penguin Books, 2006• Kurzweil, Ray, How to Create a Mind, Penguin Books, 2013• Marcus, G. and J. Freeman (eds) - The Future of the Brain: Essays by the

World's Leading Neuroscientists, Princeton, 2014• Markoff, John – Machines of Loving Grace, Ecco Press, 2015• Nowak, Peter, Humans 3.0: The Upgrading of the Species, Lyons Press, 2015 • Russell and Norvig, Artificial Intelligence, A Modern Approach, Pearson,

2009• Shanahan, Murray – The Technological Singularity, MIT Press, 2015

Thanks for listening!!

www.godatascience.io@godatascience@pmzepto@russmiles

http://www.godatascience.io/