machine learning and pattern recognition *1cm basics fileother books d. barber, bayesian reasoning...

25
Machine Learning and Pattern Recognition Basics Fabio G. Cozman August 31, 2018

Upload: truonglien

Post on 20-Aug-2019

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Machine Learningand Pattern Recognition

Basics

Fabio G. Cozman

August 31, 2018

Page 2: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Overview

Two exams (P1 and P2), plus a fewassignments (average T).

Grading: (P1 + 2 P2 + T)/4.

Textbook: G. James, D. Witten, T. Hastie, R.Tibshirani. An Introduction to StatisticalLearning, Springer 2013.

Data: UCI repository, Kaggle.

Page 3: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Other books

D. Barber, Bayesian Reasoning and MachineLearning, Cambridge 2012.

C. M. Bishop. Pattern Recognition andMachine Learning, Springer 2006.

K. R. Murphy. Machine Learning: AProbabilistic Perspective, MIT Press, 2012.

Page 4: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Topics

Basic technology, kNN.Probability, naive Bayes; Bayesian networks.MLE; Bayesian estimation; EM; MCMC.Statistical tests; bootstrap; comparisonbetween methods.Regression (linear, lasso, etc).Decision trees, random forests.SVMs; boosting.Clustering (k-means, dendrograms; latentmodels).PCA, NMF; recommendation systems.MDPs and reinforcement learning.Neural networks; deep learning; deepreinforcement learning.

Page 5: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

What is machine learning?

T. Mitchell, Machine Learning, McGraw-Hill1997:

The field of machine learning isconcerned with the question of howto construct computer programs thatautomatically improve withexperience.

Comment: this is very general. A computermight learn by watching a human perform atask once. Or by running some sort of “geneticevolution” simulation. Or a computer mightlearn by collecting and analyzing data(statistical machine learning).

Page 6: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

What is statistical machine learning?

Barber:Machine Learning is the study ofdata-driven methods capable ofmimicking, understanding and aidinghuman and biological informationprocessing tasks.

James et al.:Statistical learning refers to a vast setof tools for understanding data.

Comment: this definition equates statisticallearning and the whole of Statistics. Are theydifferent?

Page 7: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

A definition

Statistical machine learning focuses on how tobuild computer programs that can improve bycollecting and analyzing data.

Today, there is a sea of data waiting to beanalyzed automatically, hence the currentsuccess of statistical machine learning.How is this effort related to other areas?

Artificial intelligence.Statistics.Econometrics, control theory, databases, etc.

Page 8: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

A bit of AI

Definition (??)Systems that think/act like humans (or at leastwith some degree of intelligence).Systems that think/act rationally.

Some history:Beginning in the 40s/50s: interest in solvingoperations research problems,Great expectations in the 60s/70s; great difficultiesin the 80s; some maturity after 1990.

Topics: Search and optimization; Knowledgerepresentation (certain, uncertain); Planning;Machine Learning; Vision, sensing, acting,robotics; Communication and language.

Page 9: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Digression on Turing...

Turing test proposed in 1950: the imitationgame. Many arguments; however, notnecessary, not sufficient.

But note Turing’s comments on learning:

Instead of trying to produce aprogramme to simulate the adultmind, why not rather try to produceone which simulates the child’s? Ifthis were then subjected to anappropriate course of education onewould obtain the adult brain. (...)The idea of a learning machine mayappear paradoxical to some readers.

Page 10: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Machine learning in AI

At first, probabilities were consideredinadequate for AI. Machine learning focused onanalogical learning, abduction and otherlogically motivated techniques.

Situation changed with the appearance ofprobabilistic and decision models; theavailability of data and the success of neuralnetworks learning from data.

Machine learning gradually became interwinedwith statistics.

Today, statistical machine learning pervadesmany areas of AI.

Page 11: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Statistical learning and Statistics

In many ways, statistical learning seems to bejust Statistics; however, it is worth noting thatthe former is interested in computer programs,and the latter is more interested in humanconsultants.

Also, statistical learning mostly focuses onbuilding some function that approximates asystem, so as to predict or to classify outcomes;there is less focus on modeling and interpretingthe model that captures reality — but there ismore and more overlap between areas.

Also, many differences are just in usingdifferent names for the same concepts.

Page 12: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Tibshirani: Glossary

Machine learning Statistics

network, graphs model

weights parameters

learning fitting

generalization test set performance

supervised learning regression/classification

unsupervised learning density estimation, clustering

large grant = $1,000,000 large grant= $50,000

nice place to have a meeting: nice place to have a meeting:

Snowbird, Utah, French Alps Las Vegas in August

1

Page 13: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Other connections

Most of supervised learning is classification,also the object of pattern recognition.

Most of reinforcement learning parallels oldefforts in control theory, with new ideas andtwists (and focus on computers).

Page 14: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

The success of statistical machine learning

Automated classification and clustering haveproduced tools that enhance businessintelligence, recommendation systems, usersupport systems, autonomous financial systems.

Natural language and text processing is heavilybased on statistical learning / deep learning (forinstance, Google’s translator, IBM’s Watson).

Computer vision is dominated by classificationand clustering techniques; recently there hasbeen success with deep learning techniques.

Robotics currently bases navigation andlocalization on statistical models that requirestatistical learning.

Page 15: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Read:

A. Halevy, P. Norvig, F. Pereira, Theunreasonable effectiveness of data, IEEEIntelligent Systems, 24(2):8–12, 2009.

Page 16: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Big success: translation

Based on statistical techniques, and now ondeep learning.

Old times: English to Russian to English...

The spirit is willing, but the flesh isweak.

The vodka is strong, but the steak islousy.

Current success: data-based.

Page 17: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Big success: translation

Based on statistical techniques, and now ondeep learning.

Old times: English to Russian to English...

The spirit is willing, but the flesh isweak.The vodka is strong, but the steak islousy.

Current success: data-based.

Page 18: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

Big success: translation

Based on statistical techniques, and now ondeep learning.

Old times: English to Russian to English...

The spirit is willing, but the flesh isweak.The vodka is strong, but the steak islousy.

Current success: data-based.

Page 19: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

But... English - Uzbeque

The spirit is willing, but the flesh is weak.

Ruh istakka tayyor bo’lib, go’shti hafta edi.

The spirit was ready for the meal, and the fleshwas weeks.

Ruh go’shtga tayyor edi, go’sht esa bir nechahafta edi.

The spirit was ready for meat, and the meatwas only a few weeks old.

Ruh tantana qilishga tayyor edi, va bu mushakfaqat haftalik keksalikda edi.

The spirit was ready to celebrate, and thismuscle was only a weekly old man.

Page 20: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

But... English - Uzbeque

The spirit is willing, but the flesh is weak.

Ruh istakka tayyor bo’lib, go’shti hafta edi.

The spirit was ready for the meal, and the fleshwas weeks.

Ruh go’shtga tayyor edi, go’sht esa bir nechahafta edi.

The spirit was ready for meat, and the meatwas only a few weeks old.

Ruh tantana qilishga tayyor edi, va bu mushakfaqat haftalik keksalikda edi.

The spirit was ready to celebrate, and thismuscle was only a weekly old man.

Page 21: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

But... English - Uzbeque

The spirit is willing, but the flesh is weak.

Ruh istakka tayyor bo’lib, go’shti hafta edi.

The spirit was ready for the meal, and the fleshwas weeks.

Ruh go’shtga tayyor edi, go’sht esa bir nechahafta edi.

The spirit was ready for meat, and the meatwas only a few weeks old.

Ruh tantana qilishga tayyor edi, va bu mushakfaqat haftalik keksalikda edi.

The spirit was ready to celebrate, and thismuscle was only a weekly old man.

Page 22: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

But... English - Uzbeque

The spirit is willing, but the flesh is weak.

Ruh istakka tayyor bo’lib, go’shti hafta edi.

The spirit was ready for the meal, and the fleshwas weeks.

Ruh go’shtga tayyor edi, go’sht esa bir nechahafta edi.

The spirit was ready for meat, and the meatwas only a few weeks old.

Ruh tantana qilishga tayyor edi, va bu mushakfaqat haftalik keksalikda edi.

The spirit was ready to celebrate, and thismuscle was only a weekly old man.

Page 23: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

But... English - Uzbeque

The spirit is willing, but the flesh is weak.

Ruh istakka tayyor bo’lib, go’shti hafta edi.

The spirit was ready for the meal, and the fleshwas weeks.

Ruh go’shtga tayyor edi, go’sht esa bir nechahafta edi.

The spirit was ready for meat, and the meatwas only a few weeks old.

Ruh tantana qilishga tayyor edi, va bu mushakfaqat haftalik keksalikda edi.

The spirit was ready to celebrate, and thismuscle was only a weekly old man.

Page 24: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

But... English - Uzbeque

The spirit is willing, but the flesh is weak.

Ruh istakka tayyor bo’lib, go’shti hafta edi.

The spirit was ready for the meal, and the fleshwas weeks.

Ruh go’shtga tayyor edi, go’sht esa bir nechahafta edi.

The spirit was ready for meat, and the meatwas only a few weeks old.

Ruh tantana qilishga tayyor edi, va bu mushakfaqat haftalik keksalikda edi.

The spirit was ready to celebrate, and thismuscle was only a weekly old man.

Page 25: Machine Learning and Pattern Recognition *1cm Basics fileOther books D. Barber, Bayesian Reasoning and Machine Learning, Cambridge 2012. C. M. Bishop. Pattern Recognition and Machine

But... English - Uzbeque

The spirit is willing, but the flesh is weak.

Ruh istakka tayyor bo’lib, go’shti hafta edi.

The spirit was ready for the meal, and the fleshwas weeks.

Ruh go’shtga tayyor edi, go’sht esa bir nechahafta edi.

The spirit was ready for meat, and the meatwas only a few weeks old.

Ruh tantana qilishga tayyor edi, va bu mushakfaqat haftalik keksalikda edi.

The spirit was ready to celebrate, and thismuscle was only a weekly old man.