introduction to the aa2 course -...

39
Course Information Advanced Machine Learning Practicalities and Next Lectures Introduction to the AA2 Course Davide Bacciu Dipartimento di Informatica Università di Pisa [email protected] Machine Learning: Neural Networks and Advanced Models (AA2)

Upload: truongdiep

Post on 12-Jul-2018

222 views

Category:

Documents


0 download

TRANSCRIPT

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

Introduction to the AA2 Course

Davide Bacciu

Dipartimento di InformaticaUniversità di [email protected]

Machine Learning: Neural Networks and Advanced Models(AA2)

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

OverviewPractical Info

Objectives

Train machine learning (ML) specialists capable ofdesigning novel learning modelsdeveloping advanced applications using ML solutions

Address complex data domainsNoisy, hard-to-interpret, semantically rich information(natural language, images, videos)Non-vectorial relational information (sequences, trees,graphs)

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

OverviewPractical Info

Expected Outcome

Students completing the course are expected toGain in-depth knowledge of advanced machine learningtopicsUnderstand their theory and applicationsBe able to individually read, understand and discussresearch works in the field

The course is targeted atStudents specializing in

Machine learning and computational intelligenceData mining and information retrievalRoboticsBioinformatics, ...

Students seeking machine learning theses

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

OverviewPractical Info

Prerequisites

Knowledge of machine learning fundamentalsHaving taken the AA1 course.....or discuss your ML skills with me

It will be of great help if you have knowledge ofAlgebra and calculusProbability and statistics

...and, above all, adisposition not to geteasily scared by math!

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

OverviewPractical Info

Organization

The course is articulated in four partsRecurrent/recursive neural networksProbabilistic Learning and Graphical ModelsKernel methodsAdvanced Applications

Introduce learning models with an incremental approach: fromsequential data processing to complex structured domains

Guest seminars by researchers and (possibly) companies

Alessio Micheli (@di.unipi)Claudio Gallicchio (@di.unipi)Alexander Schulz(@uni-bielefeld)

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

OverviewPractical Info

Topics

Recurrent neural networksReservoir computing

Probabilistic Learning and Graphical ModelsHidden Markov modelsMarkov random fieldsLatent variable models

Non-parametric and kernel-based methodsUnsupervised learning for complex data

Learning in structured domains (sequences, trees andgraphs)Emerging topics and applications in machine learning

Deep learning, machine vision, ChemInformatics,BioInformatics, AAL

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

OverviewPractical Info

Course Instructor

Davide BacciuEmail - [email protected]

Tel - 050 2212749Office - Room 367, Dipartimento di Informatica

Office hours - Tuesday 17-19 (email me!)

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

OverviewPractical Info

Course Schedule

Weekly Timetable:

Day Time RoomMonday 16-18 C1Thursday 16-18 C1

Talk now if you need to change course weekly schedule!

Course comprises 24 lecturesNot enough dates in the academic calendarWill need to accommodate some (2,3) extra dates

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

OverviewPractical Info

Course Homepage

Reference Webpage on Didawiki:

http://didawiki.cli.di.unipi.it/doku.php/magistraleinformatica/aa2/start

Here you can findCourse informationLecture slidesArticles and course materials

You can subscribe to get RSS feeds on page updates

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

OverviewPractical Info

Reference Books

No official textbook

A standard neural networks reference book:

Simon O. Haykin, Neural Networks and Learning Machines,Pearson (2008)

Probabilistic learning (free pdf, with code):

David Barber, Bayesian Reasoning and Machine Learning,Cambridge University Press (2012)

Inference and learning (free pdf):

David J.C. MacKay, Information Theory, Inference, andLearning Algorithms, Cambridge University Press (2003)

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Complex Data

BioChemistry: protein sequences and molecular graphs

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

High-throughput Bio-Medical Data

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Structured Documents and Relational Data

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Social Data

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Natural Language

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Machine Vision

Understanding if a man is inside/outside the car impacts sceneinterpretation

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Complex data

Addressing complex tasks in challenging applicationsNeed to design novel machine learning modelsNoisy, high-dimensional, semantically-rich data

Information is no longer a single atomic pieceNeed to take context into accountRelational data

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

The Course in One Word

The secret word is structures

Structured DataCompound information representing the relationshipsbetween its composing elementsDifferent degrees of complexity and expressivity (classesof structures)

Structure in Learning ModelsRepresent the model components and the information theyshare (context)Change model expressivity and the complexity of inferenceand learningMay follow the structure of data and can be dynamical

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Data (Flat)

Data is the set of available observations (facts) of theprocess we want to model

Need an appropriate representation to capture relevantinformationData structure defines its representation capabilities

Flat dataPopulation: Fixed-size vectors of featuresRepresent a set of relevant object propertiesPatient records, DNA micro-arrays, images, ...

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Data (Structured)

Structured dataPopulation: a set of structures of variable dimensionRepresent features as well as relationships between themSequences, trees, graphs

Examples: DNA sequences, language sentences, imagesegmentations, molecules, ...

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Learning Tasks (Flat)

Approximate general functions f from observations (x , f (x))where x and f (x) are vectors

Face recognitionx → face image (descriptor)f (x) → person identifier

Handwriting recognitionx → character image (descriptor)f (x) → character/digit

Patient diagnosisx → patient recordf (x) → disease classification

Spam detectionx → email bag of wordsf (x) → spam/no-spam classification

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Learning Tasks (Structured)

Approximate general trasductions f from observations (x , f (x))where at least x is structured data

Protein foldingx → amino-acids sequencef (x) → sequence of atoms’ 3D coordinates

Machine translationx → English sentence parse treef (x) → Italian sentence parse tree

Molecule characterizationx → molecular graphf (x) → toxicity prediction

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Learning Model Structure (I)

Study how varying the structure of a learning model influencesits computational capabilities and complexity

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Learning Model Structure (II)

In some cases the learning model structure conforms with thedata structure

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Ambient Assisted Living

Learning models for the analysis of sensor timeseriesEU FP7 RUBICON - Cognitive Robotic EcologiesEU FP7 DOREMI - Smart environment for empoweringelderly lifestyle

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Machine Learning for BioInformatics

Exploratory analysis of bio-medical dataPredictive models for personalized medicine

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Machine Learning for ChemInformatics

Learning to predict functional properties from chemicalstructures

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Machine Vision

Learning structured image representations

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

MotivationsStructuresApplications and Ongoing Projects

Machine Learning for Ethology

Pattern discovery from sensor data for endangered speciespreservation

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

Exams

The (quick) way (there is no such thing as an easy way)Midterm Assign. - A short presentation on a work from a

reading list to be performed at midtermFinal Project - A written report on a topic of interest for the

course (and for you)Oral Presentation - A presentation of the final project

The alternative way (for working students, those not attendingclasses and those who fail the other way)Final Project - A written report on a topic of interest for the

course, as aboveOral Exam - A presentation of the final project plus

examination on the course program

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

Midterm Assignment

Pick one article from a reading list and prepare a shortpresentation (20 minutes) to be given in front of the classThe presenter should

Answer reading-list’s questions on the articleInclude a mathematical derivation of a theoretical result orlearning algorithm in the paperBe able to answer my (and your collegues’) questions onthe presentation

TimelineReading list published mid-marchArticle bidding by end of marchPresentation during midterm exams (tentative date:13/04/2015)

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

Final Project (I)

Choose from a set of suggested topics or propose yourown topic of interest and prepare a report (6-10 pages)Timeline (quick way)

Suggested topics list: early-mayChoose project: strictly before the last lecture (late may)Report delivery: 7 days before the oral (strict)

Timeline (alternative way)Choose project: email me to arrange a topicReport delivery: By the standard exam date (appello)(strict)

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

Final Project (II)

Possible project typesSurvey Read at least three relevant and distinct

papers on a topic and write a report: not asimple summary, rather try to find connectionsbetween the works and highlight interestingopen problems

Original Propose a research project and develop it(with my help) as much as possible: musthave a substantial innovative component

Software Develop a well-written, tested andcommented software (with doc) implementinga non-trivial learning model and/or anapplication relevant for the course

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

Oral Presentation (Quick Way)

Prepare a presentation (40 minutes) on the final projectand discuss it in front of me (and anybody interested)The presenter should

Summarize the ideas in the reportBe able to answer my questions on the presentationBe confident on the course topics: no mathematicalderivations asked, but questions on the concepts seen inthe course may pop-up

TimelineArrange a presentation date with (10 days in advance)Handle your report at least 7 days before presentationSend me your slides the day before the presentation

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

Oral Exam (Alternative Way)

Prepare a presentation (40 minutes) on the final projectfollowing the same rules and timeline of the quick wayIn addition, candidates will be subject to an oral exam withquestions covering the course contents

Mathematical derivations and details of the modelsdiscussed during the lectures will be askedExamination will be performed jointly with the presentation

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

How to get past this course?

Grading (quick way)(GP + GO)

2+ GM

Midterm: GM ∈ [0,3] (0 means failure)Project: GP ∈ [0,30] (< 18 means failure)Oral: GO ∈ [0,30] (< 18 means failure)

Grading (alternative way)(GP + GO)

2Project: GP ∈ [0,30] (< 18 means failure)Oral: GO ∈ [0,32] (< 18 means failure)

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

Upcoming..

Recurrent Neural Networks ModuleExtend connectionist paradigm with a new class of neuralmodels capable of representing the history of the input signalsin its internal state

TopicsIntroduction to Recurrent Neural NetworksProcessing history/context dependant input signalsLearning in Recurrent Neural NetworksRecurrent/recursive neural networks for structuresReservoir Computing for sequences

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

Next Lecture

Introduction to Recurrent Neural NetworksNeural networks for history/context dependant taskRepresenting time in neural networks: explicit and implicitapproachElman networksApplication examples

Guest lecture by Alessio Micheli

Course InformationAdvanced Machine Learning

Practicalities and Next Lectures

ExamNext LecturesContacts

Contacts and Information

Course Didawiki

http://didawiki.cli.di.unipi.it/doku.php/magistraleinformatica/aa2/start

Before leaving please write down your email address for thecourse mailing list

Questions?