ba yes i an multi agent beamer

Upload: ashar-ahmad

Post on 04-Apr-2018

222 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    1/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Bayesian Multi Agent Systems

    Ahmad AsharGroup 256 Modelling and Simulation

    December 17, 2012

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    2/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Table of contents

    1 Motivation and AimWhat is BayesianBayesian vs FrequentistAvoiding Over fitting

    Personal Motivation2 Applications

    3 MAS:A Brief OverviewDecision TheoryGame Theory

    4 A Bayesian FrameworkMarkov Decision Process for Single Agent

    Value Functions and Bellman Equations

    MDP for MultiAgent System in Reinforcement learning

    5

    ConclusionsAhmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    3/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Avoiding Over fittingPersonal Motivation

    The questions

    Why Bayesian ?What is Bayesian ?

    Why is it an important paradigm ?

    Why am I studying this?

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    4/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Avoiding Over fittingPersonal Motivation

    The Answers..

    What is Bayesian ?

    p(|D) =p(D|)p(0)

    == p(D|)p()d

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    M i i d Ai

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    5/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Avoiding Over fittingPersonal Motivation

    Bayesian vs Frequentist

    Why Bayesian ?

    Figure: Human Evolution

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    M ti ti d Ai

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    6/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Avoiding Over fittingPersonal Motivation

    Why use the Bayesian Paradigm

    Bayesian Marginalization avoids the problem of overfitting

    .. the phenomenon of over-fitting is really an unfortunateproperty of maximum likelihood and does not arise when wemarginalize over parameters in a Bayesian setting.

    ..It is a property of the marginal likelihood that itautomatically incorporates a trade-off between model fit and

    model complexity Above quotes courtesy Bishops book on Pattern Recognition

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    Motivation and Aim

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    7/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Avoiding Over fittingPersonal Motivation

    The details..

    Overfitting occurs when a statistical model describes randomerror or noise instead of the underlying relationship[Wikipedia]

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    Motivation and Aim

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    8/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Avoiding Over fittingPersonal Motivation

    Why am I studying this?

    Interest in Statistical Machine Learning

    Bayesian Non-parametrics: possibly the hottest thing inMachine Learning today

    My interests in Gaussian Process Regression,Classification

    Research includes tools from1 Statistical Physics : Mean Field Approximations

    2 Data Analysis: Kernel Methods3 Neural Computation : ANN

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    Motivation and Aim

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    9/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Applications

    Data Analysis - Google,Microsoft,Yahoo!

    Fraud Detection

    Geo-statistical applications

    Intelligent systems (Robots, Recommender sytems :Netflixprize)

    Bioinformatics, Cheminformatics, NLP, Phylogenetic trees

    Information Theory, Decision Theory

    Practically anything which has a large data set!

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    Motivation and Aim

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    10/15

    Motivation and AimApplications

    MAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Decision TheoryGame Theory

    MAS :Decision Theory

    Decision Theory

    Maximizing Utility over various policies (series of steps)

    State Percept Action NewState

    Found by searching various possible states s

    Search problem Decision Theory problem

    Reward R : S S A R

    Distribution over actions p(a|s) for each s S where a A(s)

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    Motivation and Aim

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    11/15

    ApplicationsMAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Decision TheoryGame Theory

    MAS :Game Theory

    Game Theory

    Framework for Co-operation and CommunicationTreat MAS as a game with agents as players

    Use mathematical results in co-operative games withincomplete information as strategies for agents

    Highly complex and advanced field of research

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    Motivation and Aim

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    12/15

    ApplicationsMAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Markov Decision Process for Single AgentMDP for MultiAgent System in Reinforcement learning

    Bayesian Decision Making: Markov Decision Process

    State transition properties depend on the current state andaction

    State transition probabilities are multinomial distributions

    When the rewards or transition probabilities are unknown theproblem reduces to that of reinforcement learning

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    Motivation and Aim

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    13/15

    ApplicationsMAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Markov Decision Process for Single AgentMDP for MultiAgent System in Reinforcement learning

    Value Functions and Bellman Equations

    Reward function:N

    t=0

    trt+1

    where > 0

    State value function

    Vp(s) = E[

    t=0

    trt+1|so = s]

    Bellman Equation

    aA(s)

    p(a|s)

    sS

    p(s|s, a)[r(s, a, s) + Vp(s)]

    State Action Value Function

    =

    p(s|s, a)[r(s, a, s) + Vp(s)]Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    Motivation and Aim

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    14/15

    ApplicationsMAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Markov Decision Process for Single AgentMDP for MultiAgent System in Reinforcement learning

    A Bayesian Framework : MDP for MAS in RL

    Acting, Coordinating should be integrated

    One agents optimal policy affects the others

    Solution: Put Bayesian Priors over MAS optimal policySolution: Likelihood could be factorized

    Bayesian inference could be carried out, Often approximations

    Have to model some agents joint behaviour for Machine

    Learning of the parametersProblem with scaling :With more agents Parameters couldtend to

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems

    Motivation and AimA li i

  • 7/29/2019 Ba Yes i an Multi Agent Beamer

    15/15

    ApplicationsMAS:A Brief OverviewA Bayesian Framework

    Conclusions

    Conclusions

    MAS : Mixing of Decision Theory and Game theory

    Making MAS an extension of probabilistic single agents withincomplete information

    Using Bayesian Inference for optimal policy determination

    Challenge: To fully develop a detailed Bayesian MAS

    mathematically

    Ahmad Ashar Group 256 Modelling and Simulation Bayesian Multi Agent Systems