learning and evolution: lessons from the baldwin-effect georg theiner p747 complex adaptive systems...

31
Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th , 2003

Upload: roy-daniels

Post on 11-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Learning and Evolution:Lessons from the Baldwin-Effect

Georg Theiner

P747 Complex Adaptive Systems

March 11th, 2003

Page 2: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Outline

• A brief history of modern evolutionary biology

• What is the Baldwin Effect?

• Hinton & Nowlan's (1987) simulation

• JAVA-applet of BE

• The trade-offs between phenotypic plasticity and rigidity

• Subsequent studies

• Discussion

Page 3: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Lamarckian Evolution

• Published Philosophie Zoologique (1809) • Assumption: Change in the environment

causes changes in the needs of organisms living in that environment, which in turn causes changes in their behavior.

• Mechanisms of evolution– First Law: Use or disuse causes structures

(organs) to enlarge or shrink– Second Law: All such acquired changes are

heritable

• Example: long legs and webbed feet of wading birds, long neck of giraffe

Jean-Baptiste Lamarck

(1744-1829)

Page 4: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Darwinian Evolution

• Published The Origin of Species (1859)

• direct manipulation of one's genetic make-up impossible

• acquired characteristics are not directly passed on to offspring

• Mechanism of evolution:– Genetic variation in species through

random mutations – Natural selection operates on

phenotypes

Charles Darwin (1809-82)

Page 5: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Baldwinian Evolution

• Published "A New Factor in Evolution" (1896)

• Independently identified by Baldwin, Morgan, and Osborn in 1896

• New factor = phenotypic plasticity: the ability of an organism to adapt to its environment during its lifetime – Examples: ability to learn, to increase muscle

strength with exercise, to tan with exposure to sun

James Mark Baldwin

(1861-1934)

Page 6: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

The Baldwin Effect

• A cluster of effects emerging from an interaction between 2 adaptive processes: – genotypic evolution of population (global search)– individual organism's phenotypic flexibility (local

search)

• Concerned with benefits and costs of lifetime learning

• lifetime learning can alter the genetic composition of an evolving population

Page 7: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

• Hypothesized examples: – bird song (Simpson

1953)

– human language capacity (Pinker and Bloom 1990, Deacon 1997)

– consciousness, intelligence (Dennett 1991, 1995)

• learning capacity eventually becomes genetically encoded resembles Lamarckian sequence

• consistent with Darwinian mechanism for inheritance of traits

Page 8: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

The Baldwin Effect, Step 1

• Evolutionary value of learning: accelerates evolution of an adaptive trait – As a result of mutation, an organism becomes capable of

learning how to do X

– Learning how to do X increases an organism's fitness

– Creates new selective pressures: because selection is now also working on the ability to perform X.

– Since the successful X-er has greater reproductive success, eventually the population may consist entirely of individuals able to learn how to do X.

Page 9: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

The Baldwin Effect, Step 2

• Since learning can be costly, evolution favors rigid solutions in which acquiring X is part of an organism's genetic make-up (phenotypic rigidity) – Chance of reproductive success be proportional to how

quickly (reliably) X can be learnt

– New selective pressures cause competition between slow and fast learners

– Some individuals are innately better equipped for performing X, have reproductive advantage

– Eventually, capacity to X comes under direct genetic control = genetic assimilation, canalization of a trait (Waddington 1942)

Page 10: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Hinton & Nowlan Simulation (1987)

• Organism with neural net, 20 connections (phenes)

• 20 genes, one-to-one mapping on phenes • Each gene can have 3 alleles

– 0 = no connection – 1 = connection – ? = undetermined, learning

• one Good Phenotype: net works just in case all nodes are connected

• one Good Genotype: all 1's

Page 11: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

"Needle in a haystack"-fitness landscape

• Evolutionary search modeled by GA

• Population of 1000 organisms

• Each allele is randomly initialized – p = 0.5 for ? – p = 0.25 for 0 and 1

• performs no better than random

fitness

combination of alleles

Page 12: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Problem of passing on the good genome

• Even if good solution discovered, not easily passed on

• unless fit organism finds very-close-to-fit mate, good genome will be destroyed

• expected number of good (immediate) offspring < 1 – can be bypassed in artificial simulations using

elitism operator, asexual reproduction

Page 13: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

The importance of lifetime learning

• Augment evolutionary search with phenotypic plasticity

• Each organism performs 1000 learning trials during lifetime

• learning mechanism: random guess – if correct net is found, stop; else keep searching

• all phenes equally hard to learn • requires that organism recognizes the correct

solution

Page 14: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Determine next generation

• Use a version of Holland's GA (1975)

• Perform 1000 matings

• Selection algorithm: Roulette Wheel

• Select parents with probability proportional to fitness

• Fitness function F of an

individual A in a population i is F(A[i]) = 1 + [(G – g) / G] * (N – 1)

– G = number of allowed guesses

– g = number of guesses until solution found

– N = length of genotype

• in our case: 1 + (19n/1000)

Page 15: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

• Wheel is spun twice (2 parents) for each mating, single offspring is generated

• cross-over point for combining parental alleles is chosen randomly

• offspring inherit only genome, never learnt connection settings

• Model parameters are fine-tuned – typical genotype has about 10 connections

genetically determined (0's or 1's)

– about 2^10 learning trials

Page 16: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Results 1 • Phenotypic plasticity smoothes "needle in a

haystack" fitness landscape

• by allowing an organism to explore neighboring regions of phenotypic space

• no unlikely saltations necessary to climb fitness peak

Page 17: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Results 2 • if no phenotypic

plasticity, about 2^20 (~ 1 million) organisms have to be produced to succeed in search

• with learning, finding the correct net requires only 16 x 1000 organisms

• little selection pressure to fix all phenes genetically

Page 18: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

• Run with "Show all data" check-box to see frequency of 0's and ?'s

• Alter random number seed• Additional evolutionary operators

– mutation• chance (as specified in Advanced Options) that a given allele will be

flipped to either 0, 1, or ? (with equal p)

• maintain diversity, avoid local fitness maxima

– elitism• forces best individual of each population to be included unchanged

in next generation

JAVA-Simulation (Watson and Wiles 2001)

Page 19: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

• Alternative Selection Algorithms– Ranked Roulette Wheel

• slice of wheel is proportional to ranked fitness• minimizes real differences in fitness• less selection bias for top-fit individuals

– Tournament• randomly picks 2 individuals from population, chooses

fitter one with p = k (as set in Advanced Options)• runs much faster• preserves genetic diversity much longer

• Standard combinations for optimization algorithms– Standard roulette without elitism– tournament with elitism

Page 20: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Fundamental insight of BE

• Trade-offs between learning (plasticity) and instinct (rigidity)

Page 21: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

French and Messinger (1994)

• amount of plasticity and amount of benefit of learnt behavior is crucial to size of BE – having blue eyes vs. humming Middle C vs. winking

– x-axis: agent's normalized distance from Good Gene (number of bits differing by total number of bits)

– y-axis: probability of learning the Good Phene

Page 22: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

• BE is significant only for a narrow window of plasticity

• if too low or too high, virtually no convergence towards Good Gene

Page 23: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Mayley (1996a, 1996b, 1997) • Possible selective disadvantage of learning:

Hiding Effect • phenotypic fitness differentials are compensated

by learning capacity • genetic differences are hidden from selection by

learning• trade-offs between Baldwin and Hiding effect

Page 24: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Discussion

• Unrealistic assumptions about fitness landscape– extremely rugged fitness landscape makes pure

evolutionary search very hard– How smooth are real search spaces?

• Unrealistic assumption about learning mechanism– instead e.g. use hillclimbing procedure for local

optimization– enhances BE only if learning procedure is not too

sophisticated, otherwise insufficient selective pressure for hard-wiring

Page 25: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

• Learning trials are "cheap" genetic experiments– but biological reality of those two search

strategies differs in many respects

• Unrealistic assumption about genome-phenome mapping– mapping could be one-to-many– genetic specification and successful guessing

of a trait are treated interchangeably– transformation of phenotype to genotype

(development) is trivialized

Page 26: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

• Do we need an explicit fitness function?– French & Messinger (1994): introduce spatial

dimension– consider 3 areas of plasticity: Good Phene =

more efficient metabolism, movement, reproduction

– world determines fitness of a given genotype

• Using simple models to understand complex phenomena– Controlled experiments are practically

unfeasible– How simple is too simple?

Page 27: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Selective Bibliography on BEA bibliography on BE (last update: 2001)

http://www.cs.bath.ac.uk/~jjb/web/baldwin.html

An online JAVA-simulation of BE

http://www.itee.uq.edu.au/~jwatson/bdemo/index.html

requires JAVA version 1.3.1 or greater

Ancel, L. (2000)

Undermining the Baldwin Expediting Effect: Does Phenotypic Plasticity Accelerate Evolution? Theoretical Population Biology, 58, 307-19.

http://www.santafe.edu/~ancel/PAPERS/TPB.pdf

Baldwin, J.M. (1896)

A New Factor in Evolution, American Naturalist, 30, 441-51.

http://www.santafe.edu/sfi/publications/Bookinforev/baldwin.html

Page 28: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Belew, R.K. (1990)

Evolution, Learning, and Culture: Computational Metaphors for Adaptive Search, Complex Systems, 4, 11-49.

Downes, S. (2003)

Baldwin Effects and Expansion of the Explanatory Repertoire in Evolutionary Biology, in: Weber, B., and Depew, D.J., (eds.), loc.cit.

French, R., and Messinger, A. (1994)

Genes, phenes and the Baldwin effect, in: Brooks, R., and Maes, P. (eds.), Artificial Life IV, MIT Press, 277-82.

http://www.santafe.edu/~amessing/baldwin.ps

Hinton, G.E., and Nowlan, S.J. (1987)

How Learning Can Guide Evolution, Complex Systems, 1, 495-502.

[reprinted in Mitchell, M., and Belew, R. (eds.), Adaptive Individuals in Evolving Populations: Models and Algorithms (1996)]

http://www-advancedgec.ge.uiuc.edu/papers/Chap 25 Adaptive Individuals.pdf

Page 29: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Jones, M., and Konstam, A. (1999) The Use of Genetic Algorithms and Neural Networks to Investigate the Baldwin Effect, in: Carroll, J., and Hiddad, H. et al. (eds.), Proceedings of the 1999 ACM Symposium on Applied Computing, 275-79.

Ku, K., and Mak, M. (1998) Empirical Analysis of the Factors that Affect the Baldwin Effect, in: Eiben, A.E., and Baeck, T. et al. (eds.), Parallel Problem Solving From Nature, Springer, 481-90.

Mayley, G. (1996a) The evolutionary cost of learning. In Maes, P., Mataric, M., Meyer, J-A., Pollack, J., and Wilson, S. (Eds), From Animals to Animats: Proceedings of the Fourth International Conference on Simulation of Adaptive Behaviour, 458-467, MIT Press.

http://www.cogs.susx.ac.uk/users/gilesm/sab96.ps

Page 30: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Mayley, G., (1996) Landscapes, Learning Costs and Genetic Assimilation, Evolution, Learning, and Instinct: 100 Years of the Baldwin Effect, a Special Issue of Evolutionary Computation, 4(3), 1996. http://www.cogs.susx.ac.uk/users/gilesm/ec.ps

P.Turney, D. Whitley and R. Anderson (eds), Evolution, Learning, and Instinct: 100 Years of the Baldwin Effect, Special Issue of Evolutionary Computation, 4(3), 1996

Check out Table of contents: http://alife.ccp14.ac.uk/baldwin/baldwin/toc.html

Editorial with a short history of BE: http://alife.ccp14.ac.uk/baldwin/baldwin/editorial.html

Page 31: Learning and Evolution: Lessons from the Baldwin-Effect Georg Theiner P747 Complex Adaptive Systems March 11 th, 2003

Turney, P. (1996) Myths and legends of the Baldwin effect, in: Fogarty, T., and Venturini, G. (eds.), Proceedings of the ICML-96, 135-42.

ftp://ai.iit.nrc.ca/pub/iit-papers/NRC-39220.pdf

Waddington, C.H. (1942) Canalization of Development and the Inheritance of Acquired Characteristics, Nature, 150, 563-65.

Weber, B., and Depew, D.J. (eds.), Evolution and Learning : the Baldwin Effect Reconsidered, MIT Press, 2003.

on-order by B-Main Lib