learning and evolution: lessons from the baldwin-effect georg theiner p747 complex adaptive systems...

Learning and Evolution:Lessons from the Baldwin-Effect

Georg Theiner

P747 Complex Adaptive Systems

March 11th, 2003

Outline

• A brief history of modern evolutionary biology

• What is the Baldwin Effect?

• Hinton & Nowlan's (1987) simulation

• JAVA-applet of BE

• The trade-offs between phenotypic plasticity and rigidity

• Subsequent studies

• Discussion

Lamarckian Evolution

• Published Philosophie Zoologique (1809) • Assumption: Change in the environment

causes changes in the needs of organisms living in that environment, which in turn causes changes in their behavior.

• Mechanisms of evolution– First Law: Use or disuse causes structures

(organs) to enlarge or shrink– Second Law: All such acquired changes are

heritable

• Example: long legs and webbed feet of wading birds, long neck of giraffe

Jean-Baptiste Lamarck

(1744-1829)

Darwinian Evolution

• Published The Origin of Species (1859)

• direct manipulation of one's genetic make-up impossible

• acquired characteristics are not directly passed on to offspring

• Mechanism of evolution:– Genetic variation in species through

random mutations – Natural selection operates on

phenotypes

Charles Darwin (1809-82)

Baldwinian Evolution

• Published "A New Factor in Evolution" (1896)

• Independently identified by Baldwin, Morgan, and Osborn in 1896

• New factor = phenotypic plasticity: the ability of an organism to adapt to its environment during its lifetime – Examples: ability to learn, to increase muscle

strength with exercise, to tan with exposure to sun

James Mark Baldwin

(1861-1934)

The Baldwin Effect

• A cluster of effects emerging from an interaction between 2 adaptive processes: – genotypic evolution of population (global search)– individual organism's phenotypic flexibility (local

search)

• Concerned with benefits and costs of lifetime learning

• lifetime learning can alter the genetic composition of an evolving population

• Hypothesized examples: – bird song (Simpson

1953)

– human language capacity (Pinker and Bloom 1990, Deacon 1997)

– consciousness, intelligence (Dennett 1991, 1995)

• learning capacity eventually becomes genetically encoded resembles Lamarckian sequence

• consistent with Darwinian mechanism for inheritance of traits

The Baldwin Effect, Step 1

• Evolutionary value of learning: accelerates evolution of an adaptive trait – As a result of mutation, an organism becomes capable of

learning how to do X

– Learning how to do X increases an organism's fitness

– Creates new selective pressures: because selection is now also working on the ability to perform X.

– Since the successful X-er has greater reproductive success, eventually the population may consist entirely of individuals able to learn how to do X.

The Baldwin Effect, Step 2

• Since learning can be costly, evolution favors rigid solutions in which acquiring X is part of an organism's genetic make-up (phenotypic rigidity) – Chance of reproductive success be proportional to how

quickly (reliably) X can be learnt

– New selective pressures cause competition between slow and fast learners

– Some individuals are innately better equipped for performing X, have reproductive advantage

– Eventually, capacity to X comes under direct genetic control = genetic assimilation, canalization of a trait (Waddington 1942)

Hinton & Nowlan Simulation (1987)

• Organism with neural net, 20 connections (phenes)

• 20 genes, one-to-one mapping on phenes • Each gene can have 3 alleles

– 0 = no connection – 1 = connection – ? = undetermined, learning

• one Good Phenotype: net works just in case all nodes are connected

• one Good Genotype: all 1's

"Needle in a haystack"-fitness landscape

• Evolutionary search modeled by GA

• Population of 1000 organisms

• Each allele is randomly initialized – p = 0.5 for ? – p = 0.25 for 0 and 1

• performs no better than random

fitness

combination of alleles

Problem of passing on the good genome

• Even if good solution discovered, not easily passed on

• unless fit organism finds very-close-to-fit mate, good genome will be destroyed

• expected number of good (immediate) offspring < 1 – can be bypassed in artificial simulations using

elitism operator, asexual reproduction

The importance of lifetime learning

• Augment evolutionary search with phenotypic plasticity

• Each organism performs 1000 learning trials during lifetime

• learning mechanism: random guess – if correct net is found, stop; else keep searching

• all phenes equally hard to learn • requires that organism recognizes the correct

solution

Determine next generation

• Use a version of Holland's GA (1975)

• Perform 1000 matings

• Selection algorithm: Roulette Wheel

• Select parents with probability proportional to fitness

• Fitness function F of an

individual A in a population i is F(A[i]) = 1 + [(G – g) / G] * (N – 1)

– G = number of allowed guesses

– g = number of guesses until solution found

– N = length of genotype

• in our case: 1 + (19n/1000)

• Wheel is spun twice (2 parents) for each mating, single offspring is generated

• cross-over point for combining parental alleles is chosen randomly

• offspring inherit only genome, never learnt connection settings

• Model parameters are fine-tuned – typical genotype has about 10 connections

genetically determined (0's or 1's)

– about 2^10 learning trials

Results 1 • Phenotypic plasticity smoothes "needle in a

haystack" fitness landscape

• by allowing an organism to explore neighboring regions of phenotypic space

• no unlikely saltations necessary to climb fitness peak

Results 2 • if no phenotypic

plasticity, about 2^20 (~ 1 million) organisms have to be produced to succeed in search

• with learning, finding the correct net requires only 16 x 1000 organisms

• little selection pressure to fix all phenes genetically

• Run with "Show all data" check-box to see frequency of 0's and ?'s

• Alter random number seed• Additional evolutionary operators

– mutation• chance (as specified in Advanced Options) that a given allele will be

flipped to either 0, 1, or ? (with equal p)

• maintain diversity, avoid local fitness maxima

– elitism• forces best individual of each population to be included unchanged

in next generation

JAVA-Simulation (Watson and Wiles 2001)

http://www.itee.uq.edu.au/~jwatson/bdemo/index.html

• Alternative Selection Algorithms– Ranked Roulette Wheel

• slice of wheel is proportional to ranked fitness• minimizes real differences in fitness• less selection bias for top-fit individuals

– Tournament• randomly picks 2 individuals from population, chooses

fitter one with p = k (as set in Advanced Options)• runs much faster• preserves genetic diversity much longer

• Standard combinations for optimization algorithms– Standard roulette without elitism– tournament with elitism

Fundamental insight of BE

• Trade-offs between learning (plasticity) and instinct (rigidity)

French and Messinger (1994)

• amount of plasticity and amount of benefit of learnt behavior is crucial to size of BE – having blue eyes vs. humming Middle C vs. winking

– x-axis: agent's normalized distance from Good Gene (number of bits differing by total number of bits)

– y-axis: probability of learning the Good Phene

• BE is significant only for a narrow window of plasticity

• if too low or too high, virtually no convergence towards Good Gene

Mayley (1996a, 1996b, 1997) • Possible selective disadvantage of learning:

Hiding Effect • phenotypic fitness differentials are compensated

by learning capacity • genetic differences are hidden from selection by

learning• trade-offs between Baldwin and Hiding effect

Discussion

• Unrealistic assumptions about fitness landscape– extremely rugged fitness landscape makes pure

evolutionary search very hard– How smooth are real search spaces?

• Unrealistic assumption about learning mechanism– instead e.g. use hillclimbing procedure for local

optimization– enhances BE only if learning procedure is not too

sophisticated, otherwise insufficient selective pressure for hard-wiring

• Learning trials are "cheap" genetic experiments– but biological reality of those two search

strategies differs in many respects

• Unrealistic assumption about genome-phenome mapping– mapping could be one-to-many– genetic specification and successful guessing

of a trait are treated interchangeably– transformation of phenotype to genotype

(development) is trivialized

• Do we need an explicit fitness function?– French & Messinger (1994): introduce spatial

dimension– consider 3 areas of plasticity: Good Phene =

more efficient metabolism, movement, reproduction

– world determines fitness of a given genotype

• Using simple models to understand complex phenomena– Controlled experiments are practically

unfeasible– How simple is too simple?

Selective Bibliography on BEA bibliography on BE (last update: 2001)

http://www.cs.bath.ac.uk/~jjb/web/baldwin.html

An online JAVA-simulation of BE

http://www.itee.uq.edu.au/~jwatson/bdemo/index.html

requires JAVA version 1.3.1 or greater

Ancel, L. (2000)

Undermining the Baldwin Expediting Effect: Does Phenotypic Plasticity Accelerate Evolution? Theoretical Population Biology, 58, 307-19.

http://www.santafe.edu/~ancel/PAPERS/TPB.pdf

Baldwin, J.M. (1896)

A New Factor in Evolution, American Naturalist, 30, 441-51.

http://www.santafe.edu/sfi/publications/Bookinforev/baldwin.html

Belew, R.K. (1990)

Evolution, Learning, and Culture: Computational Metaphors for Adaptive Search, Complex Systems, 4, 11-49.

Downes, S. (2003)

Baldwin Effects and Expansion of the Explanatory Repertoire in Evolutionary Biology, in: Weber, B., and Depew, D.J., (eds.), loc.cit.

French, R., and Messinger, A. (1994)

Genes, phenes and the Baldwin effect, in: Brooks, R., and Maes, P. (eds.), Artificial Life IV, MIT Press, 277-82.

http://www.santafe.edu/~amessing/baldwin.ps

Hinton, G.E., and Nowlan, S.J. (1987)

How Learning Can Guide Evolution, Complex Systems, 1, 495-502.

[reprinted in Mitchell, M., and Belew, R. (eds.), Adaptive Individuals in Evolving Populations: Models and Algorithms (1996)]

http://www-advancedgec.ge.uiuc.edu/papers/Chap 25 Adaptive Individuals.pdf

Jones, M., and Konstam, A. (1999) The Use of Genetic Algorithms and Neural Networks to Investigate the Baldwin Effect, in: Carroll, J., and Hiddad, H. et al. (eds.), Proceedings of the 1999 ACM Symposium on Applied Computing, 275-79.

Ku, K., and Mak, M. (1998) Empirical Analysis of the Factors that Affect the Baldwin Effect, in: Eiben, A.E., and Baeck, T. et al. (eds.), Parallel Problem Solving From Nature, Springer, 481-90.

Mayley, G. (1996a) The evolutionary cost of learning. In Maes, P., Mataric, M., Meyer, J-A., Pollack, J., and Wilson, S. (Eds), From Animals to Animats: Proceedings of the Fourth International Conference on Simulation of Adaptive Behaviour, 458-467, MIT Press.

http://www.cogs.susx.ac.uk/users/gilesm/sab96.ps

Mayley, G., (1996) Landscapes, Learning Costs and Genetic Assimilation, Evolution, Learning, and Instinct: 100 Years of the Baldwin Effect, a Special Issue of Evolutionary Computation, 4(3), 1996. http://www.cogs.susx.ac.uk/users/gilesm/ec.ps

P.Turney, D. Whitley and R. Anderson (eds), Evolution, Learning, and Instinct: 100 Years of the Baldwin Effect, Special Issue of Evolutionary Computation, 4(3), 1996

Check out Table of contents: http://alife.ccp14.ac.uk/baldwin/baldwin/toc.html

Editorial with a short history of BE: http://alife.ccp14.ac.uk/baldwin/baldwin/editorial.html

Turney, P. (1996) Myths and legends of the Baldwin effect, in: Fogarty, T., and Venturini, G. (eds.), Proceedings of the ICML-96, 135-42.

ftp://ai.iit.nrc.ca/pub/iit-papers/NRC-39220.pdf

Waddington, C.H. (1942) Canalization of Development and the Inheritance of Acquired Characteristics, Nature, 150, 563-65.

Weber, B., and Depew, D.J. (eds.), Evolution and Learning : the Baldwin Effect Reconsidered, MIT Press, 2003.

on-order by B-Main Lib

learning and evolution: lessons from the baldwin-effect georg theiner p747 complex adaptive systems...

Documents

baldwin effect

darwinian evolution

learning capacity

baldwinian evolution

successful x

capable of learning

genetic assimilation

genetic variation