cooperative behavior of artificial neural agents based on ......supervised learning : parameters...

21
WETICE 2008 - Workshop on Coordination Models and Applications (CoMA) Cooperative behavior of artificial neural agents based on evolutionary architectures Alessandro Londei, Piero Savastano, Marta Olivetti Belardinelli Interuniversity Center for Research on Cognitive Processing In Natural and Artificial Systems - ECONA, “Sapienza” University of Rome Email: [email protected]

Upload: others

Post on 21-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Cooperative behavior of artificial

neural agents based on evolutionary

architecturesAlessandro Londei, Piero Savastano, Marta Olivetti Belardinelli

Interuniversity Center for Research on Cognitive Processing

In Natural and Artificial Systems - ECONA, “Sapienza” University of Rome

Email: [email protected]

Page 2: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Empiricism vs. Innatism

Constructivism: Knowledge cannot be instructed by a teacher, it can only be constructed

by a learner (learning by environment)

Instructivism: Knowledge is instructed by a teacher (learning by a supervisor)

Empiricism

Nativism: Knowledge (skills or abilities) is hard-wired into the brain at birth.

Innatism

Certain cognitive modules are built-in at birth that allow to learn and acquire certain

skills as language (N. Chomsky, J. Fodor) and other cognitive functions

The human mind of a newborn is not a tabula rasa

Page 3: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Empiricism in Neural Networks

Supervised learning: parameters (synaptic weights) are modified until the response is correct

At the end of the learning phase, knowledge representation is given by weights distribution…

…and this is only true for the chosen neural architecture!

Stim

ulu

s

Response

Supervisor

Architecture plays a crucial role in the ability of a network to learn a specific behavior

Page 4: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Nativism in Neural Networks?

Evolutionary robotics: agents have a neural brain (fixed feed-forward architecture)

whose weights are described by an evolutionary process (genetic algorithm) [Nolfi 1994,

Belew 1992]S

tim

ulu

s

Response

Genetic coding

Fitness to the environment

• architecture is not considered as an evolutionary parameter

• phenotype is only given by synaptic weights

Low plausibility

No internal dynamical evolution

Absent learning phase (almost…)

Page 5: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Nativism in Neural Networks!

Topology and Weight Evolving Artificial Neural Networks (TWEANNs): optimization of

neural systems through augmenting topologies [Stanley 2002]

Drawback

One gene for each connection

Minimal network means weak robustness to failure

Page 6: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

An improved nativist approach

Genetic

populationfitness crossing-over mutations

New

population

Plasticity

selection

Fixed feed-forward

architecture

Architecture

selectionDynamical recurrent

architecture

Food

Page 7: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Genetic Coding for a Neural Architecture

Phenotype:

- 8 sensorial inputs for: directional food detection (3), directional obstacle detection (2),

food proximity (2), hunger proprioceptor (1)

- 2 motor outputs

Neural Network:

- 40 max excitatory/inhibitory neurons with

sigmoidal activation function

- Hebb (through time) learning rule

- Neural transmission delay

Genotype:

- 7-length slot per neuron

- labeling for connection structure

- redundancy (forbidden codes)

Page 8: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Neuron and Network Model

yn= ! x

n( ) =1

1" exp # $ xn"T( )( )

Neuron:

- Sigmoidal activation function

- Beta and offset described in genetic code

- Excitatory or inhibitory (from label)

!!

y

1t ! "

1( )

y

kt ! "

k( )

y

Nt ! "

N( )

Network:

- Connections given by labels similarity (Hamming distance)

- Distance defines time-delay

- PSP given by synaptic weight and delayed inputs

- Hebbian learning rule !w

nk= e"#t

$ yn

t( ) $ ykt " %

k( )

x

nt( ) = w

kn! y

kt " #

k( )k$S

%

Page 9: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Boolean Gates - AND

01

11

10

00

01

11

01

00

01

10

11 11 11

0010

Page 10: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Boolean Gates - XOR

10

11

00

01

10

11

10

01

10

110000

11 10

Page 11: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Artificial Neural Agentslif

etim

e

generations

Best agent fitness

Worst agent fitness

Average fitness

Environment: a square arena containing 4 agents and 10 incremental food (max 80).

Arena sides are sticky and dangerous for the agents. Food increases lifetime by 200

steps and sides wounds the agents (lifetime decreases by 4 at each step)

Fitness: is given by the sum of all agents lifetime.

fitness = LTk

k

!

Page 12: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Agents Behavior

Algorithm demonstration

Gen #1: casual behavior

Gen #200: sensitivity to food and obstacle

Gen #400: emergence of a strategy for area exploration

Gen #1000: global optimization of social behavior

Page 13: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Features - Small World

Characteristic Path Length L(p)

typical separation between 2 vertices in the graph

Clustering Coefficient C(p)

cliquishness of a typical neighborhood

example C.Elegans: Lnorm=0.85, Cnorm=0.20

Watts Nature, 1998

Averaged over the first 10 best agents of each generation

Page 14: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Features - Chaos

Rabbit Olfactory Bulb - Freeman IEEE

Trans. on Circ. and Syst., 1988

3D evolution of 3 inner neurons -

autonomous case

Page 15: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Lyapunov Exponent

Features - Chaos

! = limT"#

1

Tlog

!

$ T( )!

$ 0( )

Page 16: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Features - Experimentum Crucis

Page 17: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Features - Experimentum Crucis

Page 18: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Features - Cooperation

Fitness takes into account the “global ability to survive”

fitness = LTk

k

! "1

2LT

k" LT

jk , j

!

Page 19: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Features - Cooperation

Page 20: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Features - Cooperation

Agents behavior increases global lifetime by means of a greater modulation of the speed:

agents tend to stop after feeding and begin to search for food again after a “refractory”

transient

Difference between speed modulations is significant: F1,99=13.43, p<4·10-4

Improved agent models may be implemented in order to evidence put in evidence more

realistic cooperative behavior (communication and language emergence)

Page 21: Cooperative behavior of artificial neural agents based on ......Supervised learning : parameters (synaptic weights) are modified until the response is correct At the end of the learning

WETICE 2008 - Workshop on Coordination Models and Applications (CoMA)

Conclusion

“A multi-agent system is one composed of multiple interacting software

components known as agents, which are typically capable of co-operating to

solve problems that are beyond the abilities of any individual member”

• Cooperation and social abilities depend on the agent phenotype and the

environment the agents are embedded in

• This neural model allows an improvement of the neural networks world in order to

be a candidate for MA systems, in respect of a native and unsupervised approach

• A way to reveal and analyze the complex aspects of natural neural dynamics