Title: (1) Learning (2) The proximity principle and
1 (1) Learning (2) The proximity principle
and evolutionary learning
Ling 411 17
2Schedule of Presentations
Tu Apr 13 Th Apr 15 Tu Apr 20
Th Apr 22
3Operations in relational networks
REVIEW
- Relational networks are dynamic
- Activation moves along lines and through nodes
- Links have varying strengths
- A stronger link carries more activation, other
things being equal - All nodes operate on two principles
- Integration
- Of incoming activation
- Broadcasting
- To other nodes
4Operation of the Networkin terms of cortical
columns
Review
- The linguistic system operates as distributed
processing of multiple individual components - Nodes in an abstract model
- These nodes are implemented as cortical columns
- Columnar Functions
- Integration A column is activated if it receives
enough activation from other columns - Can be activated to varying degrees
- Can keep activation alive for a period of time
- Broadcasting An activated column transmits
activation to other columns - Exitatory contribution to higher level
- Inhibitory dampens competition at same level
5Additional operations Learning
- Links get stronger when they are successfully
used (Hebbian learning) - Learning consists of strengthening them
- Hebb 1948
- Threshold adjustment
- When a node is recruited its threshold increases
- Otherwise, nodes would be too easily satisfied
6Requirements that must be assumed(implied by the
Hebbian learning principle)
- Links get stronger when they are successfully
used (Hebbian learning) - Learning consists of strengthening them
- Prerequisites
- Initially, connection strengths are very weak
- Term Latent Links
- They must be accompanied by nodes
- Term Latent Nodes
- Latent nodes and latent connections must be
available for learning anything learnable - The Abundance Hypothesis
- Abundant latent links
- Abundant latent nodes
7Support for the abundance hypothesis
- Abundance is a property of biological systems
generally - Cf. Acorns falling from an oak tree
- Cf. A sea tortoise lays thousands of eggs
- Only a few will produce viable offspring
- Cf. Edelman silent synapses
- The great preponderance of cortical synapses are
silent (i.e., latent) - Electrical activity sent from a cell body to its
axon travels to thousands of axon branches, even
though only one or a few of them may lead to
downstream activation
8Learning The Basic Process
Latent nodes
Latent links
Dedicated nodes and links
9Learning The Basic Process
Latent nodes
Let these links get activated
10Learning The Basic Process
Latent nodes
Then these nodes will get activated
11Learning The Basic Process
That will activate these links
12Learning The Basic Process
This node gets enough activation to satisfy its
threshold
13Learning The Basic Process
This node is therefore recruited
These links now get strengthened and the nodes
threshold gets raised
B
A
14Learning The Basic Process
This node is now dedicated to function AB
AB
B
A
15Learning
Next time it gets activated it will send
activation on these links to next level
AB
B
A
16Learning more terms
AB
Child nodes Potential Actual
Parent nodes
B
A
17Learning Deductions from the basic process
- Learning is generally bottom-up.
- The knowledge structure as learned by the
cognitive network is hierarchical has multiple
layers - Hierarchy and proximity
- Logically adjacent levels in a hierarchy can be
expected to be locally adjacent - Excitatory connections are predominantly from one
layer of a hierarchy to the next - Higher levels will tend to have larger numbers of
nodes than lower levels
18Learning in cortical networksA Darwinian process
- A trial-and-error process
- Thousands of possibilities available
- The abundance hypothesis
- Strengthen those few that succeed
- Neural Darwinism (Edelman)
- The abundance hypothesis
- Needed to allow flexibility of learning
- Abundant latent nodes
- Must be present throughout cortex
- Abundant latent connections of a node
- Every node must have abundant latent links
19Learning Enhanced understanding
- This basic process is not the full story
- The nodes of this depiction
- Are they minicolumns, maxicolumns, or what?
- Most likely, a bundle of contiguous columns
- Perhaps usually a maxicolumn or hypercolumn
20Columns of different sizes
REVIEW
- Minicolumn
- Basic anatomically described unit
- 70-110 neurons (avg 75-80)
- Diameter barely more than that of pyramidal cell
body (30-50 µ) - Maxicolumn (term used by Mountcastle)
- Diameter 300-500 µ
- Bundle of 100 or more contiguous minicolumns
- Hypercolumn up to 1 mm diameter
- Can be long and narrow rather than cylindrical
- Bundle of contiguous maxicolumns
- Functional column
- Intermediate between minicolumn and maxicolumn
- A contiguous group of minicolumns
21Hypercolums Modules of maxicolumns
REVIEW
A homotypical area in the temporal lobe of a
macaque monkey
22Functional columns vis-à-vis minicolumns and
maxicolumns
- Maxicolumn
- About 100 minicolumns
- About 300-500 microns in diameter
- Functional column
- A group of one to several contiguous minicolumns
within a maxicolumn - Established during learning
- Initially it might be an entire maxicolumn
23Learning in a system with columns of different
sizes
- At early learning stage, maybe a whole
hypercolumn gets recruited - Later, maxicolumns for further distinctions
- Still later, functional columns as subcolumns
within maxicolumns - New term Supercolumn a group of minicolumns of
whatever size, hypercolumn, maxicolumn,
functional column - Links between supercolumns will thus consist of
multiple fibers
24Question on cortical columns
E-mail from Kelly Banneyer . I understand that
a minicolumn is the smallest unit and maxicolumns
are composed of minicolumns and functional
columns are intermediate in size while
hypercolumns are composed of several maxicolumns.
I wonder if there can exist a minicolumn or
functional column in the brain that is not part
of a larger type of column. For example, I know
that there exists hierarchical structure, but is
there maybe some concept so exact and unrelated
to anything else that a mini/functional column
exists that is not part of a maxicolumn?
25Functional columns in phonological recognitionA
hypothesis
REVIEW
- Demisyllable (e.g. /de-/) activates a maxicolumn
- Different functional columns within the
maxicolumn for syllables with this demisyllable - /ded/, /deb/, /det/, /dek/, /den/, /del/
26Functional columns in phonological recognitionA
hypothesis
REVIEW
de-
deb ded den
de- det del dek
A maxicolumn (ca. 100 minicolumns)
Divided into functional columns (Note that all
respond to /de-/)
27Phonological hypercolumns (a hypothesis)
REVIEW
- Maybe we have
- Hypercolumn of contiguous maxicolumns for /e/
- With maxicolumns for /de-/, /be-/, etc.
- Each such maxicolumn subdivided into functional
columns for different finals - /det/, /ded/, /den/, /deb/, /dem/. /dek/
- (N.B. This is just a hypothesis)
- Maybe someday soon well be able to test with
sensitive brain imaging
28Adjacent maxicolumns in phonological cortex?
REVIEW
de-
A module of contiguous maxicolumns
te-
be-
pe-
Hypercolum
Each of these maxicolumns is divided into
functional columns
ge-
ke-
Note that the entire module responds to -e-
29Adjacent maxicolumns in phonological cortex?
REVIEW
de-
te-
deb ded den
de- det del dek
A module of six contiguous maxicolumns
be-
pe-
ge-
ke-
The entire maxicolumn responds to de-
The entire module responds to -e-
30Revisit the diagram Each node of the diagram
represents a group of minicolumns a supercolumn
Latent super-columns
Bundles of latent links
Dedicated super-columns and links
31Learning The Basic Process
Let these links get activated
32Learning The Basic ProcessRefined view
Then these supercolumns get activated
33Learning The Basic ProcessRefined view
That will activate these links
34Learning Refined view
This supercolumn gets enough activation to
satisfy its threshold
35Learning Refined view
This super-column is recruited for function AB
AB
B
A
36LearningRefined view
Next time it gets activated it will send
activation on these links to next level
AB
B
A
37LearningRefined view
Can get subdivided for finer distinctions
AB
B
A
38A further enhancement
- Minicolumns within a supercolumn have mutual
horizontal excitatory connections - Therefore, some minicolumns can get activated
from their neighbors even if they dont receive
activation from outside
39Learning Refined view
AB
Hypercolumn composed of 3 maxicolumns Can get
subdivided for finer distinctions
B
A
40Learning refined view
If, later, C is activated along with A and B,
then maxicolumn ABC is recruited for ABC
ABC
AB
B
A
C
41Learning refined view
And the connection from C to ABC is strengthened
it is no longer latent
ABC
AB
B
A
C
42Learning phonological distinctionsA hypothesis
de-
te-
deb ded den
de- det del dek
1. In learning, this hypercolumn gets
established first, responding to -e-
be-
pe-
ge-
ke-
3. The maxicolumn gets divided into functional
columns
2. It gets subdivided into maxicolumns for
demisyllables
43Remaining problems lateral inhibition
- When a hypercolumn is first recruited, no lateral
inhibition among its internal subdivisions - Later, when finer distinctions are learned, they
get reinforced by lateral inhibition - Problem How does this work?
44Hypothesis applied to conceptual categories
REVIEW
- A whole maxicolumn gets activated for the
category - Example DRINKING-VESSEL
- Different functional columns within the
maxicolumn for subcategories - CUP, GLASS, etc.
- Adjacent maxicolumns for categories related to
DRINKING VESSEL - BOWL, JAR, etc.
45Locating FunctionsThe Proximity Principle
- Related functions tend to be in close proximity
- If very closely related, they tend to be
adjacent - Areas which integrate properties of different
subsystems (e.g., different sensory modalities)
tend to be in locations intermediate between
those subsystems
46Consequences of the Proximity Principle
- Nodes in close competition will tend to be
neighbors - And their mutual competition is preordained even
though the properties they are destined to
integrate will only be established through the
learning process - Therefore, inhibitory connections should exist
predominantly among nodes of the same
hierarchical level - The presence of their mutual inhibitory
connections could be genetically specified
47Learning and the Proximity Principle
- Start with the observation
- Related areas tend to be adjacent to each other
- Primary auditory and Wernickes area
- V1 and V2, etc.
- Wernickes area and lexical-conceptual
information angular gyrus, SMG, MTG - Thus we have the proximity principle
- Question Why How to explain?
48Two aspects of the proximity principle
- A node that integrates a combination of
properties of different subsystems can be
expected to lie in a location intermediate
between those subsystems - A node that integrates a combination of
properties of the same subsystem should be within
the same subsystem, and maximally close to the
properties it integrates
49How to Explain the Proximity Principle?
- Factors responsible for observations of proximity
in cortical structure - Economic necessity
- Genetic factors
- Experience provides details of localization
within the limits imposed by genetic factors
50 Proximity Economic necessity
- Question Could a given column be connected to
any other column anywhere in the cortex? - That would require a huge number of available
latent connections - Way more than are present
- Hence there are strict limits on intercolumn
connectivity - Therefore, proximity is necessary just for
economy of representation
51Limits on intercolumn connectivity
- Number of cortical minicolumns
- If 27 billion neurons in entire cortex
- If avg. 77 neurons per minicolumn
- Then 350 million minicolumns in the cortex
- Extent of available latent connections to other
columns - Perhaps 35,000 to 350,000
- Do the math..
- A given column has available latent connections
to between 1/1000 and 1/10000 of the other
columns in the cortex
52Locations of available latent connections
- Local
- Surrounding area
- Horizontal connections (grey matter)
- Intermediate
- Short-distance fibers in white matter
- For example from one gyrus to neighboring gyrus
- Long-distance
- Long-distance fiber bundles
- At ends, considerable branching
53The role of long-distance fibers
- Arcuate fasciculus
- Genetically determined
- Limits location of phonological recognition area
- Interhemispheric fibers
- Also genetically determined
- Wernickes area RH homolog of Ws area
- Brocas area RH homolog of Bs area
- Etc.
54Two Factors in Localization
- Genetic factors determine general area for a
particular type of knowledge - Within this general area the learning-based
proximity factors select a more narrowly defined
location - Thus the exact localization depends on experience
of the individual - When part of the system is damaged,
learning-based factors can take over and result
in an abnormal location for a function
plasticity
55Genetically determined proximity
- Genetically-determined proximity would have
developed over a long period of evolution - Many features are shared with other mammals
- This process could be called evolutionary
learning - According to standard evolutionary theory..
- A process of trial-and-error
- Trial
- Produce varieties
- Error
- Most varieties will not survive/reproduce
- The others the best among them are selected
- Other genetic factors supplement proximity
- Long-distance fiber bundles
56Some innate factors relating to localization
- Primary areas
- Long-distance fiber bundles
57Innate factors relating to primary areas
- Location
- Genetically determined locations
- But there are exceptions
- Malformation
- Damage
- Structure
- Genetically determined structures adapted to
sensory modality (they have to be where they are) - Heterotypical structures
- Found in primary areas
- Primary visual
- Primary auditory
58A Heterotypical (i.e., genetically built-in)
structureVisual motion perception
REVIEW
An area in the posterior bank of the superior
temporal sulcus of a macaque monkey (V-5) A
heterotpical area Albright et al. 1984
400-500 µ
59A Heterotypical structureAuditory areas in a
cats cortex
REVIEW
A1
AAF Anterior auditory field A1 Primary
auditory field PAF Posterior auditory
field VPAF Ventral posterior
auditory field
60Innate factors relating to localization
- The primary areas
- Long-distance fiber bundles
- Interhemispheric via corpus callosum
- Longitudinal from front to back
- Arcuate fasciculus is part of the superior
longitudinal fasciculus - They allow for exceptions to proximity
- Areas closely related yet not neighboring
61Applying the proximity principle
- For both types (genetic and experience-based) we
can make predictions of where various functions
are most likely to be located, based on the
proximity principle - Brocas area near the inferior precentral gyrus
- Wernickes area near the primary auditory area
- Such predictions are possible even in cases where
we dont know whether genetics or learning is
responsible - maybe both
62Implications of the proximity principle
- System level
- Functionally related subsystems will tend to be
close to one another - Neighboring subsystems will probably have related
functions - Cortical column level
- Nodes for similar functions should be physically
close to one another - Nodes that are physically close to one another
probably have similar functions - Therefore..
- Neighboring nodes are likely to be competitors
- They need to have mutually inhibitory connections
63Deriving location from proximity hypothesis
- The cortex has to provide for decoding speech
input - Speech input enters the cortex in the primary
auditory area - Results of the decoding (recognition of
syllables etc.) are represented in Wernickes
area - Why is Wernickes area where it is?
64Speech Recognition in the Left Hemisphere
Phonological Production
Phonological Recognition
Primary Auditory Area
Wernickes Area
65Exercise Location of Wernickes area
- Why is phonological recognition in the posterior
superior temporal gyrus? - Alternatives to consider
- Anterior to primary auditory cortex
- Advantage would be close to phonological
production - Inferior to primary auditory cortex
- (There are two reasons)
66Answer Location of Wernickes area
- Wernickes area pretty much has to be where it is
to take advantage of the arcuate fasciculus - The location of W.s area makes it close to
angular gyrus, likely area for noun lemmas
(morphemes and complex morphemes) - Also, close to SMG, presumed area for
phonological monitoring - (Why?
- Because it is adjacent to primary somatosensory
area)
67More exercises
- Explaining likely locations of morphemes
- verb morphemes in the frontal lobe
- noun morphemes in the angular gyrus and/or middle
temporal gyrus - The dorsal (where) pathway of visual perception
68Experience-based proximity
- Can be expected to be operative
- more at higher (more abstract) levels, less at
lower levels - for areas of knowledge that have developed too
recently for evolution to have played a role - Reading
- Writing
- Higher mathematics
- Physics, computer technology, etc.
69Innate features that support language
- Columnar structure
- Coding of frequencies in Heschls gyrus
- Arcuate fasciculus
- Interhemispheric connections (via corpus
callosum) e.g., connect Wernickes area with RH
homolog - Spread of myelination from primary areas to
successively higher levels - Left-hemisphere dominance for grammar etc.
70end