apparent position governs contour-element binding by the visual system

6
Apparent Position Governs Contour-Element Binding by the Visual System Author(s): Anthony Hayes Source: Proceedings: Biological Sciences, Vol. 267, No. 1450 (Jul. 7, 2000), pp. 1341-1345 Published by: The Royal Society Stable URL: http://www.jstor.org/stable/2665948 . Accessed: 12/06/2014 21:51 Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp . JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. . The Royal Society is collaborating with JSTOR to digitize, preserve and extend access to Proceedings: Biological Sciences. http://www.jstor.org This content downloaded from 195.78.108.147 on Thu, 12 Jun 2014 21:51:50 PM All use subject to JSTOR Terms and Conditions

Upload: anthony-hayes

Post on 18-Jan-2017

213 views

Category:

Documents


0 download

TRANSCRIPT

Apparent Position Governs Contour-Element Binding by the Visual SystemAuthor(s): Anthony HayesSource: Proceedings: Biological Sciences, Vol. 267, No. 1450 (Jul. 7, 2000), pp. 1341-1345Published by: The Royal SocietyStable URL: http://www.jstor.org/stable/2665948 .

Accessed: 12/06/2014 21:51

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .http://www.jstor.org/page/info/about/policies/terms.jsp

.JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range ofcontent in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new formsof scholarship. For more information about JSTOR, please contact [email protected].

.

The Royal Society is collaborating with JSTOR to digitize, preserve and extend access to Proceedings:Biological Sciences.

http://www.jstor.org

This content downloaded from 195.78.108.147 on Thu, 12 Jun 2014 21:51:50 PMAll use subject to JSTOR Terms and Conditions

ri THE ROYAL doi 10.1098/rspb.2000.1148

*@I SOCIETY

Apparent position governs contour-element binding by the visual system Anthony Hayes Department of Psychology, University of Melbourne, Parkville, Victoria 3052, Australia

An assumption inherent in many models of visual space is that the spatial coordinates of retinal cells implicitly give rise to the perceptual code for position. The results of the experiments reported here, in which it is shown that retinally non-veridical locations of contour elements are used by the visual system for contour-element binding, lend support to a different view. The visual system does not implicitly code position with reference to the labelled locations of retinal cells, but dynamically extracts spatial position from the aggregate result of local computations. These computations may include local spatial relation- ships between retinal cells, but are not confined to them; other computations, including position derived from local velocity cues, are combined to code the position of objects in the visual world.

Keywords: contour; Gestalt good continuation; motion-induced apparent displacement; perception

1. INTRODUCTION

The trajectory of a moving object may be roughly estimated from knowledge of its current (or recent past) position, together with a measure of its local velocity, or it may be inferred from samples of its position over some short period of time. Neuropsychological studies and adaptation evidence point to a dissociation in human vision between mechanisms that code local velocity and mechanisms that code position. Brain injury can result in an inability to perceive motion, but can spare an ability to perceive displacement (Zihl et al. 1983), and adaptation to a moving target (e.g. a waterfall) has as an after-effect the perception of motion, in reverse direction to the target, that lacks the perception of displacement (for review, see Mather et al. 1998). In most situations of real motion of objects there will be a consistent match between changes in position displacement and local motion, and it is reasonable to presume that the output of mechanisms sensitive to motion and the output of mechanisms sensitive to position will also be consistent with each other.

However, MacKay's (1958, 1961) demonstrations with stroboscopically lit, partially self-luminous, objects sug- gest that mechanisms sensitive to local velocity can give rise to estimates of position that do not correspond to estimates derived from mechanisms sensitive to position alone. Nijhawan (1994) in a rediscovery, and confirma- tion, of MacKay's effect showed that moving objects can give rise to dissonant estimates of position by the visual system moving objects were typically perceived as being ahead of their strobed retinal location. This dissonance in perception has been shown to be most probably the result of a reduced neural delay for moving stimuli (Whitney & Murakami 1998).

The results of experiments by De Valois & De Valois (1991) show that stationary objects that contain local- motion cues are perceived as being displaced in the direction of the motion vector. De Valois & De Valois demonstrated this perceived mispositioning by vignetting a drifting sinusoid by a stationary Gaussian; an apparent displacement of the vignette is induced. Figure 1 illustrates their finding. The motion-induced 'illusory'

displacement of each element of a vertically aligned triplet results in perceived misalignment of the triplet, demonstrating the, perhaps partial, independence of codes for velocity and codes for position.

A question that is prompted by motion-induced perceived displacement is whether the veridical (i.e. in this case retinal) position or the apparent position, of 'perceptually displaced' elements, is used by the visual system for the grouping of visual elements into perceptual wholes. A powerful example of perceptual grouping is provided by fragmented contours, which are usually seen despite fragmentation as continuous objects. David Field and I (Field et al. 1993) created a 'path' stimulus paradigm, which we used to demonstrate that a subset of an array of elements is readily perceived as a contour when the contour is defined only by alignment of the elements along a smooth curve (see figure 2a). This tendency of aligned fragments to group into a perceived whole was described earlier this century as the 'Gestalt law of good continuation' (Wertheimer 1938); one of a set of 'laws' that constituted an attempt to describe and categorize the rules of perceptual organization that allow us to perceive objects as whole entities, and not simply as isolated scraps of form and colour. Small off-axis displacements of fragments aligned along a smooth curve disrupt the 'good continuation' of the contour such that it is no longer perceived as an integrated whole (Field et al. 1993). This finding is illustrated in figure 2.

If contours are constructed of elements, where each element is a Gaussian-vignetted drifting sinusoid, that are aligned along the axis of a smooth curve, then the result shown in figure 1 predicts that the motion of the sinusoid will induce an apparent off-axis displacement of each element. This displacement will be similar to the physical off-axis displacement illustrated in figure 2b, and may, as a consequence, disrupt the perceptual salience of the contour. However, the prediction that arises from the view that the metric of visual space is derived from the positions of retinal cells a view that can be traced to the 'local sign' hypothesis of Lotze (1852) and is sub- sequently mapped in an orderly topographic fashion through the visual pathway, is that the positions of objects are coded according to their retinal coordinates. It can be

Proc. R. Soc. Lond. B (2000) 267, 1341-1345 1341 C) 2000 The Royal Society Received 9 December 1999 Accepted 28 January 2000

This content downloaded from 195.78.108.147 on Thu, 12 Jun 2014 21:51:50 PMAll use subject to JSTOR Terms and Conditions

1342 A. Hayes Visual system contour-element binding

(a) (b) Li

- > t2 ;\D5-

t3

1.0 - 1,-(c)

0.8 -

o.6 0.6-

042 -

p.s.a. 0.0 . . ..

I , I , I I -0.60 -0.50 -0.40 -0.30 -0.20

aligmnent (a)

Figure 1. Alignment of a triplet of Gaussian-vignetted, drifting sinusoids. (a) The three-element alignment stimulus array. The arrows indicate the direction of drift of the sinusoid (spatial frequency, 8 cycles deg- 1; wavelength, 7.5'; temporal frequency, 4.7 Hz; a of Gaussian vignette, 3.75'). The figure shows the approximate position of the elements when the appearance of the 'dynamic' triplet is vertical alignment. (b) Section through a Gaussian-vignetted sinusoid (broken line shows the shape of the Gaussian vignette) illustrating the position of a rightward-drifting sinusoid at 53 ms intervals. (c) Psychometric function for observer A.H. when the central element is a rightward-drifting Gaussian-vignetted sinusoid, and the top and bottom elements are leftward-drifting Gaussian-vignetted sinusoids. The point of subjective align- ment (p.s.a.) occurred when the central element was displaced 0.35 af from vertical alignment, in the opposite direction to the drift of the sinusoid (displacement is scaled to af of the Gaussian vignette). Each datum plots the result of 40 trials; the error bars show + 1 s.d.

argued that this metric is available regardless of appearance; compelling evidence comes from experi- ments where the size or position of objects is distorted perceptually, but the 'illusory' distortion is not reproduced in the control of action (Milner & Goodale 1995). Thus,

which position of the elements, their motion-induced apparent position or their retinal position, is used by the visual system in the assembly of contours (often thought of as a relatively 'high-level' task, e.g. Rock (1984)), and hence makes explicit the shape of objects?

2. METHODS AND RESULTS

Figure 2 presents examples of the stimulus arrays for a contour-detection experiment where the task of the obser- vers was to determine in each trial which of a sequentially presented pair of arrays contained a contour. One array was made up of 256 elements quasi-randomly arranged in position and orientation, the other was made up of 256 elements of which 244 were quasi-randomly arranged, and 12 formed a 'path' aligned along a smooth contour. The right panels of figure 2 present examples of the arrays; each array contains the aligned set of elements the path-that is shown in isolation in the left panels. The arrays were made up of 'dynamic' elements, all of which in the experimental condition were Gaussian- vignetted drifting sinusoids. The perceived displacement of the elements that is induced by the drift of the vignetted sinusoids is so dramatic that the array of 'dynamic' elements resembles a carpet of insects moving around the surface of the array in every direction. Indeed, a casual observer needs to view the array for some time to convince him/herself that the elements themselves are, in fact, stationary, and that it is only the sinusoid that drifts.

The results of the experiment, presented in figure 3, show that detection of contours constructed from vignetted drifting sinusoids improves when the contour elements are progressively misaligned, to some optimal displacement from alignment, in a direction opposite to the sinusoid's direction of motion. Contour-detection performance peaks when the elements are physically misaligned by more than a quarter of their visible width. Thus, when the visual system binds the elements to give rise to the perception of a contour, it uses the motion-induced 'illusory' position of elements of the contour-where the results of the align- ment experiment (figure 1) suggest that they appear to be-and not their retinal locations. The displacement of contour elements from physical alignment that is required for optimal contour detection, about 1.0 a for observer A.H., is larger than the displacement from physical align- ment required for a triplet of elements to be perceived as aligned, about 0.35 of (compare the results in figure 1 with those in figure 3; 'a' refers to the standard deviation of the Gaussian vignette).

3. DISCUSSION

The salience of contours, such as the ones formed by the stimuli used in the experiment reported here, illustrates the operation of an 'organizing principle' of the visual system that may be a consequence of relatively low-level binding operations that may occur in primary and secondary areas of the visual cortex (Spillmann & Werner 1996; Grossberg et al. 1997). Contour assembly, using fragments that are apparently displaced due to a local-motion signal, is achieved with contour fragments whose positions corre- spond to their apparent, local-motion induced, location,

Proc. R. Soc. Lond. B (2000)

This content downloaded from 195.78.108.147 on Thu, 12 Jun 2014 21:51:50 PMAll use subject to JSTOR Terms and Conditions

Visual system contour-element binding A. Hayes 1343

(b

Figure 2. Stimulus arrays from the contour-detection experiment. (a) Contour constructed from fully aligned elements. On the left, contour path alone; on the right, contour path embedded in background 'noise'. All elements of the path are aligned along a smooth curve. (b) Contour path constructed from non-aligned elements. All elements of the contour path are displaced away from the axis of the contour. The Gaussian-vignetted sinusoid of contour and background elements either drifted or counter- phased, in incoherent phase, in each of two conditions. In the experimental drifting-sinusoid condition, 12 contour elements were either all displaced in the direction opposite to the direction of drift of the sinusoid, or all displaced in the same direction as the drift. The direction of drift of the sinusoid of each contour element was randomly allocated; however, no more than two adjacent elements of the contour drifted in the same direction. The direction of drift of the sinusoids of the background elements was randomly allocated. In the counterphasing control condition the sinusoids of all elements-contour and background-counter- phased at the same temporal frequency. The demonstration shown in (b) corresponds to a condition where contour-elements are misaligned off the axis of a smooth curve. The off-axis displacement shown is 4o,; a level of misalignment from a smooth curve that renders the contour path utterly undetectable when embedded in background elements (right panel). In the experiment all elements, background and contour, were made up of drifting or counterphasing Gaussian-vignetted sinusoids-drift speed/ counterphase frequency, 4.7 Hz; display update, 37.5 Hz; screen refresh rate, 75 Hz; dimensions of the elements were as described in figure 1. All elements were constrained in position to a 16 x 16 grid. Background elements had random orientation and position, within the limitations imposed by the 16 x 16 grid. The array of 256 elements subtended 80 visual angle. A two-alternate forced-choice procedure was used in The experiments; the observer chose which of two sequentially presented arrays contained a contour. One presentation consisted of background 'noise' elements alone, the other consisted of 12 contour elements embedded in 'noise' elements, as shown in the right panels. The angular difference between each of the 12 elements aligned along a contour was + (40 + (0-12)) o Each presentation was a lO0s (half-height to half-height) raised Gaussian-ramped temporal window. Further details on the construction of the stimulus arrays have been published elsewhere (Field et at. 1993).

not with fragments whose positions correspond to their actual retinal location. The distortions in visual location that are induced by a local-motion signal cannot be explained with reference to high-level visual processing since the visual system's use of the displaced positions for contour assembly indicates that motion-induced apparent displacement precedes 'binding' operations. Because the measure used in the experiment-contour detection-is an indirect measure of visual alignment, the results of the experiment provide strong evidence that the visual system's spatial metric incorporates local-motion cues.

The findings illustrate the powerful influence of local- motion signals on position information, not only in deter- mining perceived position, but also in shaping the input to the mechanisms that link or bind elements into objects. These results add to and extend existing evidence for the non-separability of form and motion (e.g. Burr et al. 1986), and challenge a commonly accepted view of modular analysis by the visual system where different attributes, such as form and motion, are thought to be analysed through non-communicating independent modules.

Proc. R. Soc. Lond. B (2000)

This content downloaded from 195.78.108.147 on Thu, 12 Jun 2014 21:51:50 PMAll use subject to JSTOR Terms and Conditions

1344 A.Hayes Visualsystemcontour-elementbinding

(a)

A.H. L.M. 0.72 -

0.78-

0.68 - ; 0.74-

0 0.64- 0.70 -

-a- drift opposite -s- drift same 0.60

0 .6 6 - _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

(b)

0.84 A-l.0.84 LM

o0.80 -0.80-

0.76 - counterphase 0.76 -

0.0 1.0 2.0 0.0 1.0 2.0

alignment offset (a) alignment offset (a)

Figure 3. Results of contour-detection experiments. (a) Contour detection as a function of element offset from alignment with a smooth curve, where contour elements consist of Gaussian-vignetted drifting sinusoids. Offset dimension is scaled to the size of the elements cr of the Gaussian vignette. The results of two observers are plotted; each datum plots the percentage correct and + 1 s.e.m. for 1000 two-alternate forced-choice trials; all conditions were interleaved in blocks of 50 trials. Observer L.M. was naive to the aims and manipulations of the experiment. The round symbols show detection performance as a function of offset of elements from a smooth curve, where the offset is in the opposite direction to the direction of drift of the sinusoid (i.e. offset is counter to motion-induced displacement). The square symbols show detection performance where the offset is in the same direc- tion as the direction of drift of the sinusoid. These results indicate that the position of the elements that results in optimal contour-detection performance occurs when the elements are physically displaced from alignment along a smooth curve by about lo for observer A.H. and 1.5u for observer L.M. (b) Contour detection as a function of element offset using Gaussian-vignetted counterphasing (i.e. non-drifting) sinusoids. The symbols show detection performance as a function of offset of contour elements from a smooth curve. As expected, performance simply deteriorates monotonically as offset increases, since no apparent displace- ment is induced by the counterphasing sinusoids.

The findings presented here are not consistent with simple models of spatial localization that have as their basis the simple transference through the visual pathway of the retinotopic position of retinal cells. Nor are they consistent with simple, and naive, structural isomorphist speculations on cellular anatomy and the shape of contours. The findings are, however, consistent with the emerging view that the majority of receptive fields of cells in the geniculostriate pathway are space-time insepar- able, with many cells having spatio-temporally tuned receptive fields that will allow for summation along a velocity trajectory (DeAngelis et al. 1995; Burr & Ross 1986). They are also compatible with recent findings that suggest that visual space is dynamically remapped for image stabilization (e.g. Motter & Poggio 1990) and other findings that imply a plasticity in visual space prior to saccadic eye movements (e.g. Ross et al. 1997).

It may not be too fanciful to suggest that the metric for spatial vision is the dynamical result of a number of different low-level computations, and that the explana- tion for the 'illusion' of apparent displacement induced by a local-motion signal is that a normally useful computa- tion produces a misleading output when the visual system

is presented with a visual image that has unlikely spatio- temporal statistics.

This work was supported by a grant from the Australian Research Council. I thank S. Cropper, J. Ross, P. Smith, andJ. Wray for helpful comments on an early draft of the manuscript, and I thank D. Burr and an anonymous referee for helpful com- ments on the submitted draft. Some of the data presented here have been published in conference-abstract form (Hayes 1997).

REFERENCES

Burr, D. & Ross, J. 1986 Visual processing of motion. Trends Neurosci. 9, 304-307.

Burr, D. C., Ross, J. & Morrone, M. C. 1986 Seeing objects in motion. Proc. R. Soc. Lond. B 227, 249-265.

DeAngelis, G. C., Ohzawa, I. & Freeman, R. 1995 Receptive- field dynamics in the central visual pathways. Trends Neurosci. 18, 451-458.

De Valois, R. L. & De Valois, K. K. 1991 Vernier acuity with stationary moving Gabors. Vision Res. 31, 1619-1626.

Field, D. J., Hayes, A. & Hess, R. E 1993 Contour integration by the human visual system: evidence for a local 'association field'. Vision Res. 33, 173-193.

Proc. R. Soc. Lond. B (2000)

This content downloaded from 195.78.108.147 on Thu, 12 Jun 2014 21:51:50 PMAll use subject to JSTOR Terms and Conditions

Visual system contour-element binding A. Hayes 1345

Grossberg, S., Mingolla, E. & Ross, W. D. 1997 Visual brain and visual perception: how does the cortex do perceptual grouping? Trends Neurosci. 20, 106-111.

Hayes, A. 1997 Contour integration with stationary moving elements. Invest. Ophthal. Vis. Sci. 38, 74.

Lotze, R. H. 1852 Medicinische Psychologie, oder Psychologie der Seele (Leipzig) bk. II, 292-294; translated by D. Cantor 'Rudolf Hermann Lotze (1817-1881) on local signs in their relation to the perception of space, 1852'. In A source book in the history of psychology (ed. R. J. Herrnstein & E. G. Boring), pp. 135-140. Harvard University Press, 1965.

MacKay, D. M. 1958 Perceptual stability of a stroboscopically lit visual field containing self-luminous objects. Nature 181, 507-508.

MacKay, D. M. 1961 Interactive processes in visual perception. In Sensory communication (ed. W. A. Rosenblith), pp. 339-355. Cambridge, MA: MIT Press.

Mather, G., Verstraten, F. & Anstis, S. (eds) 1998 The motion aftereffect: a modern perspective. Cambridge MA: MIT Press.

Milner, A. D. & Goodale, M. A. 1995 The visual brain in action. Oxford University Press.

Motter, B. C. & Poggio, G. F. 1990 Dynamic stabilization of receptive fields of cortical neurons (VI) during fixation of gaze in the macaque. Exp. Brain Res. 83, 37-43.

Nijhawan, R. 1994 Motion extrapolation in catching. Nature 370, 256-257.

Rock, I. 1984 Perception. San Francisco, CA: W. H. Freeman & Co. Ross, J., Morrone, M. C. & Burr, D. C. 1997 Compression of

visual space before saccades. Nature 386, 598-601. Spillmann, L. & Werner, J. S. 1996 Long-range interactions in

visual perception. Trends Neurosci. 19, 428-434. Wertheimer, M. 1938 Laws of organization in perceptual forms.

London: Brace andJovanovich. Whitney, D. & Murakami, I. 1998 Latency difference, not

spatial extrapolation. Nature Neurosci. 1, 656-657. Zihl, J., Von Cramon, D. & Mai, N. 1983 Selective disturbance

of movement vision after bilateral brain damage. Brain 106, 313-340.

Proc. R. Soc. Lond. B (2000)

This content downloaded from 195.78.108.147 on Thu, 12 Jun 2014 21:51:50 PMAll use subject to JSTOR Terms and Conditions