1 Service Hospitalier Frédéric Joliot, CEA, Orsay and , 2 Laboratoire de Physiologie de la Perception et de l'Action, CNRS-Collège de France, Paris, France
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Little is known about the neural bases underlying the reconstruction of a 3-D percept from optic flow. In the present study we use functional magnetic resonance at 3 T to explore the cortical pathways involved in such a capacity. The perception of 3-D structure from motion contributes to the representation of object shape, but also requires a spatial integration of visual motion. Therefore we first summarize the results related to both object shape representation and optic flow processing as the basis for motivating and interpreting our experiments.
The Cortical Coding of Object Shape
It is now widely accepted that the cortical visual pathway is divided in two parallel streams, the so-called dorsal (from V1 to the parietal lobe) and ventral (from V1 to the inferotemporal cortex) pathways, that are relatively independent (Ungerleider and Mishkin, 1982). Functionally the dorsal pathway mediates the visual perception of the movement and location of objects relative to the observer (where), and of the observer's own movement and location. Goodale and Milner also suggested that the dorsal stream plays a specific role in object-oriented action (Goodale and Milner, 1992
). Conversely, the ventral stream seems to be mainly involved in object recognition and identification (what).
Both primate electrophysiology and human cortical imagery show that the coding of object shape from static monocular depth cues and binocular disparity is distributed in the dorsal and ventral visual pathways. This is in agreement with the view that information on object shape is necessary for recognizing objects, as well as for locating and manipulating them.
In man, the presentation of an object shape, as opposed to visual noise, induces an activity in the ventral pathway, particularly the fusiform gyrus (Malach et al., 1995; Martin et al., 1996
), even if the object is perceived as a nonsense object. Also, Schacter et al. found that an activity in inferior temporal regions was elicited during a decision task about possible versus impossible 3-D objects (Schacter et al., 1995
). A selectivity to object shape has also been repeatedly found in the infero temporal cortex of monkeys (Ungerleider and Haxby, 1994
).
As for the dorsal pathway, Faillenot et al. compared a task of matching object shapes, to a task of pointing toward the centre of objects, in humans (Faillenot et al., 1997). Their positron emission tomography (PET) results showed an increased activity level for the task of matching an object shape not only in ventral areas, but also in the posterior parietal cortex. Similarly, the monkey intraparietal sulcus (area AIP) contains neurons that are driven by hand manipulation and respond selectively to the 3-D geometrical shape of the manipulated objects (Murata et al., 1993
).
Optic Flow Processing and Extraction of 3-D Structure from Motion Parallax
Let us consider the case where the visual environment is made of rigid objects, and the relative motion between the object and observer is not a pure rotation around the observer's eye (in which case the optic flow is devoid of 3-D structure information). The variations of image velocity on the retina are then termed motion parallax. Psychophysical experiments and computational studies showed that 3-D structure can be recovered from motion parallax. In theory, this is possible if the retinal velocity is given (i) in a sufficient number of points (discrete approach) (Ullman, 1979) or (ii) within a retinal area large enough to compute the spatial derivatives of velocity (continuous approach) (Koenderink and van Doorn, 1975
; Longuet-Higgins and Prazdny, 1980
). These derivatives, also called the optic flow components, are computed at an intermediate stage.
The continuous model has been validated to a certain extent by electrophysiological investigations, since neurons sensitive specifically to one or several optic flow components have been found in the dorsal visual pathway of the primate. Such neurons are located in the dorsal part of area MST (Saito et al., 1986; Duffy and Wurtz, 1991
), as well as in parietal areas such as VIP (Bremmer et al., 1997
). Whether the neurons sensitive to optic flow components contribute to the coding of object shape, object motion or self-motion is still unclear. However, because they usually have large receptive fields and are also modulated by extraretinal signals of vestibular, tactile or proprioceptive origin (Andersen, 1987
), several authors have hypothesized that their activity is related to self-motion rather than object motion (Duffy and Wurtz, 1991
).
The visual motion area MT/V5 is also a major locus for optic flow processing (Newsome et al., 1989). Contrary to area MST, MT/V5 shows no tuning to the optic flow components (Orban et al., 1992
). Nevertheless the role of MT/V5 in the spatial integration of visual motion has been demonstrated. Firstly, its neurons seem capable of integrating and segmenting different retinal motion components (Stoner and Albright, 1992
). Secondly, unicellular responses in MT/V5 can be inhibited by the superposition ot two motion patterns moving in opposite directions (Snowden et al., 1991
; Qian and Andersen, 1994
), and this suppression is weaker if the two patterns have different binocular disparities (Bradley et al., 1995
). Thirdly, Bradley et al. found a correlation between perceived 3-D motion and unicellular activity in MT (Bradley et al., 1998
). Fourthly, the centersurround interactions demonstrated in MT/V5 neurons are possible mechanisms for spatial integration of retinal motion, which could subserve the specific sensitivity to the 3-D orientation of a moving plane in space (Xiao et al., 1997
).
In summary, the properties of area MT/V5 related to the processing of retinal motion differ from what is seen in, for example, area MST. Although it does not seem to follow the continuous model described above, MT/V5 probably contributes to the 3-D processing of optic flow through its capacity to integrate or segment different motion signals. Finally, the functional role of MT/V5 seems to be distinct from the role of MST or parietal areas, because its activity is not modulated by extraretinal signals but depends only on retinal motion (Wurtz et al., 1990).
Dorsal and/or Ventral Implication in the Processing of 3-D Structure from Motion?
Binocular disparity, which is a critical cue for the perception of 3-D shape, is coded in the dorsal pathway [in MT/V5 (Maunsell and van Essen, 1983) and in MST (Roy et al., 1992
)] and in the ventral pathway (Cowey and Porter, 1979
; Schiller, 1993
) of the monkey. The similarities between motion parallax and binocular disparity as depth cues are numerous, including psychophysical and computational aspects (Rogers and Graham, 1982
; Cornilleau-Pérès and Droulez, 1993
). Since both cues participate in the perception of 3-D object shape, the question arises whether motion parallax is also coded in both the ventral and dorsal visual pathways.
In addition, because of the large size of the receptive fields of neurons sensitive to optic flow components in MST and other parietal areas, it remains to be discovered whether these areas are involved in small-field analysis of 3-D shape from motion.
Given the physiological properties and functional roles of the ventral and dorsal visual pathways, a working hypothesis is that optic flows depicting little or no 3-D structure information (such as expansion/contraction) may be processed exclusively in the dorsal visual pathway, whereas a strong perception of 3-D shape from motion (such as during rotation in depth) might also involve the ventral pathway.
To address these questions in the human we used stimuli that consisted of random dot distributions moving coherently or not. Coherent motion stimuli could contain little or strong 3-D shape information. They contain no depth cue, except motion parallax. In addition, we compared random motion stimuli with static dots, so as to locate previously documented motion areas. Overall, two series of experiments were performed. The first series was designed to outline areas involved in the processing of motion and 3-D structure from motion. The second was designed to control for the influence of speed distribution and 3-D curvature in the areas delineated by the first series of experiments.
![]() |
Main Experiments |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
This investigation was approved by an Institutional Ethics Committee (CCPPRB, Paris).
Subjects
Nine normal volunteers (five men and four women) aged 2042 years were scanned, after giving their informed consent about the nature of the experiment. All subjects were healthy and had normal uncorrected vision. All but one were right handed, and three had a left ocular dominance (two right-handed subjects had a left ocular dominance).
Image Acquisition
Subjects were scanned in a 3 T whole-body MRI scanner (Bruker, Ettlingen, Germany) with BOLD contrast echo planar imaging (flip angle 90°, TE = 40 ms). Twenty slices covering the whole brain were acquired, roughly perpendicular to the axis of the brainstem. Voxel size was 4 x 4 x 5 mm. Each functional sequence consisted of 76 scans, with a repetition time of 3.05 s. We also performed a high-resolution T1 weighted (IR gradient echo) sequence in order to acquire accurate anatomical information.
Visual Stimulation
Subjects laid supine in the magnet. They wore glasses, consisting of a mirror angled at ~45° from their visual axes, to allow them to see the translucent screen located at the extremity of the magnet on head side. Viewing distance was 1.40 m. Stimuli were projected on the screen using an Eiki 5000 projector driven by a PC computer.
The visual stimuli consisted of 300 anti-aliased dots (6 pixels width) and a central fixation cross; both the dots and the cross were displayed in green over a black background. The stimuli covered a 16° diameter region, each dot subtending 0.27°. Global luminance was 1 cd/m², and subjects experienced no colour after effect.
In this experiment, four different stimuli were used (Fig. 1A):
|
Stimulus RM: moving dots with random direction and random displacement amplitude. Dot speed varied sinusoidally with time (frequency: 1 Hz). Direction and amplitude of motion were changed to new random values at each speed zero crossing. Maximal dot speed was 3.3°/s and mean speed over the block of stimulation was 1°/s.
Stimulus EX: moving dots in alternated expansion and contraction (frequency: 1 Hz). Dot speed ranged from 0 to 2.4°/s with a mean speed of 1°/s. Such a stimulus can be considered as coherent motion with little, or ambiguous, structure information. Indeed, a sphere moving back and forth would yield almost the same perceptual sensation as a plane moving the same way, so that it would not be possible to distinguish between both shapes. The same ambiguity would arise should motion be a rotation around the observer. The reason is that frontoparallel translation only can provide reliable structure information (Longuet-Higgins and Prazdny, 1980). At 16° visual angle, we found that the frontoparallel component of motion in the EX stimulus was in average 9.3% of its value in the SP3D stimulus described below. We hence considered structure information in the EX stimulus to be one-tenth that of structure information in the SP3D stimulus.
Stimulus SP3D: dots moved as if they belonged to a spherical surface S of radius 38 cm (15.1° visual angle) rotating around a frontoparallel axis tangent to S. We constructed the stimulus by projecting a 2-D random dot distribution onto the virtual 3-D surface, so that there was no density bias at the edges of the sphere. Then, the 3-D position of the dots on the oscillating sphere were back-projected onto the screen at each frame. This rotation yields a pattern of coherent motion, and is optimal for the non-ambiguous perception of structure from motion parallax (Cornilleau-Pérès and Droulez, 1994). The direction of the rotation axis was randomly chosen in the first occurrence of the stimulus, and then incremented by 45° on each of the next three occurrences. Each time the spherical surface reaches its extremal position or a frontoparallel position (i.e. four times per 3-D oscillation) the 2-D dot velocity passes through zero. Hence the frequency of the 2-D velocity is twice the frequency of the 3-D motion. Therefore, the frequency of the 3-D motion was chosen to be 0.5 Hz so that the 2-D dot velocity frequency was 1 Hz, as for other motion stimuli. Dot speed ranged from 0 to 4.7°/s, with a mean value of 1°/s.
In each stimulus image, dots were uniformly spread over the viewing window. The frame rate was 70 Hz. Dots that moved outside the viewing window between two successive frames were randomly repositioned within the window in the second frame. This was done under the constraints that (i) dot density was kept uniform over the window and (ii) the average dot flicker (number of dots appearing and disappearing in the window in each frame) was kept similar for the three motion stimuli (RM, EX, SP3D). Hence the number of dots inside the window, and therefore the luminance, were constant across the different stimuli. Also, the following parameters were equalized for the three motion conditions (RM, EX and SP3D):
Design
The volumes of functional images were acquired while subjects viewed two stimuli, hereafter denoted S1 and S2, alternately. S1 was first presented during 36.6 s (12 scans), then S2 and S1 were alternately displayed during 24.4 s (eight scans) each, for a total duration of 3 min 52 s (= 36.6 + 8 x 24.4 s). Each subject underwent three experiments:
The order of the three experiments was randomized across subjects. Subjects were instructed to lie still, fixate the central fixation cross and attend the visual stimulus. Sequences were separated by rest (no scanning) periods of 45 min. After completion of the scanning sessions, subjects were asked to report on their different visual and non-visual percepts while in the scanner.
Image Analysis
Data were pre-processed and analyzed using SPM96 (Friston et al., 1995). For each subject, all functional volumes were motion-corrected using sinc interpolation and normalized in the Talairach stereotactic system of coordinates. The normalization was done by using linear transformations to match the anatomical images with the Montreal Neurological Institute template. Functional volumes were then spatially smoothed with a 5 mm width Gaussian kernel. The voxel size of the normalized volumes was set to 3 x 3 x 3 mm. The first four scans of each experiment (a series of 76 scans), acquired during the transition to the steady state of the magnetic resonance signal, were discarded.
Data were analyzed both on a group and on a subject per subject basis. For subject and group analyses, the three experiments were modelled with separate blocs of covariates. For each experiment, the stimulation paradigm was modelled as a box-car function modified to take into account the hemodynamic function delay, rise and fall time. Low frequency trends were modelled as confounding effects with discrete cosine functions (cut-off period = 96 s = twice the paradigm period).
Modelled effects were fitted to the BOLD signal on a voxel per voxel basis using standard least-square estimation procedures in linear model theory. Voxels for which effects of interest accounted for a significant part of the signal variation were kept for further analysis (i.e. voxels surviving F-test, P < 0.001). For both individual and group analyses, the statistical parametric maps (SPM{Z}) corresponding to contrasts RMST, ±(EXRM) and ±(SP3DRM) were generated.
Additionally, the SPM{Z} corresponding to the (SP3DRM)(EXRM) contrast was also calculated for the group analysis. Since stimuli SP3D and EX were always presented in separate experiments, this contrast was not computed on an individual basis, because it may reveal nothing but random effects between sessions. In the group analysis, these effects should average across subjects, since the order of the experiments was randomized across subjects.
We performed the group analysis with an exploratory perspective. The type II error (risk of not detecting a region that is activated) was limited by choosing permissive thresholds at the voxel (Z intensity) and cluster size (spatial extent) levels. SPM{Z} for the group were thresholded at Z = 2.33 (P = 0.01), and clusters were kept for further analysis if (i) their volume was >190 mm3 and (ii) their corrected probability for the conjoint test (maximum intensity and spatial extent) was <0.5
Regions highlighted by the group analysis were subsequently tested on an individual basis. The threshold at the voxel level was chosen more conservative than for the group analysis (Z = 3.09, P = 0.001) and regions were taken into account if their volume reached at least 190 mm3 (P = 0.056).
This methodology allowed us to study the functional intersubject variability while limiting the risk of false negative results.
Distortion Correction and Anatomical Localization
At 3 T, the T2*-weighted EPI sequence yields some distortion relative to the T1-weighted anatomical images, which results, in the functional images, in a compression of the occipital lobe along the antero-posterior axis of the slices (i.e. in the phase encoding direction). This compression was estimated for each subject. For five subjects, the distortion was or could be corrected to <4 mm [using Jezzard and Balaban's correction algorithm (Jezzard and Balaban, 1995)]. For the other four subjects, the distortion was >4 mm (up to 13 mm along y and 7.5 mm along z) and could not be corrected. These later data were not used for anatomical localization in the occipital pole. Distortion of the posterior part of the brain was negligible in the 10 upper slices.
The approximate anatomical locations of activity foci for the group results were derived from their stereotactic coordinates (Talairach and Tournoux, 1988). To achieve a better precision in anatomical localization, the group analysis was repeated including only the five subjects with smallest distortion. Because no marked difference was observed between this analysis and the previous one, we present results obtained with all subjects included in the analysis.
Results
Activation Magnitude in Volume and Amplitude
Figure 2 indicates, for each subject and each contrast, the volume that corresponds to the number of suprathreshold voxels, denoted activity volume. The use of a lower threshold (P < 0.01 instead of P < 0.001) for individual analyses only slightly increased these volumes, indicating the robustness of the procedure. Figure 2
also indicates the median of the activity volume across subjects, and its value in the group analysis.
|
For other contrasts, results were more variable across subjects in terms of the activity volume rank. The general trend, however, is that the SP3DRM contrast yielded the strongest activity, while the RMSP3D contrast showed the weakest response.
Activation Localization
Figure 3 presents an overview of the results of the group analysis for the SP3DRM, EXRM and RMST contrasts, showing on the anatomical template the location of suprathreshold regions, for which Z scores are given in Table 1
. Figure 3C
shows the location of the large activity volume of the RMST contrast, while Figure 3A
and Figure 3B
highlight the similarities between the activity loci for contrasts SP3DRM and EXRM respectively.
|
|
|
Random Motion versus Static
The RMST contrast yielded similar results for all subjects, with activity volumes ranging between 3.5 and 39.5 cm3. This contrast operationally defines the regions, denoted motion areas in the following, involved in visual motion processing. These are the bilateral middle-temporal complex (V5+), the left SOG (lSOG; V3/V3A), the bilateral lingual and middle occipital gyri (V1/V2), and the ventral part of the occipito-temporal junction.
Bilateral Middle-temporal (V5+) Complex.
Using individual data, we located V5+ on both hemispheres for all subjects. A maximum deviation of ~20 mm was found on the location of this region in the Talairach coordinates, a figure consistent with Watson et al.'s findings (Watson et al., 1993). For subjects allowing precise anatomical localization, this region clearly followed the anterior occipital sulcus (ascending branch of the inferior temporal sulcus) at the lateral occipito-temporal junction.
Left Superior Occipital Gyrus (V3/V3A). The lSOG was found activated for seven subjects. This region was at the border between the dorsal cuneus (medial occipital gyrus) and the lateral occipital cortex, in the intra-occipital sulcus.
Bilateral Lingual and Middle Occipital Gyri (V1/V2). Individual analyses showed that the most occipital regions, as well as bilateral V5+ complex, were active for all subjects.
Ventral Part of the Occipito-temporal Junction.
This region was active for six subjects. The anatomical study of the ventral part of the brain is more difficult with a 3 T MR scanner, as distortion of T2* images can exceed the voxel size in this region. However, in subjects with anatomical images of negligible distortion, we could locate this region along the posterior part of the collateral sulcus, anterior to V1/V2 (see Figure 4, subject DC, z-coordinates = 9 and 3 mm).
|
EXRM and SP3DRM contrasts of the group analysis delineated regions in similar locations, with Z-scores ranging from 2.33 to 5.17 (Table 1). The occipito-temporal junction was only found for SP3DRM.
The Superior Occipital Gyrus (SOG).
Both group and individual analyses showed overlapping regions of activity in the dorsal SOG for RMST, EXRM and SP3DRM comparisons. The dorsal SOG was more active for SP3D compared with RM (Z = 5.32, pc = 0.002) and for EX compared with RM (Z = 4.17, pc = 0.219, uncorrected P < 104). Because this region was also clearly delineated as a motion area by the RMST contrast, it seems to be active for all motion stimuli, with enhanced activity for coherent motion as compared with random motion, this being significant for the SP3DRM contrast. In the group analysis, the activity was located mainly in the left hemisphere for RMST, but individual data failed to confirm a systematic left lateralization. The activity in the dorsal SOG for EXRM and SP3DRM was confirmed for five and six subjects respectively. It was usually located in the intra-occipital sulcus, reaching dorsally the parieto-occipital sulcus (e.g. Fig. 4, subjects MB and VH).
Although we could not differentiate V3 from V3A in our data, the anatomical and functional descriptions given by Tootell et al. (Tootell et al., 1997) suggest that this region might correspond to V3A, which presents a higher sensitivity to visual motion than V3.
The Parieto-occipital Junction (POJ).
In the group analysis, the parieto-occipital junction exhibited a higher BOLD signal consistently for coherent versus incoherent motion in both hemispheres (region POJ for EXRM and SP3DRM in Fig. 3A,B). As indicated in Table 1
, this activity was moderate (Zscore ranging between 4.05 and 4.47). In individual data (Table 2
), four subjects for EXRM and five subjects for SP3DRM presented suprathreshold activation in this region. For these subjects, the region was located anatomically at the junction between the superior parietal lobule and the SOG (see Fig. 4
, subject DC). It extends along the posterior part of the intraparietal sulcus and/or the intra-occipital sulcus, as the limit between those two sulci is usually impossible to delineate (Duvernoy, 1992
). Following Eidelberg and Galaburda (Eidelberg and Galaburda , 1984
) and Cheng et al. (Cheng et al., 1995
), the cortex in the intraparietal sulcus was assigned to the superior parietal lobule rather than to the inferior parietal lobule.
The Ventral Part of the Occipito-temporal Junction (OTJ).
In the ventral part of the temporo-occipital junction (region OTJ on Fig. 3A), the group analysis showed activity (Z = 4.48, pc = 0.071, uncorrected P < 105) for the SP3DRM contrast but no detectable activity for EXRM (Z < 2.33). Also, (SP3DRM) (EXRM) contrast was positive in this region (Z = 3.23, uncorrected P < 0.001).
From individual analyses, six of the nine subjects showed clear BOLD signal modulation for SP3DRM in the OTJ region. For our less distorted anatomical images, voxels activated in the SP3DRM contrast lie along the collateral sulcus, at the border between the fusiform and lingual gyri, close to the ventral part of the lingual motion areas as determined by the RMST contrast (see Fig. 4, subject DC, z-coordinates = 9 and 3 mm). When comparing EX to RM, only two subjects presented activity in this ventral region, confirming the negative result of the group analysis.
As the occipto-temporal junction had also shown very significant activity (Z = 5.42, pc = 0.002) for the RMST contrast, this region seems sensitive to the three motion patterns, possibly with higher activity for the SP3D stimulus.
The V5 Complex (V5+). From the group analysis, V5+ did not present any significant changes of signal in contrasts comparing motion stimuli (EXRM, SP3DRM or RMEX, RMSP3D). In individual analyses, EXRM and SP3DRM contrasts highlighted some activity in the vicinity of V5+ for three and four subjects respectively, but these regions were small, in terms of volumes and Z-score values, and located in various positions relative to V5+. For six of the nine subjects, the activity in V5+ itself tended to be slightly but non-significantly higher for the random motion pattern than for the coherent motion. Therefore, within the precision of our investigation, V5+ presented a similar BOLD activity level for all three motion patterns.
Opposite Contrast. The opposite contrasts between incoherent and coherent motion stimuli (RMEX and RMSP3D) yielded a locus of activity in the most posterior part of the right lingual gyrus. Such activity could be due to zero velocity in the centre of the image for coherent motions, possibly producing a differential activity in the part of V1/V2 corresponding to central vision (posterior part of the lingual gyrus). At the threshold used here, however, this effect remains small and can be seen only in a limited part of the retinotopic motion areas, not in the SOG nor in V5+.
![]() |
Control Experiments |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Materials and Methods
Four subjects (three female and one male, aged 2034 years) were scanned. Unless otherwise stated, the methods were similar to those used for the main experiments.
Image Acquisition
The voxel size of the functional images was 3.75 x 3.75 x 6 mm and each functional sequence consisted of 133 scans, with a repetition time of 2 s. These parameters are close to those used in the first series of experiments and should not introduce any bias in the interpretation of the results.
The distortion between functional and anatomical images could be corrected in all four subjects.
Visual Stimulation
We used six different stimuli, that presented the same global characteristics as the stimuli described for the main experiments. Their peculiarities are detailed as follows (Fig. 1B):
3-D stimuli: these stimuli were similar to SP3D except that (i) they represented a paraboloid, or a plane, rather than a spherical surface and (ii) the rotation axis was changed every 2 s, spanning eight directions during the 16 s block of presentation. Here we replaced the sphere by the paraboloid in order to simplify the programmation of stimuli and make possible the design of the complex control stimuli hereafter. Care was taken that our paraboloid represented a close approximation of the sphere within our stimulus window, and yielded a similar percept of a curved surface rotating in depth.
2-D pseudo-random stimuli (PRM): simultaneous presentation of eight surfaces rotating in different directions. These stimuli were similar to the 3-D stimuli, except that the dots were divided in eight sets which were all affected a given motion direction and sign (for each oscillation axis, as represented in Fig. 1B, two rotation directions of opposite sign can be defined). Hence all 3-D movements of the 3-D stimulus were presented simultaneously, as if eight transparent surfaces were rotating in depth around different rotation axes. Because of this large number of surfaces, these stimuli appeared as completely devoid of any 3-D shape information. Each of the 3-D stimuli, paraboloid and plane, had its corresponding 2-D pseudo-random stimulus obtained with the same underlying surface shape. Also, in order to maintain the same frequency of dot speed variation as in the 3-D stimuli, we changed the motion direction of the eight oscillating surfaces every 2 s.
2-D coherent stimuli: dots moved either in expansion/contraction (as in EX) or in clockwise/counterclockwise rotation (ROT) about an axis perpendicular to the screen plane and passing through the centre of the viewing area. Both movements yielded little or no information about the underlying 3-D shape.
As in the main experiments, all stimuli had the same average dot speed and frequency of variation of that speed. By construction, the 3-D stimuli and 2-D pseudo-random stimuli had the same speed distribution over one block.
Design
Stimuli were displayed in a pseudo-random order, following a block design, each block during 16 s (eight TR). In both experiments, each stimulus was presented four times. Experiments were repeated twice for each subject.
Specific analysis
A group analysis was performed on the data of the four subjects. The aims were, first, to confirm that the activity found in (SP3DRM) and (EXRM) contrasts was not merely due to differences in speed distribution, and second, to test whether areas found in (SP3DRM) were sensitive to the presence of coherent motion, curvature or 3-D structure information. Hence, SPM{Z} maps were generated to compare directly:
Investigations were bounded in boxes of 15 x 15 x 15 mm around the coordinates of maximum activity found in the (SP3DRM) contrast.
Results
Table 3 presents the results of the group analysis in the areas highlighted by the SP3D versus RM experiment. Although only the left OTJ was significantly activated in the (SP3DRM) contrast, we also explored the right OTJ, since this area might have suffered from distortions in the main experiments. Results are corrected for multiple comparison in these volumes.
|
The contrast between the paraboloid and the oscillating plane showed that most of the areas found with the SP3DRM contrast were actually specifically sensitive to curvature. In particular, the left and right OTJ presented a strong difference of BOLD signal when comparing the paraboloid to the plane (left: Z = 4.42, pc = 0.001; and right: Z = 4.49, pc < 0.001). Hence, we can interpret the activity of the OTJ in the (SP3DRM) as the OTJ being sensitive to the 3-D structure of the SP3D stimulus.
It should be noted that this sensitivity to curvature does not explain the activity of the SOG and POJ in the conjunction of both (planePRM) and (paraboloidPRM) contrasts.
Finally, the contrast between 3-D stimuli and coherent motion stimuli showed that all areas found in the SP3DRM contrast were on average more sensitive to 3-D stimuli than to other coherent motion stimuli (expansion and in-plane rotation). For the SOG, right POJ and OTJ, this can be explained partly by their sensitivity to the paraboloid curvature. However, this is not likely to be so for the left POJ, which was not found sensitive to the structure of the 3-D stimulus.
Summary
![]() |
Discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
From both the experimental design and the activation results, it is unlikely that conclusions of this study are affected by eye movements. In the experimental protocol, subjects were instructed to carefully fixate the central fixation cross, and the sum of velocity vectors around the fixation cross was designed to be exactly zero for all our stimuli. [In the random motion stimulus, the direction of dot motion is randomized; in the expansion/contraction stimulus, velocities on each side of the image center are of similar magnitude but of opposite direction; in the sphere stimulus, the velocity pattern is that of an expansion/contraction pattern in the direction orthogonal to the rotation axis. As a result, the vector sum of dot velocities is null on average for all motion stimuli.] As for the data analyses, no activity was found in the prefrontal and frontal cortex, where areas involved in the generation of saccades are located (Kandel et al., 1991). Smooth pursuit or optokinetic nystagmus (OKN) may have differed across stimuli. However, if this were the case we would then expect a corresponding variation of the amount of corrective saccades to the fixation centre. As stated above, such a variation is not likely to have occurred in a systematic way in our experiments. Finally, we cannot rule out the possibility that the suppressive mechanisms associated with OKN and smooth pursuit might play a role here, particularly as our dorsal regions of activity are located near the posterior parietal regions known to be involved in these oculomotor activities.
Possible Influence of the Subject Attention on the Results
Passive vision might be the cause of the intersubject variability observed here and in previous studies of coherent motion processing (de Jong et al., 1994). This is suggested by an increasing number of cortical imagery experiments demonstrating the modulation of activity in cortical regions as a function of attention. More specifically, results in monkey studies as well as in human imagery work (Corbetta et al., 1991
) show that the neuronal activity in different visual cortical areas can be strongly dependent on the attended visual attribute. More precisely, in V5+ the level of activity is known to depend on spatial and featural attention (Beauchamp et al., 1997
; O'Craven et al., 1997
) but also on the attentional load (Rees et al., 1997
). Therefore, part of the interindividual variability observed here could be due to variations in the attentional state of our subjects and also to the attendent attribute (e.g. surface curvature, 3-D movement).
Given that the parietal lobe plays an important role in spatial attention and spatial representation [for a review see (Andersen, 1987)], the dorsal occipito-parietal activities detected by the present study for coherent versus incoherent motion comparisons might be due to attentional variations occurring at the onset of each coherent motion field. However, these activities are located systematically below the parietal foci highlighted by spatial attention shifts in Corbetta et al.'s study (Corbetta et al., 1995
).
Interestingly, the parieto-occipital cortex seems to play a prominent role in visual feature binding (Friedman-Hill et al., 1995). It may not be surprising, therefore, that this region is involved in the coding of coherent motion, which requires a spatial grouping of local motion signals.
Motion-sensitive Areas: Comparison with Previous Results
The results of our RM versus ST experiment are in close agreement with previous studies. In particular, the coordinates of V5+ are similar to what has been found in other functional magnetic resonance imaging (fMRI) or PET studies [for a review see (McCarthy et al., 1995); and see Figure 5
). Also, the other motion areas (V1/V2 and the SOG) delineated by the contrast between random motion and stationary dots correspond to regions found by others (Dupont et al., 1994
; McKeefry et al., 1997
).
|
We compared our results with three cortical imaging studies on coherent motion (de Jong et al., 1994; Cheng et al., 1995
; McKeefry et al., 1997
) and summarize the results in Table 4
. de Jong et al. used a coherent stimulus that consisted in an expansion flow of dots located on a horizontal plane (heading flow) which was alternated with an incoherent flow, obtained by randomizing the directions of dot velocities. Cheng et al. and McKeefry et al. used an array of dots moving coherently in one direction compared with dots moving each in a randomly selected direction. The main results are as follows.
|
3-D Structure from Motion and Motion Coherence: the Dorsal Pathway
The similarity of patterns between contrasts EXRM and SP3DRM shows that, despite the difference in their depth information contents, EX and SP3D stimuli are probably processed by similar cortical areas.
These areas, however, as detailed in the second Results section, present a gradation of activity depending on the type of motion pattern. Overall, in the dorsal areas (POJ and SOG), curved surfaces tend to elicit more activity than planes or 2-D coherent motion, which in turn elicit more activity than random motion.
As stated in the Introduction, the rotating surfaces and 2-D coherent stimuli (EX or ROT) may all induce a percept of a 3-D shape, but the percept is theoretically and psychophysically stronger for a curved surface rotating in depth. Therefore the results seem to demonstrate, in the two dorsal regions, a parallel between the BOLD signal and the perceptual strength of the stimulus in terms of 3-D shape.
Hence, although these regions do not show a clear-cut response to motion parallax or specific activity for one stimulus only, their graded activity makes them good candidates to participate in the processing of 3-D structure from motion.
3-D Structure from Motion: the Ventral Pathway
In contrast to the dorsal areas, the ventral part of the OTJ, as highlighted by the contrast SP3DRM, does not appear to be sensitive to coherent motion in general, but only to the rotation in depth of a curved surface. Indeed, in this area the curved surfaces yield a higher signal than all other coherent and random patterns of motion, which are at similar levels. Because of its location and specific activity, the role of the OTJ is likely to be different from dorsal areas.
The OTJ region is close to the latero-occipital region (LO) reported by Malach et al. (Malach et al., 1995) as being active when objects rather than random contrast distributions are presented. It should be noted, however, that the LO region is located on the lateral aspect of the fusiform gyrus, at coordinates (43 73 18), whereas the OTJ was found on its medial aspect, adjoining the lingual gyrus. In a PET study, Martin et al. (Martin et al., 1996
) confirmed that activity can be evoked in the fusiform gyrus by presenting either meaningful or nonsense 3-D objects, suggesting that part of this activity does not depend on stimulus semantic content. In their study, two main locations were found, one posterior and one anterior to the OTJ. Complementary to this, Schacter et al. (Schacter et al., 1995
) have proposed that inferior temporal and fusiform gyri activity could be specifically associated with manipulating structurally coherent 3-D objects but not with manipulating 2-D shapes.
The fact that the OTJ region is also sensitive to random motion, as compared with static dots, is not incompatible with the literature, as some authors found direction selective cells in area V4 of the monkey (Cheng et al., 1994). Moreover, Van Essen and Drury (Van Essen and Drury, 1997
) describe a region in the ventral part of the posterior OTJ (near areas VP and V4v) that would be involved in both motion and form processing. From anatomical and functional criteria, this area could correspond well to OTJ.
On the one hand, activation of the ventral pathway is commonly associated with experiments involving the perception of the structure of static objects. On the other hand, our results show that the OTJ is sensitive to the structure of a 3-D object only defined from motion. The conjunction of these results supports the involvement of the ventral visual pathway in the coding of the 3-D shape independently of depth cues (static or dynamic).
![]() |
Conclusion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Moreover, we propose that the lSOG, which shows a graded sensitivity to random motion, coherent motion and 3-D stimuli, is a locus of motion parallax processing. The POJ, which is not found to be sensitive to random motion but is activated by coherent motion and object curvature, might be involved in coding object shape and motion at a higher level. As for the ventral part of the OTJ, it appears to be involved in object complex structure reconstruction either from motion cues, as suggested by our results, or static cues, as suggested by other studies.
Finally, our results support the existence of a coding of 3-D shape from motion in both the dorsal and ventral visual pathways. Therefore, whether the coding of 3-D shape from motion is mediated by the projections of dorsal areas sensitive to motion coherence onto the ventral visual pathway becomes a critical question for exploring the mechanisms of perception of 3-D space and object shape.
![]() |
Notes |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Address correspondence to A.-L. Paradis, LPPA, Collège de France, 11 place Marcelin Berthelot, F-75005 Paris, France. Email: paradis{at}shfj.cea.fr.
![]() |
References |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Beauchamp MS, Cox RW, DeYoe EA (1997) Graded effects of spatial and featural attention on human area MT, and associated motion processing areas. J Neurophysiol 78:516520.
Bradley DC, Qian N, Andersen RA (1995) Integration of motion and stereopsis in middle temporal cortical area of macaques. Nature 373:609611.[ISI][Medline]
Bradley DC, Chang GC, Andersen RA (1998) Encoding of three dimensional structure-from-motion by primate are MT neurons. Nature 392:714716.[ISI][Medline]
Braunstein ML, Andersen GJ (1984) Shape and depth perception from parallel projections of three-dimensional motion. J Exp Psychol Hum Percept Perform 10:749760[ISI][Medline]
Bremmer F, Duhamel JR, Ben Hamed S, Graf W (1997) The representation of movement in near extra-personal space in the macaque ventral intraparietal area (VIP). In: Parietal lobe contribution to orientation in 3D space (Thier P, Karnath H-O, eds), pp. 620630. Heidelberg: Springer Verlag.
Cheng A, Fujita H, Kanno I, Miura S, Tanaka K (1995) Human cortical regions activated by wide-field visual motion: an H2O15 PET study. J Neurophysiol 74:413427.
Cheng K, Hasegawa T, Saleem KS, Tanaka K (1994) Comparison of neuronal selectivity for stimulus speed, length, and contrast in the prestriate visual cortical areas V4 and M of the macaque monkey. J Neurophysiol 71:22692280.
Corbetta M, Miezin FM, Shulman GL, Petersen SE (1991) Selective and divided attention during visual discriminations of shape, color, and speed: functional anatomy by positron emission tomography. J Neurosci 11:23832402.[Abstract]
Corbetta M, Shulman GL, Miezin FM, Petersen SE (1995) Superior parietal cortex activation during spatial attention shifts and visual feature conjunction. Science 270:805.[Abstract]
Cornilleau-Pérès V, Droulez J (1993) Stereo-motion cooperation and the use of motion disparity in the visual perception of 3D structure. Percept Psychophys 54:223239.[ISI][Medline]
Cornilleau-Pérès V, Droulez J (1994) The visual perception of 3D shape from self-motion and object-motion. Vis Res 34:23312336.[ISI][Medline]
Cowey A, Porter J (1979) Brain damage of global stereopsis. Proc Roy Soc Lond B 4:399407.
de Jong BM, Shipp S, Skidmore B, Frackowiack RSJ, Zeki S (1994) The cerebral activity related to the visual perception of forward motion in depth. Brain 117:10391054.[Abstract]
Duffy CJ, Wurtz RH (1991) Sensitivity of MST neurons to optic flow stimuli. I. A continuum of response selectivity to large-field stimuli. J Neurophysiol 65:13291345.
Dupont P, Orban GA, De Bruyn B, Verbruggen A, Mortelmans L (1994) Many areas in the visual system respond to visual motion. J Neurophysiol 72:14201424.
Duvernoy HM (1992) Le cerveau humain. Surfaces, coupes sériées tridimensionnelles et IRM. Paris: Springer Verlag.
Eidelberg D, Galaburda AM (1984) Inferior parietal lobule: divergent architectonic asymmetries in the human brain. Arch Neurol 41: 843852.[Abstract]
Faillenot I, Toni I, Decety J, Grégoire M-C, Jeannerod M (1997) Visual pathways for object-oriented action and object recognition: functional anatomy with PET. Cereb Cortex 7:7785.[Abstract]
Friedman-Hill SR, Robertson LC, Treisman A (1995) Parietal contributions to visual feature binding: evidence from a patient with bilateral lesions. Science 269:853855.[ISI][Medline]
Friston KJ, Holmes AP, Poline J-B, Grasby BJ, Williams CR, Frith CD, Frackowiak RSJ, Turner R (1995) Analysis of MRI time series revisited. NeuroImage 2:4553.[ISI][Medline]
Goodale MA, Milner DA (1992) Separate visual pathways for perception and action. Trends Neurosci 15:2025.[ISI][Medline]
Jezzard P, Balaban RS (1995) Correction for geometric distortion in echo planar images from B0 field variations. Magn Reson Med 34:6573.[ISI][Medline]
Kandel ER, Schwartz JH, Jessell TM (1991) Principles of neural science, 3rd edn. Appleton & Lange.
Koenderink JJ, van Doorn AJ (1975) Invariant properties of the motion parallax field due to the movement of rigid bodies relative to an observer. Opt Acta 22:773791.[ISI]
Longuet-Higgins HC, Prazdny K (1980) The interpretation of a moving retinal image. Proc Roy Soc Lond B 208:385397.[ISI][Medline]
Malach R, Reppas JB, Benson RR, Kwong KK, Jiang H, Kennedy WA, Ledden PJ, Brady TJ, Rosen BR, Tootell RBH (1995) Object-related activity revealed by functional magnetic resonance imaging in human occipital cortex. Proc Natl Acad Sci USA 92:81358139.[Abstract]
Martin A, Wiggs CL, Ungerleider LG, Haxby JV (1996) Neural correlates of category-specific knowledge. Nature 379:649652.[ISI][Medline]
Maunsell JH, Van Essen DC (1983) Functional properties of neurons in middle temporal visual area of the macaque monkey. II. Binocular interactions and sensitivity to binocular disparity. J Neurophysiol 49:11481167.
McCarthy G, Spicer M, Adrignolo A, Luby M, Gore J, Truett A (1995) Brain activation associated with visual motion studied by functional magnetic resonance imaging in humans. Hum Brain Map 2:234243.
McKeefry DJ, Watson JDG, Frackowiack RSJ, Fong K, Zeki S (1997) The activity in human area V1/V2, V3 and V5 during the perception of coherent and incoherent motion. NeuroImage 5:112.[ISI][Medline]
Murata A, Gallese V, Kaseda M, Kunimoto S, Sakata H (1993) Hand manipulation-related neurons of the parietal cortex of the monkey: further analysis of selectivity in shape, size and orientation of objects for manipulation. Jpn J Physiol 43(Suppl. 2):S251.
Newsome WT, Britten KH, Movshon JA (1989) Neuronal correlates of a perceptual decision. Nature 341:5254.[ISI][Medline]
Norman JF, Lappin JS (1992) The detection of surface curvatures defined by optical motion. Percept Psychophys 51:386396.[ISI][Medline]
O'Craven KM, Rosen BR, Kwong KK, Treisman A, Savoy RL (1997) Voluntary attention modulates fMRI activity in human MT-MST. Neuron 18:591598.[ISI][Medline]
Orban GA, Lagae L, Verri A, Raiguel S, Xiao D, Maes H, Torre V (1992) First-order analysis of optic flow in monkey brain. Proc Natl Acad Sci USA 89:25952599.[Abstract]
Qian N, Andersen RA (1994) Transparent motion perception as detection of unbalanced motion signals. II. Physiology. J Neurosci 14: 73677380.[Abstract]
Rees G, Frith CD, Lavie N (1997) Modulating irrelevant motion perception by varying attentional load in an unrelated task. Science 278:16161619.
Regan D, Beverley KI (1979) Binocular and monocular stimuli for motion in depth: changing-disparity and changing-size feed the same motion in-depth-stage. Vision Res 19:13311342.[ISI][Medline]
Rogers BJ, Graham M (1979) Motion parallax as an independent cue for depth perception. Perception 8:125134.[ISI][Medline]
Rogers BJ, Graham ME (1982) Similarities between motion parallax and stereopsis in human depth perception. Vision Res 22:261270.[ISI][Medline]
Roy JP, Komatsu H, Wurtz RH (1992) Disparity sensitivity of neurons in monkey extrastriate area MST. J Neurosci 12:24782492.[Abstract]
Saito HA, Yukie M, Tanaka K, Hikosaka K, Fukada Y, Iwai E (1986) Integration of direction signals of image motion in the superior temporal sulcus of the macaque monkey. J Neurosci 6:145157.[Abstract]
Schacter DL, Reiman E, Uecker A, Polster MR, Yun LS, Cooper LA (1995) Brain regions associated with retrieval of coherent visual information. Nature 376:587590.[ISI][Medline]
Schiller PH (1993) The effects of V4 and middle temporal (MT) area lesions on visual performance in the rhesus monkey. Vis Neurosci 10:717746.[ISI][Medline]
Snowden RJ, Treue S, Erickson RG, Andersen RA (1991) The response of area MT and V1 neurons to transparent motion. J Neurosci 11: 27682785.[Abstract]
Stoner GR, Albright TD (1992) Neural correlates of perceptual motion coherence. Nature 358:412414.[ISI][Medline]
Talairach J, Tournoux P (1988) Co-planar stereotaxic atlas of the human brain. New York: Thieme.
Tootell RBH, Reppas JB, Kwong KK, Malach R, Born RT,Brady TJ, Rosen BR, Belliveau JW (1995) Functional analysis of human MT and related visual cortical areas using magnetic resonance imaging. J Neurosci 15:32153230.[Abstract]
Tootell RBH, Mendola JD, Hadjikhani NK, Ledden PJ, Liu AK, Reppas JB, Sereno MI, Dale AM (1997) Functional analysis of V3A and related areas in human visual cortex. J Neurosci 17:70607078.
Ullman S (1979) The interpretation of structure from motion. Proc Roy Soc Lond B 203:405426.[ISI][Medline]
Ungerleider LG, Haxby JG (1994) What and where in the human brain. Curr Opin Neurobiol 4:157165.[Medline]
Ungerleider LG, Mishkin M (1982) Two cortical visual systems. In: Analysis of visual behavior (Ingle DJ, Goodale MA, Mansfield RJW, eds), pp. 549586. Cambridge, MA: MIT Press.
Van Essen DC, Drury HA (1997) Structural and functional analyses of human cerebral cortex using a surface-based atlas. J Neurosci 17:7079102.
Watson JDG, Myers F, Frackowiack RSJ, Hajnal JV, Woods RP, Mazziotta JC, Shipp S, Zeki S (1993) Area V5 of the human brain: evidence from a combined study using positron emission tomography and magnetic resonance imaging. Cereb Cortex 3:7994.
Wurtz RH, Yamasaki DS, Duffy CJ, Roy JP (1990) Cold Spring Harbor Symposia on quantitative biology. Vol LV. Functional specialization for visual motion processing in primate cerebral cortex. Cols Spring Harbor, NJ: Cold Spring Harbor Laboratory Press.
Xiao DK, Marcar VL, Raiguel SE, Orban GA (1997) Selectivity of macaque MT/V5 neurons for surface orientation in depth specified by motion. Eur J Neurosci 9:956964.[ISI][Medline]
Zeki S, Watson JDG, Lueck CJ, Friston KJ, Kennard C, Frackowiack RSJ (1991) A direct demonstration of functionnal specialization in human visual cortex. J Neurosci 11:641649.[Abstract]
Zeki S, Watson JDG, Frackowiack RSJ (1993) Going beyond the information given: the relation of illusory visual motion to brain activity. Proc Roy Soc Lond B 252:215222.[ISI][Medline]