Electrophysiological Studies of Human Face Perception. III: Effects of Top-down Processing on Face-specific Potentials

Aina Puce1, Truett Allison and Gregory McCarthy2

Neuropsychology Laboratory, VA Medical Center, West Haven, CT 06516 and Departments of Neurosurgery and Neurology, Yale University School of Medicine, New Haven, CT 06510, USA


    Abstract
 Top
 Footnotes
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 General Discussion
 References
 
This is the last in a series of papers dealing with intracranial event-related potential (ERP) correlates of face perception. Here we describe the results of manipulations that may exert top-down influences on face recognition and face-specific ERPs, and the effects of cortical stimulation at face-specific sites. Ventral facespecific N200 was not evoked by affective stimuli; showed little or no habituation; was not affected by the familiarity or unfamiliarity of faces; showed no semantic priming; and was not affected by face-name learning or identification. P290 and N700 were affected by semantic priming and by face-name learning and identification. The early fraction of N700 and face-specific P350 exhibited significant habituation. About half of the AP350 sites exhibited semantic priming, whereas the VP350 and LP350 sites did not. Cortical stimulation evoked a transient inability to name familiar faces or evoked face-related hallucinations at two-thirds of facespecific N200 sites. These results are discussed in relation to human behavioral studies and monkey single-cell recordings. Discussion of results of all three papers concludes that: face-specific N200 reflects the operation of a module specialized for the perception of human faces; ventral and lateral occipitotemporal cortex are composed of a complex mosaic of functionally discrete patches of cortex of variable number, size and location; in ventral cortex there is a posterior-to-anterior trend in the location of patches in the order letter-strings, form, hands, objects, faces and face parts; P290 and N700 at face-specific N200 sites, and face-specific P350, are subject to top-down influences.


    Introduction
 Top
 Footnotes
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 General Discussion
 References
 
In the psychology of perception and in computer vision the concepts of `bottom-up' and `top-down' processing are commonly used. The distinction is that bottom-up processes are involved in the analysis of the incoming image, while top-down processes originate with stored models and information associated with them (Ullman, 1996Go). Neuropsychological and neuroimaging studies suggest that object recognition involves top-down modulation of earlier bottom-up visual processes (Humphreys et al., 1997Go). In previous papers of this series we described event-related potentials (ERPs) evoked by faces and other visual stimuli, and described the responsiveness of these ERPs to processes that were thought to be driven mainly by the perceptual properties of the stimuli (Allison et al., 1999Go; McCarthy et al., 1999Go).

In this paper we describe experiments designed to test the responsiveness of face-specific ERPs to manipulations that are regarded as top-down processes. Such processing engages the subject's prior knowledge, sets up expectancies or contexts, and imposes modifications and constraints on the manner in which neuronal computations are performed on incoming stimuli. We tested the responsiveness of face-specific ERPs to affective stimuli, habituation, familiar and unfamiliar faces, semantic priming, and face-name learning and identification. In some patients the effects of cortical stimulation on recognition of familiar faces were determined. As in the previous papers we will provide a rationale for each experiment in the Results section. A preliminary report of some of this work has appeared (Puce et al., 1997Go).


    Materials and Methods
 Top
 Footnotes
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 General Discussion
 References
 
General methods were described previously (Allison et al., 1999Go). The experiments of this study used stimuli of standard size and luminance as described previously, except that erotic, aversive and neutral stimuli had a luminance of 31 ± 4 cd/m2 and subtended 15.2° x 15.2° of visual angle. Additional details will be provided in the description of each experiment. Unless stated otherwise all faces were unfamiliar.


    Results
 Top
 Footnotes
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 General Discussion
 References
 
Are Face ERPs Due to Emotional Arousal?

In recordings of face-specific cells in monkeys the question arose whether faces provoke arousal or emotional reactions that might have evoked the increased spike discharge. Several studies found that this was not the case. Cells that responded well to faces responded minimally to various types of aversive auditory and tactile stimuli, and to aversive visual stimuli (e.g. a snake) that would be expected to evoke an emotional response (Perrett et al., 1982Go; Desimone et al., 1984Go; Leonard et al., 1985Go; Brothers and Ring, 1993Go). To evaluate this question in humans, patients viewed faces, and erotic (attractive semi-nude males), aversive (violent events) and neutral (landscapes) images. Because most highly aversive images involve human activity, 74% of the aversive stimuli contained faces or bodies (e.g. people jumping out of a burning building), and 90% of the erotic stimuli contained faces, but in both cases faces were secondary to the affect provoked by the images.

There were nine ventral face-specific N200 sites, five in the right and four in the left hemisphere. Representative recordings are shown in Figure 1A,BGo. Results for the right and left hemisphere were similar and are combined in Figure 1CEGo. The overall ANOVA for N200 amplitude was significant [F(df 3,24) = 24.8, P < 0.0001]. N200 amplitude to faces was significantly larger than to the erotic or aversive images (P < 0.008 in each case), and was significantly larger to erotic than to aversive images (P < 0.001). The overall ANOVA for N700 area under the curve (AUC) was significant (P < 0.05). N700 was larger to faces than to the other categories of stimuli. Thus N200 amplitude and N700 AUC were a function of the probability of faces in the stimulus set, not a function of the emotional valence of the stimuli. With one exception all face-specific N200 sites were recorded in females, hence we could not assess possible sex differences in ERP responsiveness to the erotic and aversive stimuli.



View larger version (36K):
[in this window]
[in a new window]
 
Figure 1.  ERPs at ventral face-specific N200 sites to faces and to erotic, aversive and neutral images. (A,B) Examples of recordings. (CE) Summaries of amplitude, latency and N700 AUC.

 
Habituation

Habituation, defined as a progressive decrease in response to repeated presentation of the same stimulus, is perhaps the most elementary and ubiquitous form of neuronal and behavioral plasticity (Groves and Thompson, 1970Go). Miller et al. (Miller et al., 1991Go) showed monkeys repeated presentations of common objects at 2 s intervals and found considerable habituation of most STS/IT cortex cells. Habituation has not been studied in face-specific cells, but Rolls et al. (Rolls et al., 1989Go) found that the response of face-specific cells to novel faces tended to decrease over the first few presentations as the face became familiar. To assess habituation of face-specific ERPs we used a design similar to that of Miller et al. (Miller et al., 1991Go). Patients viewed the same novel face presented eight times at intervals of 2 s, followed by eight presentations of a new face, and so on for a total of 40 sets of faces. No target stimuli were interspersed among the face stimuli.

There were 28 ventral face-specific N200 sites, 17 in the right and 11 in the left hemisphere. Results for the right and left hemisphere were similar and are combined in Figure 2Go. The overall ANOVA for N200 amplitude was significant [F(df 7,189) = 2.68, P < 0.01]. This effect was partly due to the significant (P < 0.003) decrease in N200 amplitude from the first to the second presentation of a face. When trial 2 was removed from analysis the ANOVA was marginally significant (P < 0.04), but there was no progressive decrease in N200 amplitude from the second to the eighth presentation of a face. Thus the evidence for decrement of N200 amplitude was equivocal. The overall ANOVA was not significant for P150 amplitude. The overall ANOVA for P290 amplitude was significant [F(df 7,189) = 6.19, P < 0.0001]. This effect was due mainly to the significant (P < 0.005) increase in P290 amplitude from the first to the second presentation of a face; there was no progressive increase in amplitude from the second to the eighth presentation of a face. These offsetting changes can be seen in a plot of the peak-topeak amplitude of N200 and P290, which shows a small increase in amplitude after the second presentation of a face (Fig. 2Go). The overall ANOVA for N200–P290 peak-to-peak amplitude was not significant. The overall ANOVA for N700 AUC was not significant across all face-specific N200 sites. However, analysis of the subset of sites that had a face-specific N700 revealed that the overall ANOVA for the early fraction of N700 AUC was significant [F(df 7,56) = 2.48, P < 0.03], and this activity showed considerable decrement (Fig. 2Go). The overall ANOVA for the late fraction was not significant. There were only three lateral face-specific N200 sites; no changes in P150, N200, P290 or N700 were seen.



View larger version (31K):
[in this window]
[in a new window]
 
Figure 2.  ERPs to eight presentations of the same face. N700 (E) denotes the early fraction of N700. Plots are normalized to the amplitude evoked by the first face.

 
There were eight face-specific VP350 and LP350 sites, four in the right and four in the left hemisphere. Results for the right and left hemisphere were similar and are combined in Figure 2Go. The overall ANOVA for VP350 and LP350 was significant [F(df 7,49) = 14.5, P < 0.0001]. There was a progressive decrease in AUC from the first to the eighth repetition of a face. There were seven face-specific AP350 sites, all in the right hemisphere. The overall ANOVA for AP350 AUC was significant [F(df 6,49) = 2.37, P < 0.04]. There was a progressive decrease in AP350 from the first to the eighth repetition of a face (Fig. 2Go).

In summary, ventral and lateral N200s exhibited a slight non-progressive response decrement, P290 exhibited a nonprogressive response increment, and P350 and the early fraction of N700 exhibited substantial progressive response decrements in response to repeated presentation of the same face.

Familiar and Unfamiliar Faces

In the steps leading to recognition of a familiar face it is likely that an earlier stage of face perception (`this is a face') is followed by a stage of recognition (`this is Ronald Reagan's face'). Most prosopagnosics recognize that they are viewing a face, and can identify the eyes and other face parts, but the parts do not `add up' to a recognizable face. Thus in the model of Bruce and Young (Bruce and Young, 1986Go) an initial `structural encoding' stage is followed by `face recognition unit' and `person identity node' stages. Because face-specific N200 is the first reliable sign of face-specific processing, we surmised that it reflects activity related to the structural encoding stage (Allison et al., 1994aGo). If this assumption is correct we would expect that familiarity or unfamiliarity of the face would not affect face-specific N200 amplitude or latency, but might affect later processing at N200 sites (e.g. P290) or at P350 sites. We tested this hypothesis by using a randomized set of 60 faces of famous persons (primarily politicians, movie and television stars) and 60 unfamiliar faces obtained from modeling agency books. The two sets of faces were matched for sex and luminance, and approximately matched for age and attractiveness. In the first run of this experiment human faces were task irrelevant and dog faces were targets.

There were 15 ventral face-specific N200 sites, 7 in the right and 8 in the left hemisphere. Representative recordings are shown in Figure 3A,BGo. Results for the right and left hemisphere were similar and are combined in Figure 3C,DGo. P150, N200, P290 and N700 showed no significant differences in amplitude, latency or AUC between familiar and unfamiliar faces. Familiar and unfamiliar faces did not evoke significantly different lateral face-specific N200s, or lateral or ventral P350s. There were nine face-specific AP350 sites, all in the right hemisphere. AP350 AUC was not significantly different for familiar and unfamiliar faces.



View larger version (39K):
[in this window]
[in a new window]
 
Figure 3.  ERPs at ventral face-specific N200 sites to familiar (e.g. Ronald Reagan) and unfamiliar faces. (A,B) Examples of recordings. (C,D) Summaries of amplitude and latency.

 
To verify that the familiar faces were recognized, this experiment was run a second time, immediately after the first run. The patient pressed one button to indicate a familiar (famous) face, and another to indicate an unfamiliar face; dog faces were not task relevant. None of the ERPs reviewed above were significantly different to familiar and unfamiliar faces, thus they were unaffected by face familiarity whether the human faces were task irrelevant or task relevant.

Semantic Priming

Semantic priming refers to the fact that recognition of a word or object of a particular category (e.g. animals) is better and faster when preceded by a stimulus of the same category (e.g. cat preceded by dog) than when preceded by a stimulus of a different category (e.g. cat preceded by pencil). Behavioral studies of face priming have been carried out to infer the processes involved in face recognition, or more generally to study implicit memory processes [reviewed by Young and Bruce (Young and Bruce, 1991Go), Bruce and Humphreys (Bruce and Humphreys, 1994Go) and Schacter and Buckner (Schacter and Buckner, 1998Go)]. Semantic priming is defined electrophysiologically as a decrement in ERP amplitude produced by a preceding associatively related stimulus (Nobre et al., 1994Go).

To assess the effects of semantic priming on face-specific ERPs, patients viewed pairs of stimuli consisting of the name of a famous person followed by a picture of a famous person. On a random half of the trials, the name matched the picture (e.g. the name Albert Einstein preceded a picture of Albert Einstein's face; Fig. 4Go) and thus the face was primed. On the remaining trials, the name did not match the face (e.g. the name Princess Diana preceded a picture of Albert Einstein's face) and thus the face was unprimed. Later testing verified that the famous faces and names were familiar to the patient.



View larger version (31K):
[in this window]
[in a new window]
 
Figure 4.  ERPs at ventral face-specific N200 sites in a semantic priming experiment. (A,B) Examples of recordings. (CF) Summaries of amplitude, latency, the early fraction of N700 AUC and the late fraction of N700 AUC.

 
There were 25 ventral face-specific N200 sites, 13 in the right and 12 in the left hemisphere. Representative recordings are shown in Figure 4A,BGo. Results for the right and left hemisphere were similar and are combined in Figure 4CFGo. (ERPs evoked by the names were also recorded as shown in Fig. 4A,BGo, but these stimuli generated negligible N200s at face-specific sites.) The overall ANOVA for N200 amplitude was significant [F(df 3,66) = 54.2, P < 0.0001] due to the negligible N200 to names, but N200 amplitude was not significantly different to primed and unprimed faces. The overall ANOVA for P290 amplitude was significant (P < 0.05), and P290 showed significant (P < 0.01) priming. N700 often consisted of early and late fractions (Allison et al., 1999Go, Fig. 5Go). To determine whether priming might affect them differentially we calculated AUCs in the latency range of the early (450–750 ms) and late (751–924 ms) fractions. The early fraction did not show a priming effect (Fig. 4EGo), but there was marginally significant (P < 0.05) priming of the late fraction (Fig. 4FGo).



View larger version (48K):
[in this window]
[in a new window]
 
Figure 5.  AP350 in a semantic priming experiment. (A,B) Examples of recordings. (C) AP350 AUC.

 
No significant priming of N200, P290 and N700 was seen at lateral face-specific N200 sites, nor was there significant priming of VP350 and LP350. There were eight face-specific AP350 sites, all in the right hemisphere. Representative recordings are shown in Figure 5A,BGo, and results are summarized in Figure 5CGo. Priming was seen at five sites (e.g. Fig. 5AGo) but not at the other three (e.g. Fig. 5BGo). When tested across all eight face-specific sites, the 22% reduction of the AP350 AUC by priming was not significant.

To summarize, N200, the early fraction of N700, VP350 and LP350 showed no evidence of semantic priming, but there was priming of P290, the late fraction of N700 and AP350 at some sites.

Face Identification

How do previously unfamiliar faces become familiar? Some template or mental representation of a new face must be stored and compared with later instances of the same face. As noted above, the model of Bruce and Young (Bruce and Young, 1986Go) posits a structural encoding stage, followed by activation of face recognition units, which in turn activate the person identity node. The model of Damasio et al. (Damasio et al., 1982Go) also proposes a three-stage mechanism beginning with a stage of template formation, followed by a stage of template matching, which provides the link to an activational stage `unlocked' by the matched template. These models have in common an early perceptual stage, indifferent to face familiarity, and later stages that determine familiarity. The simplest prediction would be that N200, which may reflect a structural encoding or template formation stage of face processing, would be insensitive to later identification of a face, whereas later face-specific ERPs might be. A paired-associate experiment consisting of three stages was run to test this prediction: (i) a learning stage in which 10 unfamiliar faces were paired with common names, (ii) a distractor stage in which the patient categorized 10 new faces as male or female by pressing one of two buttons, and (iii) an identification stage in which the subject reviewed the 10 faces learned in the first stage and indicated by button press whether the paired name was correct. A name–face pair had a 50% chance of being the original pairing.

There were 15 face-specific N200 sites, 8 in the right and 7 in the left hemisphere. Representative recordings are shown in Figure 6A,BGo. Results for the right and left hemisphere were similar and are combined in Figure 6CEGo. There were no significant effects on N200 amplitude and latency or P290 latency. P290 amplitude was significantly larger in the learning condition than in the other two conditions (P < 0.01 in each case). N700 AUC was significantly larger (P < 0.03) in the identification compared to the learning condition. There were too few P350s in this experiment for quantitative analysis. These results suggest that face-specific N200 is not involved in processes related to face learning and identification, whereas later processing at facespecific N200 sites is affected by (or involved in) face learning and identification.



View larger version (30K):
[in this window]
[in a new window]
 
Figure 6.  ERPs at face-specific N200 sites in a face identification experiment. (A,B) Examples of recordings. (CE) Summary of amplitude, latency and N700 AUC.

 
Face-specific ERPs were not recorded from the hippocampus in the experiments summarized previously (Allison et al., 1999Go; McCarthy et al., 1999Go). Instead, the hippocampus generated a large negative ERP to targets, as illustrated in Figure 7A,BGo and as described previously (McCarthy et al., 1989Go). Two-way ANOVAs (site x condition) for the AUC between 300 and 800 ms showed that the ERPs evoked by targets were larger than those evoked by the other stimulus categories in the right (P < 0.007) and left (P < 0.01) hemisphere. The AUCs for faces, scrambled faces and cars did not differ significantly in either hemisphere. However, in this experiment faces were irrelevant to the target detection task in which the patient was engaged. Because the face identification experiment explicitly involved learning and identifying face–name pairs, it is possible that hippocampal ERPs would differ between these stages. Recordings relevant to this issue are shown in Figure 7CFGo. Because the site of largest target-related activity is not necessarily the site of possible face-sensitive activity, Figure 7CFGo shows ERPs recorded from sites anterior and posterior to the site of maximum target-related ERPs. These sites spanned most of the anterior–posterior extent of the hippocampus. As in the recordings of Figure 7A,BGo, sites generating large intrahippocampal ERPs to targets were identified in a target detection task (Fig. 7C,DGo). Two-way ANOVAs (site x condition) showed that the ERPs evoked by targets were much larger than those evoked by the other stimulus categories in the right (P < 0.0001) and left (P < 0.01) hemisphere, demonstrating that the hippocampus of each hemisphere was functionally intact (Puce et al., 1989Go). ERPs in the face identification task are shown for the same sites in Figure 7E,FGo. Two-way ANOVAs showed that the main effects of stimulus type and electrode location were not significant for either hemisphere. Thus there was no evidence that the hippocampus was engaged differentially in the learning or identification stages of this experiment.



View larger version (26K):
[in this window]
[in a new window]
 
Figure 7.  Grand-averaged ERPs recorded within the hippocampus of patients with bilateral depth probes. (A,B) Recordings from the right and left hemisphere in six patients. Recordings in the third row from the top are the largest target ERPs recorded; the second and top rows show ERPs recorded 5 and 10 mm respectively anterior to the third-row sites; the bottom row shows ERPs recorded 5 mm posterior to the third-row sites. Butterflies were targets. (C,D) Recordings from the right and left hemisphere in a target detection task in seven patients. Intrahippocampal locations determined as in (A,B). Gray-and-black checkerboards were targets. (E,F) ERPs recorded in the face identification experiment from the same sites as in (C,D).

 
The results of this and the other ERP experiments are summarized in Table 1Go.


View this table:
[in this window]
[in a new window]
 
Table 1 Summary of effects of top-down manipulations on face-specific and related ERPs
 
Cortical Stimulation

Allison et al. (Allison et al., 1994aGo) found that electrical stimulation at ventral face-specific N200 sites produced a temporary inability to name famous or family faces that they had previously identified correctly. We have since carried out cortical stimulation in additional patients. This summary applies to all 12 cases. Two types of tests were carried out during stimulation with 5 s trains of 50 Hz, 0.2 ms duration, 2–10 mA constantcurrent bipolar pulses. (i) While viewing a white screen with a central fixation point and spatial markers the patient was asked to point to and describe any visual alterations. (ii) Cognitive tests included naming faces of famous individuals or family members that the patient rapidly identified during prior testing; naming common (flash-card) objects rapidly named during prior testing; and reading and completing simple sentences.

The most common perceptual alterations evoked by stimulation were white or colored phosphenes, always seen in the visual field contralateral to stimulation (e.g. Table 2Go, A6–7). Phosphenes often moved from the central toward the peripheral visual field during the 5 s of stimulation, and were typically evoked at sites posterior or medial to face-specific N200 sites. Of the 20 face-specific N200 sites stimulated, stimulation at four sites produced facial hallucinations. One patient had adjacent face-specific N200 sites on the lateral fusiform gyrus (Fig. 8Go). Cortical stimulation at these sites produced detailed imagery involving single or multiple faces (Table 2Go, A9–10, A10–11, A11–13). In another patient, stimulation of a face-specific N200 site evoked imagery of an eye which changed into a right profile view of a face during stimulation. In a third patient, stimulation of a face-specific N200 site evoked the image of a `blinking eye' in the visual field contralateral to stimulation. Stimulation of a face-specific AP350 site evoked the image of a small face with a large eye; during stimulation the eye moved laterally in the contralateral visual field.


View this table:
[in this window]
[in a new window]
 
Table 2 Results of cortical stimulation in a patient
 


View larger version (46K):
[in this window]
[in a new window]
 
Figure 8.  Locations of sites stimulated in the experiment summarized in Table 1Go. Interelectrode spacing was 5 mm.

 
Table 2Go also shows the results of naming of famous faces, objects, face parts and sentence reading in the same patient. Stimulation involving face-specific sites A10 and 11 produced clear deficits in face recognition with normal or near-normal object naming, face-parts naming and sentence reading. Despite the intrusive nature of the visual hallucinations experienced during cortical stimulation, the patient was able to read sentences and name objects when face-specific sites were stimulated. Stimulation of sites B15–16 on the inferior temporal gyrus (Fig. 8Go) produced clear deficits in recognizing faces and face parts. However, stimulation of these sites also produced deficits in sentence reading and distortions of any viewed image, suggesting a general perceptual deficit.

Overall, face recognition was tested at 16 ventral face-specific N200 sites; clear deficits were produced at seven sites and possible deficits at three sites. Thus some disruption of face recognition occurred at 63% of face-specific N200 sites tested, generally without an accompanying impairment of object naming or sentence reading. A deficit in face recognition was also seen at the one lateral face-specific N200 site tested. In contrast, definite or possible deficits of face recognition were produced at 30 of 139 (22%) of sites that were not face-specific; of these 30 sites, 11 were adjacent to face-specific sites. Thus stimulation of 19/139 (14%) of sites that were not face specific and not adjacent to face-specific sites produced deficits in face recognition. The difference in percentage of effects on face recognition at face-specific compared to other sites was significant ({chi}2 = 9.65, P < 0.002).


    Discussion
 Top
 Footnotes
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 General Discussion
 References
 
Is Face-specific N200 Evoked by Emotional Arousal?

The answer to this question is no. Erotic or aversive images evoked appreciable N200s at face-specific sites only to the extent that they also contained faces. However, we did not obtain enough recordings in this task to determine possible effects on P350. In a functional magnetic resonance imaging (fMRI) study Lange et al. (Lange et al., 1998Go) assessed activity in visual cortex to pleasant, neutral and unpleasant pictures. They found that pleasant and unpleasant pictures produced more activation of visual cortex, including the fusiform gyrus, than did neutral pictures. Our results suggest that this increased activation is tonic and non-specific, and has little or no effect on transient face-specific activity.

Habituation

N200 amplitude decreased to repeated presentation of faces (Fig. 2Go), which could be interpreted as evidence of habituation. P290 amplitude increased in amplitude, which could be interpreted as evidence of sensitization (Groves and Thompson, 1970Go). Three considerations argue against these interpretations. First, the changes in N200 and P290 amplitude were not progressive, compared with the progressive decrement seen in behavioral (Groves and Thompson, 1970Go) and electrophysiological (Miller et al., 1991Go) studies of habituation. Second, the N200 amplitude decrease was coupled with a corresponding P290 amplitude increase. Thus it is necessary to postulate a linked habituation of N200 and sensitization of P290, unlikely given the evidence that habituation and sensitization are independent processes (Groves and Thompson, 1970Go). Third, N200– P290 peak-to-peak amplitude did not change systematically with repeated face presentation (Fig. 2Go). The most parsimonious explanation of these results is that N200 and P290 amplitude remained unchanged and that a slow positive baseline shift was evoked by the second and later presentations of a face. In any case, the amplitude changes were small and indicate that habituation of N200 and sensitization of P290, if present, were minimal. By contrast, there were large and progressive decreases in the early fraction of N700 and in P350 (Fig. 2Go), demonstrating considerable habituation of this activity.

Familiar and Unfamiliar Faces

N200, P290, N700, VP350, LP350 and AP350 recorded at face-specific sites were not significantly different in amplitude, latency or AUC for familiar and unfamiliar faces (Fig. 3Go). This was the case whether the faces were task relevant or not. Thus there is no evidence that these neuronal processes are involved in the recognition of familiar faces. A mnemonic or associative role for AP350 would be consistent with the partial overlap of the anterior face area with entorhinal cortex, but this experiment did not provide evidence that AP350 is related to the identification of familiar faces.

Semantic Priming

There was no evidence of semantic priming of face-specific N200, VP350 or the early fraction of N700 (Fig. 4Go). However, P290 and the late fraction of N700 exhibited significant priming, suggesting top-down influences of semantic priming at later stages of processing at face-specific N200 sites. There was evidence of semantic priming at some face-specific AP350 sites (e.g. Fig. 5AGo). This result suggests that these face-specific AP350s reflect neuronal activity that is functionally analogous to word-specific P400s, which can be primed by prior exposure to a related word or by sentence context (Nobre et al., 1994Go). In both cases the neuronal activity is decreased by prior exposure to a semantically related stimulus, and may be involved in creating image-based representations of faces and words.

Face Identification

The results of this experiment (Fig. 6Go) suggest that N200 amplitude is unaffected during the learning, gender discrimination and identification stages of a face memory task. By contrast, P290 was significantly larger during the learning stage, while N700 was significantly larger during the identification stage, suggesting later stages of face processing similar to the `face recognition unit' stage in the model of Bruce and Young (Bruce and Young, 1986Go) and the `template matching' stage in the model of Damasio et al. (Damasio et al., 1982Go). If this inference is correct, later stages of face processing, sensitive to task demands, occur at the same cortical sites as the initial taskinsensitive processing reflected by face-specific N200. There was no evidence in this experiment that the hippocampus is preferentially involved in the learning and identification of faces and names.

Cortical Stimulation

Stimulation of face-specific N200 sites produced two types of face-related changes. The more common was a transient inability to name familiar faces. Such changes were seen at most facespecific N200 sites. At these sites patients did not report distortions of faces, rather they were unable to name familiar faces. These results suggest that stimulation of face-specific N200 sites does not disrupt face perception but instead disconnects the face representation from later face recognition and mnemonic processes. In the patient described by Allison et al. (Allison et al., 1994aGo) the face–name association was not completely abolished during stimulation; he identified the face at the ordinate level (`politician') but not at the unique level (he identified the face as `President Bush' rather than the state governor, whom he knew well). This type of disruption by cortical stimulation is perhaps a transient form of `associative' prosopagnosia. By contrast, stimulation of non-face-specific sites B15–16 (Fig. 8Go, Table 2Go) produced a deficit in face recognition associated with a general perceptual deficit (distortion of any viewed image). This type of disruption by cortical stimulation is perhaps a transient form of `apperceptive' prosopagnosia.

Less commonly, stimulation evoked transient hallucinations of faces, face parts or bodies. These changes were seen mainly at face-specific N200 or immediately adjacent sites, whereas stimulation of other sites mainly evoked phosphenes in the contralateral visual field, object-naming deficits or no changes. Stimulation of a middle fusiform AP350 site evoked the image of a small face with a large eye. Penfield and Perot (Penfield and Perot, 1963Go) stimulated a similarly located site on the fusiform gyrus and evoked the image of `a face in a picture'. Kanwisher and O'Craven (Kanwisher and O'Craven, 1998Go) found that imagining faces activated the fusiform face area. These results, together with the finding that stimulation of face-specific N200 sites can evoke face-related hallucinations, suggest that facerelated portions of the fusiform gyrus are involved in face imagery as well as face perception.


    General Discussion
 Top
 Footnotes
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 General Discussion
 References
 
Relative Responsiveness of Ventral Face-specific N200s

In the previous papers we described the responsiveness of ventral face-specific N200s to a number of stimulus categories. Here it will be useful to recapitulate these results to highlight trends in response amplitude and latency. In Figure 9AGo, N200 amplitudes are shown relative to the standard amplitude to grayscale, unfamiliar faces with eyes directed to the viewer. The other stimulus categories are plotted in decreasing order of magnitude. Relative latencies are plotted in the same order. Results for the right and left hemisphere were combined. Several trends are apparent. (i) All face and face-part stimuli evoke N200s that are 40% or more of the response to standard faces. (ii) There is a break in amplitude between all types of face stimuli and all types of non-face, non-body stimuli, with the latter having an amplitude of 16% or less of the response to standard faces. N200 amplitude to hands (26%) is transitional, suggesting either that hands activate face-specific cells slightly more than do other non-face stimuli, or that a subgroup of cells at face-specific N200 sites is hand sensitive. (iii) The only categories of stimuli that evoke N200s larger than those to standard faces are blurred faces, large faces and faces with eyes averted. (iv) Noses evoked the smallest N200 of any face stimulus and evoked the latest N200 of any stimulus. These results may be related to the fact that the nose is the least examined face part during free viewing of faces (Yarbus, 1967Go; Shepherd et al., 1981Go). (v) There is a dissociation between N200 amplitude and latency. The longest latencies are to lips, noses, and line-drawing faces, whereas most non-face stimuli evoke N200s whose latency is similar to that of standard faces. The simplest explanation of these results is that faces or face parts that are difficult to recognize engage the face processing reflected by N200 but require additional processing time.



View larger version (32K):
[in this window]
[in a new window]
 
Figure 9.  Responsiveness at ventral face-specific N200 sites to the categories of stimuli used in this study. Amplitude (A) and latency (B) normalized to the amplitude and latency to faces with the head and eyes directed at the viewer, and plotted in decreasing order of amplitude. Results for stimulus categories that evoked similar N200 amplitudes and latencies were combined as follows: Averted eyes = faces with eyes left, eyes up and eyes closed; Animal faces = cat and dog faces; Non-Cartesian = hyperbolic, radial, spiral and polar gratings; Cartesian = Cartesian vertical and horizontal gratings; Scrambled faces = scrambled and phase-scrambled faces.

 
Specificity of ERP Responsiveness to Faces

When Gross and colleagues first recorded from monkey STS/IT cells that responded best to hands or faces (Gross et al., 1969Go, 1972Go) there was scepticism whether the increased firing rate was hand or face specific, or whether it was due to more elementary stimulus features, e.g. stimuli with a similar frequency spectrum (Desimone, 1991Go; Gross, 1994Go). Many studies have since demonstrated that some STS/IT cells indeed respond much better to faces than to many categories of non-face stimuli [reviewed by Desimone (Desimone, 1991Go), Gross (Gross, 1992Go), Perrett et al. (Perrett et al., 1992Go), Logothetis and Scheinberg (Logothetis and Scheinberg, 1996) and Tanaka (Tanaka, 1996Go)].

Similarly, the conclusion that N200 and other ERPs are facespecific requires evidence that simpler explanations are not plausible. We made the following observations (many of them summarized in Fig. 9Go) at ventral face-specific N200 sites. (i) Scrambled faces that control for luminance (but not spatial frequency) evoked N200s that were 6% as large as the N200 evoked by faces. (ii) Phase-scrambled faces that control for luminance and spatial frequency evoked N200s that were 7% as large as the N200 evoked by faces. (iii) N200 amplitude was approximately size-invariant; it changed by a factor of two over a 32-fold change in face size. (iv) N200 amplitude was not significantly affected by removing the high-frequency or low-frequency portion of the face frequency spectrum. (v) Complex nonobjects such as hyperbolic gratings evoked N200s that were 9–18% as large as the N200 evoked by faces. (vi) Complex non-living objects such as cars evoked N200s that were 12% as large as the N200 evoked by faces. (vii) Living objects evoked N200s that were 2% (flowers), 5% (butterflies) and 26% (hands) as large as the N200 evoked by faces. (viii) Language-related stimuli evoked N200s that were 3% (nouns) and 14% (Arabic numbers) as large as the N200 evoked by faces. (ix) N200 amplitude was not a function of the emotional valence of the stimuli (Fig. 1Go). (x) Other face-specific ERPs were also much larger to faces than to non-face stimuli, but this activity was less frequently encountered and the results are correspondingly less conclusive. These results demonstrate that N200 and other face-specific ERPs reflect the activation of cells driven by the configuration of a human face and not to the incidental features or emotional valence of a face.

Imaging studies also provide evidence of specialized face processing. Kanwisher et al. (Kanwisher et al., 1997Go) reported that a region of the fusiform gyrus, primarily in the right hemisphere, was activated more by faces than by non-face stimuli in tasks that eliminated or minimized the contribution of visual attention, subordinate-level classification or processing of non-face body parts. McCarthy et al. (McCarthy et al., 1997Go) reasoned that faces may engender both face-specific and general object processing, and that face-specific processing might be revealed only if the general object system was occupied by concurrent object processing. They found that a large portion of the fusiform gyrus bilaterally was activated by faces viewed among non-objects, but that when viewed among objects faces primarily activated a focal right fusiform region. Thus both ERP and fMRI studies demonstrate face-specific processing regions located primarily in portions of the fusiform gyrus.

The Functional Architecture of the Ventral Object Recognition System

Since the seminal work of Mountcastle (Mountcastle, 1957Go) in somatosensory cortex it has been known that columns of cells extending vertically through the cortical layers, extending ~0.5 mm horizontally, and having similar response properties, exist in primary sensory cortex [reviewed by Mountcastle (Mountcastle, 1997Go)]. Less is known about columnar organization in association cortex, but in monkey STS/IT cortex there is a columnar organization 0.4–1 mm in diameter as inferred from single-unit recordings (Perrett et al., 1984Go; Fujita et al. 1992Go), and ~0.5 mm in diameter as determined by optical imaging (Wang et al., 1996Go, 1998Go). In the human ventral face area, face-specific columns could be intermixed randomly with letter-string-specific, object-specific or other columns, as illustrated schematically in Figure 10AGo. Superimposed is an outline of the electrode (2.2 mm diameter) used in our recordings, which would record from about 11 columns assuming a diameter of 0.5 mm and the packing density illustrated. Such an arrangement accounts well for the non-specific responsiveness seen at some sites. However, such a model cannot account for category-specific ERPs, which imply segregation of category-specific columns as illustrated in Figure 10BGo. An electrode (left) centered over a patch of face-specific columns would record a face-specific N200, an electrode (right) centered over a patch of letter-string-specific columns would record a letter-string-specific N200, and an electrode (lower left) located over a patch of object-specific columns would record an object-specific N200. An electrode that straddled such patches would record N200s evoked by two or more stimulus categories, as we often observed. This model implies that human extrastriate cortex is more differentiated into category-specific regions than is monkey STS/IT cortex, in which columns of cells responsive to faces may be intermixed with columns responsive to non-face stimuli (Tanaka, 1996Go), although patches of cortex with a higher concentration of face-specific cells are also found (Harries and Perrett, 1991Go). The data of Allison et al. (Allison et al., 1999Go) indicate that human face-responsive patches of cortex are on average 12–16 mm wide and 15–35 mm long, depending on whether the region is unitary or broken into two patches. Harries and Perrett (Harries and Perrett. 1991Go) found that monkey face-sensitive patches were 3–4 mm wide and 3–6 mm long, separated by regions less responsive to faces. The human and monkey values may be comparable if differences in brain size are taken into account.



View larger version (44K):
[in this window]
[in a new window]
 
Figure 10.  Models of the functional anatomy of visual occipitotemporal cortex. (A) Random mixture of face-specific, letter-string-specific, object-specific and other columns 0.5 mm in diameter, and the outline of an electrode (2.2 mm in diameter) of the size used in this study. (B) Segregated category-specific and other columns. (C,D) Ventral and lateral centroids of form (grating) N180 sites; face-specific, facepart-specific, object-specific and letter-string-specific N200 sites; and hand-specific N230 sites. The center of each icon is placed at its centroid in each hemisphere. The lateral view shows centroids for left and right hemisphere sites combined.

 
The centroids of activation of grating and category-specific sites are summarized in Figure 10C,DGo. The centroids of ventral letter-string-specific and grating sites were located in the same region of occipital cortex (Fig. 10CGo), suggesting that letter-string processing sites are carved developmentally out of form processing sites. More anterior is the centroid of hand-specific sites in the right hemisphere. However, there were only five such sites (McCarthy et al., 1999Go, Fig. 11), and the lack of left hemisphere sites may be adventitious. Object-specific sites were on average slightly posteromedial to face-specific sites, which were slightly lateral to face-part-specific sites. Thus there is a posterior-toanterior gradient of form-processing to face-processing sites. However, the maps from which the centroids were derived [e.g. (Allison et al., 1999Go), Fig. 6DGo] demonstrate considerable variability in the location of these sites. It is possible that the orderly progression of processing implied by Figure 10CGo would not be detectable in some hemispheres. In a given hemisphere a more accurate summary may be that ventral extrastriate cortex is composed of a complex mosaic of functionally discrete patches of cortex of variable location, number and size, with a trend toward the organization summarized in Figure 10CGo. A similar organization can be detected in fMRI studies of face and letterstring perception. Object-specific sites are on average posterior to face-specific sites, hence subtraction of activation related to general object processing reveals a more anterior face-specific region of the fusiform gyrus (Kanwisher et al., 1997Go; McCarthy et al., 1997Go). In the left hemisphere the centroid of letter-stringspecific N200s is near the occipitotemporal sulcus, which Puce et al. (Puce et al., 1996Go) found to be activated more by letter-strings than by faces or textures. Some face-specific N200 sites were within or posterior to color-sensitive regions (Lueck et al., 1989Go; Allison et al., 1993Go; Sakai et al., 1995Go; Kleinschmidt et al., 1996Go) V4 and V8 as defined anatomically (Clarke and Miklossy, 1990Go) or retinotopically (Sereno et al., 1995Go; De Yoe et al., 1996Go; Engel et al., 1997Go; Van Essen and Drury, 1997Go; Hadjikhani et al., 1998Go). The responsiveness of the posterior sites was not demonstrably different from that of the anterior sites [see the `Intrahemispheric Comparisons' section of Allison et al. (Allison et al., 1999Go)]. By contrast, monkey face-specific cells are located anterior to area V4 (Desimone, 1991Go, Fig. 1Go; Perrett et al., 1992Go, Fig. 4Go). These results suggest that face-processing sites are more widespread in human than in monkey extrastriate cortex.

Models of Object Recognition

The model of visual processing implied by Figure 10CGo differs from that of Farah (1990, 1994), who concluded that only two systems are needed for object recognition. One system uses holistic processing and is required for faces and used to a lesser extent for objects; the other uses feature-based processing and is required for words and used to a lesser extent for objects. Our results suggest (at least) four systems: one dedicated to face processing and detected electrophysiologically by face-specific N200s; one dedicated to word processing and detected by letter-string-specific N200s; a general object system detected by object-specific N200s; and a hand-specific system detected by hand-specific N230s. Some lesion studies (Newcombe et al., 1994Go; Rumiati et al., 1994Go; Moscovitch et al., 1997Go; De Renzi and di Pellegrino, 1998; Buxbaum et al., 1999Go) also suggest a general object recognition system distinct from the systems subserving face and word recognition.

Some sites generated N200s specific to one or more internal face parts (McCarthy et al., 1999Go, Fig. 8Go), which could be regarded as evidence for face-part-specific processing independent of face processing. For the time being it is parsimonious to view this activity as a variant of face-specific processing. We previously inferred that there may be a separate system dedicated to the perception of Arabic numbers (Allison et al., 1994bGo), but we encountered only three number-specific N200 sites, not enough to provide convincing evidence of such a system. A fMRI and behavioral study also suggests that letter and digit recognition depend on different neural substrates that become differentiated by experience (Polk and Farah, 1998Go).

It is likely that all humans have face-specific, and that all literate humans have letter-string-specific, patches of ventral extrastriate cortex. Furthermore, it is possible that the general object recognition system is itself not monolithic, and may become differentiated by experience. Carey and colleagues [reviewed by Carey and Diamond (Carey and Diamond, 1994Go)] concluded that years of experience are required to develop the perceptual expertise needed for face and dog encoding, a conclusion that might hold for other types of category-specific recognition as well. Newcombe et al. (Newcombe et al., 1994Go) described category-specific deficits in visual recognition, and Caramazza and Shelton (Caramazza and Shelton, 1998Go) argued for the presence of category-specific knowledge systems that develop evolutionarily or developmentally. Depending on experience and expertise, it is possible that an individual's mosaic of category-specific patches of cortex may be even more differentiated than proposed in Figure 10C,DGo. We have not had the opportunity to test a patient with category-specific expertise (e.g. a dog judge or experienced birdwatcher), but recordings in typical patients using a variety of object categories might be useful in testing this possibility. It is unlikely that the development of face-specific ERPs can be assessed by intracranial recordings. The youngest patient in this study was 10 years old, the age at which children perform in the normal adult range on face-encoding tasks (Carey and Diamond, 1994Go); indeed in this patient we recorded a face-specific N200 that was well within normal adult limits of amplitude and latency. However, scalprecorded ERPs may prove useful in tracking the development of face-sensitive ERPs (Taylor et al., 1997Go).

Face-specific N200 Reflects the Operation of a Face Module

We have used the term `face module' as shorthand to describe face-specific N200 sites (Allison et al., 1994aGo). However, `module' carries additional connotations; in particular it implies a neuronal population that responds to a preferred input in an automatic, mandatory fashion and carries out specific computations that are encapsulated and relatively immune to outside influence (Marr, 1976Go; Fodor, 1983Go).

The evidence reviewed in this and previous papers supports the conclusion that face-specific N200s are generated by a population of neurons that initially respond in a mandatory and largely invariant manner over a wide range of manipulations (Table 1Go). Thus N200 (i) is evoked in passive viewing tasks that do not require explicit learning or identification of faces; (ii) is unaffected by face familiarity; (iii) shows little or no habituation; (iv) is not semantically primed; (v) responds much more to faces than to other stimulus categories; (vi) and is recorded from specific regions of cortex. This activity is therefore modular as defined by Fodor (Fodor, 1983Go), except that we do not know to what extent N200 reflects the operation of an innate process, as Fodor suggests it should. Additional study of N200 may alter this conclusion, but the available evidence allows the conclusion that it reflects the operation of a module specialized for face perception. Moscovitch et al. (Moscovitch et al., 1997Go) studied a patient with a severe object agnosia but with normal face recognition, and also concluded that face recognition is modular. What is the function of the `face-specific N200 module'? N200 is sensitive to the configuration of a face but is insensitive to its familiarity. We propose that the operations reflected by N200 are the instantiation of a `structural encoding module' in the model of Bruce and Young (Bruce and Young, 1986Go) or a `template formation module' in the model of Damasio et al. (Damasio et al., 1982Go).

The results of the line-drawing (McCarthy et al., 1999Go, Fig. 1Go), priming (Fig. 4Go), and face identification (Fig. 6Go) experiments suggest that later activity at face-specific N200 sites (P290 and N700) may be subject to influences related to recognition processes (Table 1Go). These results lead to the conclusion that modularity can be limited in time as well as space. Modularity is conceptualized as a set of neurons that perform automatic, encapsulated operations on a preferred input. Our results suggest that initial operations performed by such neurons can be modular, whereas later operations of the same neurons can be influenced by top-down processes and are thus non-modular.

Limitations of These Studies

Electrogenesis of ERPs

A major gap in our knowledge of the ERPs described here is that their cellular basis is unknown. However, a plausible model is available from single-cell and ERP recordings in animals. In primary sensory cortex natural stimulation of receptors, or electrical stimulation of afferent pathways, evokes initial surface positive–negative potentials referred to in the older literature as the `primary evoked response' (Towe, 1966Go). The primary positivity is due to initial depolarization of layer 3–4 pyramidal cells at the level of the cell body, and a corresponding positive source potential in the apical dendrites. The primary negativity is due to later depolarization of the apical dendrites, either by direct synaptic excitation or by back-propagation from the axosomatic region (Schlag, 1973Go; Cruetzfeldt and Houchin, 1974; Wood and Allison, 1981Go). Monkey area TEO sends feedforward projections to layer 3–4 of area TE (Distler et al., 1993Go; Saleem et al., 1993Go), similar to the thalamocortical afferents to primary sensory cortex. The human ventral face area is probably homologous to area TE and thus probably receives feedforward projections to layer 3–4 from the homolog of monkey area TEO. These considerations suggest that P150 is analogous to the primary positivity, while N200 is analogous to the primary negativity. If this inference is correct, these potentials reflect initial phasic excitation of layer 3–4 pyramidal cells at facespecific N200 sites. N700 may reflect tonic excitation of the same cells, comparable to the tonic discharge of face-specific STS/IT cells during face perception (Oram and Perrett, 1992Go). The best test of this model would come from simultaneous ERP and single-unit recordings from face-sensitive patches in monkey STS/IT cortex.

Face-specific Processing in the Frontal Lobe

In monkey inferior prefrontal cortex some cells are face selective, with response properties similar to those in STS/IT cortex (Wilson et al., 1993Go; Ó Scalaidhe et al., 1997Go). We recorded small face-specific ERPs from a few frontal lobe sites (Allison et al., 1999Go). Our failure to find sites that generated large, obviously face-specific ERPs like those recorded from occipitotemporal cortex may be due to two factors. (i) Monkey face-selective cells are found mainly in a small area just lateral to the principal sulcus (Ó Scalaidhe et al., 1997Go). The human homolog of this area is unknown, but most of our frontal electrodes were posterior and superior to the region of inferior prefrontal cortex that would be expected to contain a homologous area. (ii) Only 5% of cells were face-selective within the face-selective area (Ó Scalaidhe et al., 1997Go), less than the 10–34% of face-selective cells found in selected areas of monkey STS/IT cortex (Perrett et al., 1982Go; Desimone et al., 1984Go). If the same ratio holds in humans, face-specific ERPs would be correspondingly smaller. Our preliminary results suggest that human prefrontal cortex contains face-specific patches of cortex, and encourage a more systematic search for such sites.

Relationship Between Scalp and Intracranial ERPs

Halgren et al. (Halgren et al., 1994Go) recorded an N130–P180– N240 sequence of potentials in white matter superior to the fusiform gyrus. At some sites these ERPs were face specific, and probably reflect polarity-inverted counterparts of the P150– N200–P290 sequence we recorded from the surface of the fusiform gyrus. In scalp recordings faces evoke a positivity that is largest at the vertex and has a latency of 150–200 ms (Grüsser et al., 1990Go; Bentin et al., 1996Go; George et al., 1996Go; Jeffreys, 1996Go; Schendan et al., 1998Go). The vertex positivity may be a polarity-inverted counterpart of N200, although it may also partly reflect the polarity-inverted counterpart of N170 (George et al., 1996Go). Simultaneous scalp and intracranial recordings will be necessary to determine the relationship among these facesensitive ERPs.

Attention

Imaging studies indicate that attention to faces increases face-related activation of the fusiform gyrus (Haxby et al., 1994Go; Clark et al., 1997Go; Wojciulik et al., 1998Go) [reviewed by McCarthy (McCarthy, 1999Go)]. Due to time constraints and the limited capacity of many patients to perform cognitive tasks, we have not carried out tests of the effects of attention on face-specific ERPs. In suitable patients such experiments will be required to address the important question of the effects of attention on face-specific and related ERPs.

Perception of Static and Dynamic Faces

Studies in monkeys and humans demonstrate that portions of the superior temporal sulcus and adjacent cortex are involved in analysis of direction of gaze, and eye, mouth, hand and body movement (Campbell et al., 1990Go; Perrett et al., 1992Go; Bonda et al., 1996Go; Oram and Perrett, 1996Go; Calvert et al., 1997Go; Puce et al., 1998Go). The stimuli used in the present study were static. We have begun to study the ERP responsiveness of occipitotemporal cortex to moving eyes and mouths; these results are being reported separately (Puce and Allison, 1999Go; Puce et al.,1999).


    Notes
 
We thank F. Favorini, J. Jasiorkowski, M. Jensen, M. Luby and K. McCarthy for assistance, and Dr A.C. Nobre for collaboration in the face identification experiment. This work was supported by the Veterans Administration and by NIMH grant MH-05286.

Address correspondence to Gregory McCarthy, Brain Imaging and Analysis Center, Box 3808, Duke University Medical Center, Durham, NC 27710, USA. Email: gregory.mccarthy{at}duke.edu.


    Footnotes
 Top
 Footnotes
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 General Discussion
 References
 
1 Current address: Brain Sciences Institute, Swinburne University of Technology, PO Box 218, Hawthorn, Victoria 3122, Australia Back

2 Current address: Brain Imaging and Analysis Center, Box 3808, Duke University Medical Center, Durham, NC 27710, USA Back


    References
 Top
 Footnotes
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 General Discussion
 References
 
Allison T, Begleiter A, McCarthy G, Roessler E, Nobre AC, Spencer DD (1993) Electrophysiological studies of color processing in human visual cortex. Electroenceph Clin Neurophysiol 88:343–355.[ISI][Medline]

Allison T, Ginter H, McCarthy G, Nobre A, Puce A, Luby M, Spencer DD (1994a) Face recognition in human extrastriate cortex. J Neurophysiol 71:821–825.[Abstract/Free Full Text]

Allison T, McCarthy G, Nobre A, Puce A, Belger, A (1994b) Human extrastriate visual cortex and the perception of faces, words, numbers, and colors. Cereb Cortex 5:544–554.

Allison T, Puce A, Spencer DD, McCarthy G (1999) Electrophysiological studies of human face perception. I: Potentials generated in occipitotemporal cortex by face and non-face stimuli. Cereb Cortex 9:415–430.[Abstract/Free Full Text]

Bentin S, Allison T, Puce A, Perez A, McCarthy G (1996) Electrophysiological studies of face perception in humans. J Cogn Neurosci 8:551–565.[ISI]

Bonda E, Petrides M, Ostry D, Evans A (1996) Specific involvement of human parietal systems and the amygdala in the perception of biological motion. J Neurosci 16:3737–3744.[Abstract/Free Full Text]

Brothers L, Ring R (1993) Mesial temporal neurons in the macaque monkey with responses selective for aspects of social stimuli. Behav Brain Res 57:53–61.[ISI][Medline]

Bruce V, Young A (1986) Understanding face recognition. Br J Psychol 77:305–327.[ISI][Medline]

Bruce V, Humphreys GW (1994) Recognizing objects and faces. Vis Cogn 1:141–180.

Buxbaum LJ, Glosser G, Coslett HB (1999) Impaired face and word recognition without object agnosia. Neuropsychologia 37:41–50.[ISI][Medline]

Calvert GA, Bullmore ET, Brammer MJ, Campbell R, Williams SCR, McGuire PK, Woodruff PWR, Iversen SD, David AS (1997) Activation of auditory cortex during silent lipreading. Science 276:593–596.[Abstract/Free Full Text]

Campbell R, Heywood CA, Cowey A, Regard M, Landis T (1990) Sensitivity to eye gaze in prosopagnosic patients and monkeys with superior temporal sulcus ablation. Neuropsychologia 28:1123–1142.[ISI][Medline]

Caramazza A, Shelton JR (1998) Domain-specific knowledge systems in the brain: the animate–inanimate distinction. J Cogn Neurosci 10:1–34.[Abstract]

Carey S, Diamond R (1994) Are faces perceived as configurations more by adults than by children? Vis Cogn 1:253–274.

Clark VP, Parasuraman R, Keil K, Kulansy R, Fannon S, Maisog JM, Ungerleider LG, Haxby JV (1997) Selective attention to face identity and color studied with fMRI. Hum Brain Map 5:293–297.[ISI]

Clarke S, Miklossy J (1990) Occipital cortex in man: organization of callosal connections, related myeloand cytoarchitecture, and putative boundaries of functional visual areas. J Comp Neurol 298:188–214.[ISI][Medline]

Creutzfeldt O, Houchin J (1974) Neuronal basis of EEG-waves. In: Handbook of electroencephalography and clinical neurophysiology (Redmond A, ed.), vol. 2C, pp. 5–55. Amsterdam: Elsevier.

Damasio AR, Damasio H, Van Hoesen GW (1982) Prosopagnosia: anatomic basis and behavioral mechanisms. Neurology 32:331–341.[Abstract]

De Renzi E, di Pelligrino G (1998) Prosopagnosia and alexia without object agnosia. Cortex 34:403–415.[ISI][Medline]

Desimone R (1991) Face-selective cells in the temporal cortex of monkeys. J Cogn Neurosci 3:1–8.[ISI]

Desimone R, Albright TD, Gross CG, Bruce CJ (1984) Stimulus selective properties of inferior temporal neurons in the macaque. J Neurosci 4:2051–2062.[Abstract]

De Yoe EA, Carman GJ, Bandettini P, Glickman S, Wieser J, Cox R, Miller D, Neitz J (1996) Mapping striate and extrastriate visual areas in human cerebral cortex. Proc Natl Acad Sci USA 93:2382–2386.[Abstract/Free Full Text]

Distler C, Boussaoud D, Desimone R, Ungerleider LG (1993) Cortical connections of inferior temporal area TEO in macaque monkeys. J Comp Neurol 334:125–150.[ISI][Medline]

Engel SA, Glover GH, Wandell BA (1997) Retinotopic organization in human visual cortex and the spatial precision of functional MRI. Cereb Cortex 7:181–192.[Abstract]

Farah MJ (1990) Visual agnosia. Cambridge, MA: MIT Press.

Farah MJ (1994) Specialization within visual object recognition: Clues from prosopagnosia and alexia. In: The neuropsychology of highlevel vision (Farah MJ, Ratcliff G, eds), pp. 133–146. Hillsdale, NJ: Lawrence Erlbaum.

Fodor J (1983) The modularity of mind. Cambridge, MA: MIT Press.

Fujita I, Tanaka K, Ito M, Cheng K (1992) Columns for visual features of objects in monkey inferotemporal cortex. Nature 360:343–346.[ISI][Medline]

George N, Evans J, Fiori N, Davidoff J, Renault B. (1996) Brain events related to normal and moderately scrambled faces. Cogn Brain Res 4:65–76.[ISI][Medline]

Gross CG (1992) Representation of visual stimuli in inferior temporal cortex. Phil Trans R Soc Lond B 335:3–10.[ISI][Medline]

Gross CG (1994) How inferior temporal cortex became a visual area. Cereb Cortex 5:455–469.

Gross CG, Bender DB, Rocha-Miranda CE (1969) Visual receptive fields of neurons in inferotemporal cortex of the monkey. Science 166:1303–1306.[ISI][Medline]

Gross CG, Rocha-Miranda CE, Bender DB (1972) Visual properties of neurons in inferior temporal cortex of the macaque. J Neurophysiol 35:96–111.[Free Full Text]

Groves PM, Thompson RF (1970) Habituation: a dual-process theory. Psychol Rev 77:419–450.[Medline]

Grüsser OJ, Kirchhoff N, Naumann A (1990) Brain mechanisms for recognition of faces, facial expression, and gestures: Neuropsychological and electroencephalographic studies in normals, brain-lesioned patients, and schizophrenics. In: Vision and the brain (Cohen B, Bodis-Wollner I, eds), pp. 165–193. New York: Raven Press.

Hadjikhani N, Liu AK, Dale AM, Cavanagh P, Tootell RBH (1998) Retinotopy and color sensitivity in human visual cortical area V8. Nature Neurosci 1:235–241.[ISI][Medline]

Halgren E, Baudena P, Heit G, Clarke M, Marinkovic K (1994) Spatiotemporal stages in face and word processing. I. Depth-recorded potentials in the human occipital, temporal and parietal lobes. J Physiol (Paris) 88:1–50.[ISI][Medline]

Harries MH, Perrett DI (1991) Visual processing of faces in temporal cortex: physiological evidence for modular organization and possible anatomical correlates. J Cogn Neurosci 3:9–24.[ISI]

Haxby JV, Horwitz B, Ungerleider LG, Maisog, JM, Pietrini P, Grady CL (1994) The functional organization of human extrastriate cortex: a PET–rCBF study of selective attention to faces and locations. J Neurosci 14:6336–6353.[Abstract]

Humphreys GW, Riddoch MJ, Price CJ (1997) Top-down processes in object identification: Evidence from experimental psychology, neuropsychology and functional anatomy. Phil Trans R Soc Lond B 352:1275–1282.[ISI][Medline]

Jeffreys DA (1996) Evoked potential studies of face and object processing. Vis Cogn 3:1–38.[ISI]

Kanwisher N, McDermott J, Chun MM (1997) The fusiform face area: a module in human extrastriate cortex specialized for face perception. J Neurosci 17:4302–4311.[Abstract/Free Full Text]

Kanwisher N, O'Craven KM (1998) Extrastriate activity during visual imagery is `stimulus' specific. Soc Neurosci Abstr 24:530.

Kleinschmidt A, Lee BB, Requardt M, Frahm J (1996) Functional mapping of color processing by magnetic resonance imaging of responses to selective Pand M-pathway stimulation. Exp Brain Res 110:279–288.[ISI][Medline]

Lange PJ, Bradley MM, Fitzsimmons JR, Cuthbert BN, Scott JD, Moulder B, Nangia V (1998) Emotional arousal and activation of the visual cortex: an fMRI analysis. Psychophysiology 35:199–210.[ISI][Medline]

Leonard CM, Rolls ET, Wilson FAW, Baylis GC (1985) Neurons in the amygdala of the monkey with responses selective for faces. Behav Brain Res 15:159–176.[ISI][Medline]

Logothetis NK, Sheinberg DL (1996) Visual object recognition. Annu Rev Neurosci 19:577–621.[ISI][Medline]

Lueck CJ, Zeki S, Friston KJ, Cope P, Cunningham VJ, Lammertsma AA, Kennard C, Frackowiak RSJ (1989) The colour centre in the cerebral cortex of man. Nature 340: 386–389.[ISI][Medline]

Marr D (1976) Early processing of visual information. Phil Trans R Soc Lond B 275:483–524.[ISI][Medline]

McCarthy G (1999) Physiological studies of face processing in humans. In: The cognitive neurosciences (Gazzaniga M, ed.). Cambridge, MA: MIT Press (in press).

McCarthy G, Wood CC, Williamson PD, Spencer DD (1989) Taskdependent field potentials in human hippocampal formation. J Neurosci 9:4253–4268.[Abstract]

McCarthy G, Puce A, Gore JC, Allison T (1997) Face-specific processing in the human fusiform gyrus. J Cogn Neurosci 9:605–610.[Abstract]

McCarthy G, Puce A, Belger A, Allison T (1999) Electrophysiological studies of human face perception. II: Response properties of facespecific potentials generated in occipitotemporal cortex. Cereb Cortex 9:431–444.[Abstract/Free Full Text]

Miller EK, Gochin PM, Gross CG (1991) Habituation-like decrease in the responses of neurons in inferior temporal cortex of the macaque. Vis Neurosci 7:357–362.[ISI][Medline]

Moscovitch M, Winocur G, Behrmann M (1997) What is special about face recognition? Nineteen experiments on a person with visual object agnosia and dyslexia but normal face recognition. J Cogn Neurosci 9:555–604.[Abstract]

Mountcastle VB (1957) Modality and topographic properties of single neurons of cat's somatic sensory cortex. J Neurophysiol 20:408–434.[Free Full Text]

Mountcastle VB (1997) The columnar organization of the neocortex. Brain 120:701–722.[Abstract]

Newcombe F, Mehta Z, de Haan EHF (1994) Category specificity in visual recognition. In: The neuropsychology of high-level vision (Farah MJ, Ratcliff G, eds), pp. 103–132. Hillsdale, NJ: Lawrence Erlbaum.

Nobre AC, Allison T, McCarthy G (1994) Word recognition in the human inferior temporal lobe. Nature 372:260–263.[ISI][Medline]

Oram MW, Perrett DI (1992) Time course of neural responses discriminating different views of the face and head. J Neurophysiol 68:70–84.[Abstract/Free Full Text]

Oram MW, Perrett DI (1996) Integration of form and motion in the anterior superior temporal polysensory area (STPa) of the macaque monkey. J Neurophysiol 76:109–129.[Abstract/Free Full Text]

Ó Scalaidhe SP, Wilson FAW, Goldman-Rakic PS (1997) Areal segregation of face-processing neurons in prefrontal cortex. Science 278: 1135–1138.[Abstract/Free Full Text]

Penfield W, Perot P (1963) The brain's record of auditory and visual experience. Brain 86:595–696.[ISI]

Perrett DI, Rolls ET, Caan W (1982) Visual neurones responsive to faces in the monkey temporal cortex. Exp Brain Res 47:329–342.[ISI][Medline]

Perrett DI, Smith PAJ, Potter DD, Mistlin AJ, Head AS, Milner AD, Jeeves MA (1984) Neurones responsive to faces in the temporal cortex: studies of functional organization, sensitivity to identity and relation to perception. Human Neurobiol 3:197–208.[ISI][Medline]

Perrett DI, Hietanen JK, Oram MW, Benson PJ (1992) Organization and functions of cells responsive to faces in the temporal cortex. Phil Trans R Soc Lond B 335:23–30.[ISI][Medline]

Polk TA, Farah MJ (1998) The neural development and organization of letter recognition: evidence from functional neuroimaging, computational modeling, and behavioral studies. Proc Natl Acad Sci USA 95:847–852.[Abstract/Free Full Text]

Puce A, Allison T (1999) Differential processing of moving and static faces in temporal cortex. NeuroImage (in press).

Puce A, Kalnins RM, Berkovic SF, Donnan GA, Bladin PF (1989) Limbic P3 potentials, seizure localization, and surgical pathology in temporal lobe epilepsy. Ann Neurol 26:377–385.[ISI][Medline]

Puce A, Allison T, Asgari M, Gore JC, McCarthy G (1996) Differential sensitivity of human visual cortex to faces, letterstrings, and textures: a functional magnetic resonance imaging study. J Neurosci 16: 5205–5215.[Abstract/Free Full Text]

Puce A, McCarthy G, Allison T (1997) Changes in face-specific intracranial ERP amplitude as a function of repeated stimulus presentation. Soc Neurosci Abstr 23:2065.

Puce A, Allison T, Bentin S, Gore JC, McCarthy G (1998) Temporal cortex activation in humans viewing eye and mouth movements. J Neurosci 18:2188–2199.[Abstract/Free Full Text]

Puce A, Smith A, Allison T (1999) ERPs evoked by viewing facial movements. Cogn Neuropsychol (in press).

Rolls ET, Baylis GC, Hasselmo ME, Nalwa V (1989) The effect of learning on the face selective responses of neurons in the cortex of the superior temporal sulcus of the monkey. Exp Brain Res 76:153–164.[ISI][Medline]

Rumiati RI, Humphreys GW, Riddoch JM, Bateman A (1994) Visual object agnosia without prosopagnosia or alexia: evidence for hierarchical theories of visual recognition. Vis Cogn 2:181–225.

Sakai K, Watanabe E, Onodera Y, Uchida I, Kato H, Yamamoto E, Koizumi H, Miyashita Y (1995) Functional mapping of the human colour centre with echo-planar magnetic resonance imaging. Proc R Soc Lond B 261:89–98.[ISI][Medline]

Saleem KS, Tanaka K, Rockland KS (1993) Specific and columnar projection from area TEO to TE in the macaque inferotemporal cortex. Cereb Cortex 3:454–464.[Abstract]

Schacter DL, Buckner RL (1998) Priming and the brain. Neuron 20:185–195.[ISI][Medline]

Schendan HE, Ganis G, Kutas M (1998) Neurophysiological evidence for visual perceptual categorization of words and faces within 150 ms. Psychophysiology 35:240–251.[ISI][Medline]

Schlag J (1973) Generation of brain evoked potentials. In: Bio-electric recording techniques (Thompson RF, Patterson MM, eds), pp. 5–55. Amsterdam: Elsevier.

Sereno MI, Dale AM, Reppas JB, Kwong KK, Belliveau JW, Brady TJ, Rosen BR, Tootell RBH (1995) Borders of multiple visual areas in humans revealed by functional magnetic resonance imaging. Science 268:889–893.[ISI][Medline]

Shepherd JW, Davies GM, Ellis HD (1981) Studies of cue saliency. In: Perceiving and remembering faces (Davies G, Ellis H, Shepherd J, eds), pp. 105–131. London: Academic Press.

Tanaka K (1996) Inferotemporal cortex and object vision. Annu Rev Neurosci 19:109–139.[ISI][Medline]

Taylor MJ, Edmonds G, Allison T, McCarthy G (1997) Neurophysiological indices of the development of face perception. Soc Neurosci Abstr 23:2112.

Towe AL (1966) On the nature of the primary evoked response. Exp Neurol 15:113–139.[ISI][Medline]

Ullman S (1996) High-level vision: object recognition and visual cognition. Cambridge, MA: MIT Press.

Van Essen DC, Drury HA (1997) Structural and functional analyses of human cerebral cortex using a surface-based atlas. J Neurosci 17:7079–7102.[Abstract/Free Full Text]

Wang G, Tanaka K, Tanifuji M (1996) Optical imaging of functional organization in the monkey inferotemporal cortex. Science 272: 1665–1668.[Abstract]

Wang G, Tanifuji M, Tanaka K (1998) Functional architecture in monkey inferotemporal cortex revealed by in vivo optical imaging. Neurosci Res 32:33–46.[ISI][Medline]

Wilson FAW, Ò Scalaidhe SP, Goldman-Rakic PS (1993) Dissociation of object and spatial processing domains in primate prefrontal cortex. Science 260:1955–1958.[ISI][Medline]

Wojciulik E, Kanwisher N, Driver J (1998) Covert visual attention modulates face-specific activity in the human fusiform gyrus: fMRI study. J Neurophysiol 79:1574–1578.[Abstract/Free Full Text]

Wood CC, Allison T (1981) Interpretation of evoked potentials: a neurophysiological perspective. Can J Psychol 35:113–135.[ISI][Medline]

Yarbus AL (1967) Eye movements and vision. Plenum: New York.

Young AW, Bruce V (1991) Perceptual categories and the computation of `grandmother'. Eur J Cogn Psychol 3:5–49.