The bottom-up and top-down processing of faces in the human occipitotemporal cortex

  1. Xiaoxu Fan
  2. Fan Wang
  3. Hanyu Shao
  4. Peng Zhang
  5. Sheng He  Is a corresponding author
  1. Institute of Biophysics, Chinese Academy of Sciences, China
  2. University of Chinese Academy of Sciences, China
  3. University of Minnesota, United States
7 figures, 1 video, 2 tables and 4 additional files

Figures

Figure 1 with 4 supplements
Face-selective areas identified by fMRI localizer and face-evoked MEG source activation displayed on an inflated right hemisphere of a typical subject.

(A) Face-selective statistical map (faces>objects) showing four face-selective regions (rOFA, rpFFA, raFFA and rpSTS). (B) Face-evoked MEG source activation patterns represented as LCMV value maps at different time points (120-160 ms) after the stimulus onset. LCMV values represent signal power normalized by noise.

Figure 1—figure supplement 1
Face-selective areas identified by fMRI localizer and face-evoked MEG source activation displayed on an inflated right hemisphere of typical subject one.

(Left) Face-selective statistical map (faces>objects) showing face-selective regions (OFA, pFFA, aFFA and pSTS). (Right) Face-evoked MEG source activation patterns represented as LCMV value maps at different time points (120-160 ms) after the stimulus onset. LCMV values represent signal power normalized by noise. Results of these four subjects were shown here because of their relatively clear fMRI defined face-selective areas.

Figure 1—figure supplement 2
Face-selective areas identified by fMRI localizer and face-evoked MEG source activation displayed on an inflated right hemisphere of typical subject two.

(Left) Face-selective statistical map (faces>objects) showing face-selective regions (OFA, pFFA, aFFA and pSTS). (Right) Face-evoked MEG source activation patterns represented as LCMV value maps at different time points (120-160 ms) after the stimulus onset. LCMV values represent signal power normalized by noise. Results of these four subjects were shown here because of their relatively clear fMRI defined face-selective areas.

Figure 1—figure supplement 3
Face-selective areas identified by fMRI localizer and face-evoked MEG source activation displayed on an inflated right hemisphere of typical subject three.

(Left) Face-selective statistical map (faces>objects) showing face-selective regions (OFA, pFFA, aFFA and pSTS). (Right) Face-evoked MEG source activation patterns represented as LCMV value maps at different time points (120-160 ms) after the stimulus onset. LCMV values represent signal power normalized by noise. Results of these four subjects were shown here because of their relatively clear fMRI defined face-selective areas.

Figure 1—figure supplement 4
Face-selective areas identified by fMRI localizer and face-evoked MEG source activation displayed on an inflated right hemisphere of typical subject four.

(Left) Face-selective statistical map (faces>objects) showing face-selective regions (OFA, pFFA, aFFA and pSTS). (Right) Face-evoked MEG source activation patterns represented as LCMV value maps at different time points (120-160 ms) after the stimulus onset. LCMV values represent signal power normalized by noise. Results of these four subjects were shown here because of their relatively clear fMRI defined face-selective areas.

Figure 2 with 1 supplement
Temporal response characteristics of face-selective ROIs.

(A) The time courses of face (solid line) and object (dotted line) induced responses averaged across subjects, for the seven face-selective ROIs. Shaded area means SEM. The green bar indicates significant difference between face and object. Significance was assessed by cluster-based permutation test (cluster-defining threshold p<0.05, significance level p<0.05) for each ROI. (B) The peak latency averaged across subjects for each ROI (mean ± SEM). The peak latency of raFFA is significantly later than rpFFA (t11 = 3.21, p=0.025, Bonferroni corrected) (C) The mean peak latencies for the face-selective ROIs were shown on inflated cortical surfaces of both hemispheres at corresponding locations.

Figure 2—figure supplement 1
Temporal response characteristics of face-selective ROIs for unfamiliar faces.

(A) The time courses of unfamiliar face (solid line) and object (dotted line) induced responses averaged across subjects, for the seven face-selective ROIs. Shaded area means SEM. (B) The peak latency averaged across subjects for each ROI (mean± SEM).

Temporal response characteristics and granger causality analysis for face-selective ROIs during perception of Mooney and normal faces.

(A) Normal and Mooney face images. (B) The peak latency averaged across subjects for each face-selective ROI (mean ± SEM). Mooney faces elicited a response with significantly longer latency in rOFA than normal faces (paired t test, t23 = 4.009, p=0.001). (C) Time courses averaged across subjects for bilateral OFA and pFFA. Gray line is OFA and red line is pFFA. Shaded areas denote SEM. The circles above time courses represent peak latencies of individual subjects. rOFA was engaged significantly later than rpFFA when processing Mooney faces (Paired permutation test p=0.02. Bonferroni corrected). (D) Granger causality analysis performed within a series of 50 ms time windows. Arrows represent statistically significant causal effects (p<0.05, FDR corrected, F test. See Materials and methods for details).

Temporal response characteristics for face-selective ROIs in response to distorted face.

(A) Example stimuli and averaged time courses for each face-selective ROI. The green horizontal bar indicates significant difference between distorted face and object (cluster-defining threshold p<0.01, corrected significance level p<0.05). (B) Peak latency averaged across subjects for each ROI. The peak latency of raFFA is significant later than rpFFA (paired t test, p=0.019, t8 = 2.92).

Temporal response characteristics for face-selective ROIs in response to contextual cues.

(A) Example stimuli. (B) Time courses averaged across subjects for each condition. For each ROI, Blue horizontal bars indicate significant difference between degraded faces with relevant body cues and objects, and red horizontal bars indicate significant difference between degraded faces with irrelevant body cues and objects (cluster-defining threshold p < 0.05, corrected significance level p < 0.05). (C) The peak latency averaged across subjects for each face-selective ROI (mean± SEM).

Author response image 1
Author response image 2

Videos

Video 1
MEG activation of a typical subject.

Tables

Table 1
Number of subjects showing fMRI defined face-selective areas.
Experiment 1Experiment 2Experiment 3Experiment 4
famous faceUnfamiliar face
IOFA13/139/925/269/913/14
IpFFA13/139/926/269/914/14
IpSTS13/139/918/269/911/14
rOFA13/139/926/269/914/14
rpFFA13/139/926/269/914/14
raFFA12/139/918/269/912/14
rpSTS13/139/923/269/914/14
Table 2
Number of subjects used in peak latency analysis.
Experiment 1Experiment 2Experiment 3Experiment 4
famous faceunfamiliar facenormal faceMooney facedistorted facecontaxtual cues defined face
IOFA13/139/924/2624/269/913/14
IpFFA12/139/925/2625/269/912/14
IpSTS11/138/9---
rOFA13/139/924/2626/269/913/14
rpFFA13/139/924/2625/269/913/14
raFFA12/138/918/2615/269/910/14
rpSTS12/137/9----

Additional files

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Xiaoxu Fan
  2. Fan Wang
  3. Hanyu Shao
  4. Peng Zhang
  5. Sheng He
(2020)
The bottom-up and top-down processing of faces in the human occipitotemporal cortex
eLife 9:e48764.
https://doi.org/10.7554/eLife.48764