The pupillary light response as a physiological index of aphantasia, sensory and phenomenological imagery strength

  1. Lachlan Kay
  2. Rebecca Keogh  Is a corresponding author
  3. Thomas Andrillon
  4. Joel Pearson
  1. School of Psychology, University of New South Wales, Australia
  2. School of Psychological Sciences, Macquarie University, Australia
  3. Sorbonne Université, Institut du Cerveau - Paris Brain Institute - ICM, Inserm, CNRS, France

Abstract

The pupillary light response is an important automatic physiological response which optimizes light reaching the retina. Recent work has shown that the pupil also adjusts in response to illusory brightness and a range of cognitive functions, however, it remains unclear what exactly drives these endogenous changes. Here, we show that the imagery pupillary light response correlates with objective measures of sensory imagery strength. Further, the trial-by-trial phenomenological vividness of visual imagery is tracked by the imagery pupillary light response. We also demonstrated that a group of individuals without visual imagery (aphantasia) do not show any significant evidence of an imagery pupillary light response, however they do show perceptual pupil light responses and pupil dilation with larger cognitive load. Our results provide evidence that the pupillary light response indexes the sensory strength of visual imagery. This work also provides the first physiological validation of aphantasia.

Editor's evaluation

This is a rigorous study of the relation between the vividness of visual imagery and the pupillary light response that can result from it. It provides evidence for the absence of imagery in individuals that self-report as aphantasic. The results will likely be of interest to researchers in a range of disciplines such as psychology, neuroscience and philosophy.

https://doi.org/10.7554/eLife.72484.sa0

Introduction

Our pupil’s ability to change size is an important physiological response that adjusts the amount of light hitting the retina to optimize vision and protect the retina. Pupils constrict in response to brightness whereas they dilate in response to dark conditions (known as the pupillary light response or reflex); while these responses are related, they are considered to be driven by different neural pathways (see Mathôt, 2018 for a review). These involuntary pupil responses were once thought to be driven only by afferent visual stimulation, or automatic activation from emotional responses (Bradley et al., 2008; Partala and Surakka, 2003), however, recent studies suggest that pupil size is sensitive to higher order perceptual and cognitive processes. For example, subjective interpretation of equiluminant stimuli, such as greyscale images of the sun elicit greater pupil constriction than those of the moon (Binda et al., 2013b). The target of covert visual attention can drive pupillary light responses (Binda et al., 2013a), as can visual working memory content (Zokaei et al., 2019), but see Blom et al., 2016. Further, evidence suggests that it might be mental imagery that is driving some of these cognitively induced pupil responses (Laeng and Sulutvedt, 2014) and recent work has shown that there are pupillary light responses even when reading or listening to words conveying some level of brightness (Mathôt et al., 2017). Hence, it remains unknown if the variations in pupil response to equiluminant stimuli are due to high-level semantic content or low-level visual imagery.

Visual imagery is considered a useful and often essential tool in many aspects of cognition. It plays an important role in the retrieval of items from short- and long-term memory (Pearson, 2019), visual working memory (Keogh and Pearson, 2011; Keogh and Pearson, 2014; Pearson and Keogh, 2019), acquisition of language (Just et al., 2004), and spatial navigation (Sack et al., 2005; Guariglia and Pizzamiglio, 2007). It is also used for simulating both past and potential future events (Schacter et al., 2012; Schacter and Madore, 2016), the latter often as a form of self-motivation for goal attainment (Szpunar et al., 2007). As essential to cognition as it might appear, large individual differences exist in visual imagery and its vividness. Some people report imagery as so vivid it feels almost like perception, while a small percentage of otherwise healthy people seemingly do not have the capacity for visual imagery at all – they report that when they think about how an object looks, there is no sensory-like experience of it whatsoever (Galton, 1880). This condition has been recently termed ‘aphantasia’ (Zeman et al., 2015); it can be congenital, persisting throughout one’s lifetime (Zeman et al., 2015) or acquired (Zeman et al., 2010), is associated with a range of differences in general cognition (Dawes et al., 2020; Keogh et al., 2021a, Keogh and Pearson, 2021), including dampened fear responses to imagined scary scenarios (Wicken et al., 2021). The existence of aphantasia has also been established using objective techniques that measure the low-level sensory elements of imagery (Keogh and Pearson, 2018).

The rationale of the current study was to accurately and objectively utilize individual differences in mental imagery (both in the general population and aphantasia) to provide strong evidence that it is the sensory strength and subjective vividness of imagery that drives the cognitive pupillary light response. Similar rationale has been previously used by linking the vividness and objective sensory strength of imagery to behavioural or neurological measures (Bergmann et al., 2016; Shine et al., 2015; Wassell et al., 2015). If imagery plays a causal role in endogenous pupil size changes, then individual differences in imagery should be reflected in these measures.

Here, we utilized both subjective and objective measures of visual imagery ability and show that, within the same individual, greater pupillary light responses during imagery are associated with reports of stronger and more vivid imagery. We then used this task to compare imagery strength between individuals and test the veracity of the self-reported lack of imagery in aphantasia. We show that while aphantasic individuals display pupil contraction to perceptual brightness and dilation with effort (cognitive load), they do not show any evidence of pupil change in response to attempts at imagery – providing the first objective physiological evidence confirming the existence of aphantasia.

Results

The imagery pupillary light response in the general population

In the pupillometry imagery task (based on Laeng and Sulutvedt, 2014; see Figure 1A), participants who reported having visual imagery were presented with one or four ‘Bright’ or ‘Dark’ triangles for 5 s (see Figure 1—figure supplement 1 for images used). Following this they viewed a blank screen for 8 s (which allowed any after-images to fade) and were then instructed to imagine the prior image/s for 6 s, after which they rated the vividness of their imagery from 1 to 4. Pupils showed a clear pupillary light response to perceptual images (Figure 1B; perception section; a significant effect of perceptual luminance F(1, 41) = 190.02, p < 0.001.) This trend was mirrored in the imagery period showing a significant main effect of imagery luminance (Figure 1B, box insets: imagery section; F(1, 41) = 67.42, p < 0.0001), indicating that imagery also demonstrates a pupillary light response. Post hoc analysis using the Bonferroni correction for multiple comparisons found that for both Set-Size-One and Set-Size-Four, the pupil size in the Dark condition was significantly greater than in the Light condition during imagery (p < 0.001 and p < 0.05, respectively, see Figure 1C). There was no main effect of set size during perception F(1, 42) = 2.67, p = 0.11. However, there was a significant main effect of set size during imagery F(1, 41) = 6.48, p = 0.015, with less constriction/more dilation for Set-Size-Four (when averaged across the brightness conditions). This is consistent with previous studies suggesting that pupil size is influenced by cognitive load (Kahneman and Beatty, 1966; Laeng et al., 2011; van der Wel and van Steenbergen, 2018). Post hoc analysis also demonstrated that in the Bright condition, Set-Size-Four resulted in significantly more pupil dilation during imagery than Set-Size-One (p = 0.001). However, in the Dark condition pupil dilation during Set-Size-Four imagery was not significantly different to Set-Size-One (p = 0.266).

Figure 1 with 3 supplements see all
Pupillary response task schematic and eye-tracker results for the general population.

(A) Pupillometry imagery experiment timeline. Each trial began with the presentation of a white fixation cross at the centre of a grey screen (baseline screen) for 1 s. An image was then presented at the centre of this grey screen for 5 s (either one or four triangles of varying brightness, see Figure 1—figure supplement 1 for illustrations of all stimuli). Participants were instructed to focus on the stimuli during this time and memorize its size, orientation, and level of brightness. Next, a black screen with a white fixation cross was presented for 8 s, allowing the perceived after-image to completely fade and pupils to dilate back to equivalent resting levels. The grey baseline screen was then presented again for 6 s. During this time, participants were cued (via two auditory beeps) to actively start imagining the stimuli observed previously during that trial, while maintaining focus on the fixation cross. These beeps were presented 1 s into the grey screen period leaving 5 s of imagery time. Lastly, participants were prompted to report the vividness of their imagery during those previous 5 s on a scale of 1–4 (1 being ‘not vivid at all – no shape appeared in imagery’; 4 being ‘very vivid – almost like seeing it’) via key response. (B) Mean pupil size waveforms for the general population, presented as mm change from baseline. Left panel: data averaged across the course of a trial for Bright (red lines) and Dark (blue lines) conditions for the general population. Right panels: Set-Size-One and Set-Size-Four conditions are shown separately during the imagery period (i.e. pupil size from seconds 15 to 20). Shaded error bands represent the standard error of the mean (± standard error of the mean [SEM]). (C) Mean pupil size change from baseline during imagery (i.e. averaged from seconds 15 to 20 of trials) of Bright (red bars) and Dark stimuli (blue bars). (D) Pupil-difference scores (difference in pupil size during imagery between bright and dark conditions) as a function of subjective vividness ratings for Set-Size-One and Set-Size-Four conditions. Data points represent one participant. Error bars indicate ± SEM, calculated across participants. *p < 0.05, ***p < 0.0001.

Figure 1—source code 1

r Code for LME analysis of vividness ratings.

Source code file 1 provides the r code for the LME used to analyse the vividness data in Figure 1D.

https://cdn.elifesciences.org/articles/72484/elife-72484-fig1-code1-v2.zip
Figure 1—source data 1

Source data for Figure 1C.

https://cdn.elifesciences.org/articles/72484/elife-72484-fig1-data1-v2.csv
Figure 1—source data 2

Source data for Figure 1D.

https://cdn.elifesciences.org/articles/72484/elife-72484-fig1-data2-v2.csv

Prior behavioural work suggests we have reasonable metacognition of visual imagery, that is we are able to estimate the strength of imagery on a trial-by-trial basis (Pearson et al., 2011; Rademaker and Pearson, 2012). Here, we compared pupil responses to the trial-by-trial ratings of vividness. Pupil-difference scores are shown as a function of intraindividual vividness ratings for Set-Size-One and Set-Size-Four (see Figure 1D). A 2 × 4 linear mixed-effects analysis (2 (set size: 1, 4) × 4 (vividness rating: 1, 2, 3, 4)) demonstrated there was a significant effect of vividness (χ2(3) = 49.54, p = 1.004e−10), with larger pupillary light response for more vivid imagery trials (for both set sizes, see Figure 1D and fixed effects estimates in Supplementary file 1). These data demonstrate that the pupillary light response tracks the phenomenological vividness of visual imagery from moment to moment.

If the sensory strength of imagery is indeed driving the imagery pupillary light response, then the degree to which this response occurs should be related to independent objective measures of imagery strength in each individual. To assess this, we utilized the binocular rivalry method (Pearson, 2014; Pearson et al., 2008), which allows the objective assessment of the sensory strength of imagery, without relying on any subjective reports (Chang and Pearson, 2018). This is achieved by measuring the degree to which an individual’s imagery biases subsequent binocular rivalry perception. We compared pupil-difference scores (imagery of dark stimuli–bright stimuli, such that larger scores indicate a larger pupillary light response) with imagery strength measured using the binocular rivalry paradigm, in which higher priming scores indicate stronger imagery (Figure 2A; Pearson et al., 2008; Pearson et al., 2011). Within the general population, degree of pupil change in the Set-Size-One condition correlated positively with imagery strength, using Pearson’s correlation coefficient (rp(41) = 0.62, p = <0.0001, see Figure 2B: green circles and green trendline). The Set-Size-Four pupil data set violated normality (Shapiro–Wilk test, p = 0.003), therefore, the Spearman’s correlational coefficient was used to assess its relationship with binocular rivalry priming. A significant positive correlation was found between Set-Size-Four pupil-difference scores and binocular rivalry priming (rs(41) = 0.46, p = 0.002, see Figure 2C: green circles and green trendline). This provides further evidence that the sensory strength of imagery content is driving the imagery pupillary light response.

Figure 2 with 8 supplements see all
Binocular rivalry task schematic and correlational results.

(A) Example of an imagery trial for the binocular rivalry paradigm. Participants were cued to imagine either a red or green Gabor pattern prior to binocular rivalry with the letter ‘R’ or ‘G’ (750 ms). Participants then imagined the image for 6 s, after which they were presented with the binocular rivalry display (750 ms) and were asked to indicate which image was dominant. Trials where participants reported seeing the pattern they were cued to imagine as dominant were denoted as ‘primed’ trials. The number of primed trials divided by the total number of trials (excluding mock trials and mixed percepts) was used to calculate a percent primed score for each participant. (B) Correlation between visual imagery strength, as measured by the pupillary response task (pupil-difference score: difference between bright and dark conditions) and visual imagery strength as measured by the binocular rivalry task. Set-Size-One (left) and Set-Size-Four (right) conditions are shown. Scatterplots show the general population (green circles and green trendline) and aphantasic individuals (yellow triangles and yellow trendline) data. Correlation coefficients refer to the general population only (green trendline). All data points represent one participant.

Aphantasia and the imagery pupillary light response

Our results indicate that the strength of the content of imagery drives the imagery pupillary light response in participants who experience visual imagery. The involuntary nature of this response provides a valuable objective measure of imagery strength. Accordingly, we sought to utilize this finding to test the veracity of a condition called aphantasia, that is if these individuals truly lack visual imagery, they should not show a pupillary light response to imagined images. However, if aphantasic individuals do show an imagery-based pupillary light response, one might interpret this as a form of imagery existing, but below threshold for conscious phenomenological awareness. We ran this same study in 18 aphantasic participants and compared their performance to that of the general population. These participants had contacted the lab reporting their lack of visual imagery and asked to participate in our research. They were also unaware of the goals and hypotheses of the current study. Aphantasia was confirmed in these individuals using self-report questionnaires (Vividness of Visual Imagery Questionnaire [VVIQ] score <32) and by means of our binocular rivalry priming method (priming <65%), based on cut-off points used in previous research (Keogh and Pearson, 2018).

Here, we again found a strong effect of stimulus luminance in the perceptual phase of the task for the aphantasic participants (Figure 3A: perception section; F(1, 17) = 81.18, p < 0.001), reflecting a functional pupillary light response. However, we found no significant effect of luminance on pupil size during imagery Figure 3A, box insets: imagery section; F(1, 17) = 0.193, p = 0.67 and Figure 3B shows the lack of pupil diameter change for bright stimuli (red bars) and dark stimuli (blue bars). Similarly, to the general population, there was no main effect of set size during perception F(1, 17) = 1.92, p = 0.18, however interestingly, there was a significant main effect of set size during imagery F(1, 17) = 6.185, p = 0.02, with greater pupil diameters for Set-Size-Four compared to Set-Size-One (when averaged across the brightness conditions). This suggests that the aphantasic participants were actively engaging in the imagery task and exerting greater cognitive effort for the larger set size (van der Wel and van Steenbergen, 2018). In comparison to the general population, 61.11% (11/18) of the aphantasic individuals had difference scores that were lower than or equal to 0 for set size one as compared to 9.5% (4/42) of the general population (see Figure 2B). To confirm this absence of an imagery effect in the aphantasia population, we compared the pupil-difference score obtained when comparing the bright and dark conditions for the control and aphantasia groups, and computed a Bayes Factor (H0: score = 0; H1: score ≠ 0; see Materials and methods). Controls showed very strong evidence for H1 (BF10 > 1010; Bayesian one-sample t-test), whereas the aphantasia population showed evidence for the null effect (BF01 = 3.180). A direct comparison between the control and aphantasia groups using a Bayesian repeated measure analysis of variance (ANOVA; see Materials and methods) showed very strong evidence for an effect of group (BF10 > 106). Finally, and as expected, pupil-difference scores (imagery of dark stimuli–bright stimuli) did not significantly predict imagery strength (measured using the binocular rivalry paradigm) for the aphantasic population (Figure 2B: yellow triangles; Set-Size-One: rp(17) = 0.20, p = 0.44); Set-Size-Four: (rp(17) = −0.08, p = 0.76). It should be noted that we could not perform an analysis on the vividness data in the same way as was done with the general population (Figure 1D) as the aphantasic individuals did not have any variation in their vividness ratings, reflecting their lack of subjective visual imagery (see Figure 3—figure supplement 1).

Figure 3 with 5 supplements see all
Pupillary response eye-tracker results for the aphantasic population.

(A) Mean pupil size waveforms over time. Left panel: data averaged across the course of a trial for Bright (red lines) and Dark (blue lines) conditions for the aphantasic population. Right panels: Set-Size-One and Set-Size-Four conditions are shown separately during the imagery period. (B) Mean pupil size change from baseline during imagery (i.e. averaged from seconds 15 to 20 of trials) of Bright (red bars) and Dark stimuli (blue bars). Error bars indicate ± standard error of the mean (SEM), calculated across participants. *p < 0.05.

Age disparities between the groups are a potential confounding variable. This factor is of particular importance because the sensitivity of the pupillary light response, as well as maximum pupillary constriction velocity and acceleration, are thought to decline with age, beginning at 40–50 years old (Fotiou et al., 2007; Lobato-Rincón et al., 2014). However, trial time-course pupil waveforms are very similar for both general and aphantasic populations (Figures 1B and 3A, respectively). Both groups exhibited similar levels of pupil change during the perception phase of the task. Furthermore, a two-way ANCOVA was run on pupil-difference scores between general population and aphantasic groups with age as a covariate. Levene’s test and normality checks were carried out and the assumptions were met. We found a significant difference in pupil-difference score (F(1, 57) = 4.763, p = 0.033) between the groups when accounting for age. This provides evidence that decreased pupil responsiveness with age was not driving the observed effects.

Another possible explanation of our findings could be that the passive viewing of the perceptual images, lingering visual persistence and sluggish pupil responses could be driving our results. If this is the case, we would expect that pupil diameter during the perception of the images should correlate with pupil size during imagery for the corresponding images. Further, the pupillary light reflex during perception should be more pronounced in the control than the aphantasic populations. To investigate this possible alternative explanation of our data we first assessed the correlations between pupil diameter during perception of bright and dark images for Set-Size-One and -Four and their corresponding imagery conditions (control participants only). We found there were no significant correlations between any of the perception and imagery conditions, or the difference scores for set size one and four (all p > 0.40, see Figure 2—figure supplement 1). This lack of a correlation suggests that those individuals who have the largest pupillary light response while viewing the images, do not also have the greatest imagery driven pupillary light responses, making it unlikely that the pupil response while seeing the image is driving the mental imagery pupillary response. Next, we assessed whether the aphantasic individuals demonstrated any significant difference in their pupil responses to perceptual stimuli by running a 2 (image: bright and dark) × 2 (set size: 1 and 4) × 2 (group: aphantasic and controls) repeated measures ANOVA on the pupil diameter during the 5-s perceptual period of the task (see Figure 1A for task timeline). There was no main effect of imagery group F(1, 58) = 1.15, p = 0.29 and no significant interactions between imagery groups and any other factor (all p > 0.22, see Figure 3—figure supplement 2). These findings suggest the observed pupil responses during the imagery period of the task is unlikely to be a carry-over effect of the previous sensory response to perceived images.

Pupil size has been shown to depend on eye position (Drewes et al., 2014; Gagl et al., 2011) and the preparation to make a saccade to an upcoming image (Jainta et al., 2011; Mathôt et al., 2015; Wang et al., 2018). Pupil modulation and eye position are also both controlled by largely overlapping circuitry (Wang and Munoz, 2018). It could then be the case that group differences in eye position or saccades (either while viewing or imagining the triangles) may explain our data. To assess how eye movements and position (eccentricity) might be related to our findings, we analysed both eye position and saccades made while viewing and imagining the images, to see if these differed as a function of group. Eccentricity was extracted using the ‘saccades’ package in R (von der Malsburg, 2015) and saccades were detected using a velocity-based algorithm (Engbert and Kliegl, 2003) using the same R package. There were no significant differences between the groups for the number of saccades made during perception or imagery of the stimuli (see Figure 3—figure supplement 4 and Figure 3—figure supplement 5). There were also no differences in mean eccentricity values when comparing the two groups (Figure 3—figure supplement 3), and no correlation with eccentricity and the pupillary light response (Figure 2—figure supplement 3), binocular priming (Figure 2—figure supplement 4), or vividness ratings (Figure 2—figure supplement 5), suggesting that differences in fixation or eye movements between the two groups is unlikely to drive the observed group differences in regard to the mental imagery pupillary light response.

Taken together these data from the general population and aphantasic individuals suggest that it is the content and ability to form vivid visual images, not the voluntary attempt to do so or the semantic content, that is driving the imaginary pupillary light response, providing the first evidence that these pupil changes are due to the sensory strength of imagery content and are not driven by higher-level semantic content.

Discussion

Our results provide novel evidence that our pupils respond to the vividness and strength of a visual image being held in mind, the stronger and more vivid that image, the greater the pupillary light response. Our data provide the first evidence linking the pupil response to strength and vividness of imagery, not only between individuals, but also within an individual as imagery vividness fluctuates from moment to moment (Dijkstra et al., 2017; Pearson et al., 2011; Rademaker and Pearson, 2012). Finally, we show that, as a group, there is no evidence of this pupil response in individuals without mental imagery (aphantasia).

How might the content of mental imagery be driving the pupillary light response? One interpretation of these findings is that this imagery pupillary response is a by-product of the top-down modulation of midbrain-level visual circuitry (pretectal olivary nucleus, superior colliculus; Joshi and Gold, 2020), which occurs when imagining vividly, resulting in these regions interpreting this modulation as coming from external or afferent stimuli, and responding accordingly (Larsen and Waters, 2018; Schwalm and Rosales Jubal, 2017). In this case, the pupil would be responding to imagined luminance in much the same way that it responds to retina-based light sources. This is consistent with current data and models proposing shared mechanisms between visual imagery and perception (Dijkstra et al., 2017; Dijkstra et al., 2019; Ganis et al., 2004; Naselaris et al., 2015; Xie et al., 2020) and the idea that visual imagery functions much like a weak version of afferent perception (Pearson, 2019), supporting the idea that the stronger or more vivid an individual’s imagery is, the more ‘perception like’ their imagery is.

An alternative mechanistic account might be that pupil diameter is encoded along with the original visual information for example bright object, and hence is replayed during memory decoding to form the mental image. This would be in a similar manner to theories proposing a functional role of eye movements during imagery generation from memory (Wang et al., 2020). It will be up to future work to uncover the exact mechanist account of imagery induced pupil changes.

Here we also provide the first objective physiological evidence of an extreme lack of visual imagery in aphantasic individuals. Aphantasia has largely been defined using subjective means (Dawes et al., 2020; Jacobs et al., 2018; Pounder et al., 2018; Zeman et al., 2015, but see Keogh and Pearson, 2018). Accordingly, people have remained sceptical about its true nature and possible psychogenic basis (de Vito and Bartolomeo, 2016). Our data demonstrate that using a non-visual strategy (no imagery in aphantasia) to think about bright and dark objects does not induce a pupillary light response. These data simultaneously provide strong evidence linking the pupillary light response to mental imagery, as well as supporting the behavioural work showing that aphantasic individuals indeed lack visual sensory imagery (Keogh and Pearson, 2018). Because the pupillary light response is involuntary (Bouffard, 2019), we can consider these findings as an unbiased neurophysiological measure of aphantasia. Not only do these data show that pupillary light response can be an objective index of imagery strength in studies of imagery in general populations, our data also provide a new low-cost objective measure for aphantasia that is uniquely based on a physiological mechanism and not reliant on self-report.

Could a lack of active engagement during imagery explain the aphantasia results? Put another way, are such participants refusing to imagine (de Vito and Bartolomeo, 2016)? We think this is highly unlikely as pupil size did increase as a function of set size for aphantasic individuals when attempting imagery, as has previously been shown in the general population, demonstrating the typical relationship between cognitive effort or arousal and pupil dilation (Kahneman and Beatty, 1966; van der Wel and van Steenbergen, 2018). This demonstrates active task engagement, suggesting that aphantasic individuals were not simply ‘refusing’ to actively participate in the task due to demand characteristics or a belief that they are unable to imagine (de Vito and Bartolomeo, 2016).

Further, we ran Bayesian one-sample t-tests on the binocular rivalry and pupillary light response difference scores (see Figure 2) comparing their performance to chance to see if there was any evidence they were performing significantly below chance. We found no significant evidence of below chance performance for either group on either the binocular rivalry or pupillometry imagery tasks (see Figure 2—figure supplement 7). Taken together, with the set-size pupillary effect we observed in our aphantasic participants, it seems unlikely that our aphantasic individuals were not engaging in the tasks. However, we cannot fully rule out this possibility. Further, there was no significant evidence of an abnormal pupillary response in our aphantasic cohort when viewing images, thus it is likely the lack of an imaginary pupillary light response is due to their lack of visual imagery. It also reveals that regardless of what imagery strategy aphantasic participants are implementing (e.g. propositional, spatial, language-like) to recall information about the shapes, they require greater cognitive effort to simultaneously maintain a larger number of shapes in their mind.

One limitation of our study is we did not include catch trails in our pupillometry task, that is we did not include trials where we asked participants to report on what image they had been asked to imagine. We did however include catch trials in our binocular rivalry task through presenting mock binocular rivalry trials. If aphantasic participants are showing a response bias we would expect see a reduction in these mock priming trials when compared to the control population, which we did not find (see Figure 2—figure supplement 6). Adding catch trials to future experiments, in addition to set-size manipulations, may help to further confirm participant engagement. However, adding a simultaneous memory component to the task may lead some subjects to use a non-visual imagery strategy and as such, a reduction or dilution of the pupillary light response (see Pearson and Keogh, 2019). Future studies of visual imagery, and even more importantly when investigating aphantasia, should aim to include appropriate positive controls that allow for the identification of task engagement even when an individual doesn’t have visual imagery. This will allow researchers to exclude the alternate explanation that those individuals who do not show evidence of imagery are not just refusing to imagine or not completing the task correctly.

Another possible explanation of our results is that perceptual pupillary light responses are lingering throughout each trial and driving the observed imagery pupil response. If this is the case, then pupil responses during perceptual viewing and imagery should be correlated, however we did not find any such correlations (see Figure 2—figure supplement 1). Further, when directly comparing the perceptual pupil responses between the general population and aphantasic individuals, there was no main effect of group or interaction between group and stimuli brightness or set size (see Figure 3—figure supplement 2). This demonstrates that there is no significant difference in the perceptual pupillary responses between the two groups, making it unlikely that aphantasic individual’s lack of an imagery pupillary response is due to a lack of perceptual response. Finally, we also asked participants if they perceived any after images during the imagery period and any participants who reported they did were excluded from the study. Taken together, these results suggest that it is unlikely that the pupillary response to perceptually viewing the images is driving our observed imagery pupillary responses, and the lack thereof in the aphantasic individuals. Instead, it appears the pupillary light response during the visual imagery period reflects the wilful generation of imagery in the mind’s eye of those who experience visual imagery. This is further substantiated by the strength of visual imagery (measured using the binocular rivalry paradigm) correlating with the imagery pupillary light reflex, but not the perceptual pupillary light reflex (see Figure 2B and Figure 2—figure supplement 2).

We also found that in the imagery task, higher within-trial reports of vividness are reflected by greater pupillary light responses (within-subjects effects; see Figure 1D). This indicates that participants were able to accurately evaluate the vividness of individual episodes of imagery in comparison to other vividness episodes on previous trials. However, average vividness ratings did not correlate with their pupil-difference scores, that is, participants who gave higher vividness ratings on average did not necessarily have increased pupil light responses in response to imagery (between-subjects effects; see Figure 1—figure supplement 2). Participant’s scores on the VVIQ also did not correlate with pupil-difference scores (between-subjects effects; see Figure 1—figure supplement 3). This suggests that participants might have difficulties in accurately reporting their strength of sensory visual imagery on an absolute scale (i.e. from ‘no image’ to ‘as vivid as perception’), and brings into question the reliability of these subjective measures of imagery and highlights the utility of using objective or online (i.e. in a task), and less trait-like measures when studying visual imagery.

Recent studies have shown pupil size is also modulated by the content of visual working memory (Blom et al., 2016; Hustá et al., 2019; Zokaei et al., 2019). It is interesting to note here that previous work has shown that imagery has been implicated as one mnemonic that can be used to retain information in mind during visual working memory tasks (Albers et al., 2013; Keogh and Pearson, 2011; Keogh and Pearson, 2014; Keogh and Pearson, 2017). This highlights the possibility that it is imagery, being used as a mnemonic strategy, that is driving the pupillary light response observed in visual working memory experiments (Pearson and Keogh, 2019). Although many participants report using a visual imagery strategy during these tasks, some participants report using a non-visual imagery strategy when remembering visual information, and recent work demonstrates that aphantasic individuals can perform traditional visual working memory tasks just as well as control populations (Keogh et al., 2021b). Measuring the pupillary light response in aphantasic individuals, and those who report not using an imagery strategy, while performing classic visual working memory tasks may help to further elucidate these differences in cognitive strategy use in a more objective manner.

One limitation that is important to note here is that our aphantasic sample contained a relatively small sample (18 participants) due to the relative rarity of this condition. Further our two samples were not age matched, which may have affected our results, however seeing as there was no difference between the two groups for the perceptual pupillary light response, we think this is unlikely to be driving our findings. Future studies should aim to replicate and extend these findings with a larger group of aphantasic individuals and age matched controls.

To conclude, the present study demonstrates that the pupillary light response can be used as a physiological index of individual differences in the sensory and phenomenological strength of visual imagery, including the lack of visual imagery – aphantasia. Combining this measure with the binocular rivalry paradigm in favour of subjective alternatives will increase the reliability and objectivity of imagery test batteries and may lead to the development of more congenial theories of the mind’s eye.

Materials and methods

Participants

Fifty-six psychology students with a mean age of 19.8 years (range 18–31, 27 females) were recruited for the study and participated for course credit. We aimed to obtain analysable data from a minimum of 40 participants, which should be a large enough sample to identify a strong positive correlation between pupil dilation and imagery, which is what we would expect if imagery content were driving the previously observed imagery pupillary light response (g*Power effect size = 0.5, α = 0.05, β = 0.95). Fourteen of these participants were excluded from data analysis for not meeting a priori criteria (see Exclusion criteria), leaving 42 participants in the final general population sample.

The aphantasic individuals come from a rare population and for this reason we did not run a specific power analysis but aimed to collect a minimum of 15 participants. We had nineteen aphantasic individuals agree to participate in the study with a mean age of 35.8 years (aged 18–54, 12 females). One of these individuals was excluded from data analysis for not meeting a priori criteria (see Exclusion criteria), leaving 18 in the final sample. These participants had all contacted the lab regarding their aphantasia and asked to participate in our research. They were all reimbursed $20 AUD per hour for their participation. All participants had normal or corrected to normal vision (i.e. glasses or contacts). Both experiments were approved by the UNSW Human Research Ethics Advisory Panel (HREAP-C 3182).

Apparatus

Apparatus stimuli in all experiments were presented on an LCD display monitor (Dell UltraSharp U2419H) with 60 Hz refresh rate and a 1920 × 1080 resolution. Luminance values of all stimuli were measured using a Konica Minolta chroma meter (CS-100A). Participants placed their chin on a chin rest throughout the experiment to maintain fixation at a distance of 57 cm from the monitor 13 and to limit head movements. The tasks were performed in a blackened room to eliminate any possible fluctuations in ambient light.

In the pupillary response task, pupil sizes and eye movements were recorded using head mounted eye-tracking glasses (Pupil, Pupil Labs GmbH, Berlin, Germany) (Kassner et al., 2014). Pupil diameter of participants’ right eye was continuously sampled at 200 Hz throughout the task. A pupil detection 3D algorithm locates the dark pupil in the infrared illuminated eye camera image, thus recording capabilities are not compromised by an absence of room lighting. Pupil diameter is then scaled to millimetres (mm) based on mean anthropomorphic eyeball diameter and corrected for perspective. The algorithm does not depend on corneal reflection, and is compatible with users who wear contact lenses and most eyeglasses (Kassner et al., 2014).

A second camera mounted on the glasses continuously recorded participants’ field of view. Footage from this camera was subsequently assessed to ensure fixation on the computer monitor was maintained throughout the task. The experiment was designed using MATLAB (version R2017b). ZeroMQ plug-ins were used for cross-communication between eye-tracking and stimulus presentation platforms (Akgul, 2013). Pupil data were recorded with Pupil Capture v.1.10.20 (Pupil Labs) installed on an ASUS (GL502V) PC (Windows 10).

In the binocular rivalry task, participants wore red-green anaglyph glasses to ensure rivalrous stimuli were presented to left and right eyes in isolation. Responses of 1, 2, or 3 on a keyboard were used by participants to indicate which image dominated their perception during binocular rivalry (1 for green; 3 for red; 2 for perceptually mixed green and red).

Stimuli

Request a detailed protocol

For the pupillary response task, 32 achromatic shape stimuli were created for participants to perceive and then later imagine in their absence, across 32 trials. The stimuli were evenly divided based on a 2 × 2 factorial design, belonging to one of two luminance conditions (‘Bright’ or ‘Dark’) and one of two set-size conditions (‘Set-Size-One’ or ‘Set-Size-Four’). Shapes belonging to the Bright condition were either white with a luminance of 117 cd/m2 or light grey with a luminance of 65 cd/m2. Shapes in the Dark condition were black (1 cd/m2) of dark grey (9 cd/m2). Set-Size-One stimuli consisted of a single equilateral triangle with 12.5 cm sides, subtending 12.5° of visual angle. Set-Size-Four stimuli consisted of an arrangement of four smaller equilateral triangles with a total surface area and luminance equal to that of the corresponding Set-Size-One triangles (see Figure 1—figure supplement 1 for illustration of all stimuli). Stimuli were also uniquely orientated at either 0°, 90°, 180°, or 270° (e.g. four Set-Size-One black triangles, each with a different orientation. See Figure 1—figure supplement 1 for examples all possible shape orientations). This ensured that all 32 stimuli were unique and participants and were encoding information about a new stimulus on each trial, therefore avoiding the use of long-term memory. Set-Size-Four stimuli therefore subtended either 10.8° or 18.9° of visual angle depending on their orientation.

All stimuli were presented on a grey background screen with a luminance of 26 cd/m2. This same level of background luminance was used during measurement of baseline and imagery phases. A fixation cross on a black background with a luminance of 1 cd/m2 was presented during the resting phase of each trial. All stimuli were created in MATLAB, using the Psychophysics Toolbox 3 extensions (Brainard, 1997).

For the binocular rivalry task, sinusoidal luminance modulated Gabor patterns were used as rivalrous stimuli; vertical-green (CIE chromaticity coordinates: x = 0.275, y = 0.590) and horizontal-red (CIE chromaticity coordinates: x = 0.492, y = 0.372), both with a mean luminance of 8.35 cd/m2 and 7.1° of visual angle. In each trial, both patterns were presented at the same time around a fixation point at the centre of a black background screen. Mock rivalry stimuli (a single 15 Gabor pattern spatially divided into half vertical-green and half horizontal-red) were used on 12.5% of trials to measure the influence of decisional bias or lack of attention to the task. More details on the binocular rivalry task can be found in Keogh and Pearson, 2014.

All participants also complete the VVIQ (Marks, 1973) to get a self-report measure of trait imagery vividness (see Figure 2—figure supplement 8 for VVIQ data for general population and aphantasic individuals).

Procedure

Pupillometry imagery experiment timeline

Request a detailed protocol

Each trial began with the presentation of a white fixation cross at the centre of a grey screen (baseline) for 1 s. An image was then presented at the centre of this grey screen for 5 s (either one or four triangles of varying brightness, see Figure 1—figure supplement 1 for illustrations of all stimuli). Participants were instructed to focus on the stimuli during this time and memorize its size, orientation, and level of brightness. Next, a black screen with a white fixation cross was presented for 8 s, allowing the perceived after-image to completely fade and pupils to dilate back to equivalent resting levels. The grey baseline screen was then presented again for 6 s. During this time, participants were cued (via two auditory beeps) to actively imagine the stimuli observed previously during that trial. Lastly, participants were prompted to report the vividness of their imagery during those previous 5 s on a scale of 1–4 (1 being ‘not vivid at all – no shape appeared in imagery’; 4 being ‘very vivid – almost like seeing it’) via key response.

Binocular rivalry paradigm

Request a detailed protocol

Participants were cued to imagine either a red or green Gabor pattern prior to binocular rivalry with the letter ‘R’ or ‘G’ (750 ms). Participants then imagined the image for 6 s, after which they were presented with the binocular rivalry display (750 ms) and were asked to indicate which image was dominant (see Figure 2). Trials where participants reported seeing the pattern they were cued to imagine as dominant were denoted as ‘primed’ trials. Participants completed 2 blocks of 48 trials resulting in a total of 96 trials in total (84 real and 12 mock trials). The number of primed trials divided by the total number of trials (excluding mock trials and mixed percepts) was used to calculate a percent primed score for each participant. Mock trial priming was calcuated by giving a value to each mock trial as either 0 (reporting the catch trial as the opposite colour to that primed), 50 (reporting the catch trial as being mixed), or 100% (reporting the catch trial to be the same as the cued image) (Figure 2—figure supplement 6). These values are then averaged to get a priming value where 50% indicates no bias, while higher values indicate a bais towards reporting the mock trails as being the same as the imagined image, while negative numbers indicate a bias towards reporting the oppoiste image to that which was imagined.

Exclusion criteria

Request a detailed protocol

Of the 56 participants recruited for the general population sample, 14 in total were excluded from data analysis due to not meeting a priori criteria.

Pupillary response task exclusions: eight participants were excluded because more than 50% of their pupil data points were below the pupil detection algorithm confidence value of 0.6, provided by the Pupil Capture system. This cut-off point was derived prior to data collection and is the recommended cut-off point for obtaining accurate pupil size data (Pupil Labs). Three participants were excluded due to reporting (during systematic post-task questioning) seeing after-images of the shape stimuli for longer than the 8 s black screen presentation (i.e. seeing after-images during the imagery phase of trials), because pupil size is known to be influenced by the induced compensatory light perception of an after-image (Tsujimura et al., 2003).

Binocular rivalry task exclusions: three participants were excluded due to having mock rivalry priming >66.67% (more than one incorrect response on the mock trials), which indicated either an influence of decisional bias or lack of attention to the task. An a priori cuff-off point of scoring both below 65% priming on the binocular rivalry task and below 32 on the VVIQ was used to exclude participants who potentially did not have visual imagery (i.e. may be aphantasic). No participants fell below this combined cut-off point thus none were excluded on this basis.

Of the 19 participants recruited for the aphantasic population, 1 was excluded from data analysis because more than 50% of their pupil data points were below the pupil detection algorithm confidence value of 0.6, given by the Pupil Capture system. All participants scored below both of the a priori cut-off points of 32 on the VVIQ and 65% on the binocular rivalry task, therefore, no participants were excluded due to this criterion.

Data analysis

Request a detailed protocol

For the pupillary response task, cubic spline interpolation was used to estimate pupil diameter during periods where subjects’ pupils were occluded due to blinking (in accordance with Mathôt et al., 2013). Artefacts in the pupil data were then smoothed using a moving average Hanning window (Kret and Sjak-Shie, 2019). Individual trials in which mean pupil diameter while passively viewing the grey baseline screen was lower than 2 mm or higher than 8 mm were excluded (N(total trials from whole sample) = 8) as values outside this range are unnatural pupil sizes and were clear outliers based on inspection of participants’ pupil-baseline histograms (Mathôt et al., 2018). Trials were averaged to form condition-specific pupil diameter waveforms to represent change in pupil size over time. Mean pupil diameter values during imagery in each trial were baseline corrected using a within-trial baseline subtraction approach (Mathôt et al., 2018) (i.e. subtracted from mean pupil diameter during 0.5 s prior to stimulus perception onset) to account for temporal shifts in pupil size across the experimental session due to fatigue (Morad et al., 2000). A two-way repeated measures ANOVA was used to compare Dark and Bright means during perception and imagery within both set-size conditions. ‘Pupil-difference’ scores were calculated by subtracting Dark condition means from Bright condition means of the corresponding set size for comparison with binocular rivalry percent primed scores. Pupil-difference scores were also separated based on the discrete within-trial vividness ratings to assess metacognition and whether pupil size changes in response to imagery were reflective of subjects’ own experience of vividness of visual imagery.

In the binocular rivalry task, trials where participants reported seeing the pattern they were cued to imagine as dominant in the subsequent binocular rivalry display were denoted as ‘primed’ trials. The number of primed trials divided by the total number of trials (excluding mock trials and mixed percepts) was used to calculate a percent primed score for each participant. Participants’ percent primed scores in binocular rivalry were correlated with their pupil-difference scores (both Set-Size-One and Set-Size-Four) to assess potential for the pupillary response task to measure individual variability in visual imagery strength.

The LMEs were run in R (R Development Core Team, 2018) using the lme4 package and ANOVA’s and ANCOVA were run in SPSS v.25.0 (IBM Corp. Released, 2017 ). For the linear mixed-effects models set size (1 or 4) and vividness ratings (1, 2, 3, and 4) were entered into the model as fixed effects. As random effects intercepts for subjects were entered into the model. p values were obtained by likelihood ratio tests of the full model with vividness included vs. the model without vividness included.

Bayesian statistics were used to determine whether null findings can be interpreted as evidence for an absence of effect (Dienes, 2014). We used Bayesian repeated measure ANOVA (within-subject effect: set size; between-subject effect: group) to compare the control and aphantasia groups as well as Bayesian one-sample t-tests to compare each group with H0, defined as the absence of effect. All Bayesian analysed were performed with JASP (Version 0.10.2).

Data availability

Figure 1 - Source Data 1& 2, Figure 2 - Source Data 3, and Figure 3 - Source Data 4 contain the numerical data used to generate the figures.

References

  1. Book
    1. Akgul F
    (2013)
    ZeroMQ: Use ZeroMQ and Learn How to Apply Different Message Patterns
    Packt Publishing.
    1. Bouffard MA
    (2019) The Pupil
    CONTINUUM (Minneapolis, Minn.) 25:1194–1214.
    https://doi.org/10.1212/CON.0000000000000771
  2. Book
    1. Fotiou DF
    2. Brozou CG
    3. Tsiptsios DJ
    4. Fotiou A
    5. Kabitsi A
    6. Nakou M
    7. Giantselidis C
    8. Goula A
    (2007)
    Effect of Age on Pupillary Light Reflex: Evaluation of Pupil Mobility for Clinical Practice and Research
    Electromyography and Clinical Neurophysiology.
  3. Book
    1. Guariglia C
    2. Pizzamiglio L
    (2007) The Role of Imagery in Navigation: Neuropsychological Evidence
    In: Mast F, Jäncke L, editors. Spatial Processing in Navigation, Imagery and Perception. Boston, MA: Springer. pp. 17–28.
    https://doi.org/10.1007/978-0-387-71978-8
  4. Software
    1. IBM Corp. Released
    (2017)
    IBM SPSS Statistics version 25.0
    IBM Corp, Armonk, NY.
    1. Jacobs C
    2. Schwarzkopf DS
    3. Silvanto J
    (2018) Visual working memory performance in aphantasia
    Cortex; a Journal Devoted to the Study of the Nervous System and Behavior 105:61–73.
    https://doi.org/10.1016/j.cortex.2017.10.014
  5. Conference
    1. Kassner M
    2. Patera W
    3. Bulling A
    (2014) Pupil: An open source platform for pervasive eye tracking and mobile gaze-based interaction
    UbiComp 2014 - Adjunct Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing.
    https://doi.org/10.1145/2638728.2641695
    1. Keogh R
    2. Pearson J
    (2018) The blind mind: No sensory visual imagery in aphantasia
    Cortex; a Journal Devoted to the Study of the Nervous System and Behavior 105:53–60.
    https://doi.org/10.1016/j.cortex.2017.10.012
  6. Software
    1. R Development Core Team
    (2018) R: A language and environment for statistical computing
    R Foundation for Statistical Computing, Vienna, Austria.
    1. Zeman A
    2. Dewar M
    3. Della Sala S
    (2015) Lives without imagery - Congenital aphantasia
    Cortex; a Journal Devoted to the Study of the Nervous System and Behavior 73:378–380.
    https://doi.org/10.1016/j.cortex.2015.05.019

Decision letter

  1. John T Serences
    Reviewing Editor; University of California, San Diego, United States
  2. Chris I Baker
    Senior Editor; National Institute of Mental Health, National Institutes of Health, United States
  3. Martin Rolfs
    Reviewer; Humboldt Universität zu Berlin, Germany
  4. Jesse Breedlove
    Reviewer; UMN, United States

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "The eyes have it: The pupillary light response as a physiological index of aphantasia, sensory and phenomenological imagery strength" for consideration by eLife. Your article has been reviewed by 2 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Chris Baker as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Martin Rolfs (Reviewer #2); Jessica Breedlove (Reviewer #3).

We all thought that the paper was interesting and that it addresses a timely topic with a novel approach. There is a lot to like here in terms of learning about mental imagery but also about people who seem to lack it. That said, you'll see that the reviewers brought up some substantive points, and, after consultation, we think that a revision might be possible to address these points but that it will require more analyses and most likely more data.

In consultation, we focused primarily on two issues: eye movements and demand characteristics.

With respect to eye movements, there are a number of reasons why we think further analyses are crucial:

1. Pupil size (in particular, constriction velocity, maximum constriction, and mean pupil change) depends stimulus eccentricity. (e.g., https://doi.org/10.1016/j.visres.2020.03.008)

2. Measured pupil size depends on eye position, and vice versa. The origin of these effects are measurement errors related to video-based eye tracking, as it this dependency is seen even for artificial, fixed-size pupils (e.g., https://doi.org/10.3758/s13428-011-0109-5; https://doi.org/10.1371/journal.pone.0111197).

3. There is now plenty of evidence that saccade preparation alters pupil size (https://doi.org/10.1037/a0038653; https://doi.org/10.3389/fnhum.2011.00097; https://doi.org/10.5334/joc.33; https://doi.org/10.1111/ejn.12883) and that both are controlled by largely-overlapping circuitry (e.g., https://doi.org/10.1073/pnas.1809668115).

4. Saccades greatly alter content in visual short-term memory (e.g., https://doi.org/10.1037/xlm0000338; reviewed in https://doi.org/10.1080/13506285.2020.1764156).

5. Eye movements can be correlated with imagery (e.g., https://doi.org/10.1162/jocn.1997.9.1.27); comparison between groups might thus provide additional indicators of imagery.

As a consequence, a comparison of eye position and saccade statistics is very important.

With respect to demand characteristics, R3 has some detailed suggestions, and we all feel that a revision of the design would potentially yield significant gains. It doesn't seem technically difficult to collect the data, but we also understand that collecting any data in person these days can be challenging. However, at the very least we think you should report on the mock trials in the binocular rivalry task (to show there wasn't a difference between groups in what you did check for, a BR bias), pull back on your claims accordingly, and provide a discussion of this limitation to the results.

Again, please see the reviews for more details, but we hope that you will find the comments helpful in deciding next steps for the paper.

Reviewer #1 (Recommendations for the authors):

1. Please analyze and report eye movement parameters in each experiment and whether correlate with vividness of imagery. Please also clarify where observers were required to look during each phase of the experiments.

2. Please provide evidence that the correlations reported in Figure 1D persist at the level of individuals.

3. Please revise this Discussion section to clarify that such tests would always have to be combined with positive tests that show the commitment of participants to the task instructions.

Reviewer #2 (Recommendations for the authors):

My largest (and maybe only) major concern is the possibility that aphantasic subjects were not attempting to imagine during both the main experiment and the BR task. This concern is somewhat exacerbated by the fact that subjects reached out to the lab to participate, making risk for demand characteristics high. The increase in pupil size in the aphantasic group in the set-size-4 condition is encouraging but this increase could also represent a number of other things. While I am hard pressed to come up with a way to ever eliminate this potential fully, I do think a bit more could be done to strengthen the argument against lack of participation and so I'm wary of the claim that this has been fully "ruled out" (line 322). My suggestions are as follows:

– Most ideally, there would be an addition of a probe task following the imagery period that would require subjects to report on the objects that they were supposed to remember and imagine (for example, like the one the in Zokaei 2019 which the authors cited). Both groups should be able to perform fairly well since, as the authors point out, aphantasic individuals can perform visual working memory tasks just as well as the general population. This would serve as a better indicator of effort and attention and require that subjects at the very least engage with the visual features of the stimuli (maybe they would have to adjust a probe's brightness until it matched the imagined one or adjust an angle to match that of the triangle, etc.).

– I also appreciate the use of a more objective measure of imagery vividness with the binocular rivalry task, but this suffers from the same issue. Aphantasic subjects could be not attempting to imagine or have an unintentional bias pushing them away from indicating the priming effect. The latter can be addressed using catch/mock trials as the authors did use. However if my math is correct, there were 24 trials total with only 3 mock trials. So while they did exclude subjects for making more than 1 incorrect response to mock trials, that still means that subjects could have shown a bias on 33% of the trials and still be included in the study. This might account for some of the findings if there were more aphantasic than control subjects who incorrectly answered a mock trial. My suggestion is then to increase the number and type of mock trials (all green and all red in addition to mixed) and (or at the very least) report the missed mock trials for each group.

– I would also suggest estimating the null hypothesis and significance bound through random sampling of the experiment under the null to show that the aphantasia subjects did not perform significantly worse than chance in Figure 2b.

Lines 124 – 127 "These data provide novel evidence individuals can reliably evaluate the comparative vividness of single episodes of imagery. Further, these data demonstrate that the pupillary light response also tracks the phenomenological vividness of visual imagery from moment to moment." These two lines are circular. The authors are using changes in pupil size as evidence that subjects can reliably report their vividness and then using subject reports as evidence that pupil size is a good indicator of vividness. Suggest rewriting /rewording

Around line 256: the authors state there is no correlation between perception and imagery conditions. I initially found this confusing because if imagery is like faint vision, the pupil change during imagery of an object should mimic the pupil change during viewing of the same object. The point I believe they were making is that there is no correlation when all subjects are pooled together. I think this would be clearer if the authors first pointed out that there is a correlation between perception and imagery, but in the general population only (if true).

Line 312 – I believe the authors might be conflating psychogenic and fabrication and suggest that they revise the discussion to be clearer on this. The results speak to the latter but not so much the former. While psychogenic implies there might not be a traceable physical cause, it doesn't necessitate that the patient is acting intentionally and they likely experience it as real (for example, in non-epileptic seizures the brain is in fact not seizing, but the patient often isn't consciously convulsing and truly believes they are having a seizure). The lack of experience of imagery could lead to a lack of pupillary light response. A psychogenic source would not be inconsistent with the findings since the authors are not claiming to identify what is blocking or restricting the visual experience of imagery, just that there is a link between that experience and a physiological response. I suggest removing the claim that the study rules out a psychogenic source to aphantasia.

The timing diagram in Figure 1A seems off. The caption and methods say that the black screen rest was presented for 8s, but only 7s exist between the 2nd and 3rd dotted lines, and this doesn't match Figure 3. The reason for using a black screen for rest was also not clear. Together, these made the large dip in pupil size before imagery a bit confusing at first.

https://doi.org/10.7554/eLife.72484.sa1

Author response

We all thought that the paper was interesting and that it addresses a timely topic with a novel approach. There is a lot to like here in terms of learning about mental imagery but also about people who seem to lack it. That said, you'll see that the reviewers brought up some substantive points, and, after consultation, we think that a revision might be possible to address these points but that it will require more analyses and most likely more data.

In consultation, we focused primarily on two issues: eye movements and demand characteristics.

With respect to eye movements, there are a number of reasons why we think further analyses are crucial:

1. Pupil size (in particular, constriction velocity, maximum constriction, and mean pupil change) depends stimulus eccentricity. (e.g., https://doi.org/10.1016/j.visres.2020.03.008)

2. Measured pupil size depends on eye position, and vice versa. The origin of these effects are measurement errors related to video-based eye tracking, as it this dependency is seen even for artificial, fixed-size pupils (e.g., https://doi.org/10.3758/s13428-011-0109-5; https://doi.org/10.1371/journal.pone.0111197).

3. There is now plenty of evidence that saccade preparation alters pupil size (https://doi.org/10.1037/a0038653; https://doi.org/10.3389/fnhum.2011.00097; https://doi.org/10.5334/joc.33; https://doi.org/10.1111/ejn.12883) and that both are controlled by largely-overlapping circuitry (e.g., https://doi.org/10.1073/pnas.1809668115).

4. Saccades greatly alter content in visual short-term memory (e.g., https://doi.org/10.1037/xlm0000338; reviewed in https://doi.org/10.1080/13506285.2020.1764156).

5. Eye movements can be correlated with imagery (e.g., https://doi.org/10.1162/jocn.1997.9.1.27); comparison between groups might thus provide additional indicators of imagery.

As a consequence, a comparison of eye position and saccade statistics is very important.

We thank the reviewers and editor for their detailed and thoughtful points regarding eye movements and pupil diameter. We think the points raised are fair and we have added an extra supplementary analysis to the manuscript analysing the eccentricity and saccade data. Assessing the eccentricity data we found that in general the participants mostly maintained fixation throughout the experiment, and there was no significant difference between the groups in their average eccentricity values (see Figure 2 —figure supplement 2). There was also no significant correlations between eccentricity and the pupillary light response for either group during the imagery period (see Figure 3 —figure supplement 3). We also assessed whether the number of saccades during perception and imagery was different across groups or as a function of either set size or luminance. We found that there were no consistent differences in the number of saccades across the two groups for these variables (see Figure 2 —figure supplements 3 and 4). We believe taken together these results suggest that it is unlikely that our pupil diameter findings are driven by different eccentricity/fixation or saccades between the two groups and we thank the Reviewers for helping us address this important potential alternative explanation of our data. We believe that by showing eye-movements are unlikely to be driving the observed imaginary pupillary light reflex our paper has been strengthened.

With respect to demand characteristics, R3 has some detailed suggestions, and we all feel that a revision of the design would potentially yield significant gains. It doesn't seem technically difficult to collect the data, but we also understand that collecting any data in person these days can be challenging. However, at the very least we think you should report on the mock trials in the binocular rivalry task (to show there wasn't a difference between groups in what you did check for, a BR bias),

Thank you for these suggestions. We will endeavour to incorporate their excellent suggestions into future studies we run. Due to the current testing environment however it will take a long time to collect enough data to run a full new study as suggested. As suggested, we have added in the mock trial data to the supplementary material (Figure 3 —figure supplement 5) and additional information regarding these trials (see response to point 3 from Reviewer #3).

pull back on your claims accordingly, and provide a discussion of this limitation to the results.

We have toned down our claims and added in discussion of the limitations of the study, see the response to point 2 from Review #3.

Reviewer #1 (Recommendations for the authors):

1. Please analyze and report eye movement parameters in each experiment and whether correlate with vividness of imagery. Please also clarify where observers were required to look during each phase of the experiments.

Participants were instructed to maintain fixation on the fixation cross, throughout the experiment, which has been clarified in Figure 1’s legend. However, following Reviewer 2’s suggestion, we have now added the analysis of eccentricity and saccade data to the supplementary materials (Figure 2 —figure supplement 2 and 3, Figure 3 —figure supplement 3, see also above response to general points). We found that there were no consistent differences between the groups making it unlikely that differences in eyeposition is driving the lack of a pupillary light response in the aphantasic population.

2. Please provide evidence that the correlations reported in Figure 1D persist at the level of individuals.

We are not sure we understand the Reviewer’s comment correctly. We do not report correlations for Figure 1D, but the results of 2 x 4 linear mixed-effects analysis. This model included subject identity as a random effect (see Methods) and therefore the effects reported were computed at the subject level. We report in the text, effects that are significant at the level of the sample. This does not exclude the possibility of inter-individual differences, but we are not sure how interpretable a single-subject analysis is in the current study.

3. Please revise this Discussion section to clarify that such tests would always have to be combined with positive tests that show the commitment of participants to the task instructions.

We have now added the following to the discussion regarding the importance of including these commitment controls to imagery studies:

“Future studies of visual imagery, and even more importantly when investigating aphantasia, should aim to include appropriate controls that allow for the identification of task engagement even when an individual doesn’t have visual imagery. This will allow researchers to exclude the alternate explanation that those individuals who do not show evidence of imagery are not just refusing to imagine or not completing the task correctly.”

Reviewer #2 (Recommendations for the authors):

My largest (and maybe only) major concern is the possibility that aphantasic subjects were not attempting to imagine during both the main experiment and the BR task. This concern is somewhat exacerbated by the fact that subjects reached out to the lab to participate, making risk for demand characteristics high. The increase in pupil size in the aphantasic group in the set-size-4 condition is encouraging but this increase could also represent a number of other things. While I am hard pressed to come up with a way to ever eliminate this potential fully, I do think a bit more could be done to strengthen the argument against lack of participation and so I'm wary of the claim that this has been fully "ruled out" (line 322).

We have re-worded parts of the discussion to tone down such claims:

“This demonstrates active task engagement suggesting that aphantasic individuals were most likely not simply ‘refusing’ to actively participate in the task due to demand characteristics or a belief that they are unable to imagine (de Vito and Bartolomeo, 2016)."

“…However, we cannot fully rule out this possibility. Further, there was no evidence of an abnormal pupillary response in our aphantasic cohort when viewing images, thus it is likely the lack of an imaginary pupillary light response is due to their self-reported lack of visual imagery.”

My suggestions are as follows:

– Most ideally, there would be an addition of a probe task following the imagery period that would require subjects to report on the objects that they were supposed to remember and imagine (for example, like the one the in Zokaei 2019 which the authors cited). Both groups should be able to perform fairly well since, as the authors point out, aphantasic individuals can perform visual working memory tasks just as well as the general population. This would serve as a better indicator of effort and attention and require that subjects at the very least engage with the visual features of the stimuli (maybe they would have to adjust a probe's brightness until it matched the imagined one or adjust an angle to match that of the triangle, etc.).

Thank you for this excellent suggestion, which could be implemented in follow-up studies. However, as mentioned above, the current situation makes the planning of new experiments extremely uncertain. In addition, we did not find evidence suggesting aphantasic participants did not engage in the task. In fact, the modulation of pupil size by stimulus complexity suggests that these individuals engaged in the task, at least sufficiently for this effect to emerge. We agree that the methodology can be improved, and we are thankful for the Reviewer’s suggestion. But we do think our conclusions are warranted by the data at hand.

We have now further clarified our reasoning and outlined better the limitations of our study in the Discussion section. Indeed, we wanted the participants to focus more on holding the image in their mind and creating the most vivid image they were able to. Having them rate their vividness reinforces the imagery component of the task. If we had asked participants to remember the items instead, it is possible that some participants may have imagined the images as a mnemonic strategy. However, it is also possible that they may have also changed the type of strategy they used to remember the items which might not have involved imagery. This, in of itself, is interesting, however it was outside of the scope of this current study. In addition, if multiple participants were not usuing visual imagery to remember the images, this may have diluted the imaginary pupillary light response and replicating and extending this finding was central to the research question of this study. We have added in a limitations and future directions section to our discussion that speaks to these points:

“One limitation to our study is we did not include catch trails in our pupillometry task, i.e. we did not include trials where we asked participants to report on what image they had been asked to imagine. We did however include catch trials in our binocular rivalry task through presenting mock binocular rivalry trails. If aphantasic participants are showing a response bias we would expect see a reduction in these mock priming trails when compared to the control population, which we did not find (see Figure 3 —figure supplement 5). Adding catch trials to future experiments, in addition to setsize manipulations, may help to further confirm participant engagement. However, adding a simultaneous memory component to the task may lead some subjects to use a non-visual imagery strategy and as such, a reduction or dilution of the pupillary light response (see Pearson and Keogh (2019)). Future studies of visual imagery, and even more importantly when investigating aphantasia, should aim to include appropriate positive controls that allow for the identification of task engagement even when an individual doesn’t have visual imagery. This will allow researchers to exclude the alternate explanation that those individuals who do not show evidence of imagery are not just refusing to imagine or not completing the task correctly.”

– I also appreciate the use of a more objective measure of imagery vividness with the binocular rivalry task, but this suffers from the same issue. Aphantasic subjects could be not attempting to imagine or have an unintentional bias pushing them away from indicating the priming effect. The latter can be addressed using catch/mock trials as the authors did use. However if my math is correct, there were 24 trials total with only 3 mock trials. So while they did exclude subjects for making more than 1 incorrect response to mock trials, that still means that subjects could have shown a bias on 33% of the trials and still be included in the study. This might account for some of the findings if there were more aphantasic than control subjects who incorrectly answered a mock trial. My suggestion is then to increase the number and type of mock trials (all green and all red in addition to mixed) and (or at the very least) report the missed mock trials for each group.

We take the Reviewer’s point seriously. However, we note it is standard for mock or catch trials to represent a minority of the total number of trials. In fact, too frequent mock trials could make the subjects aware of the existence of the mock trials and fundamentally alter the results. In addition, we think our interpretation of the data (that the absence of priming is due to a lack of a reported imagery, in accordance with individuals’ self-reports) is more parsimonious than hypothesising the existence of an unintentional bias.

To give readers the clearest account of our data, we have now included the data for mock trials for both the controls and undergraduate students in the supplementary material in addition to VVIQ scores (Figure 3 —figure supplement 5 and Figure 2 —figure supplement 6). We have also further clarified how mock trials were calculated in the procedure as, upon re-reading the manuscript, we realised we did not include the number of trials participants completed in the binocular rivalry task, this has now been updated in the procedure section (84 real and 12 mock trials). The mock trials we use have a bespoke zig-zag walk border between the red and green patterns, and thus are not exactly the same each presentation, and appear slightly more red or green. We hope this explanation helps to clarify the mock trial data.

“Participants completed 2 blocks of 48 trials resulting in a total of 96 trials in total (84 real and 12 mock trials). The number of primed trials divided by the total number of trials (excluding mock trials and mixed percepts) was used to calculate a percent primed score for each participant. Mock trial priming was calcuated by giving a value to each mock trial as either 0 (reporting the catch trial as the opposite colour to that primed) , 50 (reporting the catch trial as being mixed) or 100% (reporting the catch trial to be the same as the cued image) (Figure 3 —figure supplement 5). These values are then averaged to get a priming value where 50% indicates no bias, while higher values indicate a bais towards reporting the mock trails as being the same as the imagined image, while negative numbers indicate a bias towards reporting the oppoiste image to that which was imagined.”

– I would also suggest estimating the null hypothesis and significance bound through random sampling of the experiment under the null to show that the aphantasia subjects did not perform significantly worse than chance in Figure 2b.

We have now run one-sample Bayesian t-tests to assess the evidence for aphantasic individuals performing significantly below chance in both the imaginary pupillary light response and binocular rivalry task (figure 1D). When assessing both groups there was no significant evidence that their priming scores were significantly lower than chance (Comparing scores to 50%: Aphantasic individuals BF = .162, Controls BF = .012). Similarly, a one sample t-test found no significant evidence that either group’s pupil difference scores were lower than chance (comparing to 0: Aphantasic individuals SS1 BF = .860, Aphantasic individuals SS4 BF = .187, Controls SS1 BF = .091, Controls SS4 BF = .050). These analysis have been added to figure 2B as Figure supplement 6 and the following has been added to the discussion:

“We ran Bayesian one-sample t-tests on the binocular rivalry and pupillary light response difference scores (see figure 2) comparing their performance to chance to see if there was any evidence they were performing significantly below chance. We found no significant evidence of below chance performance for either group on either the binocular rivalry or pupillometry imagery tasks (see Figure 2 —figure supplement 6). Taken together, with the set-size pupillary effect we observed in our aphantasic participants, it seems unlikely that our aphantasic individuals were not engaging in the tasks.”

Lines 124 – 127 "These data provide novel evidence individuals can reliably evaluate the comparative vividness of single episodes of imagery. Further, these data demonstrate that the pupillary light response also tracks the phenomenological vividness of visual imagery from moment to moment." These two lines are circular. The authors are using changes in pupil size as evidence that subjects can reliably report their vividness and then using subject reports as evidence that pupil size is a good indicator of vividness. Suggest rewriting /rewording

We have now removed the first line from this paragraph (see manuscript).

Around line 256: the authors state there is no correlation between perception and imagery conditions. I initially found this confusing because if imagery is like faint vision, the pupil change during imagery of an object should mimic the pupil change during viewing of the same object. The point I believe they were making is that there is no correlation when all subjects are pooled together. I think this would be clearer if the authors first pointed out that there is a correlation between perception and imagery, but in the general population only (if true).

The reviewer is correct, that there is no correlation at a group-level between pupil responses during perception, specifically when looking at the control population. Having re-read this section we can see the confusion, because the point of the study is to show that imagery acts like weak perception by showing a pupillary light response. To clarify it was important to run this analysis as it might just be the case that pupil responses during imagery passively reflect the amount the pupils responded during the perception phase, rather than reflecting the effortful generation and maintenance of mental images. Specifically, imagery does act like perception through demonstrating a pupillary light reflex, however this analysis shows that this effect is not just due to lingering pupil responses to the previously seen images. We have added in further clarification of this point to the manuscript and we hope this distinction is now more readily understandable from the added text:

“Another possible explanation of our findings could be that the passive viewing of the images, and lingering visual persistence and sluggish pupil responses could be driving our results. If this is the case, we would expect that pupil diameter during the perception of the images should correlate with pupil size during imagery for the corresponding images. Further, the pupillary light reflex during perception should be more pronounced in the control than the aphantasic populations. To investigate this possible alternative explanation of our data we first assessed the correlations between pupil diameter during perception of bright and dark images for set size one and four and their corresponding imagery conditions (control participants only). We found there were no significant correlations between any of the perception and imagery conditions, or the difference scores for set size one and four (all p >.40, see Figure 2 —figure supplement 1). This lack of a correlation suggests that those individuals who have the largest pupillary light response while viewing the images, do not also have the greatest imagery driven pupillary light responses, making it unlikely that the pupil response while seeing the image is driving the mental imagery pupillary response.”

Line 312 – I believe the authors might be conflating psychogenic and fabrication and suggest that they revise the discussion to be clearer on this. The results speak to the latter but not so much the former. While psychogenic implies there might not be a traceable physical cause, it doesn't necessitate that the patient is acting intentionally and they likely experience it as real (for example, in non-epileptic seizures the brain is in fact not seizing, but the patient often isn't consciously convulsing and truly believes they are having a seizure). The lack of experience of imagery could lead to a lack of pupillary light response. A psychogenic source would not be inconsistent with the findings since the authors are not claiming to identify what is blocking or restricting the visual experience of imagery, just that there is a link between that experience and a physiological response. I suggest removing the claim that the study rules out a psychogenic source to aphantasia.

We thank the reviewer for their thoughtful response, we have now removed this claim in the discussion.

The timing diagram in Figure 1A seems off. The caption and methods say that the black screen rest was presented for 8s, but only 7s exist between the 2nd and 3rd dotted lines, and this doesn't match Figure 3. The reason for using a black screen for rest was also not clear. Together, these made the large dip in pupil size before imagery a bit confusing at first.

Thank you for noting this error in Figure 1A. This has now been amended to correctly show an 8s black screen rest period. The reason for using a black screen for the rest period was firstly, to accurately replicate the experimental design used in Laeng and Sulutvedt, 2014 (The Eye Pupil Adjusts to Imaginary Light). This black screen is included to be a wash out period for after-images caused by stimuli during the perception phase of the trial. Additionally this black screen is used to bring the pupils to a similar diameter so that they are at a similar size for the beginning of the imagery component of the task.

https://doi.org/10.7554/eLife.72484.sa2

Article and author information

Author details

  1. Lachlan Kay

    School of Psychology, University of New South Wales, Sydney, Australia
    Contribution
    Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Visualization, Writing – original draft
    Contributed equally with
    Rebecca Keogh
    Competing interests
    No competing interests declared
  2. Rebecca Keogh

    1. School of Psychology, University of New South Wales, Sydney, Australia
    2. School of Psychological Sciences, Macquarie University, Sydney, Australia
    Contribution
    Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Supervision, Visualization, Writing - review and editing
    Contributed equally with
    Lachlan Kay
    For correspondence
    rebeccalkeogh@gmail.com
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4814-433X
  3. Thomas Andrillon

    1. School of Psychology, University of New South Wales, Sydney, Australia
    2. Sorbonne Université, Institut du Cerveau - Paris Brain Institute - ICM, Inserm, CNRS, Paris, France
    Contribution
    Methodology, Software, Supervision, Writing - review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-2794-8494
  4. Joel Pearson

    School of Psychology, University of New South Wales, Sydney, Australia
    Contribution
    Conceptualization, Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing - review and editing
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-3704-5037

Funding

National Health and Medical Research Council (APP1024800)

  • Joel Pearson

National Health and Medical Research Council (APP1046198)

  • Joel Pearson

National Health and Medical Research Council (APP1085404)

  • Joel Pearson

National Health and Medical Research Council (APP1049596)

  • Joel Pearson

Australian Research Council (DP140101560)

  • Joel Pearson

Human Frontier Science Program (LT000362/2018-L)

  • Thomas Andrillon

The funders had no role in study design, data collection, and interpretation, or the decision to submit the work for publication.

Ethics

Informed written consent was obtained from all participants to participate in the experiment and to publish their anonymized data in a journal article. Both experiments were approved by the UNSW Human Research Ethics Advisory Panel (HREAP-C 3182).

Senior Editor

  1. Chris I Baker, National Institute of Mental Health, National Institutes of Health, United States

Reviewing Editor

  1. John T Serences, University of California, San Diego, United States

Reviewers

  1. Martin Rolfs, Humboldt Universität zu Berlin, Germany
  2. Jesse Breedlove, UMN, United States

Publication history

  1. Received: July 26, 2021
  2. Preprint posted: September 3, 2021 (view preprint)
  3. Accepted: March 30, 2022
  4. Accepted Manuscript published: March 31, 2022 (version 1)
  5. Version of Record published: April 19, 2022 (version 2)

Copyright

© 2022, Kay et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 7,654
    Page views
  • 685
    Downloads
  • 2
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Lachlan Kay
  2. Rebecca Keogh
  3. Thomas Andrillon
  4. Joel Pearson
(2022)
The pupillary light response as a physiological index of aphantasia, sensory and phenomenological imagery strength
eLife 11:e72484.
https://doi.org/10.7554/eLife.72484

Further reading

    1. Neuroscience
    Guy Avraham, Jordan A Taylor ... Samuel David McDougle
    Research Article

    Traditional associative learning tasks focus on the formation of associations between salient events and arbitrary stimuli that predict those events. This is exemplified in cerebellar-dependent delay eyeblink conditioning, where arbitrary cues such as a light or tone act as conditioning stimuli (CSs) that predict aversive sensations at the cornea (unconditioned stimulus, US). Here we ask if a similar framework could be applied to another type of cerebellar-dependent sensorimotor learning – sensorimotor adaptation. Models of sensorimotor adaptation posit that the introduction of an environmental perturbation results in an error signal that is used to update an internal model of a sensorimotor map for motor planning. Here we take a step towards an integrative account of these two forms of cerebellar-dependent learning, examining the relevance of core concepts from associative learning for sensorimotor adaptation. Using a visuomotor adaptation reaching task, we paired movement-related feedback (US) with neutral auditory or visual contextual cues that served as conditioning stimuli (CSs). Trial-by-trial changes in feedforward movement kinematics exhibited three key signatures of associative learning: Differential conditioning, sensitivity to the CS-US interval, and compound conditioning. Moreover, after compound conditioning, a robust negative correlation was observed between responses to the two elemental CSs of the compound (i.e., overshadowing), consistent with the additivity principle posited by theories of associative learning. The existence of associative learning effects in sensorimotor adaptation provides a proof-of-concept for linking cerebellar-dependent learning paradigms within a common theoretical framework.

    1. Neuroscience
    Linda Geerligs, Dora Gözükara ... Umut Güçlü
    Research Article Updated

    A fundamental aspect of human experience is that it is segmented into discrete events. This may be underpinned by transitions between distinct neural states. Using an innovative data-driven state segmentation method, we investigate how neural states are organized across the cortical hierarchy and where in the cortex neural state boundaries and perceived event boundaries overlap. Our results show that neural state boundaries are organized in a temporal cortical hierarchy, with short states in primary sensory regions, and long states in lateral and medial prefrontal cortex. State boundaries are shared within and between groups of brain regions that resemble well-known functional networks. Perceived event boundaries overlap with neural state boundaries across large parts of the cortical hierarchy, particularly when those state boundaries demarcate a strong transition or are shared between brain regions. Taken together, these findings suggest that a partially nested cortical hierarchy of neural states forms the basis of event segmentation.