Cortical excitability controls the strength of mental imagery

Abstract
Introduction
Results
Discussion
Materials and methods
Appendix 1
Data availability
References
Article and author information
Metrics

Abstract

Mental imagery provides an essential simulation tool for remembering the past and planning the future, with its strength affecting both cognition and mental health. Research suggests that neural activity spanning prefrontal, parietal, temporal, and visual areas supports the generation of mental images. Exactly how this network controls the strength of visual imagery remains unknown. Here, brain imaging and transcranial magnetic phosphene data show that lower resting activity and excitability levels in early visual cortex (V1-V3) predict stronger sensory imagery. Further, electrically decreasing visual cortex excitability using tDCS increases imagery strength, demonstrating a causative role of visual cortex excitability in controlling visual imagery. Together, these data suggest a neurophysiological mechanism of cortical excitability involved in controlling the strength of mental images.

Introduction

Visual imagery - the ability to ‘see with the mind’s eye’ - is ubiquitous in daily life for many people; however, the strength and vividness with which people are able to imagine varies substantially from one individual to another. Due to its highly personal nature, the study of visual imagery has historically relied on self-report measures and had long been relegated to the shadows of scientific inquiry. However, with the advent of fMRI and new analysis techniques like decoding, as well as new advances in behavioral and psychophysical experiments, this is quickly changing (Pearson, 2014; Pearson, 2019).

To date, much of the research in the field of visual imagery has focused on the similarities between visual imagery and perception, due to a long-ranging debate around whether visual imagery can be depictive and/or pictorial, referred to as the ‘imagery debate’ (Pearson and Kosslyn, 2015a). Research has shown that a large network of occipital, parietal, and frontal areas are involved when imagining (Pearson et al., 2015b; Dijkstra et al., 2019), with recent studies providing evidence that visual imagery content is tied to early visual cortex, indicating that imagery-related processing overlaps with that of perception. For example, research using fMRI has demonstrated that BOLD activity in early visual cortex increases when individuals imagine, and the content of visual imagery can be decoded from early visual cortex, as well as being cross-decoded from perception (Albers et al., 2013; Thirion et al., 2006; Cui et al., 2007). Additionally, recent work has shown that trial-by-trial self-rated vividness of visual imagery during an imagery task correlated with the neural overlap between perception and imagery (Dijkstra et al., 2017). Brain stimulation research has similarly investigated whether the early visual cortex is involved during visual imagery with findings demonstrating that, like motor imagery, visual cortex excitability increases during imagery (Cattaneo et al., 2011; Sparing et al., 2002).

It is now well accepted that visual imagery can indeed be pictorial/depictive in nature and involves representations in low-level visual cortex (Pearson and Kosslyn, 2015a). However, there are also large individual differences in the reported vividness of imagery across the general population. Some report imagery so vivid it is akin to seeing the image, while others report no experience of visual imagery at all, a new special population referred to as aphantasia (Zeman et al., 2015). These large individual differences exist in both subjective reports (Galton, 1883), and in objective measures of imagery strength (Keogh and Pearson, 2018). Little research has investigated exactly what drives these large individual differences. One study reported that the vividness of visual imagery correlates positively with BOLD activity changes in visual cortex during an imagery task (Cui et al., 2007). Another study found a correlation between imagery vividness and the similarity of BOLD responses for perception and imagery in early visual cortex (Lee et al., 2012). A recent study found that trial-by-trial differences in imagery vividness were also related to the similarity of BOLD responses between imagery and perception (Dijkstra et al., 2017).

Taken together, these studies suggest that visual cortex is linked to the subjective vividness of visual imagery. However, they do not provide information about why some individuals are better at recruiting the early visual cortex to create stronger more vivid images. Work in synesthesia and migraines has found evidence that the neural excitability of early visual cortex relates to the experience of involuntary forms of visual imagery (Terhune et al., 2015a; Terhune et al., 2011; Gunaydin et al., 2006). Specifically, these previous studies have shown that individuals who experience grapheme-colour synesthesia, or auras prior to the onset of migraines, have heightened visual cortical excitability measured by TMS phosphene thresholds (Terhune et al., 2015a; Terhune et al., 2011; Gunaydin et al., 2006). It is known that the excitability of visual cortex varies substantially across individuals, and as such may be a candidate for driving some of the observed interindividual differences in visual imagery strength.

Here, we investigated whether cortical excitability might also be linked to the individual differences that exist in the strength of voluntarily produced visual imagery. We used a multi-method approach (fMRI, TMS, and tDCS, see Materials and methods for measures of cortical excitability) to assess the potential contributions of resting levels of cortical excitability in the visual imagery network as a critical physiological precondition, which influences the strength of visual imagery.

Measuring visual imagery strength

To measure mental imagery strength, we utilized the binocular rivalry imagery paradigm (see Figure 1), which has been shown to reliably measure the sensory strength of mental imagery through its impact on subsequent binocular rivalry perception (Pearson, 2014). Previous work has demonstrated that when someone imagines a pattern or is shown a weak perceptual version of a pattern, they are more likely to see that image in a subsequent brief binocular rivalry display (see Pearson et al., 2015b for review of methods). Longer periods of imagery generation, or weak perceptual presentation, increase the probability of perceptual priming of subsequent rivalry. For this reason, the degree of imagery priming has been taken as a measure of the sensory strength of mental imagery. Importantly, this measure of imagery is directly sensory; while it is related to subjective reports of imagery vividness, it is not a direct proxy for subjective reports of imagery vividness, and findings regarding their relationship across individuals have been mixed (see Figure 1—figure supplement 1A and Pearson et al., 2011; Bergmann et al., 2016a). This measure of imagery strength has been shown to be both retinotopic location and spatial orientation specific (Bergmann et al., 2016a; Pearson et al., 2008a), is reliable when assessed over days or weeks (see Figure 1—figure supplement 2 and Bergmann et al., 2016a), is contingent on the imagery generation period (therefore not due to any rivalry control) and can be dissociated from visual attention (Pearson et al., 2008a). This measure of imagery is advantageous in that it allows us to avoid the prior limitations of subjective introspections and reports.

Figure 1 with 2 supplements see all

Download asset Open asset

Timeline of the basic imagery experiment.

Participants were cued to imagine a red-horizontal or a green-vertical Gabor patch for 6–7 s by the letter R or G (respectively). Following this, they were presented with a brief binocular rivalry display (750 ms) and asked to indicate which image was dominant. In the behavioral experiments with the brain-imaging sample and in three of the tDCS experiments, a rating of subjective vividness of the imagery also preceded the binocular rivalry display.

Results

Visual cortex and visual imagery strength

Correlations between visual cortex excitability and visual imagery strength: exploratory fMRI analysis

First an exploratory analysis was run to see if there was any relationship between cortex physiology and imagery strength. To do this, we looked at fMRI data and assessed a sample of 31 participants resting-state fMRI data (these participants form part of a sample that has previously been reported on in Bergmann et al., 2016a; Bergmann et al., 2016b); however, these previous analyses were structural rather than functional). We related this data set to each individual’s imagery strength determined using the binocular rivalry method (% primed, see Figure 1). Using a whole-brain surface-based group analysis (see Methods), we found that the normalized mean fMRI intensity of clusters in the visual cortex showed a negative relationship with imagery strength, while frontal cortex clusters showed positive relationships (multiple comparison-corrected; see Figure 2, left column; and Supplementary file 1 Supplementary Table S1 –S3). This pattern of results was also present in a second set of resting-state fMRI data that was acquired from the sample (see Figure 2, right column). We also assessed the relationship between retinotopically defined early visual cortices V1-V3 and the adjacent occipito-parietal areas (defined by the Desikan–Killiany atlas). We found significant negative correlations with normalized mean fMRI intensity levels in V1-V3 and lateral occipital cortex and imagery strength (See Figure 2—figure supplement 1).

Figure 2 with 3 supplements see all

Download asset Open asset

Surface-based whole brain analysis of data from two different fMRI resting-state measurements: negative associations with imagery strength in the occipital cortex.

Two columns on the left: results of the main resting-state fMRI data set with a TR of 2 s (TR2). Two columns on the right: results of an additional resting-state fMRI data set with a TR of 1 s (TR1); in those participants with which both measurements were conducted, about half were done on the same day. In the other half, the two measurements were conducted on different days. The two upper rows show the uncorrected (positive and negative) relationships with imagery as heatmaps. The two lower rows show the corrected clusters that had a *negative* association with individual imagery strength at a cluster-wise probability threshold (CWP) of p<0.05 (also see Supplementary file 1 - Supplementary Table S1). The two hemispheres are shown from the back, with the lateral view in the upper and the medial view in the lower panel. Multiple comparison correction was done using Monte Carlo Null-Z simulation (mc-z). No smoothing of the functional mean intensity data was applied. In line with the correlation analyses using normalised fMRI mean intensity of atlas- and retinotopically defined areas, only fMRI mean intensity clusters in the back of the brain, where early visual and lateral occipital cortex are located, showed negative associations with imagery strength (% primed). The fMRI measurement with a TR = 2 s has a better signal-to-noise ratio, as longer TR increase T2* tissue contrast (e.g. see Hashemi et al., 2010); in addition, the larger voxel size of the TR1 measurement (3.28 × 3.28×5 mm³) also means that they are more likely to pick up signals from other tissue (e.g. white matter), thereby increasing the contributions of biophysical noise. Both of this likely weakens the observed correlations with behavior; this might explain why none of the relationships with the brain areas using retinotopic mapping and the Desikan-Killiany atlas survived multiple comparison correction in the ROI-based approach (all p>0.05). Despite this, the clusters from the two different measurements in the surface-based group analysis show striking similarities; while the clusters in the TR1 measurement are smaller and sparser, their location in early visual and lateral occipital cortex are strongly overlapping with those found in the TR2 measurement. Further analyses showed that these similarities were not driven by the group that completed the measurements on the same day (analysis not shown).

Correlations between visual cortex excitability and visual imagery strength: TMS

To further substantiate our observations and circumvent other potential confounds that might influence the fMRI data (see appendices for discussion of these limitations), we next utilized a different methodology that measures cortical excitability: transcranial magnetically induced phosphenes. A new sample of 32 participants performed an automated phosphene threshold (PT) procedure using transcranial magnetic stimulation (TMS) over early visual cortex. Visual phosphenes are weak hallucinations caused by TMS applied to visual cortex. The magnetic strength needed to induce a phosphene is a reliable and non-invasive method to measure cortical excitability (see Materials and methods section for explanation of phosphene thresholds and cortex excitability). In line with the normalized mean fMRI intensity data, we found a significant negative correlation between imagery strength and visual cortex excitability (data shows inverse phosphene threshold (100-PT) for ease of visualizing data as PT’s are negatively correlated with cortical excitability: r = −0.43, p=0.0127; Figure 3A). In other words, individuals with lower visual cortex excitability exhibited stronger imagery. Importantly, we also tested the phosphene threshold retest reliability for our paradigm over 2 days and found it was a very reliable measure (r_s = 0.75, p<0.001; see Figure 1—figure supplement 2), as was our measure of imagery strength re-test reliability of tDCS experiments imagery strength: r_s = 0.51, p<0.001; see Figure 1—figure supplement 2).

Figure 3

Download asset Open asset

Scatterplots for TMS phosphene thresholds and mock rivalry data.

(A) Correlation between the inverse phosphene threshold and imagery strength. Individuals with lower cortical excitability in visual cortex tended to have stronger imagery. (B) Correlation between mock priming scores and real binocular rivalry priming for participants in the fMRI (circles) and TMS (squares) study. There was no significant association between perceptual priming in real and mock trials for the fMRI or TMS data. In the scatterplots (A & B), each data point indicates the value of one participant; the bivariate correlation coefficients are included with their respective significance levels.

Figure 3—source data 1 TMS inverse phosphene correlation data.: https://cdn.elifesciences.org/articles/50232/elife-50232-fig3-data1-v1.csv
Download elife-50232-fig3-data1-v1.csv

To assess possible effects of a decisional bias, mock rivalry trials were included in all tests of imagery strength (Pearson et al., 2008a; Bergmann et al., 2016a; Keogh and Pearson, 2011; Keogh and Pearson, 2014)(see Materials and methods). We found no correlation between real binocular rivalry and ‘mock priming’ (fMRI (circles r_s = −0.03, p=0.89 and TMS (squares) r_s = −0.01, p=0.97, see Figure 3B). These data suggest it is unlikely that the relationship between imagery strength and physiology is due to demand characteristics or decisional bias.

Manipulating visual cortex excitability using tDCS

The data suggest that the excitability of the visual cortex might influence the strength of visual imagery, as participants with lower visual cortex activity tended to have stronger visual imagery and vice versa. However, these data do not speak to the causal role of early visual cortex in creating strong mental images. If the association between imagery strength and visual cortex activity is causal, manipulating visual cortex excitability should likewise modulate imagery strength.

To assess this hypothesis, we utilized non-invasive transcranial direct current stimulation (tDCS), which can increase or decrease cortical excitability depending on electrode polarity and position (see Filmer et al., 2014 for review, and methods for evidence of tDCS modulating visual cortex excitability). Broadly speaking, when the cathode is placed over the cortex, when averaging across participants, the underlying cortical excitability is decreased, whereas the anode increases excitability (although the polarity specific effects can be influenced by multiple methodological and inter-individual differences Strube et al., 2016; Filmer et al., 2019a; Batsikadze et al., 2013; Monte-Silva et al., 2013). Sixteen new participants underwent both anodal and cathodal stimulations of visual cortex (see Figure 4B for electrode montage) on 2 separate days (separated by at least 24 hr). On each day, participants completed six blocks of the imagery task, two before tDCS, two during tDCS and two post-tDCS (see Figure 4—figure supplement 1A for experimental timeline). To assess the effect of tDCS on imagery strength, we calculated the percent change in priming for each participant from baseline (on each day, see Materials and methods for percent change calculation details) such that positive numbers indicate increases in imagery strength and negative ones indicate decreases.

Figure 4 with 3 supplements see all

Download asset Open asset

Visual cortex stimulation data.

(A) Effect of visual cortex stimulation on imagery strength at 1mA. The top image shows the tDCS montage, with the active electrode over Oz and the reference electrode on the supraorbital area. The bottom image shows the effect of cathodal (decreases excitability, blue dots represent each participant’s data) and anodal (increases excitability, red dots represent each individual participant’s data) stimulation averaged across all tDCS stimulation blocks (D1, D2, P1, and P2). (B) Effect of visual cortex stimulation on imagery strength at 1.5mA. Top: the tDCS montage with the active electrode over Oz and the reference electrode on the right cheek. Bottom: the effect of cathodal (blue dots, decrease excitability) and anodal (red dots, increase excitability) stimulation averaged across all blocks during and after tDCS stimulation (D1, D2, P1, and P2). Each data point represents a single participant. Imagery strength increases in the cathodal stimulation condition (blue), when neural excitability is reduced. (C) Effect of visual cortex stimulation on imagery strength at 1.5mA. Top: the tDCS montage with the active electrode over Oz and the reference electrode on the right cheek. Bottom: The left bar shows the relative change in imagery strength for cathodal stimulation (blue bar, blue dots represent individual participants data), the middle bar shows the relative change in imagery strength for anodal stimulation (red bar, red dots represent individual participants data), while the right bar shows the change in imagery strength for sham stimulation (grey bar, grey dots represent individual participants data). All error bars show ± SEMs and stars (*) indicate a significant effect of tDCS polarity.

Figure 4—source data 1 1mA occipital tDCS data.: https://cdn.elifesciences.org/articles/50232/elife-50232-fig4-data1-v1.csv
Download elife-50232-fig4-data1-v1.csv
Figure 4—source data 2 1.5mA occipital tDCS data.: https://cdn.elifesciences.org/articles/50232/elife-50232-fig4-data2-v1.csv
Download elife-50232-fig4-data2-v1.csv
Figure 4—source data 3 1.5mA occipital and sham tDCS data.: https://cdn.elifesciences.org/articles/50232/elife-50232-fig4-data3-v1.csv
Download elife-50232-fig4-data3-v1.csv
Figure 4—source data 4 1.5mA occipital TMS + tDCS data.: https://cdn.elifesciences.org/articles/50232/elife-50232-fig4-data4-v1.csv
Download elife-50232-fig4-data4-v1.csv

Figure 4A shows relative imagery priming percent change scores averaged across all stimulation blocks with 1mA of tDCS stimulation (data per block can be seen in Figure 4—figure supplement 1C). Linear mixed-effects analysis were computed for all following tDCS experiments. This analysis was run with a 2 (tDCS polarity: cathodal and anodal), x 4 (block: D1, D2, P1, P2 – see Figure 4—figure supplement 1A for timeline and Figure 4—figure supplement 1C for data for each block) x 2 (order of stimulation: cathodal on the first or second day) design. When fitting a linear mixed model, the effect of tDCS polarity was not significant (χ²(1)=2.99, p=0.084).

The non-significant results from the first tDCS experiment may be due to the stimulation intensity of 1 mA being too low to produce any effect - many tDCS studies use an intensity ranging from 1.5-2mA (for example see Jacobson et al., 2012). To investigate whether the lack of a significant result with 1mA was due to the low stimulation intensity, we ran a second tDCS study with a higher intensity of 1.5mA (see Materials and methods) and both cathodal (blue bars and dots) and anodal (red bars and dots) stimulation conditions. Additionally, to ensure we were not also stimulating the prefrontal cortex, the supraorbital placement of the reference electrode was moved to the cheek (Figure 4B). A linear mixed-effects analysis was run with a 2 (tDCS polarity: cathodal and anodal), x 4 (block: D1, D2, P1, P2 – see Figure 4—figure supplement 1A for timeline and Figure 4—figure supplement 1E for data for each block) x 2 (order of stimulation: cathodal on the first or second day) design. The effect of tDCS polarity was significant χ²(1)=15.85, p=6.86e⁻⁰⁵. The changes were in line with the correlational data for resting levels of visual cortex excitability and activity (see Figures 2 and 3), such that imagery strength increased when visual cortex excitability was decreased (cathodal stimulation, see Figure 4B), while the opposite was true of increasing visual cortex excitability (anodal stimulation).

It is likely that the change from 1mA to 1.5mA allowed us to observe the modulatory effects of tDCS; however, it also might be that the change in montage had an influence (i.e. location of reference electrode). Further, it may be the case that there are either fatigue or practice effects on this visual imagery task, that is perhaps participants just get better/worse on this task due to doing multiple sessions. For this reason, a third experiment was run to assess the effects of fatigue/practice and the change of reference location. This study was identical to the above study with the inclusion of a sham condition where the tDCS machine shut off after 30 s of stimulation. A linear mixed-effects analysis was run with a 3 (tDCS polarity: cathodal, anodal and sham), x 4 (block: D1, D2, P1, P2 – see Figure 4—figure supplement 2A for timeline and Figure 4—figure supplement 2C for data for each block) x 3 (order of stimulation: cathodal on first, second or third day) design. The effect of tDCS polarity was again significant χ²(2)=21.66, p=1.98e⁻⁰⁵. These data indicate that cathodal stimulation results in increased imagery strength (see Figure 4C), and this is unlikely to be a practice effect, as sham stimulation results in decreases in imagery strength. Additionally, previous work using the same binocular rivalry paradigm has demonstrated no increases in visual imagery strength after multiple days of training (Pearson et al., 2011). Taken together, these data suggest that cathodal stimulation leads to increases in imagery strength due to decreased visual cortex excitability, and these changes cannot be explained as a learning effect due to performing multiple sessions of the imagery task.

Although other studies have provided evidence that tDCS does change the excitability of the visual cortex (see Antal et al., 2003 for example), we wanted to ensure that our specific stimulation paradigm was indeed modulating visual cortex excitability. We ran a separate control study comparing TMS-phosphene thresholds before and after the same tDCS paradigm (1.5mA, active electrode on Oz and reference on the cheek, see Figure 4B, all subjects received both anodal and cathodal stimulations across separate days; see Materials and methods for further details). If our cathodal stimulation is decreasing visual cortex excitability, greater TMS power output would be required to elicit phosphenes post-cathodal stimulation, whereas post-anodal stimulation we would predict the opposite effect. A linear mixed-effects analysis was run with a 2 (tDCS polarity: cathodal and anodal), x 2 (block: Pre tDCS and Post tDCS) x 2 (order of stimulation: cathodal on the first or second day) design. We found that phosphene thresholds measured immediately after anodal stimulation decreased, whereas after cathodal stimulation phosphene thresholds increased (significant effect of tDCS polarity (χ²(1)=4.32, p=0.038, see Figure 4—figure supplement 3)). These findings show that our stimulation paradigm changes cortical excitability in the expected direction, that is cathodal stimulation decreases cortical excitability, whereas anodal stimulation increases activity.

Summary of visual cortex excitability and visual imagery strength

In two separate experiments, we found that resting levels of early visual cortex excitability/activity negatively predicted visual imagery strength (fMRI and TMS, Figures 2 and 3). We were also able to causally alter visual imagery strength in two separate tDCS experiments. Specifically, decreasing visual cortex excitability (using cathodal stimulation 1.5mA) increased imagery strength (see Figure 4B and C).

Our data suggest that visual cortex excitability plays a causal role in modulating imagery strength, but how exactly does excitability influence imagery strength? One hypothesis is that hyperexcitability might act as a source of noise in visual cortex that limits the availability or sensitivity of neuronal response to top-down imagery signals, resulting in weaker image-simulations. This hypothesis is supported by behavioral work showing that both imagery and visual working memory can be disrupted by the passive presence of uniform bottom-up afferent visual stimulation (Keogh and Pearson, 2011; Keogh and Pearson, 2014), known to increase neural depolarization in primary visual cortex (Kinoshita and Komatsu, 2001). However, the strength of the top-down imagery-signals arriving at visual cortex should also play a role in governing imagery strength, as activity in a brain network including prefrontal areas supports mental image generation (Pearson et al., 2015b). Therefore, we next assessed the role that prefrontal cortex activity plays in shaping visual imagery strength.

Correlations between frontal cortex excitability and imagery strength

As mentioned previously, the exploratory, multiple comparison-corrected whole-brain surface-based analysis of the mean fMRI intensity levels at rest revealed relationships with clusters in both visual and frontal cortex (see Figure 2 and Figure 2—figure supplement 2 and Supplementary Table S2). Most of the significantly positive frontal clusters were located in superior frontal cortex. Additionally, using a ROI-based approach, normalized mean fMRI intensity levels in two frontal areas also showed positive relationships with imagery strength: superior frontal cortex (r = 0.41, p=0.022) and area parsopercularis (r = 0.38, p=0.033; ROIs defined by the Desikan–Killiany atlas). However, these relationships did not survive multiple comparison correction (both p>0.05).

Manipulating prefrontal cortex excitability using tDCS

To explore the theoretical role of frontal cortex in imagery generation and maintenance further, we next sought to evaluate the effect of modulating neural excitability in prefrontal cortex using tDCS during image generation. The active electrode was placed between F3 and Fz (left frontal cortex), and the reference electrode on the right cheek (Figure 5A for montage). Participants completed both cathodal and anodal conditions (1.5mA) over 2 separate days. A linear mixed-effects analysis was run with a 2 (tDCS polarity: cathodal and anodal), x 4 (block: D1, D2, P1, P2 – see Figure 4—figure supplement 1A for timeline and Figure 4—figure supplement 1G for data for each block) x 2 (order of stimulation: cathodal on the first or second day) design. The effect of tDCS polarity was significant (χ²(1)=6.86, p=0.009, see Figure 5A). Interestingly, in contrast to the visual cortex, where decreasing excitability led to stronger imagery, we found the opposite pattern for frontal areas (see Figure 5A).

Figure 5

Download asset Open asset

Data for prefrontal cortex stimulation.

(A) Effect of left prefrontal (pFC) cortex stimulation on imagery strength at 1.5mA. The left image shows the tDCS montage, with the active electrode between Fz and F3 and the reference electrode on the right cheek. The right image shows the effect of cathodal (decrease excitability, blue dots represent each participant’s difference score) and anodal (increase excitability, red dots represent each individual participant’s difference score) stimulation averaged across all blocks during and after tDCS stimulation (D1, D2, P1, and P2). Imagery strength can be seen to increase with anodal stimulation. (B) Effect of joint electrical stimulation of prefrontal cortex and visual cortex. The left image shows brain areas targeted in the final tDCS study. Data shows non-significant effects of cathodal occipital + anodal pFC stimulation (blue bars, blue dots represent individual participants data), anodal occipital + cathodal pFC stimulation (red bars, red dots represents individual participants data) and sham stimulation (grey bars grey dots represent individual participants data). All error bars show ± SEMs and stars (*) indicate a significant effect of tDCS polarity.

Figure 5—source data 1 1.5mA Prefrontal tDCS data.: https://cdn.elifesciences.org/articles/50232/elife-50232-fig5-data1-v1.csv
Download elife-50232-fig5-data1-v1.csv
Figure 5—source data 2 1.5mA combined tDCS data.: https://cdn.elifesciences.org/articles/50232/elife-50232-fig5-data2-v1.csv
Download elife-50232-fig5-data2-v1.csv

Taken together, these fMRI and tDCS data provide evidence that the cortical excitability of prefrontal cortex also plays a role in governing the sensory strength of visual imagery.

The joint role of visual and frontal cortex activity in visual imagery strength: fMRI

Beyond the individual roles of prefrontal and visual cortex in forming mental images, evidence suggests that both areas can act together as part of an imagery network (Østby et al., 2012; Schlegel et al., 2013). Hence, we combined the whole-brain normalized mean fMRI intensity scores from the two areas (frontal and visual) and related their ratio to imagery strength. We found that the ratio of V1 to superior frontal activity predicted the strength of visual imagery (Spearman rank: r_s = −0.53, p=0.002). This effect also held when controlling for the Euklidean distance between the two areas (partial Spearman rank: r_s = −0.54, p=0.002). Hence, participants with both comparatively lower levels of visual cortex normalized mean intensity and higher frontal levels had stronger imagery.

To assess the possibility that cortical connectivity might be driving this fronto-occipital excitability relationship, we analyzed the individual functional connectivity of the same two areas for each participant, that is, the degree to which the BOLD signals in each area correlate over time. The functional connectivity did not significantly predict imagery strength (r = −0.24, p=0.19). This suggests that the combination of highly active frontal areas and low visual cortex excitability might present an optimal precondition for strong imagery creation, irrespective of the temporal coupling of their activity.

The joint role of visual and frontal cortex activity in visual imagery strength: tDCS

To further investigate the possibility that optimal imagery strength occurs with a combination of low levels of excitability in visual cortex and high levels in prefrontal cortex, a new tDCS experiment was run where both prefrontal and visual cortex were simultaneously stimulated during imagery generation using the same blocked design as in all previous tDCS experiments (1.5mA). There were 3 conditions in this study, the first condition aimed to increase prefrontal (anodal) and decrease visual cortex (cathodal) excitability (blue dots in Figure 5B), the second condition decreased prefrontal (cathodal) and increased visual cortex (anodal) excitability (red dots in Figure 5B), and the third condition was a sham condition where the tDCS machine shut off after 30 s (grey dots in Figure 5B). A linear mixed-effects analysis was run with a 3 (tDCS polarity: cathodal, anodal and sham), x 4 (block: D1, D2, P1, P2 – see Figure 4—figure supplement 2A for timeline and Figure 4—figure supplement 2E for data for each block) x 3 (order of stimulation: cathodal on first, second or third day) design. However, the effect of tDCS polarity was not significant (tDCS χ²(2)=2.70, p=0.26), see Figure 5B).

Summary of cortical excitabilities effect on imagery strength

In summary, visual cortex excitability reliably correlated negatively with the strength of visual imagery using both fMRI and TMS as measurement tools (Figures 2 and 3). Modulating visual cortex excitability also altered the strength of visual imagery (Figure 4B & C, see Table 1 for a summary of all tDCS experiments). Specifically decreasing visual cortex excitability led to increased visual imagery strength. There was also evidence that altering prefrontal cortex excitability modulates visual imagery strength, but in the opposite pattern to visual cortex – increasing prefrontal cortex excitability led to increased imagery strength (see Figure 5A). However, combining stimulation of the frontal and occipital cortex had no reliable effect on modulating visual imagery strength.

Table 1

Summary of montage, intensity, duration, and significance of each tDCS experiment.

Experiment #	Montage (EEG Coordinates)	Intensity + duration	Notes	Significant
1 Occipital	Active: Inion (Oz) Reference: Supraorbital (Fpz)	1 mA 15 min	Tested effect on imagery strength	✗
2 Occipital	Active: Inion (Oz) Reference: Right Cheek	1.5 mA 15 min	Tested effect on imagery strength	✓
3 Occipital	Active: Inion (Oz) Reference: Right Cheek	1.5 mA 15 min	Tested effect on imagery strength (additional sham condition)	✓
4 Prefrontal	Active: Between F3-Fz Reference: Right Cheek	1.5 mA 15 min	Tested effect on imagery strength	✓
5 Occipital + Prefrontal	Active: Inion (Oz) Active: Between F3-Fz	1.5 mA 15 min	Tested effect on imagery strength	✗
Additional control Occipital	Active: Inion (Oz) Reference: Right Cheek	1.5 mA 15 min	Tested effect on phosphene threshold	✓

Discussion

Perhaps as far back as Plato, but overtly since the 1880s philosophers, scientists and the general populace have wondered why the human imagination differs so profoundly from one individual to the next. This question has recently gained fresh notability and attention with the introduction and classification of the term aphantasia to describe individuals who self-report no visual imagery at all (Zeman et al., 2015; Keogh and Pearson, 2018). Here, we provide evidence that pre-existing levels of neural excitability and spontaneous resting activity in visual cortex can influence the strength of mental representations as measured using the binocular rivalry paradigm. Our data indicate that participants with lower excitability in visual cortex have stronger sensory imagery. Furthermore, we provide causative evidence, using tDCS over visual cortex, that altering neural excitability in these areas can modulate imagery strength. Prefrontal cortex excitability also played a role in controlling the strength of visual imagery, however, in the opposite direction to visual cortex excitability.

It should be noted that while previous work has demonstrated that this measure of imagery strength can be separated from feature-based attention (Pearson et al., 2008a), we cannot explicitly rule out the possibility that tDCS was improving other cognitive mechanisms that are involved in this task such as sustained or selective attention. Visual imagery has also been shown to have multiple features such as strength, vividness, capacity and precision (Keogh and Pearson, 2017; Pearson et al., 2011; Bergmann et al., 2016a) . Here, we found that the strength of imagery was improved by cathodal stimulation of the early visual cortex and anodal of the prefrontal cortex. However, there was no evidence of visual or prefrontal cortex stimulation in altering the vividness of visual imagery, despite imagery strength correlating with these ‘online’ vividness measures. Additionally, our current studies also only used imagery of red and green Gabor patches, which have features particularly suited to early visual cortex and likely recruit this region during imagery. More complex imagery, such as imagery of faces, may rely less on representations in early visual cortex and more so on representations and excitability further upstream of the cortex, such as the fusiform face area in the case of face imagery. Future research should assess what qualities of visual imagery are, and are not, altered by stimulation of early visual cortex, and prefrontal cortex to further elucidate the neural mechanisms underlying individual differences in visual imagery. It may be the case that all forms of visual imagery are improved by prefrontal cortex stimulation, due to an increase in the strength of top-down signals, whereas only images that require activation of low-level sensory features, such as color or orientation, will be influenced by the excitability of early visual cortex.

Another limitation to our study is that although the majority of participants in our tDCS experiments showed the same pattern of results (larger increases in imagery strength in the cathodal vs anodal condition for occipital stimulation, and vice versa for prefrontal), there were some participants who showed the opposite pattern. It is important to note that while tDCS has been shown to modulate visual cortex excitability in numerous studies as well as our control experiment, there are large interindividual differences in the amount of the modulation that occurs for each individual (Chew et al., 2015; López-Alonso et al., 2014) and sometimes the direction of these excitability changes (Strube et al., 2016). Recent research suggests that cortical morphology (Filmer et al., 2019a; Laakso et al., 2019; Laakso et al., 2015), proportion of neuro-modulators (Filmer et al., 2019b) and brain state (Bergmann, 2018) can all influence how well tDCS is able to modulate brain activity and behaviour.

Over the last 30 years, empirical work has demonstrated many commonalities between imagery and visual perception (see Pearson et al., 2015b; Dijkstra et al., 2019 for a review). However, the two experiences have clear phenomenological differences between them. Our findings suggest a possible dissociation between mental imagery and visual perception in regards to cortical excitability’s role in shaping externally versus internally driven visual representations. Previous work has demonstrated that perceptual sensitivity is associated with higher levels of visual cortex excitability (Ding et al., 2016; Antal et al., 2001; Kraft et al., 2010; Reinhart et al., 2016), whereas our results suggest the opposite for mental imagery; stronger imagery is associated with lower visual excitability. Interestingly, some studies of visual perception have found that reducing visual cortex excitability can improve performance on more complex perceptual tasks, such as discrimination and object tracking (Antal et al., 2004a; Waterston and Pack, 2010). It may be the case that, although both perception and imagery recruit visual cortex, the optimal visual cortex conditions for task performance vary as a function of task demands. Considering these results in terms of differences in signal-to-noise ratios (Miniussi et al., 2013) may help to explain the contrasting findings in perceptual tasks, as well as the results from our current imagery experiments.

Neural activity can be thought of as a combination of signal related activity and neural noise. The signal is often defined as the number, or proportion of neurons that code for a specific stimulus, with higher firing rates resulting in a stronger signal. Noise can be thought of as the activity of all other non-signal related neural activity. A higher signal-to-noise ratio will generally result in better performance on behavioral tasks. Increasing cortical excitability can potentially be a source of noise, through increasing the likelihood of all neurons in the stimulation region to fire, whereas decreasing cortical excitability reduces the likelihood of neurons to fire, reducing neural noise.

In very basic detection tasks, an injection of noise into the visual cortex may result in better detection by pushing a subthreshold signal over a given threshold, leading to a higher proportion of signal relative to noise. Conversely, in a discrimination task where two or more potential outcomes exist, it may be the case that adding random noise to the signal will enhance both (anticipatory) stimulus-related representations in a non-selective manner. Decreasing cortical excitability, on the other hand, reduces the likelihood of neurons to fire, which will decrease both anticipatory signals and neural noise, potentially resulting in better performance as only one of the two representations reach supra-threshold.

Decreasing visual cortex excitability during imagery may make it harder to induce an action potential in visual cortex neurons, through lowering membrane potentials. However, this also reduces random noise, which may result in a better signal-to-noise ratio and as such stronger imagery. Another possibility is that when tasks rely more heavily on top-down signals, such as in an imagery task, reduction in sensory noise might allow for better communication among neurons in the visual cortex. Furthermore, it is also possible that tDCS stimulation has a tendency to selectively reduce non-imagery related signals: tDCS appears to have a larger impact on the more superficial cortical layers than on the deeper cortical layers, as the superficial layers are closer to the current source (Komarov et al., 2019). Interestingly, recent research suggests that imagery-related signals are predominantly found in deep cortical layers (Bergmann et al., 2019). As a consequence, tDCS may attenuate signals in the mid- and superficial layers more than those in the deep layers, thereby causing a relative advantage of deep-layer imagery-related signals over the ones arising in the other layers.

The findings that visual cortex excitability is negatively related to imagery strength could hence be explained by hyperexcitability acting as a source of noise, which, when reduced, leads to less neural noise in the visual cortex resulting in a higher signal-to-noise ratio and thus stronger imagery. A signal-to-noise ratio explanation also aligns well with our findings related to prefrontal cortex excitability. Greater excitability of frontal cortex may allow for amplification of the top-down signal, either through boosting firing, or shaping of neuronal population activity. Increased top-down signals might also allow for a greater inhibition of non-signal related neural noise further down the cortical hierarchy, resulting in a higher signal-to-noise ratio in the visual cortex.

This signal-to-noise hypothesis of visual imagery is in line with findings from related research. A study on grapheme-color synesthesia found that - contradictory to our results - synesthetes had enhanced resting-state visual cortex excitability (measured using phosphene thresholds). However, they also found that synesthetic experience could be enhanced by reducing visual excitability via tDCS (Terhune et al., 2015a). These seemingly contradictory results were thought to be due to two different mechanisms. The authors suggested that a hyperexcitable visual cortex during brain development may be what leads to individuals developing synesthesia in the first place; in adulthood, however, decreasing visual cortex excitability might lead to increased signal-to-noise in the visual cortex, thereby enhancing the synesthetic experience (Terhune et al., 2015a). In addition to this, other research indicates that the expectation of a visual stimulus leads to a stimulus template in visual cortex, with reduced activity in V1 and improved stimulus decoding by pattern classifiers (Kok et al., 2013). Similarly, reduced early visual cortex activity increases the likelihood of visual hallucinations in a subsequent detection task (Pajani et al., 2015). The convergence of these data appears to indicate that ‘background’ neural noise in sensory cortices may play an important role in modulating the strength of mental representations.

Despite much evidence for the involvement of the prefrontal cortex and visual cortex working in concert during visual imagery, we found that while manipulating either prefrontal or visual cortex excitability in isolation could induce increases in imagery strength, simultaneous stimulation of visual and prefrontal cortices had no effect on visual imagery. One possible explanation for these results is that modulating activity in two regions of the brain is too much of a change and has an overall disruptive effect on imagery formation. However, neither of the stimulation conditions resulted in significant reductions as compared to sham stimulation, making this explanation unlikely.

There also exists large variability in prefrontal cortex anatomy and tDCS effectiveness, with recent research showing that the thickness of left prefrontal cortex correlated with behavioral changes from anodal (but not cathodal) stimulation (Filmer et al., 2019a). It might be that these large variations play a role in our null findings; however, we did find that isolated stimulation to prefrontal cortex modulated imagery strength, making this another unlikely explanation of these null results. A plausible explanation may be that during simultaneous stimulation of visual and prefrontal cortex, other regions were modulated as well, inducing the null effect (Bikson et al., 2010), or that this montage leads to smaller current densities and changes in excitability in both visual and prefrontal cortex. For example, a previous study found evidence that the distance between electrodes alters the stimulation effects when other stimulation parameters are kept consistent (Moliadze et al., 2010). It might be the case that due to the spacing of the electrodes the current density may have been reduced, as the reference electrode was further from the active electrode as compared to our studies with significant results (Supraorbital vs cheek in significant studies). Our first study also resulted in non-significant results, which may be due to a lower intensity of stimulation (1mA vs 1.5mA in significant studies). However, this may also have been driven in part by the placement of the reference electrode. It seems possible that in our case, the montage we used to stimulate prefrontal and occipital cortex simultaneously may not have been sufficient to alter cortical excitability in these two regions, resulting in no significant changes in visual imagery strength.

Our findings do conflict with some previous research on visual imagery. For example, one study found that applying 1 Hz TMS to area BA 17 (primary visual cortex), slowed responses in a task where individuals had to imagine stripes (or were perceptually shown stripes) and answer questions about these images (Kosslyn, 1999). Although these chronometry type experiments are very common in early visual imagery research and were important in advancing the field as a whole, they do not provide any information about the quality or sensory representational nature of the images held in mind. Slower reaction times on both the perception and imagery task may be due to a general slowing of cognitive performance or visual scanning, rather than reflecting any change in the quality of the visual images created in the mind. Previous work has also found positive correlations between BOLD activity in the visual cortex and the vividness of visual imagery questionnaire (Cui et al., 2007; Amedi et al., 2005). Additionally, some TMS studies have found that during visual imagery, visual cortex excitability increases (Cattaneo et al., 2011; Sparing et al., 2002). These findings at first may seem incompatible with our results; however, these studies measure event-related neural changes, rather than comparing changes in task performance due to modulation of neural activity, or assessing how the resting levels of visual cortex activity influence task performance. It may very well be the case that on average in our tasks neural activity increases with imagery, and perhaps those with the lowest levels of resting activity have the largest changes in neural activity. For example, to calculate BOLD changes a baseline of ‘resting-state’ activity is used. It may be that participants with initially low visual cortex excitability are able to increase visual cortex activity more-so than those with higher levels, and this could potentially explain the larger BOLD changes for individuals with stronger visual imagery.

It is possible that the observed effects of cortical excitability may be driven by individual differences in inhibitory and excitatory neurotransmitter concentrations. Numerous studies have investigated what neurotransmitters modulate cortical excitability with GABA and Glutamate being implicated in controlling inhibition and excitability, respectively. While the relationship between GABA and cortical excitability/activity is more ambiguous (Terhune et al., 2015b; Boillat et al., 2020), the concentration of glutamate in the early visual cortex has been shown to correlate positively with visual cortex excitability (measured by phosphene thresholds) in both normal and synesthetic participants (Terhune et al., 2015b). There is also evidence for a strong link between BOLD-fMRI activity and glutamate concentration: using a combined fMRI-MRS approach where BOLD-fMRI activity and glutamate signals were recorded simultaneously, researchers found that the time courses of fMRI-BOLD activity and Glutamate concentration were strongly correlated (Ip et al., 2017). Evidence of such a relationship at a between-subject level is missing but seems plausible. If this is the case, then the observed relationships of our neural measures and visual imagery may (at least partly) be due to individual differences in the concentration of glutamate in visual cortex: a lower level of glutamate in the visual cortex might result in less excitatory neuronal noise, thereby increasing the signal-to-noise ratio of top-down signals that govern the generation of internal images in the visual cortex.

A plethora of imagery research has demonstrated evoked and content specific BOLD responses in early and later visual cortex when individuals form a mental image (for reviews of this work see:Pearson et al., 2015b; Dijkstra et al., 2019). Here, however, we took a different approach by examining the individual variation in brain physiology that might form the preconditions for strong or weak imagery. This endeavor required a non-event related design. Interestingly, such non-event related designs utilizing inter-individual differences are now commonly used to mechanistically link human cognition and brain function or anatomy (Kanai and Rees, 2011). Our results add to this growing body of research, which demonstrates that pre-existing brain activity parameters can fundamentally influence mental performance.

Our observations may also have clinical applications: In many mental disorders, imagery can become uncontrollable and traumatic. On the other hand, mental imagery can also be harnessed specifically to treat these disorders (Pearson et al., 2015b). Interestingly, disorders that involve visual hallucinations such as schizophrenia and Parkinson’s disease are both associated with stronger and/or more vivid mental imagery (Shine et al., 2015; Sack et al., 2005). It has recently been suggested that the balance between top-down and bottom-up information processing may be a crucial factor in the development of psychosis, with psychosis prone individuals displaying a shift in information processing towards top-down influences over bottom-up sensory input (Teufel et al., 2015). Our data indicate that it may be possible to treat symptomatic visual mental content by reducing its strength via non-intrusively manipulating cortical excitability. Alternatively, we may be able to ‘surgically’ boost mental image simulations specifically during imagery-based treatments, resulting in better treatment outcomes. Further research on longer lasting stimulation protocols, and the individual differences in response to brain stimulation is needed to assess its therapeutic potential.

In conclusion our data demonstrates that visual cortical excitability, as well as prefrontal excitability, appears to play a role in governing the strength of an individual’s visual imagery strength providing a potential explanation for the large variation in visual imagery that exists within the general population and providing a promising new tool for altering the strength of visual imagery.

Exclusion criteria	Exp 1	Exp 2	Exp 3	Exp 4	Exp 5	Exp 6	Total
Mock priming			1	2	2		5
Low priming	2	2		1	3		8
Mixed percept’s	3	2	1				6
Attrition						1	1
Impedance		9		4	1	2	16
Incorrect buttons			1		3		4
Technical issues			3			2	5
Phosphenes						6	6
Total	5	13	6	7	9	11	51

Share this article

Cite this article

Timeline of the basic imagery experiment.

Surface-based whole brain analysis of data from two different fMRI resting-state measurements: negative associations with imagery strength in the occipital cortex.

Scatterplots for TMS phosphene thresholds and mock rivalry data.

Figure 3—source data 1

Visual cortex stimulation data.

Figure 4—source data 1

Figure 4—source data 2

Figure 4—source data 3

Figure 4—source data 4

Data for prefrontal cortex stimulation.

Figure 5—source data 1

Figure 5—source data 2

Summary of montage, intensity, duration, and significance of each tDCS experiment.

Exclusion criteria for tDCS experiments.

Number of participants excluded per exclusion criteria for tDCS experiments.

Author details

Rebecca Keogh

Contribution

Contributed equally with

For correspondence

Competing interests

Johanna Bergmann

Contribution

Contributed equally with

Competing interests

Joel Pearson

Contribution

Competing interests

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism

Further reading