Interneuron-specific gamma synchronization indexes cue uncertainty and prediction errors in lateral prefrontal and anterior cingulate cortex

Abstract
Introduction
Results
Discussion
Materials and methods
Appendix 1
Data availability
References
Article and author information
Metrics

Abstract

Inhibitory interneurons are believed to realize critical gating functions in cortical circuits, but it has been difficult to ascertain the content of gated information for well-characterized interneurons in primate cortex. Here, we address this question by characterizing putative interneurons in primate prefrontal and anterior cingulate cortex while monkeys engaged in attention demanding reversal learning. We find that subclasses of narrow spiking neurons have a relative suppressive effect on the local circuit indicating they are inhibitory interneurons. One of these interneuron subclasses showed prominent firing rate modulations and (35–45 Hz) gamma synchronous spiking during periods of uncertainty in both, lateral prefrontal cortex (LPFC) and anterior cingulate cortex (ACC). In LPFC, this interneuron subclass activated when the uncertainty of attention cues was resolved during flexible learning, whereas in ACC it fired and gamma-synchronized when outcomes were uncertain and prediction errors were high during learning. Computational modeling of this interneuron-specific gamma band activity in simple circuit motifs suggests it could reflect a soft winner-take-all gating of information having high degree of uncertainty. Together, these findings elucidate an electrophysiologically characterized interneuron subclass in the primate, that forms gamma synchronous networks in two different areas when resolving uncertainty during adaptive goal-directed behavior.

Introduction

Inhibitory interneurons in prefrontal cortex are frequently reported to be altered in neuropsychiatric diseases with debilitating consequences for cognitive functioning. Groups of fast spiking interneurons with basket cell or chandelier morphologies have consistently been found to be abnormal in individuals with schizophrenia and linked to dysfunctional working memory and reduced control of attention (Dienel and Lewis, 2019). Altered functioning of a non-fast spiking interneuron class is linked to reduced GABAergic tone in individuals with severe major depression (Levinson et al., 2010; Fee et al., 2017). These findings suggest that the circuit functions of different subtypes of interneurons in prefrontal cortices are important to regulate specific aspects of cognitive and affective functioning.

But it has remained a challenge to identify how individual interneuron subtypes support specific cognitive or affective functions in the nonhuman primate. For rodent prefrontal and anterior cingulate cortices, cells with distinguishable functions express differentially cholecystokinin (CCK), parvalbumin (PV), or somatostatin (SOM), amongst others (Roux and Buzsáki, 2015; Cardin, 2018). Prefrontal CCK expressing basket cells have been shown to impose inhibition that is required during the choice epoch, but not during the delay epoch of a working memory task (Nguyen et al., 2020). In contrast, retention of visual information during working memory delays has been shown to require activation specifically of PV+ expressing fast spiking interneurons (Lagler et al., 2016; Kamigaki and Dan, 2017; Nguyen et al., 2020). In the same prefrontal circuits, the PV+ neurons have also been associated with attentional orienting (Kim et al., 2016), shifting of attentional sets and response strategies during reward learning (Cho et al., 2015; Canetta et al., 2016; Cho et al., 2020), and with spatial reward choices (Lagler et al., 2016), among other functions (Pinto and Dan, 2015). Distinct from PV+, the group of somatostatin expressing neurons (SOM+) have been shown to be necessary during the initial encoding phase of a working memory task but not during the delay (Abbas et al., 2018), and in anterior cingulate cortex they activate specifically during the approach of reward sites (Kvitsiani et al., 2013; Urban-Ciecko and Barth, 2016). Taken together, these findings illustrate that rodent prefrontal cortex interneurons expressing PV, SOM, or CCK fulfill separable, unique roles at different processing stages during goal-directed task performance (Pinto and Dan, 2015; Lagler et al., 2016).

The rich insights into cell-specific circuit functions in rodent prefrontal cortices stand in stark contrast to the limited empirical data from primate prefrontal cortex. While there are recent advances using optogenetic tools for use in primates (Acker et al., 2016; Dimidschstein et al., 2016; Gong et al., 2020), most existing knowledge about cell-specific circuit functions are indirectly inferred from studies that distinguish only one group of putative interneurons that show narrow action potential spike width. Compared to broad spiking neurons the group of narrow spiking, putative interneurons in lateral prefrontal cortex have been found to more likely encode categorical information during working memory delays (Diester and Nieder, 2008), show stronger stimulus onset responses during cognitive control tasks (Johnston et al., 2009), stronger attentional modulation (Thiele et al., 2016), more location-specific encoding of task rules (Johnston et al., 2009), stronger reduction of firing selectivity for task irrelevant stimulus features (Hussar and Pasternak, 2009), stronger encoding of errors and loss (Shen et al., 2015; Sajad et al., 2019), more likely encoding of outcome history (Kawai et al., 2019), and stronger encoding of feature-specific reward prediction errors (Oemisch et al., 2019), amongst other unique firing characteristics (Constantinidis and Goldman-Rakic, 2002; Ardid et al., 2015; Rich and Wallis, 2017; Voloh and Womelsdorf, 2018; Torres-Gomez et al., 2020).

These summarized findings suggest that there are subtypes of narrow spiking neurons that are particularly important to regulate prefrontal circuit functions. But it is unclear whether these narrow spiking neurons are inhibitory interneurons and to which interneuron subclass they belong. Comparisons of protein expression with action potential spike width have shown for prefrontal cortex that > 95% of all PV+ and ~ 87% of all SOM + interneurons show narrow spike width (Ghaderi et al., 2018; Torres-Gomez et al., 2020), while narrow spikes are also known to occur in ~20% of VIP interneurons (Torres-Gomez et al., 2020) among other GABAergic neurons (Krimer et al., 2005; Zaitsev et al., 2009), and (at least in primate motor cortex) in a subgroup of pyramidal cells (Soares et al., 2017). In addition, electrophysiological characterization has shown at least three different types of firing patterns in narrow spiking neurons of monkeys during attention demanding tasks (Ardid et al., 2015; Dasilva et al., 2019; Trainito et al., 2019). Taken together, these insights raise the possibility that spike width and electrophysiology will allow identifying the interneuron subtypes that are particularly important for prefrontal cortex functions.

Here, we investigated this possibility by recording narrow spiking cells in nonhuman primate prefrontal and cingulate cortex during an attention demanding reversal learning task. We found that in both areas three narrow spiking neuron classes are well distinguished and show a suppressive influence on the local circuit activity compared to broad spiking neurons, supporting labeling them as inhibitory interneurons. Among these interneurons the same sub-type showed significant functional correlations in both ACC and LPFC, firing stronger to reward predictive cues when their predictability is still learned during the reversal (in LPFC), and firing stronger to outcomes when they are most unexpected during reversal (in ACC). Notably, in both, ACC and LPFC, these functions were evident in 35–45 Hz gamma rhythmic synchronization to the local field potential in the same interneuron subclass.

Results

We used a color-based reversal paradigm that required subjects to learn which of two colors were rewarded as described previously (Oemisch et al., 2019). The rewarded color reversed every ~30–40 trials. Two different colors were assigned to stimuli appearing randomly left and right to a central fixation point (Figure 1A). During the task the color information was presented independently from the up-/downward- direction of motion of the stimuli. The up-/downward direction instructed the saccade direction that animals had to show to a Go event in order to receive reward. Motion was thus the cue for an overt choice (with saccadic eye movements), while color was the cue for covert selective attention. Color was shown either before (as Feature-1) or after the motion onset (as Feature-2) (Figure 1B). Both animals took on average 7/7 (monkey H/K) trials to reach criterion performance, that is, they learned which color was rewarded within seven trials (Figure 1C). The asymptotic performance accuracy was 83/86% for monkey’s H/K (see Materials and methods).

Figure 1 with 2 supplements see all

Download asset Open asset

Task paradigm and cell classification.

(A) Trials required animals to covertly attend one of two peripheral stimuli until a dimming (Go-event) instructed to make a saccade in the direction of the motion of the attended stimulus. During the trial, the two stimuli were initially static black/white and then either were colored first or started motion first. Following this feature 1 Onset the other feature (Feature two on) was added 0.5–0.9 s later. (B) The task reversed the color (red or green) that was rewarded over at least 30 trials. (C) Two monkeys learned through trial-and-error the reward-associated color as evident in increased accuracy choosing the rewarded stimulus (*y-axis*) over trials since reversal (*x-axis*). (D) Recorded areas (details in Figure 1—figure supplement 1). (E) *Top*: Average normalized action potential waveforms of recorded neurons were narrow (*red*) or broad (*blue*). *Bottom*: Inferred hyperpolarization ratio and repolarization duration distinguishes neurons. (F) Average spike-triggered multiunit modulation for narrow and broad spiking neurons (Errors are SE’s). Spiking neuron and MUA were from different electrodes. The bottom panel zooms into the ±20 ms around the spike time and shows the difference between neuron classes (in green). (G) The histogram of post-to-pre spike AUC ratios for narrow (*red*) and broad (*blue*) spiking neurons. (H) Average ratio of post- to pre-spike triggered MUA for narrow and broad cell classes in ACC (*left*) and in LPFC (*right*). Values < 0 indicate reduced post- versus pre-spike MUA modulation. Error bars are SE.

Characterizing narrow spiking neurons as inhibitory interneurons

During reversal performance, we recorded the activity of 329 single neurons in LPFC areas 46/9 and anterior area 8 (monkey H/K: 172/157) and 397 single neurons in dorsal ACC area 24 (monkey H/K: 213/184) (Figure 1D, Figure 1—figure supplement 1). The average action potential waveform shape of recorded neurons distinguished neurons with broad and narrow spikes similar to previous studies in LPFC and ACC (Gregoriou et al., 2012; Ardid et al., 2015; Westendorff et al., 2016; Dasilva et al., 2019; Oemisch et al., 2019; Figure 1E). Prior biophysical modeling has shown that the extracellular action potential waveform shape, including its duration, is directly related to transmembrane currents and the intracellularly measurable action potential shape and duration (Gold et al., 2006; Bean, 2007; Gold et al., 2007; Buzsáki et al., 2012). Based on this knowledge we quantified the extracellularly recorded spike duration of the inferred hyperpolarization rates and their inferred time-of-repolarizations (see Materials and methods, Figure 1—figure supplement 2A,B). These measures split narrow and broad spiking neurons into a bimodal distribution (calibrated Hartigan’s dip test for bimodality, p<0.001), which was better fit with two than one gaussian (Figure 1E, Bayesian information criterion for two and one gaussian fit: 4.0450, 4.8784, where a lower value indicates a better model). We found in LPFC 21% neurons had narrow spikes (n = 259 broad, n = 70 narrow cells) and in ACC 17% of neurons had narrow action potentials (n = 331 broad, n = 66 narrow cells).

To assess the excitatory or inhibitory identity of the broad and narrow spiking neuron classes (B- and N-type neurons), we estimated the power of multi-unit activity (MUA) in its vicinity (at different electrodes than the spiking neuron) around the time of spiking for each cell and tested how this spike-triggered MUA-power changed before versus after the cell fired a spike (see Materials and methods). This approach expects for an excitatory neuron to spike concomitant with neurons in the local population reflected in a symmetric rise and fall of MUA before and after its spike. In contrast, inhibitory neurons are expected to spike when MUA rises, but when the spike occurs, the spike should contribute to suppress the local MUA activity, which should be reflected in a faster drop in MUA activity after the spike occurred (Oemisch et al., 2015). We found that B-type cells showed on average a symmetric pre- to post- spike triggered MUA activity modulation indicative of excitatory participation with local activity (Figure 1F). In contrast, spikes of N-type cells were followed by a faster drop of MUA activity indicating an inhibitory influence on MUA (Figure 1F). The excitatory and inhibitory effects on local MUA activity were consistent across the population and significantly distinguished B- and N-type neurons (Figure 1G; MUA modulation index: [(post MUA_spike - pre MUA_spike)/pre MUA_spike] for B- vs N-type cells, Wilcoxon test, p=0.001). This distinction was evident in ACC and in LPFC (Figure 1H; for the N-type the MUA modulation index was different from zero, Wilcoxon test, in ACC, p<0.001, and in LPFC, p=0.03; for B-type cells the difference was not sign.). These findings suggest narrow spiking cells contain mostly inhibitory interneurons (see Discussion).

Putative interneurons in prefrontal cortex index choices when choice probability is low

To discern how B- and N- type neurons encoded the learning of the rewarded color during reversal, we analyzed neuronal response modulation around color onset, which instructed animals to covertly shift attention to the stimulus with the reward predicting color. In addition to this color cue (acting as attention cue), we also analyzed activity around the motion onset that served as action cue. Its direction of motion indicated the saccade direction the animal had to elicit for receiving reward. This action cue could happen either 0.5–0.9 s. before or 0.5–0.9 s. after the color cue. Many neurons in LPFC selectively increased their firing to the color attention cue with no apparent modulation to the motion action cue (n = 71 cells with firing increases to the color but not motion cue) (for examples: Figure 2A,B). These neurons increased firing to the color onset when it was the first, or the second feature that was presented, but did not respond to the motion onset when it was shown as first or second feature (for more examples, Figure 2—figure supplement 1).

Figure 2 with 4 supplements see all

Download asset Open asset

Firing rate modulation of narrow and broad spiking neurons to the color cue correlate with choice probability.

(A, B) Spike rasters for example neurons around the onset of feature-1 and feature-2 when feature-1 was color (magenta) or motion (green). Both neurons responded stronger to the color than the motion onset irrespective of whether it was shown as first or as second feature during a trial. (C) Narrow spiking neurons (red) in LPFC respond to the color onset when it occurred as feature-2 (*upper panel*), or as feature-1 (*bottom panel*). (D) Same as c for the ACC shows no or weak feature onset responses. (E) Firing rates of narrow spiking neurons (red) in LPFC correlate with the choice probability of the to be chosen stimulus (*left*). The average Rate x Choice Probability correlation in LPFC was significantly larger in narrow than in broad spiking neurons (*right*). (F) Same as e for ACC shows no significant correlations with choice probability. Source data 1 Correlation data and script for ploting panels E, and F.

We found that N-type neurons in LPFC change transiently their firing to the attention cue when it occurred either early or late relative to the action cue (significant increase within 25–275 ms post-cue for Feature 1 and within 50–250 ms post-cue for Feature 2, p<0.05 randomization statistics, n = 21 N-type cells with increases and seven with decreases to the color cue, Figure 2C). This attention cue-specific increase was absent in B-type neurons in LPFC (n.s., randomization statistics, n = 44 B-type cells with increases and n = 35 with decreases to the color cue, Figure 2C). In contrast to LPFC, ACC N- and B-type neurons did not show an on-response to the color cue (n = 36/6 B- and N- type cells with increases, respectively, and n = 31/12 B- and N- type cells with decreased firing, respectively, to the color cue, the total cell number included in this analysis for the B- and N- type was n = 216/50, respectively) (Figure 2D).

The N-type-specific response to the attention cue might carry information about the rewarded stimulus color or the rewarded stimulus location. We found that the proportion of neurons whose firing rate significantly distinguished rewarded and nonrewarded colors sharply increased for N-type cells after the onset of the color cue in LPFC proportion of color selective responses within 0–0.5 s. after cue, 18%; n = 10 of 54 N-type cells, randomization test p<0.05 within [175 575] ms after cue onset, but not in ACC (cells with significant information: 6%; n = 3 of 50 N-type cells, ns., randomization test within [300 700] ms after cue onset) (Figure 2—figure supplement 2A,B). Similar to the selectivity for the rewarded stimulus color N-type cells in LPFC (but not in ACC) showed significant encoding of the right versus left location of the rewarded stimulus (in LPFC: 22% with reward location information; n = 12 of 54 N-type cells, randomization test p<0.05 within [200 500] ms after cue onset; in ACC: 10% with reward location information; n = 5 of 50 N-type cells, n.s. randomization test) (Figure 2—figure supplement 2C,D).

The color-specific firing increase and the encoding of the rewarded color by N-type neurons in LPFC suggest they support reversal learning performance. We tested this by correlating their firing rates around the color cue onset with the trial-by-trial variation of the choice probability for choosing the stimulus with the rewarded color. Choice probability, p(choice), was calculated with a reinforcement learning model that learned to optimize choices based on reward prediction errors (see Equation 3 in Materials and methods and Oemisch et al., 2019). Choice probability was low (near ~0.5) early during learning and rose after each reversal to reach a plateau after around ~10 trials (Figure 1C, for example blocks, Figure 2—figure supplement 3A). We found that during the post-color onset time period 17% (n = 20 of 120) of B-type cells and 27% (n = 11 of 41) of N-type cells in LPFC significantly correlated their firing with p(choice), which was larger than expected by chance (binomial test B-type cells: p<0.001; N-type cells: p<0.001). On average, N-type cells in LPFC showed positive correlations (Pearson r = 0.068, Wilcoxon rank test, p=0.011), while B-type neurons showed on average no correlation (Wilcoxon rank test, p=0.20) (Figure 2E). The positive p(choice) correlations of N-type neurons in LPFC grew following color onset and remained significant for 0.7 s following color onset (N = 41 N-type neurons, randomization test, p<0.05 from 0 to 0.7 s post-cue, Figure 2E). N-type neurons in LPFC of both monkeys showed a similar pattern of response to the attention cue and positive correlation of firing rate with p(choice) (Figure 2—figure supplement 4A–C). Compared to LPFC, significantly less N-type cells in ACC correlated their firing with choice probability (6%, n = 2 of 33 in ACC, versus 27% in LPFC, X²-test for prop. difference, X²-stat = 5.45, p=0.019) and showed no p(choice) correlations over time (Wilcoxon rank test, p=0.49, n.s., Figure 2F).

Putative interneurons in anterior cingulate cortex index high reward prediction errors

Choice probabilities (p(choice)) increase during reversal learning when reward prediction errors (RPEs) of outcomes decrease, which was evident in an anticorrelation of (p(choice)) and RPE of r = −0.928 in our task (Figure 2—figure supplement 3A,B) with lower p(choice) (near ~0.5) and high RPE over multiple trials early in the reversal learning blocks when the animals adjusted to the newly rewarded color (Figure 2—figure supplement 3E,F). Prior studies have shown that RPEs are prevalently encoded in the ACC (Kennerley et al., 2011; Oemisch et al., 2019). We therefore reasoned that RPEs might preferentially be encoded by narrow spiking putative interneurons. First, we analyzed N- and B-type cell responses to the reward. In both, LPFC and ACC, N- and B-type cells on average increased firing after the reward onset (p<0.05, randomization test, n = 26 of 54 and 18 of 188 B- type cells with increases, respectively, and n = 14 of 54 N- type and 5 of 188 B-type cells with decreased firing in LPFC, and n = 30 of 50 N-type and 13 of 216 B- type cells with increases, respectively, and n = 19 of 50 and 8 of 216 B-type cells with decreased firing in ACC). However, the N- and B-type responses to the reward were not significantly different in ACC or LPFC (ns., randomization test, Figure 3A,B). We estimated trial-by-trial RPEs with the same reinforcement learning model that also provided p(choice) for the previous analysis. RPE is calculated as the difference of received outcomes R and expected value V of the chosen stimulus (see Materials and methods). We found that on average 23% of LPFC and 35% of ACC neurons showed significant firing rate correlations with RPE in the post-outcome epoch with only moderately and non-significantly more N-type than B-type neurons having significant rate-RPE correlations (n = 9 N-type neurons, n = 31 B-type neurons, X²-test; p=0.64 for LPFC; n = 15 N-type neurons, n = 47 B-type neurons, X²-test; p=0.83 for ACC; Figure 3C,D). However, time-resolved analysis of the strength of the average correlations revealed a significant positive firing x RPE correlation in the 0.2–0.6 s after reward onset for ACC N-type neurons, which was absent in LPFC (ACC, n = 43 N-type neurons, randomization test p<0.05; LPFC: n = 31 N-type neurons, no time bin with sign.; Figure 3E,F). In ACC, the positive correlation of N-type neurons firing rate and RPE was evident in both monkeys (Figure 2—figure supplement 4D).

Figure 3

Download asset Open asset

Firing rate modulation to trial outcomes correlate with reward prediction errors.

(**A, B**) Narrow (red) and broad spiking neurons (blue) in LPFC (A) and ACC (B) on average activate to the reward outcome. (**C, D**) Proportion of narrow and broad spiking neurons in LPFC (C) and ACC (D) with significant firing rate X reward prediction error correlations in the [0 0.75] s after trial outcomes were received. (**E, F**) Time course of firing rate X reward prediction error correlations for narrow and broad spiking neurons in LPFC (E) and ACC (F) around the time of reward onset. Horizontal bar denotes time with significant correlations. Source data 1 Correlation data and script for ploting panels E, and F.

Classification of neural subtypes of putative interneurons

We next asked whether the narrow spiking, putative interneurons whose firing indexed relatively lower p(choice) in LPFC and relatively higher RPE in ACC are from the same electrophysiological cell type, or e-type (Markram et al., 2015; Gouwens et al., 2019). Prior studies have distinguished different narrow spiking e-types using the cells’ spike train pattern and spike waveform duration (Ardid et al., 2015; Dasilva et al., 2019; Trainito et al., 2019; Banaie Boroujeni et al., 2020b). We followed this approach using a cluster analysis to distinguish e-types based on spike waveform duration parameters (inferred hyperpolarization rate and time to 25% repolarization, Figure 1—figure supplement 2A,B), on whether their spike trains showed regular or variable interspike intervals (local variability ‘LV’, Figure 1—figure supplement 2D), or more or less variable firing relative to their mean interspike interval (coefficient of variation ‘CV’, Figure 1—figure supplement 2C). LV and CV are moderately correlated (r = 0.26, Figure 1—figure supplement 2E), with LV indexing the local similarity of adjacent interspike intervals, while CV is more reflective of the global variance of higher and lower firing periods (Shinomoto et al., 2009). We ran the k-means clustering algorithm on neurons in ACC and LPFC using variables mentioned above and their firing rate (details in Materials and methods). Clustering resulted in eight e-types (Figure 4A–C). Cluster boundaries were highly reliable (Figure 4—figure supplement 1). Moreover, the assignment of a cell to its class was statistically consistent, and reliably evident for cells from each monkey independently (Figure 4—figure supplement 2). Narrow spiking neurons fell into three e-types. The first narrow spiking N1 e-type (n = 18, 13% of narrow spiking neurons) showed high firing rates and highly regular spike trains (low LVs, mean LV 0.47, SE 0.05). The second N2 e-type (n = 27, 20% of narrow spiking neurons) showed on average Poisson spike train variability (LVs around 1) and the narrowest waveforms, and the N3 e-type (n = 91, 67% of all narrow spiking neurons) showed intermediate narrow waveform duration and regular firing (LV’s < 1, mean LV 0.84, SE 0.02) (Figure 4C). Neurons within an e-type showed similar feature characteristics irrespective of whether they were from ACC or LPFC. For example, N3 e-type neurons from ACC and in LPFC were indistinguishable in their firing and action potential characteristics (LV_{ACC / LPFC} = 0.79/0.88, ranksum-test, p=0.06; CV_{ACC / LPFC} = 1.19/1.31, ranksum-test, p=0.07; Firing Rate_ACC/LPFC = 4.41/4.29, ranksum-test p=0.71; action potential repolarization time (hyperpolarization rate)_{ACC / PFC} = 0.18 sec. (97 s.⁻¹)/0.17 s. (93 s.⁻¹)).

Figure 4 with 2 supplements see all

Download asset Open asset

Clustering of *e-type* sub-classes of cells using their spike width, firing variability and rate.

(A) Dendrogram of cluster distances for neuron classes with broad spikes (five subclasses, *blue*), and narrower spikes (three subclasses, *orange* and *red*). (B) For each e-type (*x-axis*) the average LV, CV and firing rate. The rightmost point shows the average for all *e-types* combined. (C) Illustration of the average spike waveform, spiketrain raster example, and Local Variability (LV, *upper* histograms) for each clustered *e-type*. The bottom grey LV histogram includes all recorded cells to allow comparison of *e-type* specific distribution. (D) The average post- to pre- spike MUA modulation (*y-axis*) for neurons of the different *e-types*. Values below 0 reflect reduced multiunit firing after the neuron fires a spike compared to before the spike, indicating a relative suppressive relationship. Only the N3 *etype* showed a systematically reduced post-spike MUA modulation. MUA were always recorded from other electrodes nearby the spiking neuron. Source data 2 Data and script used for clustering (panel A) and data used for plotting panels B, and C.

Beyond the narrow spiking classes, spiketrains and LV distributions showed five broad spiking neuron e-types. The B1-B5 e-types varied from irregular burst firing in e-types B2, B3 and B4 (LV >1, class B2 mean LV 1.20, SE 0.02, class B3 mean LV 0.93, SE 0.02, class B4 mean 1.24, SE 0.03), regular firing in B1 (LV <1, class B1 mean LV 0.75, SE 0.02) to regular non-Poisson firing in B5 (LV >1, class B5 mean LV 1.68, SE 0.02) (number and % of broad spiking cells: B1: 109 (18%), B2: 103 (17%), B3: 94 (16%), B4: 146 (25%), B5: 138 (23%)) (Figure 4B,C). LV values > 1 indicate bursty firing patterns which is supported by a positive correlation of the LV of neurons with their probability to fire bursts defined as spikes occurring ≤5 ms apart (r = 0.44, p<0.001, Figure 1—figure supplement 2F). We next calculated the post- to pre- spike-triggered MUA modulation ratio for each of the e-types. Across all e-types only the spike-triggered MUA modulation ratio for the N3 e-type was different from zero (p<0.05, FDR-corrected) (Figure 4D). Comparison between cell classes showed that the spike-triggered MUA modulation ratio for the N3 e-type differed significantly from the B4 (p=0.02) and B5 (p=0.03) e-types.

The same interneuron subclass indexes P(choice) in LPFC and RPE in ACC

The distinct e-types allowed testing how they correlated their firing with choice probability and with RPE. We found that the only e-type with a significant average correlation of firing and choice probability during the cue period was the N3 e-type in LPFC (r = 0.08, Kruskal Wallis test, p=0.04; randomization test difference to zero, Tukey-Kramer multiple comparison corrected, p<0.05; Figure 5A,B). Consistent with this correlation, neurons of the N3 e-type in LPFC also significantly increased firing to the color cue, irrespective of whether the color cue appeared early or later in the trial (p<0.05 during 0.04–0.2 s after feature two onset, and p<0.05 during 0.175–0.225 s after feature one onset, Figure 5—figure supplement 1). The on-average positive correlation of firing rate and p(choice) was also evident in an example N3 e-type cell (Figure 5—figure supplement 2A–C). There was no other e-type in LPFC and in ACC showing significant correlations with choice probability. In LPFC, a linear classifier trained on multiclass p(choice) values was able to label N3 e-type neurons based on their p(choice) values with an accuracy of 31% (Figure 5—figure supplement 3A).

Figure 5 with 3 supplements see all

Download asset Open asset

*E-type*-specific correlations with choice probability and reward prediction error in LPFC and ACC.

(**A, B**) Firing Rate X Choice Probability correlations for neurons of each *e-type* subclass in LPFC (A) and ACC (B). Only the N3 *e-type* neurons in LPFC show significant correlations. (**C, D**) Firing Rate X Reward Prediction Error correlations for neurons of each *e-type* subclass in LPFC (C) and ACC (D). The N3 *e-type* neurons in ACC show significant positive correlations, and the B3 *e-type* shows negative firing rate x RPE correlations. Grey shading denotes significance at p<0.05 (multiple comparison corrected). Error bars are SE’s. Source data 1 Correlation data and script for ploting panels A-D.

Similar to the N3 e-type in LPFC, in ACC it was the N3 e-type that was the only narrow spiking subclass with a significant functional firing rate correlation with reward prediction errors (RPE) (n = 30 neurons; r = 0.09, Kruskal Wallis test, p=0.01, randomization test for sign. difference to zero, Tukey-Kramer multiple comparison corrected p<0.05, Figure 5C,D). The only other e-type with a significant firing rate x RPE correlation was the B4 class which fired stronger with lower RPE’s (n = 18 neurons; r = −0.08, Kruskal Wallis test, p=0.01, randomization test for sign. difference to zero, multiple comparison corrected p<0.05). There was no subtype-specific RPE correlation in LPFC (Figure 5C,D). The average positive correlation of firing rate and RPE was also evident in example ACC N3 e-type cells (Figure 5—figure supplement 2D–F). In ACC, a linear classifier trained on multiclass RPE values was able to label N3 e-type neurons from their RPE value with an accuracy of 34% (Figure 5—figure supplement 3B).

Narrow spiking neurons synchronize to theta, beta, and gamma band network rhythms

Prior experimental studies have suggested that interneurons have unique relationships to oscillatory activity (Puig et al., 2008; Cardin et al., 2009; Sohal et al., 2009; Vinck et al., 2013; Womelsdorf et al., 2014a; Chen et al., 2017; Voloh and Womelsdorf, 2018; Shin and Moore, 2019; Banaie Boroujeni et al., 2020b; Onorato et al., 2020), raising the possibility that the N3 e-type neurons realize their functional contributions to p(choice) and RPE processing also through neuronal synchronization. To discern this, we first inspected the spike-triggered LFP averages (STAs) of neurons and found that STAs of many N3 e-type neurons showed oscillatory sidelobes in the 10–30 Hz range (Figure 6A). We quantified this phase synchrony by calculating the spike-LFP pairwise phase consistency (PPC) and extracting statistically significant peaks in the PPC spectrum (Vinck et al., 2012; Banaie Boroujeni et al., 2020a), which confirmed the presence of significant synchrony peaks across theta/alpha, beta and low gamma frequency ranges (Figure 6B). The density of spike-LFP synchrony peaks, measured as the proportion of neurons that show reliable PPC peaks (see Materials and methods), showed a high prevalence of 15–30 Hz beta synchrony for broad spiking neurons in both, ACC and LPFC, a peak of ~5–12 Hz synchrony that was unique to ACC, and a high prevalence of 35–45 Hz gamma synchronization in narrow spiking cells (but not in broad spiking cells) in both areas (Figure 6C; Voloh et al., 2020). The synchrony peak densities of the N3 e-type neurons mimicked this overall pattern by showing beta to gamma band synchrony peak densities in LPFC and a 5–12 Hz theta/alpha and a gamma synchrony in ACC (Figure 6C) (for peak densities of other e-types, see Figure 6—figure supplement 1).

Figure 6 with 1 supplement see all

Download asset Open asset

Spike-LFP phase synchronization.

(A) Average spike-triggered local field potential fluctuations of nine N3 *e-type* neurons showing a transient LFP oscillations from 5 Hz up to ~30 Hz. Black vertical line is the time of the spike. The red lines denote the LFP after adaptive spike artifact removal (raw traces in gray). (B) Peak normalized pairwise phase consistency for each spike-LFP pair (*y-axis*) rank ordered according to the frequency (*x-axis*) with peak PPC. (C) Proportion of sign. peaks of spike-LFP synchronization for neurons in LPFC (*left*) and ACC (*right*) for narrow and broad spiking neurons (*upper rows*) and for the N3 *e-type* neurons (*bottom row*).

Interneuron-specific gamma synchronization following cues in LPFC and outcomes in ACC

The overall synchrony patterns leave open whether the synchrony is task modulated or conveys information about choices and prediction errors. We addressed these questions by calculating spike-LFP phase synchronization time-resolved around the color cue onset (for LPFC) and around reward onset (for ACC) separately for trials with high and low choice probabilities (for LPFC) and high and low reward prediction errors (for ACC). We found in LPFC that the N3 e-type neurons showed a sharp increase in 35–45 Hz gamma band synchrony shortly after the color cue is presented and choice probabilities were low (i.e. when the animals were uncertain which stimulus is rewarded), while broad spiking neurons did not show gamma synchrony (Figure 7A–C) (N3 e-type vs broad spiking cell difference in gamma synchrony in the 0–700 ms after color cue onset: p<0.05, randomization test, multiple comparison corrected). When choice probabilities are high, N3 e-type neurons and broad spiking neurons in LPFC showed significant increases of 20–35 Hz beta-band synchronization (Figure 7D,E) with N3 e-type neurons synchronizing significantly stronger to beta than broad spiking neuron types (Figure 7F) (p<0.05 randomization test, multiple comparison corrected). These effects were restricted to the color cue period. LPFC broad spiking neurons and N3 e-type neurons did not show spike-LFP synchronization after the reward onset in low or high RPE trials (Figure 7—figure supplement 1A–D). Moreover, the gamma synchrony when p(choice) was low was not found in other narrow spiking or broad spiking e-types with the LPFC N3 e-type showing stronger gamma synchrony than broad spiking classes in the low p(choice) trials (p=0.02, Tukey-Kramer multiple comparison corrected) (Figure 7—figure supplement 1E–F). There was no difference in 35–45 Hz gamma synchrony of other cell classes in LPFC in the 0–0.7 s after reward onset in the high or low RPE trials, or around the (0.7 s) color onset in the high p(choice) trials (Figure 7—figure supplement 1E–H, see Figure 7—figure supplement 2A for time-frequency maps for all cell classes around cue onset).

Figure 7 with 4 supplements see all

Download asset Open asset

Spike-LFP phase synchronization in LPFC around the color onset for trials with low and high choice probability.

(A) Spike-LFP pairwise phase consistency for broad spiking neurons in LPFC around the time of the color onset (*x-axis*) for trials with the 50% lowest choice probabilities. (B) Same as (A) for neurons of the N3 *e-type*. Black contour line denotes statistically significant increased phase synchrony relative to the pre-color onset period. (C) Statistical comparison of spike-LFP synchrony for N3 *e-type* neurons (orange) versus broad spiking neurons (blue) for low choice probability trials in LPFC. Synchrony is normalized by the pre-color onset synchrony. Gray shading denotes p<0.05 significant differences of broad and N3 type neurons. (*D,E,F*) Same format as (*A,B,C*) but for the 50% of trials with the highest choice probability. Source data 3 Coherence data and script for ploting panels A-F.

In ACC, the N3 e-type neurons synchronized in a 35–42 Hz gamma band following the reward onset when RPE’s were high (i.e. when outcomes were unexpected), which was weaker and emerged later when RPEs were low, and which was absent in broad spiking neurons (Figure 8). In contrast to this gamma synchronization at high RPE, low RPE trials triggered increased spike-LFP synchronization at a ~ 6–14 Hz theta/alpha frequency in the N3 e-type neurons (Figure 8C). The increase of 6–14 Hz synchrony was significantly stronger in the N3 e-type than in broad spiking neurons in the 0 to 0.7 s post reward onset period (Figure 8F). These gamma and theta band effects of the N3 e-type neurons in ACC were restricted to the reward period, that is, they were absent in the color cue period for trials with high or low p(choice) (Figure 7—figure supplement 3A–D). Comparison to the other e-types showed that the N3 e-type significantly stronger gamma synchronized in the reward period when RPEs were high (p=0.04, Tukey-Kramer, multiple comparison corrected) (Figure 7—figure supplement 3E). Other e-type classes did not differ in their spike-LFP synchronization in this 35–45 Hz gamma band in low or high RPE trials with the exception of the B2 class in ACC that synchronized in high RPE trials at a higher >50 Hz gamma band (Figure 7—figure supplement 3E–H, see Figure 7—figure supplement 2B for time-frequency maps for all cell classes around reward onset).

Figure 8

Download asset Open asset

Spike-LFP phase synchronization in ACC during outcome processing for trials with low and high reward prediction errors.

(A) Spike-LFP pairwise phase consistency for broad spiking neurons in ACC around reward onset (*x-axis*) for trials with the 50% lowest reward prediction errors. (B) Same as (A) for neurons of the N3 *e-type*. Black contour line denotes statistically significant increased phase synchrony relative to the pre-reward period. (C) Statistical comparison of the spike-LFP synchrony (normalized by the pre-reward synchrony) for N3 *e-type* neurons (orange) versus broad spiking neurons (blue) in ACC for trials ending in low reward prediction errors. Gray shading denotes frequencies with p<0.05 significant differences of broad spiking versus N3 *e-type* neurons. (*D,E,F*) Same format as (*A,B,C*) but for the 50% of trials with the highest high reward prediction error outcomes. Source data 3 Coherence data and script for ploting panels A-F.

The spike-LFP synchronization results in PFC and in ACC were unchanged when the average reward onset aligned LFP, or the average color-cue aligned LFP was subtracted prior to the analysis, which controls for a possible influence of lower frequency evoked potentials (Figure 7—figure supplement 4).

Circuits model of interneuron-specific switches between gamma and beta or theta synchronization

The previous results showed that neurons of the N3 e-type engaged in a transient ~35–45 Hz gamma band synchronization during trials that were characterized by uncertainty. In LPFC gamma synchronization was evident when expected stimulus values were uncertain (reflected in low p(choice)), and in ACC gamma synchronization emerged when reward outcomes were uncertain (reflected in high RPE). In contrast, there was no gamma-band synchrony when choice probabilities were certain and reward outcomes predictable. In these trials, N3 e-type neurons rather showed beta synchronization to the cue (in LPFC), or theta band synchronization to the reward onset (in ACC). These findings indicate that oscillatory activity signatures inform us about the possible circuit motifs underlying uncertainty-related related computations. These computations are formally described in the reinforcement learning framework allowing us to propose a linkage of specific computations to oscillatory activity signatures and their putative circuits as proposed in the Dynamic Circuits Motif framework (Womelsdorf et al., 2014b).

To show the feasibility of this approach we devised two circuit models that reproduces the gamma band activity signatures in LPFC and ACC using populations of inhibitory cells modeled to correspond to N3 e-type cells (for modeling details, see Appendix 1). First, we modeled a putative LPFC circuit. Here, N3 e-type neurons showed gamma synchronization when p(choice) was low which happens in trials in which the values of the two available objects are similar and the choice among them is difficult (see Equation 3 in Materials and methods). We predicted in this situation gamma synchronization of the N3 e-type reflects resolving competition among inputs from similarly active, pyramidal cell populations encoding the expected values of the two objects. To test whether this scenario is plausible, we conceptualized and then simulated a circuit which modelled the activity of an N3 e-type neuron population that we presumed to be PV+ fast spiking basket cells (see Discussion) activated by two excitatory pyramidal cell populations (Es) whose activity scales with the value of the stimuli (Figure 9A). Such an E-I network can synchronize by way of mutual inhibition at beta or gamma frequencies depending on the total amount of drive the network receives (Wang and Buzsáki, 1996; White et al., 1998; Tiesinga and José, 2000). When both stimuli have similar values and the choice probability is relatively low, the drive to the network is high and it synchronizes in the gamma band. In contrast, when one of the objects has a value that is much larger than the other which results in high choice probabilities for that stimulus, it results in a net level of drive that makes the network synchronize in the beta band. We observed such a switch from gamma to beta frequencies in N3 e-type interneurons in LPFC when the choice probabilities changed from low to high (Figure 7). In order to show that such gamma-to-beta switch can indeed follow from such a E-I network as a function of the diversity of inputs we ran simulations in a firing rate E-I model (Keeley et al., 2017), described in detail in Appendix 1, which reproduces the gamma-beta switch (Figure 9—figure supplement 1). The network model simulations suggest that the N3 e-type inhibition in LPFC after color-cue onset might accomplish two functions. It leads to a normalization that transforms the object value into a choice probability (a soft winner-take-all gating of values, see Equation 3 in Materials and methods) and its gamma synchrony indexes resolving strong competition when similar excitatory drive originates from different sources (Figure 9A).

Figure 9 with 2 supplements see all

Download asset Open asset

Hypothetical link of the observed gamma band synchronization of the N3 e-type to circuit motifs and their putative functional correlate.

(A) The N3 e-type in LPFC synchronized at gamma when p(choice) was relatively low and at beta frequencies otherwise. The switch from gamma to beta synchronization can be parsimoniously reproduced in a circuit model with an interneuron (I) population receiving inputs from two excitatory (E) populations. When the input is diverse (similar p(choice)) a simulated circuit shows gamma activity (left) while when one excitatory population dominates it engages in beta synchronization (simulation details in Appendix 1). This activity signature could correspond at the functional level to choosing among similar valued stimuli (left) versus choosing stimuli with different values (bottom row). (B) In ACC the N3 e-type synchronized at gamma when the prediction error was large and at theta frequencies otherwise. The switch from gamma to theta synchronization can parsimoniously be reproduced in a circuit model with two I populations having different time constants and reciprocally connected to an E population. When the faster spiking I1 population is activated stronger, either directly from an external source, putatively by disinhibition of another interneuron population, the network synchronizes at gamma while otherwise the I2 neurons population imposes slower theta rhythmic synchrony to the network (simulation details in Appendix 1). Bottom: The activity states were functionally linked to those trials when outcomes mismatched expectations (high RPE) or matched the expected outcomes (low RPE).

Secondly, we conceptualized and simulated a circuit model that reproduces the oscillatory findings in ACC where the N3 e-type neurons gamma-synchronized when outcomes were unexpected (high RPE) but synchronized in the theta band otherwise (low RPE). Such a gamma/theta switch is different to the gamma/beta switch seen in LPFC (see above). A parsimonious circuit realizing such a switch uses two separate interneuron populations (Is) that inhibit a common group of pyramidal cells (Es): A fast interneuron (I1) presumed to be PV+, corresponding to the N3 e-type (see Discussion), and a slower interneuron population (I2) (Figure 9B). When both are reciprocally connected with an excitatory population (E), an oscillatory regime emerges whose frequency varies depending on which interneuron population receives more excitatory drive (details in Appendix 1). When the I1 population receives stronger drive, gamma frequency synchronization dominates the network, while a relatively stronger drive to the I2 population causes neurons in the network to switch to slower, theta band synchronization. We documented this gamma/theta switching result in simulations of firing rate neurons in detail in the Appendix 1. The activity signatures of this E-I-I model resembles the empirical activity signatures. The theta synchronous activity that reflects the activity of I2 neurons corresponds to low RPE trials, in which a reward R is received and the value V of the chosen stimulus was relatively high (a high V and a large R, the RPE is computed as = R-V) (see Equation 1 in Materials and methods) (Watabe-Uchida et al., 2017). In contrast, the gamma synchronous state that emerged with larger drive to the I1 neurons in the model corresponds to high RPE trials, in which a reward R is received, but the value V of the chosen stimulus was relatively low. This circuit motif is plausible when one assumes that the I1 neuron population is disinhibited when the chosen stimulus value is low. Such a disinhibition can be achieved by lowering the drive to I2 cells (which may require high values to be activated), or by assuming a separate disinhibitory circuit (for details see Appendix 1). In summary, the E-I-I motif reproduces the switch of gamma to theta synchronization we observed in ACC N3 e-type neurons. At the functional level, the circuit suggests that the emergence of gamma activity in this network indexes the detection of a mismatch between the received reward (as one source of excitation) and the chosen stimulus value (as another source of excitation) (Figure 9B).

The described circuits provide proofs-of-concept that the synchronization patterns we observed in the N3 e-type interneurons in ACC and LPFC during periods of uncertain values and outcomes can originate from biologically realistic circuits. The results justify future studies generating and testing quantitative predictions that can be derived from these circuit motifs.

Discussion

We found that narrow spiking neurons in the medial and lateral prefrontal cortex of macaques cause a fast drop of local multiunit activity indicative of inhibitory interneurons. These putative interneurons in LPFC showed increased firing rates to the color-cue onset, encoded the rewarded color and correlated their rates with the choice probabilities, while in ACC their firing correlated with reward prediction errors during the processing of the reward outcome. These functional signatures were specifically linked to a putative interneuron subtype (N3) that showed intermediate narrow action potential waveforms and more regular firing patterns than expected from a Poisson process (LVs of N3 e-type neurons: 0.84). Moreover, this putative interneuron (N3) e-type engaged in prominent event-triggered 35–45 Hz gamma band synchronization in each of the recorded brain areas. In LPFC, the N3 e-type synchronized at gamma to the cue when choice probabilities were low and uncertain, and in ACC the N3 e-type synchronized at gamma to the reward onset when the RPE was high and the reward outcome was unexpected. Thus, the same e-type showed functional firing correlations and gamma synchrony in LPFC and in ACC during periods of uncertainty about cues and outcomes, respectively. Taken together, these findings point to a special role of the same type of interneuron in LPFC and in ACC to realize their area specific functional contribution to the color-based reversal learning task. This interpretation highlights several aspects of interneuron specific circuit functions.

Characterizing narrow spiking interneurons in vivo

The first implication of our findings is that narrow spiking neurons can be reliably subdivided in three subtypes based on their electrophysiological firing profiles. Distinguishing three narrow spiking neurons in vivo during complex task performance is a significant step forward to complement previous electrophysiological distinctions of three interneuron types in-vitro (Zaitsev et al., 2009; Torres-Gomez et al., 2020) or in vivo (Ardid et al., 2015; Dasilva et al., 2019; Shin and Moore, 2019; Banaie Boroujeni et al., 2020b), and complementing the finer-grained electrophysiological characterization of ‘e-types’ in-vitro that has been achieved with a rich battery of current injection patterns that are difficult to apply in the awake and behaving primate (Markram et al., 2004; Monyer and Markram, 2004; Medalla et al., 2017; Gouwens et al., 2019). This in vitro ‘e-typing’ has distinguished eleven (Markram et al., 2015) or thirteen (Gouwens et al., 2019) distinct interneuron e-types in rodent somatosensory and mouse visual cortex, respectively. In the visual cortex, these classes entailed six fast spiking subclasses showing variably transient, sustained or pause-delay response patterns (Gouwens et al., 2019). Notably, the fast spiking interneuron classes in that study were characterized by a low coefficient of variation (CV), low bursting reflective of a low Local Variability (LV), and a feature-importance analysis showed that the narrow action potential width and firing rate of these neurons were most diagnostic for separating the fast spiking from other neuron classes (Figure 2i, S9, and S14 in Gouwens et al., 2019). Our study used these diagnostic metrics (LV, CV, AP width and rate) directly for the clustering because we do not have the current injection responses available and distinguished three interneurons in the monkey compared to six fast spiking interneuron e-types in the mouse study. These results illustrate that our three interneuron e-types will encompass further subclasses that future studies should aim to distinguish in order to narrow the gap between the in vivo e-types that we and others report in the monkey, and the in-vitro e-types in the rodents that are more easily mapped onto specific molecular, morphological and genetic make-ups (Markram et al., 2015; Gouwens et al., 2019). As a caveat, this mapping of cell types between species might also reveal cell classes and unique cell class characteristics in nonhuman primate cortices that are not similarly evident in rodents as recently demonstrated in a cross-species study of non-fast spiking gamma rhythmic neurons in early visual cortex that were exclusively evident in the primate and not in mice (Onorato et al., 2020).

With regard to the specific interneuron e-types we believe that the N3 e-type that showed functional correlations in two areas encompasses mostly parvalbumin PV+ expressing neurons, because of their narrow spikes, regular inter-spike intervals and their propensity to synchronize at gamma, which resemble the regular firing and gamma synchrony described for PV+ cells in the rodent (Cardin et al., 2009; Tiesinga, 2012; Stark et al., 2013; Amilhon et al., 2015; Chen et al., 2017; Gouwens et al., 2019). Moreover, similar to the N3 e-type responses to the attention cue, rodent dorsomedial frontal PV+ neurons systematically activate to preparatory cues while somatostatin neurons respond significantly less (Pinto and Dan, 2015). However, PV+ neurons are heterogeneous and entail Chandelier cells and variably sized basket cells (Markram et al., 2004; Markram et al., 2015; Gouwens et al., 2019). It might therefore be an important observation that the N3 e-type was distinguished from other narrow spiking neurons by having a lower firing rate and an intermediate-narrow action potential shape as opposed to the narrowest waveform and highest firing rates that N1 e-types showed. The proposed tentative suggestion that N3 e-type neurons will be mostly PV+ cells also entails for the primate brain that they would not be part of calretinin (CR+) or calbindin (CB+) expressing cells as their expression profiles do not apparently overlap (Dombrowski et al., 2001; Medalla and Barbas, 2009; Raghanti et al., 2010; Torres-Gomez et al., 2020).

What is the circuit role of the N3 interneuron e-type?

Assuming that N3 e-type neurons are partly PV+ neurons, we speculate that this translates into gamma rhythmic inhibition of local circuit pyramidal cells close to their soma where they impose output gain control (Tiesinga et al., 2004; Bartos et al., 2007; Womelsdorf et al., 2014b; Tremblay et al., 2016). In our task, such local inhibition was linked to how uncertain the expected values of stimuli were (reflected in low choice probabilities) or how unexpected reward outcomes were (reflected in high RPE’s). These conditions are periods that require a behavioral adaptation for which N3 e-type mediated inhibition could be instrumental. For example, in LPFC pyramidal cells that encoded the rewarded color in trials prior to the un-cued reversal become irrelevant when the reversal links reward to the alternative color and hence need to be suppressed during the reversal. This suppression of neurons encoding the previously relevant but now irrelevant color might be realized through activation of the N3 e-type neuron. Similarly, the N3 e-type activation in ACC reflects a rise in inhibition when an unexpected outcome (high RPE) is detected. This activation might therefore facilitate the updating of value expectations to reduce future prediction errors (Sutton and Barto, 2018; Oemisch et al., 2019).

The described, putative functions of N3 e-type activity provide direct suggestions on how they might contribute to transform inputs to outputs in a neural circuit. To understand this process, we devised and simulated circuit models of the activity signatures of inhibitory cells for the LPFC and the ACC (Figure 9, Appendix 1). For LPFC, we devised an E-E-I circuit where the interneuron (I) population synchronized at gamma when the excitatory drive of two E-cell populations was similar (Appendix 1, Figure 9—figure supplement 1). This situation mimics the situation when the values of two objects are similar, resulting in a low choice probability. According to this circuit, the function of I cells that putatively correspond to the N3 e-type neurons in LPFC is twofold. They normalize the activity of the excitatory cells, and they are instrumental in gating the activity of one over the other excitatory cell population when there is competition among them. Such competition arises specifically when choice probabilities are low because the low p(choice) indicates that the expected values of the stimuli to choose from are similar which makes a choice difficult. We therefore speculate that the putative circuit function of the N3 e-type cells in LPFC is the gating of competing excitatory inputs (Figure 9A).

For ACC, we devised an E-I-I circuit where the population of the N3 e-type putatively corresponded to one population of fast spiking inhibitory neurons (I1) that synchronized to gamma when receiving stronger excitatory drive than another population of slower inhibitory neurons (I2) (Figure 9—figure supplement 1B). The enhanced excitation of the I1 over the I2 population was modeled to correspond to trials with high RPE, which occurred when a reward (R) was received but the expected value (V) of the chosen stimulus was relatively low (a large RPE defined as the difference of R-V). In this situation, a stronger excitatory drive and consequently a gamma synchronous activity, could follow from disinhibiting the I1 population. Such a disinhibition could originate from reduced inhibition from the I2 cells in trials with low stimulus value, or it could originate from disinhibition from other neurons. These scenarios deserve explicit testing in future studies (for further discussion, see Appendix 1). They gain plausibility from anatomical studies that report that a large proportion of connections to interneurons go to disinhibitory interneurons that express calretinin and are distinct from the fast-spiking PV+ neurons that more likely entail the N3 e-type neurons (Medalla and Barbas, 2009; Medalla and Barbas, 2010). In summary, the proposed circuit model for the ACC suggests that the N3 e-type neurons activate when there is a mismatch of reward and chosen value. Activation of the N3 e-type neurons may thus be a (bio-) marker that predictions need to be updated to improve future performance.

We acknowledge that the proposed circuit models represent merely a proof-of-concept that says that the neuronal activities can originate in reasonable and previously described E-I motifs. They are not full biophysical implementations of the actual reversal learning task and entail finer predictions that await quantitative testing in future studies. They motivate combined electrophysiological and optogenetic studies in the primate to clarify cell-type-specific circuit functions during higher cognitive operations.

Interneuron-specific gamma synchronization: Comparison to previous studies

Two major findings of our study pertain to spike-LFP gamma band synchronization. First, we found that N3 e-type neurons showed an event-triggered synchrony increase in the same 35–45 Hz gamma frequency band in both LPFC and ACC when there was uncertainty about the correct choice (low p(choice) or about the outcomes (high RPE) [see Figures 7C and 8F]). Synchronization of the N3 e-type switched from a gamma frequency to the beta frequency in LPFC when the choices became more certain, and to the theta frequency in ACC when outcomes became more certain. An intrinsic propensity for generating gamma rhythmic activity through, for example GABA_aergic time constant, is well described for PV+ interneurons (Wang and Buzsáki, 1996; Bartos et al., 2007; Womelsdorf et al., 2014b; Chen et al., 2017) and is a documented activity signature even at moderate excitatory feedforward drive that might be more typical for prefrontal cortices than earlier visual cortices (Cardin et al., 2009; Vinck et al., 2013; Shin and Moore, 2019; Onorato et al., 2020).

Our findings provide strong empirical evidence that narrow spiking interneurons are the main carriers of gamma rhythmic activity in nonhuman primate prefrontal cortex during cue and outcomes processing (Whittington et al., 2000; Hasenstaub et al., 2005; Bartos et al., 2007; Hasenstaub et al., 2016; Chen et al., 2017; Shin and Moore, 2019). This conclusion resonates well with rodent studies that document how interneurons in infra-/peri-limbic and cingulate cortex engage in gamma synchrony (Fujisawa and Buzsáki, 2011; Cho et al., 2015).

The second major implication of the gamma synchronous N3 e-type neurons is that gamma band synchrony was associated with task epochs in which neural circuits realize a circuit function that can be considered to be ‘area specific’. In LPFC, the gamma increase was triggered by the color-cue onset of two peripherally presented stimuli that instructed covertly shifting attention. Our circuit model (Figure 9A) illustrates that cue related gamma was restricted to periods when object values were similar, and the animal still learned which object is most reward predictive. The control of learning what is relevant during cognitively demanding tasks is a key function of the LPFC, suggesting that gamma activity emerges when this key function is called upon (Miller and Cohen, 2001; Szczepanski and Knight, 2014; Cho et al., 2020). A similar scenario holds for the ACC whose central function is often considered to monitor and evaluate task performance and detect when outcomes should trigger a change in behavioral strategies (Shenhav et al., 2013; Heilbronner and Hayden, 2016; Alexander and Brown, 2019; Fouragnan et al., 2019). In ACC, the gamma increase was triggered by an unexpected, rewarded outcome (high RPE). Thus, the N3 e-type specific gamma band signature occurred specifically in those trials with conflicting stimulus values requiring behavioral control to reduce the prediction errors through future performance (Figure 9A). Considering this ACC finding together with the LPFC finding suggests that gamma activity of N3 e-type neurons indexes a key function of these brain areas, supporting recent causal evidence from rodent optogenetics (Cho et al., 2020).

Consistent with the proposed importance of interneurons for area-specific key functions prior studies have documented the functional importance of inhibition in these circuits. Blocking inhibition with GABA antagonists like bicuculline not only renders fast spiking interneurons nonselective during working memory tasks but abolishes the spatial tuning of regular spiking (excitatory) cells during working memory tasks in monkeys (Sawaguchi et al., 1989; Rao et al., 2000), disturbs accuracy in attention tasks (Paine et al., 2011) and reduces set shifting flexibility by enhancing perseveration (Enomoto et al., 2011). Similarly, abnormally enhancing GABAa levels via muscimol impairs working memory and set shifting behavior (Rich and Shapiro, 2007; Urban et al., 2014) and can result in either maladaptive impulsive behaviors (Paine et al., 2015), and when applied in anterior cingulate cortex to perseveration (Amiez et al., 2006). Thus, altered medial and lateral prefrontal cortex inhibition is closely linked to an inability to adjust attentional strategies given unexpected outcomes. This evidence supports our studies suggestion of the importance of inhibitory neuron involvement in resolving uncertainties during adaptive behaviors.

Taken together, our interneuron-specific findings in primate LPFC and ACC stress the importance of interneurons to influence circuit activity beyond a mere balancing of excitation. Multiple theoretical accounts have stressed that some types of interneurons ‘control information flow’ (Fishell and Kepecs, 2020), by imposing important filters for synaptic inputs to an area and gain-control the output from that area (Akam and Kullmann, 2010; Kepecs and Fishell, 2014; Womelsdorf et al., 2014b; Roux and Buzsáki, 2015; Cardin, 2018). Testing these important circuit functions of interneurons has so far been largely limited to studies using molecular tools. Our study addresses this limitation by characterizing putative interneurons, delineating their suppressive effects on the circuit and highlighting their functional activation during reversal learning. The observed interneuron-specific, gamma synchronous coding of choice probabilities and prediction errors lends strong support to study cell-type-specific circuit mechanisms of higher cognitive functions.

Materials and methods

All animal care and experimental protocols were approved by the York University Council on Animal Care (ethics protocol 2015–15 R2) and were in accordance with the Canadian Council on Animal Care guidelines.

Share this article

Cite this article

Task paradigm and cell classification.

Firing rate modulation of narrow and broad spiking neurons to the color cue correlate with choice probability.

Firing rate modulation to trial outcomes correlate with reward prediction errors.

Clustering of e-type sub-classes of cells using their spike width, firing variability and rate.

E-type-specific correlations with choice probability and reward prediction error in LPFC and ACC.

Spike-LFP phase synchronization.

Spike-LFP phase synchronization in LPFC around the color onset for trials with low and high choice probability.

Spike-LFP phase synchronization in ACC during outcome processing for trials with low and high reward prediction errors.

Hypothetical link of the observed gamma band synchronization of the N3 e-type to circuit motifs and their putative functional correlate.

Author details

Kianoush Banaie Boroujeni

Contribution

For correspondence

Competing interests

Paul Tiesinga

Contribution

Competing interests

Thilo Womelsdorf

Contribution

For correspondence

Competing interests

Citations by DOI

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organism