Speech-induced suppression and vocal feedback sensitivity in human cortex

Muge Ozker; Leyao Yu; Patricia Dugan; Werner Doyle; Daniel Friedman; Orrin Devinsky; Adeen Flinker

doi:10.7554/eLife.94198.1

eLife assessment

The manuscript describes human intracranial neural recordings in the auditory cortex during speech production, showing that the effects of delayed auditory feedback correlate with the degree of underlying speech-induced suppression. This is an important finding, as previous work has suggested that speech suppression and feedback sensitivity often do not co-localize and may be distinct processes, in contrast with findings in non-human primates where there is a strong correlation. The strength of the evidence is solid, with appropriate experimental methods, data, and analysis, though some additional analysis would strengthen comparisons with past work.

https://doi.org/10.7554/eLife.94198.1.sa2

Significance of findings

important: Findings that have theoretical or practical implications beyond a single subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

solid: Methods, data and analyses broadly support the claims with only minor weaknesses

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Across the animal kingdom, neural responses in the auditory cortex are suppressed during vocalization, and humans are no exception. A common hypothesis is that suppression increases sensitivity to auditory feedback, enabling the detection of vocalization errors. This hypothesis has been previously confirmed in non-human primates, however a direct link between auditory suppression and sensitivity in human speech monitoring remains elusive. To address this issue, we obtained intracranial electroencephalography (iEEG) recordings from 35 neurosurgical participants during speech production. We first characterized the detailed topography of auditory suppression, which varied across superior temporal gyrus (STG). Next, we performed a delayed auditory feedback (DAF) task to determine whether the suppressed sites were also sensitive to auditory feedback alterations. Indeed, overlapping sites showed enhanced responses to feedback, indicating sensitivity. Importantly, there was a strong correlation between the degree of auditory suppression and feedback sensitivity, suggesting suppression might be a key mechanism that underlies speech monitoring. Further, we found that when participants produced speech with simultaneous auditory feedback, posterior STG was selectively activated if participants were engaged in a DAF paradigm, suggesting that increased attentional load can modulate auditory feedback sensitivity.

Introduction

A major question in neuroscience is how do animals distinguish between stimuli originating from the environment and those produced by their own actions. Sensorimotor circuits share a common mechanism across the animal kingdom in which sensory responses to self-generated motor actions are suppressed. It is commonly hypothesized that suppressing responses to predicted self-generated stimuli increases sensitivity of the sensory system to external stimuli. (Poulet and Hedwig 2002, Poulet and Hedwig 2006, Crapse and Sommer 2008, E.Vonholstan, Glenn et al. 2011, Schneider and Mooney 2018). Furthermore, it enables detection and correction of motor errors by providing a template of the predicted sensory outcome to compare with the actual sensory outcome. In the domain of speech, this mechanism is described in models which suggest that neural responses in the auditory cortex are suppressed during speech production. When there is a mismatch between the predicted auditory outcome and the actual auditory feedback, responses in the auditory regions are enhanced to encode the mismatch and inform vocal-motor regions to correct vocalization (Hickok, Houde et al. 2011, Houde and Nagarajan 2011, Tourville and Guenther 2011).

A common experimental strategy to generate mismatch between the predicted auditory outcome and the actual auditory feedback is to perturb auditory feedback during speech production. Auditory feedback perturbations are usually applied either by delaying auditory feedback (DAF), which disrupts speech fluency (Lee 1950, Fairbanks 1955, Stuart, Kalinowski et al. 2002), or by shifting voice pitch and formants, which result in compensatory vocal changes in the opposite direction of the shift (Houde and Jordan 1998, Jones and Munhall 2000, Niziolek and Guenther 2013). Numerous electrophysiological and neuroimaging studies investigated neural responses during speech production both in the absence and presence of auditory feedback perturbations. In support of speech production models, these studies have repeatedly reported suppressed responses in auditory cortex during speaking compared with passive listening to speech (Numminen, Salmelin et al. 1999, Wise, Greene et al. 1999, Curio, Neuloh et al. 2000, Houde, Nagarajan et al. 2002, Christoffels, Formisano et al. 2007, Ford, Roach et al. 2010, Niziolek, Nagarajan et al. 2013), as well as enhanced responses when auditory feedback was perturbed indicating sensitivity to auditory feedback (Tourville, Reilly et al. 2008, Behroozmand, Karvelis et al. 2009, Chang, Niziolek et al. 2013, Greenlee, Behroozmand et al. 2013, Kort, Nagarajan et al. 2014, Behroozmand, Shebek et al. 2015, Ozker, Doyle et al. 2022). However, it is not clear whether the same or distinct neural populations in the auditory cortex show speech-induced suppression and sensitivity to auditory feedback.

While auditory responses are largely suppressed during speech production, detailed investigations using neurosurgical recordings revealed that the degree of suppression was variable across cortical sites, and auditory cortex also exhibited non-suppressed and enhanced responses (albeit less common) (Creutzfeldt and Ojemann 1989, Flinker, Chang et al. 2010, Greenlee, Jackson et al. 2011), mirroring results from non-human primate studies using single unit recordings (Eliades and Wang 2003, Eliades and Wang 2008). In the same non-human primate study, it was reported that neurons that were suppressed during vocalization showed increased activity when auditory feedback was perturbed (Eliades and Wang 2008). Based on this finding, we predicted that if speech-induced suppression enables detection and correction of speech errors, suppressed auditory sites should be sensitive to auditory feedback, thus exhibit enhanced neural responses to feedback perturbations. Alternatively, if suppression and speech monitoring are unrelated processes, then suppressed sites should be distinct from the ones that are sensitive to auditory feedback.

The level of attention during speech monitoring can vary depending on the speech task. During normal speech production, speech monitoring does not require a conscious effort, however it is a controlled, attentional process during an auditory feedback perturbation task (Hashimoto and Sakai 2003). It is well known that selective attention enhances auditory responses and improves speech perception under noisy listening conditions or when multiple speech streams are present (Mesgarani and Chang 2012, Golumbic, Ding et al. 2013). We predicted that increased attention to auditory feedback under adverse speaking conditions, such as during an auditory feedback perturbation task, should increase feedback sensitivity and elicit larger responses in the auditory cortex compared to normal speech production.

To summarize, in this study we aimed to test the hypothesis that speech-induced suppression increases sensitivity to auditory feedback in human neurophysiological recordings. We predicted that auditory sites showing speech induced suppression would elicit enhanced responses to auditory feedback perturbations. Further, we aimed to investigate the role of attention in auditory feedback sensitivity by comparing auditory responses during an auditory feedback perturbation task compared with normal speech production.

To address these aims, we used iEEG recordings in neurosurgical participants, which offers a level of spatial detail and temporal precision that would not be possible to achieve using non-invasive techniques. We first identified the sites that show auditory suppression during speech production, and then employed a DAF paradigm to test whether the same sites show sensitivity to perturbed feedback. Our results revealed that overlapping sites in the STG exhibited both speech-induced auditory suppression and sensitivity to auditory feedback with a strong correlation between the two measures, supporting the hypothesis that auditory suppression predicts sensitivity to speech errors in humans. Further, we showed that auditory responses in the posterior STG are enhanced in a DAF task compared to normal speech production, even for trials in which participants receive simultaneous auditory feedback (no-delay condition). This result suggests that increased attention during an auditory feedback perturbation task can modulate auditory feedback sensitivity and posterior STG is a critical region for this attentional modulation.

Materials and Methods

Participant Information

All experimental procedures were approved by the New York University School of Medicine Institutional Review Board. 35 neurosurgical epilepsy patients (19 females, mean age: 31, 23 left, 9 right and 3 bilateral hemisphere coverage) implanted with subdural and depth electrodes provided informed consent to participate in the research protocol. Electrode implantation and location were guided solely by clinical requirements. 3 patients were consented separately for higher density clinical grid implantation, which provided denser sampling of underlying cortex.

Intracranial Electroencephalography (iEEG) Recording

iEEG was recorded from implanted subdural platinum-iridium electrodes embedded in flexible silicon sheets (2.3_Jmm diameter exposed surface, 8 x 8 grid arrays and 4 to 12 contact linear strips, 10_Jmm center-to-center spacing, Ad-Tech Medical Instrument, Racine, WI) and penetrating depth electrodes (1.1_Jmm diameter, 5-10_Jmm center-to-center spacing 1 x 8 or 1 x 12 contacts, Ad-Tech Medical Instrument, Racine, WI). 3 participants consented to a research hybrid grid implanted which included 64 additional electrodes between the standard clinical contacts (16 × 8 grid with sixty-four 2 mm macro contacts at 8 x 8 orientation and sixty-four 1 mm micro contacts in between, providing 10 mm center-to-center spacing between macro contacts and 5 mm center-to-center spacing between micro/macro contacts, PMT corporation, Chanassen, MN). Recordings were made using one of two amplifier types: NicoletOne amplifier (Natus Neurologics, Middleton, WI), bandpass filtered from 0.16-250_JHz and digitized at 512_JHz. Neuroworks Quantum Amplifier (Natus Biomedical, Appleton, WI) recorded at 2048 Hz, bandpass filtered at 0.01 to 682.67 Hz and then downsampled to 512 Hz. A two-contact subdural strip facing toward the skull near the craniotomy site was used as a reference for recording and a similar two-contact strip screwed to the skull was used for the instrument ground. iEEG and experimental signals (trigger pulses that mark the appearance of visual stimuli on the screen, microphone signal from speech recordings and feedback voice signal) were acquired simultaneously by the EEG amplifier in order to provide a fully synchronized dataset.

Experimental Design

Experiment 1: Auditory word repetition (AWR)

35 participants performed the experiment. Stimuli consisted of 50 items (nouns) taken from the revised Snodgrass and Vanderwart object pictorial set (e.g. “drum’, “hat”, “pencil”) (Rossion and Pourtois 2004, Shum, Fanda et al. 2020). Auditory words presented randomly (2 repetitions) through speakers. Participants were instructed to listen to the presented words and repeat them out loud at each trial.

Experiment 2: Visual word reading (VWR)

The same 35 participants performed the experiment. Stimuli consisted of the same 50 words used in Experiment 1, however visually presented as text stimuli on the screen in a random order (2 repetitions). Participants were instructed to read the presented word out loud at each trial.

Experiment 3: Delayed auditory feedback (DAF)

A subgroup of 14 participants performed this experiment. Stimuli consisted of 10 different 3-syllable words visually presented as text stimuli on the screen (e.g. “envelope”, “umbrella”, “violin”). Participants were instructed to read the presented word out loud at each trial. As participants spoke, their voices were recorded using the laptop’s internal microphone, delayed at 4 different amounts (no-delay, 50, 100, 200ms) using custom script (MATLAB, Psychtoolbox-3) and played back to them through earphones. Trials, which consisted of different stimulus-delay combinations, were presented randomly (3 to 8 repetitions). Behavioral and neural data from the DAF experiment were used in a previous publication from our group (Ozker, Doyle et al. 2022).

Experiment 4: Visual word reading with auditory feedback (VWR-AF)

A subgroup of 4 participants performed an additional visual word reading experiment, in which they were presented with the word stimuli as in Experiment 3 and heard their simultaneous (no-delay) voice feedback through earphones.

Statistical Analysis

Electrodes were examined for speech related activity defined as significant high gamma broadband responses. Unpaired t-tests were performed to compare responses to a baseline for each electrode and multiple comparisons were corrected using the false discovery rate (FDR) method (q=0.05). Electrodes that showed significant response increase (p < 10⁻⁴) either before (−0.5 to 0 s) or after speech onset (0 to 0.5 s) with respect to a baseline period (−1 to −0.6 s) and at the same time had a large signal-to-noise ratio (μ/σ > 0.7) during either of these time windows were selected. Electrode selection was first performed for each task separately, then electrodes that were commonly selected were further analyzed. For the analysis of the DAF experiment, one-way ANOVA was calculated using the average neural response as a dependent variable and feedback delay as a factor to assess the statistical significance of response enhancement in a single electrode.

Experiment Setup

Participants were tested while resting in their hospital bed in the epilepsy-monitoring unit. Visual stimuli were presented on a laptop screen positioned at a comfortable distance from the participant. Auditory stimuli were presented through speakers in the AWR and VWR experiments and through earphones (Bed Phones On-Ear Sleep Headphones Generation 3) in the DAF and in the VWR-AF experiment. Participants were instructed to speak at a normal voice level and sidetone volume was adjusted to a comfortable level at the beginning of the DAF experiment. DAF and VWR-AF experiments were performed consecutively and sidetone volume was kept the same in the two experiments. Participants’ voice was recorded using an external microphone (Zoom H1 Handy Recorder). A TTL pulse marking the onset of a stimulus, the microphone signal (what the participant spoke) and the feedback voice signal (what the participant heard) were fed in to the EEG amplifier as an auxiliary input in order to acquire them in sync with EEG samples. Sound files recorded by the external microphone were used for voice intensity analysis. Average voice intensity for each trial was calculated in dB using the ‘Intensity’ object in Praat software (Boersma 2001).

Electrode Localization

Electrode localization in individual space as well as MNI space was based on co-registering a preoperative (no electrodes) and postoperative (with electrodes) structural MRI (in some cases a postoperative CT was employed depending on clinical requirements) using a rigid-body transformation. Electrodes were then projected to the surface of cortex (preoperative segmented surface) to correct for edema induced shifts following previous procedures (Yang, Wang et al. 2012) (registration to MNI space was based on a non-linear DARTEL algorithm (Ashburner 2007). Within participant anatomical locations of electrodes was based on the automated FreeSurfer segmentation of the participant’s pre-operative MRI. We recorded from a total of 3591 subdural and 1361 depth electrode contacts in 35 participants. Subdural electrode coverage extended over lateral temporal, frontal, parietal and lateral occipital cortices. Depth electrodes covered additional regions to a limited extent including the transverse temporal gyrus, insula and fusiform gyrus. Contacts that were localized to the cortical white matter were excluded from the analysis. To categorize electrodes in the STG into anterior and posterior groups, lateral termination of the transverse temporal sulcus was used as an anatomical landmark (Greenlee, Jackson et al. 2011, Nourski, Steinschneider et al. 2016).

Neural Data Analysis

Electrodes with epileptiform activity or artifacts caused by line noise, poor contact with cortex and high amplitude shifts were removed from further analysis. A common average reference was calculated by subtracting the average signal across all electrodes from each individual electrode’s signal (after rejection of electrodes with artifacts). The analysis of the electrophysiologic signals focused on changes in broadband high gamma activity (70–150 Hz). To quantify changes in the high gamma range, the data were bandpass filtered between 70 and 150 Hz, and then a Hilbert transform was applied to obtain the analytic amplitude.

Recordings from the DAF and VWR-AF experiments were analyzed using the multitaper technique, which yields a more sensitive estimate of the power spectrum with lower variance, thus is more beneficial when comparing neural responses to incremental changes in stimuli. Continuous data streams from each channel were epoched into trials (from −1.5 s to 3.5 s with respect to speech onset). Line noise at 60, 120 and 180 Hz were filtered out. 3 Slepian tapers were applied in timesteps of 10 ms and frequency steps of 5 Hz, using temporal smoothing (tw) of 200 ms and frequency smoothing (fw) of ±10 Hz. Tapered signals were then transformed to time-frequency space using discrete Fourier transform and power estimates from different tapers were combined (MATLAB, FieldTrip toolbox). The number of tapers (K) were determined by the Shannon number according to the formula: K=2*tw*fw-1 (Percival and Walden 1993). The high gamma broadband response (70-150 Hz) at each time point following stimulus onset was measured as the percent signal change from baseline, with the baseline calculated over all trials in a time window from −500 to −100 ms before stimulus onset.

Suppression Index (SuppI) Calculation

Suppression of neural activity is measured by comparing responses in two time periods in the AWR task. First time period was during listening the stimulus (0-0.5 s) and the second time period was during speaking (0-0.5 s). For each trial, average responses over Listen and Speak periods were found and suppression was measured by calculating Listen-Speak/Listen+Speak. Then suppression values were averaged across trials to calculate a single suppression index for each electrode. For the neural activity, raw high gamma broadband signal power was used instead of the percent signal change to ensure that the suppression index values varied between −1 to 1, indicating a range from complete enhancement to complete suppression respectively.

Sensitivity Index (SensI) Calculation

Sensitivity to DAF is measured by comparing neural responses to increasing amounts of feedback delay. Neural responses in each trial were averaged in a time period following the voice feedback (0-0.5 s). For each electrode, a sensitivity index was calculated by measuring the trial-by-trial Spearman correlation between the delay condition and the averaged neural response. A large sensitivity value indicated a strong response enhancement with increasing delays.

Results

In order to assess cortical responses during perception and production of speech, and quantify speech-induced auditory suppression, participants (N = 35) performed an auditory word repetition (AWR) task. We examined the response patterns in seven different cortical regions including superior temporal gyrus (STG), middle temporal gyrus (MTG), supramarginal gyrus (SMG), inferior frontal gyrus (IFG), middle frontal gyrus (MFG), precentral gyrus (preCG) and postcentral gyrus (postCG) (Fig 1A). As an index of the neural response, we used the high gamma broadband signal (70-150 Hz, see Methods), which correlates with the spiking activity of the underlying neuronal population (Mukamel, Gelbard et al. 2005, Crone, Sinai et al. 2006, Cardin, Carlen et al. 2009, Ray and Maunsell 2011, Lachaux, Axmacher et al. 2012).

Cortical responses during speech tasks.
A. Electrodes from all participants (n = 35) are shown on a template brain with different colors corresponding to different regions (number of electrodes in each region denoted in the parentheses). B. High gamma broadband responses (70-150 Hz) for individual trials in an Auditory Word Repetition task are shown for each region. C. High gamma responses for individual trials in a Visual Word Reading task are shown for each region. Trials are sorted with respect to speech onset (white line). D. Mean high gamma broadband response averaged across trials are shown for each region with the width representing the standard error of the mean across electrodes.

We analyzed the responses in two different time windows: During passive listening of the auditory stimulus (0-500 ms after stimulus onset) and during speaking when participants repeated the perceived auditory stimulus (0-500 ms after articulation onset). Average responses were larger during passive listening in STG (Average % signal change ± SEM; Listen: 62.1±0.6, Speak: 29.8±0.4), MTG (32.7±0.9, 22.3±0.9) and SMG (27.4±0.8, 25.8±0.7) compared with speaking. Conversely, responses were larger during speaking in IFG (29.2±1.3, 31.2±1.3), MFG (28.3±1.6, 31.4±1.3), preCG (27.4±0.4, 37±0.5) and postCG (26±0.4, 42±0.5). These results suggested that auditory regions responded more strongly during passive listening compared to speaking, verifying previous reports of neural response suppression to self-generated speech in auditory cortex (Fig 1B-D).

In the AWR task, participants heard the same auditory stimulus twice in each trial, once from a recorded female voice and once from their own voice. It is well known that repeated presentation of a stimulus results in the suppression of neural activity in regions that process that stimulus, a neural adaptation phenomenon referred to as repetition suppression (Grill-Spector, Henson et al. 2006, Todorovic and de Lange 2012). To ensure that our observed suppression of neural activity in auditory regions was not due to repetition suppression, but rather was induced by speech production, we performed a visual word reading (VWR) task, in which participants hear the auditory stimulus only once (from their own voice). Response magnitudes during speaking in the AWR and VWR tasks were similar (paired t-test: t (466) = 0.62, p = 0.53), characterized by a strong correlation across electrodes (Pearson’s Correlation: r = 0.9006, p = 0). These results suggested that repetition of the auditory stimulus in the AWR task did not affect response magnitudes and the observed reduction in response magnitudes was induced by speech production.

To quantify the amount of speech-induced suppression, we calculated a Suppression Index (SuppI) for each electrode by comparing neural responses during listening versus speaking in the AWR task (SuppI = Listen-Speak/Listen+Speak; see Methods). A positive SuppI indicated a response suppression during speaking compared to listening and was observed most strongly in middle to posterior parts of STG, followed by MTG and SMG. A negative SuppI indicated a response enhancement during speaking compared to listening and was observed in motor regions, most strongly in the postCG (Fig 2A-B).

Spatial topography of speech-induced auditory suppression.
A. Suppression indices for all electrodes are shown on a template brain. Red color tones indicate smaller neural activity during speaking, while blue electrodes indicate larger neural activity during speaking compared to listening in the Auditory Word Repetition task. B. Suppression indices averaged across electrodes are shown for each region sorted from largest to smallest mean suppression index. Boxplots indicate mean ± SD.

After mapping the topographical distribution of suppression indices across the cortex, we focused on understanding the functional role of auditory suppression in speech monitoring. We hypothesized that the degree of speech-induced auditory suppression should be tightly linked to sensitivity to speech errors, as predicted by current models (Houde and Nagarajan 2011, Tourville and Guenther 2011) and neural data in non-human primates (Eliades and Wang 2008). To test this hypothesis, we used an additional task, in which we delayed the auditory feedback (DAF) during speech production to disrupt speech fluency. In this task, 14 participants repeated the VWR task while they were presented with their voice feedback through earphones either simultaneously (no-delay) or with a delay (50, 100 and 200 ms; see Methods). In a previous study (Ozker, Doyle et al. 2022), using the same data set, we demonstrated that participants slowed down their speech in response to DAF (Articulation duration; DAF₀: 0.698, DAF₅₀: 0.726, DAF_100: 0.737, and DAF_200: 0.749 milliseconds). Moreover, auditory regions exhibited an enhanced response that varied as a function of feedback delay, likely representing an auditory error signal encoding the mismatch between the expected and the actual feedback. However, those results were not directly linked to auditory suppression.

Here, we compared neural responses in the AWR and the DAF tasks to test whether auditory regions that exhibit strong speech-induced suppression also exhibit large auditory error responses to DAF, which would indicate strong sensitivity to speech errors. In a single participant, we demonstrated that a representative electrode on the STG with strong auditory suppression (Average % signal change in 0-500 ms; Listen: 124±7, Speak: 20±3, SuppI: 0.27) exhibited significant response enhancement (DAF₀: 135±12, DAF₅₀: 134±8, DAF₁₀₀: 175±10, DAF₂₀₀: 208±17, ANOVA: F (3, 116) = 8.5, p = 3.7e-05) (Fig 3A-B), while a nearby electrode with weaker auditory suppression (Listen: 116±6, Speak: 80±4, SuppI: 0.06) did not exhibit significant response enhancement with feedback delays (DAF₀: 360±29, DAF₅₀: 328±24, DAF₁₀₀: 379±31, DAF₂₀₀: 419±30, ANOVA: F (3, 116) = 1.73, p = 0.16) (Fig 3C-D).

Speech-induced auditory suppression and sensitivity to delayed auditory feedback in representative electrodes in a single participant.
A. High gamma broadband response (70-150 Hz) in electrode G63 showing a large amount of auditory suppression during speaking words compared to listening to the same words. Error bars indicate SEM over trials. B. High gamma responses in electrode G63 to articulation of words with DAF. 0 seconds indicate the onset of the perceived auditory feedback. Inset figure shows the cortical surface model of the left hemisphere brain of a single participant. Black circles indicate the implanted electrodes. White highlighted electrodes are located on the middle (G63) and caudal (G54) STG. C. High gamma response in electrode G54 showing a small degree of auditory suppression during speaking words compared to listening. D. High gamma response in electrode G54 locked to articulation of words during DAF. 0 seconds indicate the onset of the perceived auditory feedback.

To quantify the auditory error response and measure the sensitivity of a cortical region to DAF, we calculated a Sensitivity Index (SensI) for each electrode by correlating the delay condition and the average neural response across trials (see Methods). A large SensI indicated a strong response enhancement (large auditory error response) with increasing delays. The degree of both speech-induced suppression and sensitivity to DAF were highly variable across the cortex, SuppI ranging from −0.46 to 0.53 and SensI ranging from −0.62 to 0.70. The largest suppression and sensitivity indices as well as a strong overlap between the two measures were observed in the STG, suggesting that auditory electrodes that show speech-induced suppression are also sensitive to auditory feedback perturbations (Fig 4A-C). We validated this relationship by revealing a significant correlation between suppression and sensitivity indices of auditory electrodes (n = 57, Pearson’s Correlation: r = 0.4006, p = 0.002) supporting our hypothesis and providing evidence for a common neural mechanism (Fig 4D).

Correlation between speech-induced auditory suppression and sensitivity to delayed auditory feedback.
A. Sensitivity indices for all electrodes are shown on a template brain (both right and left hemisphere electrodes were shown on the left hemisphere). Red tones indicate larger neural activity to increasing amount of delays in the Delayed Auditory Feedback task, while blue tones indicate the opposite. B. Suppression indices for all electrodes are shown on a template brain. Red tones indicate larger neural activity during listening compared to speaking in the Auditory Word Repetition task, while blue tones indicate the opposite. C. Electrodes that show either sensitivity to delayed auditory feedback (positive SensI value) or speech-induced auditory suppression (positive SuppI value), or both are shown on a template brain. D. Scatter plot and fitted regression showing a significant correlation between sensitivity to DAF and speech-induced auditory suppression across auditory electrodes. Each circle represents an electrode’s sensitivity and suppression index.

Our neural analysis revealed that response magnitudes in auditory cortex were much larger when participants heard their simultaneous voice feedback in a DAF paradigm compared with producing speech without any feedback (DAF₀: no-delay trials) (Average % signal change in 0-500 ms; DAF₀: 113±14, VWR: 41±7, compare gray lines in Fig 3A and C with black lines in Fig3B and D, respectively). We were interested in dissociating if these larger responses were merely an effect of perceiving voice feedback through earphones instead of air or rather were specific to our DAF design, likely due to increased attentional demands. Therefore, 4 participants performed an additional visual word reading task in which they were presented with their simultaneous voice feedback through earphones (VWR-AF). As previous studies have reported that DAF can increase voice intensity (Yates 1963, Howell and Archer 1984), we first verified whether participants spoke louder during the DAF task. A comparison of their voice intensity between DAF₀ (no-delay trials in the DAF task) and the VWR-AF (standard word reading with simultaneous feedback through earphones) conditions did not show a significant difference (Voice intensity; DAF₀: 50±11 dB, VWR: 49±12 dB; paired t-test: t (118) = 1.8, p = 0.08). After verifying that the sound volume entering the auditory system is not statistically different in the two conditions, we compared the responses in the auditory cortex and found that overall response magnitudes were now on par across the two conditions (DAF₀: 89±17, VWR-AF: 82±17, Fig 5A). However, a detailed inspection of individual electrode responses revealed that some electrodes showed larger response to DAF₀, while others showed either larger responses to VWR-AF or similar responses to both conditions (Fig 5B). In a single participant, we demonstrated that adjacent electrodes in the STG that are only 5 mm apart exhibited completely different response patterns. Electrodes in the more posterior parts of STG showed larger responses to DAF₀, while electrodes in more anterior parts showed similar responses to DAF₀ and VWR-AF (Fig 5C). To determine an anatomical landmark at which the reversal of response patterns occurred in the STG, we used the lateral termination of the transverse temporal sulcus (TTS) (Greenlee, Jackson et al. 2011, Nourski, Steinschneider et al. 2016) based on the individual FreeSurfer segmentation of the participant’s pre-operative MRI. Across participants, this landmark corresponded to y coordinate = −22±2.

Effect of the delayed auditory feedback paradigm on neural responses during speech.
A. High gamma broadband responses (70-150 Hz) averaged across auditory electrodes are similar during no-delay condition in the delayed auditory feedback task (DAF₀) and during visual word reading with auditory feedback (VWR-AF). Error bars indicate SEM across electrodes. B. Scatter plot shows averaged high gamma responses (0-500 ms) for VWR-AF versus DAF₀ conditions for auditory electrodes. C. High gamma responses for DAF₀ and VWR-AF are shown in representative auditory electrodes in a single participant. Electrodes that are posteriorly located on the STG show larger responses to DAF₀ condition, while electrodes that are anteriorly located on the STG show similar responses to the two conditions. The lateral termination of the transverse temporal sulcus (TTS) is identified as a landmark (white zigzagged line) that separates the two different response patterns. D. High gamma responses for DAF₀ and VWR conditions were compared and resulting t-values are shown for all electrodes on a template brain. Pink color tones indicate larger responses to DAF₀, while green color tones indicate larger responses to VWR condition. E. T-values calculated by comparing responses to DAF₀ and VWR conditions are shown for all auditory electrodes with respect to their anterior-to-posterior positions to the TTS.

Next, we compared the response patterns in the two conditions for all electrodes across participants by calculating a t-value for each electrode (unpaired t-test: average responses from −200 to 500 ms). We demonstrated that auditory regions in posterior STG showed larger responses to DAF₀ condition, while frontal motor regions showed larger responses to VWR-AF (Fig 5D). Lastly, we examined STG electrodes alone, sorted by their anterior-to-posterior positions with respect to the TTS. In line with the results from the single participant, electrodes that were located posteriorly within a 1 cm distance from this anatomical landmark showed significantly larger responses to the DAF₀ condition (Fig 5E). These results suggest that posterior STG is more activated when participants are engaged in a speech production task that requires increased effort and attention.

Discussion

Our study provides a detailed topographical investigation of speech-induced auditory suppression in a large cohort of neurosurgical participants. We found that while the strongest auditory suppression was observed in the STG, the degree of suppression was highly variable across different recording sites. To explain this variability, we considered the functional role of auditory suppression in speech monitoring. We showed that delaying auditory feedback during speech production enhanced auditory responses in the STG. The degree of sensitivity to feedback delays was also variable across different recording sites. We found a significant correlation between speech-induced suppression and feedback sensitivity, providing evidence for a shared mechanism between auditory suppression and speech monitoring. While there was no anatomical organization for auditory suppression and feedback sensitivity in the STG, we found an anterior-posterior organization for the effect of attention on feedback sensitivity. Auditory sites that lie posterior to the lateral termination of the TTS in the STG showed stronger activation during the DAF task compared to a standard word reading task, even for trials in which participants received simultaneous feedback, demonstrating attentional modulation of feedback sensitivity.

We observed the strongest speech-induced suppression in the middle and posterior parts of the STG. In line with previous iEEG studies, we found that degree of suppression was variable across different recording sites in the STG without any anatomical organization (Flinker, Chang et al. 2010, Greenlee, Jackson et al. 2011, Nourski, Steinschneider et al. 2016). So far, a clear gradient for speech-induced suppression has never been reported in the STG but only in the Heschl’s gyrus (HG) and superior temporal sulcus (STS) by studies that used comprehensive depth electrode coverage within the temporal lobe (Nourski, Steinschneider et al. 2016, Nourski, Steinschneider et al. 2021).

We found only a few sites with speech-induced enhancement and several sites with no response change. Based on single unit recordings in non-human primates, it is known that majority of neurons in the non-core auditory cortex exhibits suppression, while a smaller group exhibits excitation during vocalization. It is difficult to isolate speech-induced enhancement in human studies, because measurements reflect the average response of the underlying neural population, which is dominated by suppressed responses. A previous non-human primate study suggested that there might be a division of labor between the suppressed and excited neurons. They showed that when an external auditory stimulus is presented concurrently during vocalization, neurons that showed vocalization-induced suppression did not respond to the external stimulus. In contrary, neurons that showed vocalization-induced excitation responded even more when external stimulus is concurrently presented during vocalization, suggesting a role in maintaining sensitivity to the external acoustic environment (Eliades and Wang 2003). In humans there might be a similar division of labor between auditory sites that were suppressed and non-suppressed, such that while suppressed sites are engaged in monitoring self-generated sounds, non-suppressed sites maintain sensitivity to external sounds. But unfortunately, our study did not include the necessary experimental conditions to directly test this hypothesis.

Our broad topographical search using subdural electrodes revealed additional sites outside the canonical auditory regions in the STG that showed speech-induced suppression, mainly in the MTG, and a few others in the SMG and preCG. Sensorimotor regions in the preCG including inferior frontal and premotor cortices are known to activate during passive listening tasks (Wilson, Saygin et al. 2004, Pulvermuller, Huss et al. 2006, Cogan, Thesen et al. 2014), and show tuning to different acoustic properties of speech similar to the auditory regions in the STG (Mesgarani, Cheung et al. 2014, Cheung, Hamiton et al. 2016). Our results showed that isolated sites in these frontal motor regions were sensitive to DAF, confirming their auditory properties and suggesting their involvement in speech monitoring.

Current models of speech motor control predicted a shared mechanism between auditory suppression and sensitivity to speech errors suggesting a role for auditory suppression in speech monitoring (Houde and Nagarajan 2011, Tourville and Guenther 2011). Behavioral evidence in human studies showed that when auditory feedback is delayed in real time, speakers attempt to reset or slow down their speech (Lee 1950, Fairbanks 1955, Stuart, Kalinowski et al. 2002). Similarly, when fundamental frequency (pitch) or formant frequencies of the voice are shifted, speakers change their vocal output in the opposite direction of the shift to compensate for the spectral perturbation (Houde and Jordan 1998, Jones and Munhall 2000, Niziolek and Guenther 2013). Neurosurgical recordings and neuroimaging studies that investigate the brain mechanism of auditory feedback processing demonstrated that these feedback-induced vocal adjustments are accompanied by enhanced neural responses in various auditory regions (Tourville, Reilly et al. 2008, Behroozmand, Karvelis et al. 2009, Behroozmand, Shebek et al. 2015, Ozker, Doyle et al. 2022). However, it has not been clear whether it is the same or different neural populations that exhibit speech-induced suppression and enhanced responses to auditory feedback perturbations. Only in a non-human primate study, which recorded single-unit activity in auditory neurons of marmoset monkeys, it was shown that neurons that were suppressed during vocalization exhibited increased activity during frequency-shifted feedback (Eliades and Wang 2008). In contrast, in an attempt to replicate this finding in humans, a previous iEEG study that used frequency-shifted feedback during production of a vowel showed that majority of suppressed auditory sites did not overlap with sites that are sensitive to feedback alterations (Chang, Niziolek et al. 2013). Using DAF instead of frequency-shifted feedback, we demonstrated a wide overlap of the two neural populations in the STG as well as a significant correlation between the degree of speech-induced suppression and sensitivity to auditory feedback. It is possible that a larger auditory neural population in the STG is highly sensitive to temporal rather than spectral perturbation of the auditory feedback.

Forward models of speech production suggest that a mismatch between the predicted and the actual auditory feedback is encoded by a response enhancement in the auditory cortex signifying an error signal (Houde and Nagarajan 2011, Tourville and Guenther 2011, Hickok 2012). Our results suggested that attention to one’s own speech stream during adverse speaking conditions, such as during an auditory feedback perturbations task, might also contribute to the response enhancement in the auditory cortex. Auditory feedback control of speech was thought to be involuntary and not subject to attentional control, because several previous studies showed that participants produced compensatory responses to pitch shifts even when they were told to ignore feedback perturbations (Munhall, MacDonald et al. 2009, Zarate, Wood et al. 2010, Keough, Hawco et al. 2013). However, prolonging pitch shift duration resulted in an early vocal response that opposes the pitch shift direction and a later vocal response that follows the pitch shift direction suggesting an interplay between reflexive and top-down processes in controlling voice pitch (Hain, Burnett et al. 2000, Burnett and Larson 2002). More recent EEG studies demonstrated that dividing attention between auditory feedback and additional visual stimuli or increasing the attentional load of the task affected vocal responses as well as the magnitude of ERP components, suggesting that attention modulates auditory feedback control on both a behavioral and a cortical level (Tumber, Scheerer et al. 2014, Hu, Liu et al. 2015, Liu, Hu et al. 2015, Liu, Fan et al. 2018). In our study, we found that neural responses in the posterior STG were larger for DAF₀ (randomly presented simultaneous feedback condition in the DAF task) as compared with the VWR-AF condition (consistent simultaneous feedback throughout standard word reading task), even though participants displayed similar vocal behavior in these two conditions. In light of the previous literature, we interpret these response differences as arising from an attentional load difference between the two tasks. In the DAF experiment, the auditory feedback was not consistent since no-delay trials were randomized with delay trials. This randomized structure of the paradigm with interleaved long delay trials (causing slowed speech) required conscious effort for speech-monitoring and thus sustained attention. While remaining cautious about this interpretation and our study’s limitation in attentional controls, we believe that this response enhancement represents an increased neural gain driven by attention to auditory feedback (Hillyard, Vogel et al. 1998), and highlights the critical role of the posterior STG in auditory-motor integration during speech monitoring (Hickok and Poeppel 2000), with its close proximity to the human ventral attention network comprising temporoparietal junction (TPJ) (Vossel, Geng et al. 2014). We leave it to future studies to include additional conditions to manipulate the direction and load of attention to further validate the influence of attention on speech monitoring.

Funding Information

This study was supported by grants from the NIH (F32 DC018200 to M.O. and R01NS109367, R01DC018805, R01NS115929 to A.F.) and the NSF (CRCNS 1912286 to A.F.) and by the Leon Levy Foundation Fellowship (to M.O.). Open access funding is provided by Max Planck Society.

References

1. Ashburner J.
2007A fast diffeomorphic image registration algorithmNeuroimage 38:95–113Google Scholar
1. Behroozmand R.
2. Karvelis L.
3. Liu H.
4. Larson C. R.
2009Vocalization-induced enhancement of the auditory cortex responsiveness during voice F0 feedback perturbationClin Neurophysiol 120:1303–1312Google Scholar
1. Behroozmand R.
2. Shebek R.
3. Hansen D. R.
4. Oya H.
5. Robin D. A.
6. Howard M. A.
7. Greenlee J. D.
2015Sensory-motor networks involved in speech production and motor control: an fMRI studyNeuroimage 109:418–428Google Scholar
1. Boersma P.
2001Praat, a system for doing phonetics by computerGlot International 5:9/10:341–345Google Scholar
1. Burnett T. A.
2. Larson C. R.
2002Early pitch-shift response is active in both steady and dynamic voice pitch controlThe Journal of the Acoustical Society of America 112:1058–1063Google Scholar
1. Cardin J. A.
2. Carlen M.
3. Meletis K.
4. Knoblich U.
5. Zhang F.
6. Deisseroth K.
7. Tsai L. H.
8. Moore C. I.
2009Driving fast-spiking cells induces gamma rhythm and controls sensory responsesNature 459:663–667Google Scholar
1. Chang E. F.
2. Niziolek C. A.
3. Knight R. T.
4. Nagarajan S. S.
5. Houde J. F.
2013Human cortical sensorimotor network underlying feedback control of vocal pitchProc Natl Acad Sci U S A 110:2653–2658Google Scholar
1. Cheung C.
2. Hamiton L. S.
3. Johnson K.
4. Chang E. F.
2016The auditory representation of speech sounds in human motor cortexElife 5Google Scholar
1. Christoffels I. K.
2. Formisano E.
3. Schiller N. O.
2007Neural correlates of verbal feedback processing: an fMRI study employing overt speechHum Brain Mapp 28:868–879Google Scholar
1. Cogan G. B.
2. Thesen T.
3. Carlson C.
4. Doyle W.
5. Devinsky O.
6. Pesaran B.
2014Sensory-motor transformations for speech occur bilaterallyNature 507:94–98Google Scholar
1. Crapse T. B.
2. Sommer M. A.
2008Corollary discharge across the animal kingdomNature Reviews Neuroscience 9:587–600Google Scholar
1. Creutzfeldt O.
2. Ojemann G.
1989Neuronal activity in the human lateral temporal lobe. III. Activity changes during musicExp Brain Res 77:490–498Google Scholar
1. Crone N. E.
2. Sinai A.
3. Korzeniewska A.
2006High-frequency gamma oscillations and human brain mapping with electrocorticographyProg Brain Res 159:275–295Google Scholar
1. Curio G.
2. Neuloh G.
3. Numminen J.
4. Jousmaki V.
5. Hari R.
2000Speaking modifies voice-evoked activity in the human auditory cortexHum Brain Mapp 9:183–191Google Scholar
1. E. Vonholstan D.
2. Glenn H. M.
3. Mittelstaedt
2011The Principle of Reafference : Interactions Between the Central Nervous System and the Peripheral OrgansGoogle Scholar
1. Eliades S. J.
2. Wang X.
2003Sensory-motor interaction in the primate auditory cortex during self-initiated vocalizationsJournal of neurophysiology 89:2194–2207Google Scholar
1. Eliades S. J.
2. Wang X.
2008Neural substrates of vocalization feedback monitoring in primate auditory cortexNature 453:1102–1106Google Scholar
1. Fairbanks G.
1955Selective vocal effects of delayed auditory feedbackJournal of speech and hearing disorders 20:333–346Google Scholar
1. Flinker A.
2. Chang E. F.
3. Kirsch H. E.
4. Barbaro N. M.
5. Crone N. E.
6. Knight R. T.
2010Single-trial speech suppression of auditory cortex activity in humansJournal of Neuroscience 30:16643–16650Google Scholar
1. Ford J. M.
2. Roach B. J.
3. Mathalon D. H.
2010Assessing corollary discharge in humans using noninvasive neurophysiological methodsNat Protoc 5:1160–1168Google Scholar
1. Golumbic E. M. Z.
2. Ding N.
3. Bickel S.
4. Lakatos P.
5. Schevon C. A.
6. McKhann G. M.
7. Goodman R. R.
8. Emerson R.
9. Mehta A. D.
10. Simon J. Z.
2013Mechanisms underlying selective neuronal tracking of attended speech at a “cocktail party”Neuron 77:980–991Google Scholar
1. Greenlee J. D.
2. Behroozmand R.
3. Larson C. R.
4. Jackson A. W.
5. Chen F.
6. Hansen D. R.
7. Oya H.
8. Kawasaki H.
9. Howard M. A.
2013Sensory-motor interactions for vocal pitch monitoring in non-primary human auditory cortexPLoS One 8:e60783Google Scholar
1. Greenlee J. D.
2. Jackson A. W.
3. Chen F.
4. Larson C. R.
5. Oya H.
6. Kawasaki H.
7. Chen H.
8. Howard M. A.
2011Human auditory cortical activation during self-vocalizationPloS one 6:e14744Google Scholar
1. Grill-Spector K.
2. Henson R.
3. Martin A.
2006Repetition and the brain: neural models of stimulus-specific effectsTrends Cogn Sci 10:14–23Google Scholar
1. Hain T. C.
2. Burnett T. A.
3. Kiran S.
4. Larson C. R.
5. Singh S.
6. Kenney M. K.
2000Instructing subjects to make a voluntary response reveals the presence of two components to the audio-vocal reflexExperimental Brain Research 130:133–141Google Scholar
1. Hashimoto Y.
2. Sakai K. L.
2003Brain activations during conscious self-monitoring of speech production with delayed auditory feedback: an fMRI studyHum Brain Mapp 20:22–28Google Scholar
1. Hickok G.
2012Computational neuroanatomy of speech productionNat Rev Neurosci 13:135–145Google Scholar
1. Hickok G.
2. Houde J.
3. Rong F.
2011Sensorimotor integration in speech processing: computational basis and neural organizationNeuron 69:407–422Google Scholar
1. Hickok G.
2. Poeppel D.
2000Towards a functional neuroanatomy of speech perceptionTrends Cogn Sci 4:131–138Google Scholar
1. Hillyard S. A.
2. Vogel E. K.
3. Luck S. J.
1998Sensory gain control (amplification) as a mechanism of selective attention: electrophysiological and neuroimaging evidencePhilos Trans R Soc Lond B Biol Sci 353:1257–1270Google Scholar
1. Houde J. F.
2. Jordan M. I.
1998Sensorimotor adaptation in speech productionScience 279:1213–1216Google Scholar
1. Houde J. F.
2. Nagarajan S. S.
2011Speech production as state feedback controlFrontiers in human neuroscience 5:82Google Scholar
1. Houde J. F.
2. Nagarajan S. S.
3. Sekihara K.
4. Merzenich M. M.
2002Modulation of the auditory cortex during speech: an MEG studyJ Cogn Neurosci 14:1125–1138Google Scholar
1. Howell P.
2. Archer A.
1984Susceptibility to the effects of delayed auditory feedbackPercept Psychophys 36:296–302Google Scholar
1. Hu H.
2. Liu Y.
3. Guo Z.
4. Li W.
5. Liu P.
6. Chen S.
7. Liu H.
2015Attention modulates cortical processing of pitch feedback errors in voice controlScientific Reports 5:1–8Google Scholar
1. Jones J. A.
2. Munhall K. G.
2000Perceptual calibration of F0 production: evidence from feedback perturbationJ Acoust Soc Am 108:1246–1251Google Scholar
1. Keough D.
2. Hawco C.
3. Jones J. A.
2013Auditory-motor adaptation to frequency-altered auditory feedback occurs when participants ignore feedbackBMC Neuroscience 14:25Google Scholar
1. Kort N. S.
2. Nagarajan S. S.
3. Houde J. F.
2014A bilateral cortical network responds to pitch perturbations in speech feedbackNeuroimage 86:525–535Google Scholar
1. Lachaux J. P.
2. Axmacher N.
3. Mormann F.
4. Halgren E.
5. Crone N. E.
2012High-frequency neural activity and human cognition: past, present and possible future of intracranial EEG researchProg Neurobiol 98:279–301Google Scholar
1. Lee B. S.
1950Effects of delayed speech feedbackThe Journal of the Acoustical Society of America 22:824–826Google Scholar
1. Liu Y.
2. Fan H.
3. Li J.
4. Jones J. A.
5. Liu P.
6. Zhang B.
7. Liu H.
2018Auditory-motor control of vocal production during divided attention: behavioral and ERP correlatesFrontiers in Neuroscience 12:113Google Scholar
1. Liu Y.
2. Hu H.
3. Jones J. A.
4. Guo Z.
5. Li W.
6. Chen X.
7. Liu P.
8. Liu H.
2015Selective and divided attention modulates auditory-vocal integration in the processing of pitch feedback errorsEur J Neurosci 42:1895–1904Google Scholar
1. Mesgarani N.
2. Chang E. F.
2012Selective cortical representation of attended speaker in multi-talker speech perceptionNature 485:233–236Google Scholar
1. Mesgarani N.
2. Cheung C.
3. Johnson K.
4. Chang E. F.
2014Phonetic feature encoding in human superior temporal gyrusScience 343:1006–1010Google Scholar
1. Mukamel R.
2. Gelbard H.
3. Arieli A.
4. Hasson U.
5. Fried I.
6. Malach R.
2005Coupling between neuronal firing, field potentials, and FMRI in human auditory cortexScience 309:951–954Google Scholar
1. Munhall K. G.
2. MacDonald E. N.
3. Byrne S. K.
4. Johnsrude I.
2009Talkers alter vowel production in response to real-time formant perturbation even when instructed not to compensateThe Journal of the Acoustical Society of America 125:384–390Google Scholar
1. Niziolek C. A.
2. Guenther F. H.
2013Vowel category boundaries enhance cortical and behavioral responses to speech feedback alterationsJ Neurosci 33:12090–12098Google Scholar
1. Niziolek C. A.
2. Nagarajan S. S.
3. Houde J. F.
2013What does motor efference copy represent? Evidence from speech productionJournal of Neuroscience 33:16110–16116Google Scholar
1. Nourski K. V.
2. Steinschneider M.
3. Rhone A. E.
2016Electrocorticographic Activation within Human Auditory Cortex during Dialog-Based Language and Cognitive TestingFront Hum Neurosci 10:202Google Scholar
1. Nourski K. V.
2. Steinschneider M.
3. Rhone A. E.
4. Kovach C. K.
5. Banks M. I.
6. Krause B. M.
7. Kawasaki H.
8. Howard M. A.
2021Electrophysiology of the Human Superior Temporal Sulcus during Speech ProcessingCereb Cortex 31:1131–1148Google Scholar
1. Numminen J.
2. Salmelin R.
3. Hari R.
1999Subject’s own speech reduces reactivity of the human auditory cortexNeurosci Lett 265:119–122Google Scholar
1. Ozker M.
2. Doyle W.
3. Devinsky O.
4. Flinker A.
2022A cortical network processes auditory error signals during human speech production to maintain fluencyPLoS Biol 20:e3001493Google Scholar
1. Percival D. B.
2. Walden A. T.
1993Spectral analysis for physical applicationscambridge university press Google Scholar
1. Poulet J. F.
2. Hedwig B.
2002A corollary discharge maintains auditory sensitivity during sound productionNature 418:872–876Google Scholar
1. Poulet J. F.
2. Hedwig B.
2006The cellular basis of a corollary dischargeScience 311:518–522Google Scholar
1. Pulvermuller F.
2. Huss M.
3. Kherif F.
4. Moscoso del Prado Martin F.
5. Hauk O.
6. Shtyrov Y.
2006Motor cortex maps articulatory features of speech soundsProc Natl Acad Sci U S A 103:7865–7870Google Scholar
1. Ray S.
2. Maunsell J. H.
2011Different origins of gamma rhythm and high-gamma activity in macaque visual cortexPLoS Biol 9:e1000610Google Scholar
1. Rossion B.
2. Pourtois G.
2004Revisiting Snodgrass and Vanderwart’s Object Pictorial Set: The Role of Surface Detail in Basic-Level Object RecognitionPerception 33:217–236Google Scholar
1. Schneider D. M.
2. Mooney R.
2018How movement modulates hearingAnnual review of neuroscience 41:553–572Google Scholar
1. Shum J.
2. Fanda L.
3. Dugan P.
4. Doyle W. K.
5. Devinsky O.
6. Flinker A.
2020Neural correlates of sign language production revealed by electrocorticographyNeurology 95:e2880–e2889Google Scholar
1. Stuart A.
2. Kalinowski J.
3. Rastatter M. P.
4. Lynch K.
2002Effect of delayed auditory feedback on normal speakers at two speech ratesJ Acoust Soc Am 111:2237–2241Google Scholar
1. Todorovic A.
2. de Lange F. P.
2012Repetition suppression and expectation suppression are dissociable in time in early auditory evoked fieldsJ Neurosci 32:13389–13395Google Scholar
1. Tourville J. A.
2. Guenther F. H.
2011The DIVA model: A neural theory of speech acquisition and productionLang Cogn Process 26:952–981Google Scholar
1. Tourville J. A.
2. Reilly K. J.
3. Guenther F. H.
2008Neural mechanisms underlying auditory feedback control of speechNeuroimage 39:1429–1443Google Scholar
1. Tumber A. K.
2. Scheerer N. E.
3. Jones J. A.
2014Attentional demands influence vocal compensations to pitch errors heard in auditory feedbackPLoS One 9:e109968Google Scholar
1. Vossel S.
2. Geng J. J.
3. Fink G. R.
2014Dorsal and ventral attention systems: distinct neural circuits but collaborative rolesNeuroscientist 20:150–159Google Scholar
1. Wilson S. M.
2. Saygin A. P.
3. Sereno M. I.
4. Iacoboni M.
2004Listening to speech activates motor areas involved in speech productionNat Neurosci 7:701–702Google Scholar
1. Wise R. J.
2. Greene J.
3. Buchel C.
4. Scott S. K.
1999Brain regions involved in articulationLancet 353:1057–1061Google Scholar
1. Yang A. I.
2. Wang X.
3. Doyle W. K.
4. Halgren E.
5. Carlson C.
6. Belcher T. L.
7. Cash S. S.
8. Devinsky O.
9. Thesen T.
2012Localization of dense intracranial electrode arrays using magnetic resonance imagingNeuroimage 63:157–165Google Scholar
1. Yates A. J.
1963Delayed auditory feedbackPsychol Bull 60:213–232Google Scholar
1. Zarate J. M.
2. Wood S.
3. Zatorre R. J.
2010Neural networks involved in voluntary and involuntary vocal pitch regulation in experienced singersNeuropsychologia 48:607–618Google Scholar

Article and author information

Author information

Muge Ozker
Neurology Department, New York University, New York, 10016, NY, USA, Max Planck Institute for Psycholinguistics, 6525 XD Nijmegen, The Netherlands
ORCID iD: 0000-0001-7472-4528
- Corresponding Author: Muge Ozker, e-mail:⠀muge.ozker-sertel@mpi.nl
Leyao Yu
Neurology Department, New York University, New York, 10016, NY, USA, Biomedical Engineering Department, New York University, Brooklyn, 11201, NY, USA
Patricia Dugan
Neurology Department, New York University, New York, 10016, NY, USA
Werner Doyle
Neurosurgery Department, New York University, New York, 10016, NY, USA
Daniel Friedman
Neurology Department, New York University, New York, 10016, NY, USA
Orrin Devinsky
Neurology Department, New York University, New York, 10016, NY, USA
Adeen Flinker
Neurology Department, New York University, New York, 10016, NY, USA, Biomedical Engineering Department, New York University, Brooklyn, 11201, NY, USA
ORCID iD: 0000-0003-1247-1283

Version history

Sent for peer review: December 8, 2023
Preprint posted: February 7, 2024
Reviewed Preprint version 1: February 26, 2024
Reviewed Preprint version 2: July 31, 2024
Version of Record published: September 10, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.94198. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 1,741
downloads: 100
citations: 14

Views, downloads and citations are aggregated across all versions of this paper published by eLife.