Abstract
Speech production and perception involve complex neural dynamics in the human brain. Using magnetoencephalography (MEG), our study explores the interaction between cortico-cortical and cortico-subcortical connectivities during these processes. Our connectivity findings during speaking revealed a significant connection from the right cerebellum to the left temporal areas in low frequencies, which displayed an opposite trend in high frequencies. Notably, high-frequency connectivity was absent during the listening condition. These findings underscore the vital roles of cortico-cortical and cortico-subcortical connections within the speech production and perception network. The results of our new study enhance our understanding of the complex dynamics of brain connectivity during speech processes, emphasizing the distinct frequency-based interactions between various brain regions.
Introduction
Human communication can be described as two dynamic systems (i.e., speaker and listener) that are coupled via sensory information (Silbert et al., 2014) and operate according to principles of active inference by minimising prediction errors (Friston and Frith, 2015). Within this framework, the speaker’s predictive processing model steers speech production, allowing adjustments in volume, speed, or articulation based on proprioceptive and auditory feedback. Similarly, the listener’s predictive processing model generates anticipations about upcoming speech which are continually updated and compared with incoming sensory data. The implementation of this model in the human brain was shown to be associated with brain rhythms (Abbasi et al., 2023; Arnal and Giraud, 2012; Park et al., 2015).
Brain rhythms have been extensively investigated during continuous speech perception. A consistent finding in these studies is the synchronization of frequency-specific brain activity, with the rhythmic amplitude modulation in continuous speech (Giraud and Poeppel, 2012; Gross et al., 2013b). The exploration of the brain networks that underpins speech perception has also revealed a stronger causal influence from higher-order brain regions (such as the left inferior frontal gyrus and left motor areas) to the auditory cortex for intelligible speech compared to unintelligible speech (Park et al., 2015). In the domain of speech production, both invasive (Llorens et al., 2011; Ozker et al., 2022; Riès et al., 2017) and non-invasive (Ganushchak et al., 2011; Janssen et al., 2020) electrophysiological studies have significantly contributed, capitalizing on their superior temporal resolution compared to fMRI. However, it’s important to note that invasive recordings offer only limited spatial coverage from specific recording sites in patients. Magnetoencephalography (MEG) has been used by several research teams to explore the intricate relationship between brain frequency-specific dynamics and speech production. Ruspantini and colleagues observed notable coherence between MEG activity originating from the motor cortex and EMG activity in lip muscles, with a peak frequency of approximately 2-3 Hz (Ruspantini et al., 2012). In a separate investigation conducted by Alexandrou and colleagues, they identified high-frequency band modulation (60-90 Hz) within bilateral temporal, frontal, and parietal brain regions (Alexandrou et al., 2017). In our recent study, we also explored speech production and perception using MEG to map the cortical networks engaged during these two experimental conditions (Abbasi et al., 2023). We reported significant connectivity from the motor area to STG in lower frequency bands (up to beta) during speaking, and the reverse pattern in high gamma frequencies (Abbasi et al., 2023).
Subcortical areas also contribute in updating the brain’s internal model during speech production and perception. Thalamic nuclei serve as central nodes for circuits associated with language processing, interacting with the frontal cortex, basal ganglia, cerebellum, and dopaminergic groups. Notably, the motor-related thalamic nuclei link the basal ganglia and cerebellum with the frontal cortex, contributing to both motor and cognitive aspects of language (Barbas et al., 2013; Silveri, 2021). Relatedly, several previous studies have reported the cerebellum’s role in several cognitive functions such as speech production and perception (Giraud et al., 2007; Skipper and Lametti, 2021) movement coordination, timing, motor programming, speech motor control, and sensory prediction (Parrell et al., 2017). Lesion studies, neuroimaging investigations, and brain stimulation studies have consistently linked the cerebellum to critical aspects of timing and predictive processing in speech perception (Ackermann, 2008; Manto et al., 2012; Parrell et al., 2017). Temporo-cerebellar coupling has been suggested to support the continuous updating of internal models of the auditory environment (Kotz et al., 2014). In a more recent study, Stockert and colleagues reported bidirectional temporo-cerebellar anatomical connectivity (Stockert et al., 2021). The authors suggested that these anatomical temporo-cerebellar connections may support the encoding and modeling of rapidly modulated auditory spectro-temporal patterns.
However, to the best of our knowledge no study has so far investigated spectrally resolved, directed functional coupling between cortical and subcortical brain areas during listening and speaking. In this MEG study, we intend to investigate the specific frequency-related connections between both cortical and subcortical brain regions while participants engaged in speaking and listening tasks. Participants answered seven questions, each lasting 60 seconds (speaking condition), and also listened to recordings of their own speech from the previous speaking session (listening condition; see Methods for details). Initially, we identified the cortical and subcortical brain regions involved in speech perception and production through meta-analysis studies. Subsequently, we examined the brain network and directed connections within it to shed light on the cognitive processes underlying listening and speaking. This involved a direct examination of frequency-specific communication channels for both feedforward and feedback signaling during speech production and perception using multivariate Granger causality. Our connectivity findings confirmed the participation of subcortical regions such as the thalamus and cerebellum in both speech production and perception. We found that feedback signals, connecting the cerebellum to the auditory area, occur at slower rhythms (below 40 Hz), while feedforward signals (in the reverse direction) occur at faster rhythms (above 40 Hz).
Results
In this study, we investigated the connectivity between brain regions that are involved in speech production and perception. Our ROI-based analysis utilized the automatic meta-analysis provided by neurosynth.org using the term ‘speech’ that resulted in 642 fMRI studies highlighting the brain areas involved in speech production and perception. We extracted the MNI coordinates of active areas from the resulting neurosynth statistical map, resulting in 14 parcels. For each voxel in a given parcel, the beamformer-derived time-series for all the three source orientations were subjected to singular value decomposition (SVD). In every parcel, we used the three strongest components for further analysis. For estimating pairwise functional connectivity between all parcels (n=14), we used multivariate nonparametric Granger causality [mGC; ((Schaum et al., 2021)] and related them to the hallmarks of predictive processing models during continuous speaking and listening (see Fig. 1). The method uses three-dimensional representations of parcel activity and thus estimates connectivity more accurately than traditional one-dimensional representations (Chalas et al., 2022).
The calculation of mGC between two parcels yielded two spectra reflecting both directions (A->B and B->A). Next, we computed the directed asymmetry index (DAI) for each pair of spectra (Bastos et al., 2015). This measure captures the predominant direction of mGC between two parcels based on the relative difference between both directions (see ‘Methods’ for more information). A positive DAI indicates the dominant directionality from A toward B, while a negative DAI indicates the opposite directionality. First, we used group statistics to identify brain areas where DAI values in specific frequency bands in the speaking as well as listening conditions differed significantly from zero. We performed group statistics from 0 to 100 Hz. Next, we defined the following canonical frequency bands: Delta/Theta (0-7 Hz), alpha (7-13 Hz), beta (15-30 Hz), gamma (30-60 Hz), high gamma (60-90 Hz).
Using connectogram plots, we visualized the connectivity between the selected 14 ROIs and examined neural mechanisms associated with speech production and perception, as well as predictive processing models. The number of outgoing and incoming edges for each node was also calculated using the brain connectivity toolbox and represented as the node strength (Rubinov and Sporns, 2010). The upper row of Fig. 2 illustrates the strength for three different nodes in the speaking condition. In L-SCEF (pre-supplementary motor area) and R-CB6 (right cerebellum lobule VI), the number of outgoing edges in low frequencies is higher compared to incoming edges. However, we observed a reverse pattern in high frequencies. Interestingly, in the left STG (L-A5), there are more incoming connections compared to outgoings in low frequencies, and this changes in the higher frequencies, with more outgoing connections seen in this area. This result indicates that in low frequencies, the sensory-motor area and cerebellum predominantly transmit information, while in higher frequencies, they are more involved in receiving it. On the other hand, the left STG receives information in low frequencies and transmits it in high frequencies.
The connectograms in the middle parts of Fig. 2 illustrate the connections of the left sensory-motor area (left plots; L-3b and L-SCEF) and the right cerebellum (right plots) to other cortical and subcortical parcels in low (top: 7-20 Hz) and high (bottom: 60-90 Hz) frequencies during the speaking condition. In low frequencies, both the sensory-motor area and the right cerebellum primarily send information to other parcels, including the left and right temporal regions, while in the high frequencies, these nodes primarily receive information. Interestingly, our connectivity results illustrate that the top-down effects from higher-order cortical areas to lower-order areas during speaking occur in distinct frequency bands. Fig. 2 (lower panels) depicts that the strongest top-down connectivity from the left pre-supplementary motor area (pre-SMA; L-SCEF) to lower-order cortical areas such as L-A5 occur in the theta band (peak at 5-7 Hz). However, the top-down connectivity from L-3b to L-A5 was strongest in two distinct theta (peak at 7 Hz) and beta (peak at 21 Hz) frequency bands. Moreover, these results revealed significant bottom-up connectivity from lower-order cortical areas, namely L-A5 to left sensory-motor area (L-3b and L-SCEF) in high frequency bands. As we previously showed (Abbasi et al., 2023), this striking reversal indicates a dissociation of bottom-up and top-down information flow during speaking in distinct frequency bands. Signals communicated top-down are predominantly transmitted in low-frequency ranges, while those communicated bottom-up are transmitted in high-frequency ranges. Notably, Fig. 2 (bottom-right panel) also depicts directional connectivity from the right cerebellum lobule VI (R-CB6) to the left STG (L-A5) in frequencies below 50 Hz. In contrast, in frequencies above 50 Hz, we observed reverse signaling from the left STG to the right cerebellum during speaking.
The connectograms in Fig. 3 provide a comprehensive overview of significant connectivities across various frequency bands among the selected fourteen brain areas during speaking and listening conditions. Specifically highlighting the first column’s connectograms during speaking, we observe comparable connectivity patterns to those of the left sensory-motor area. Connections from the right higher-order cortical regions, such as R-SCEF and R-3b, towards lower-order areas like R-A5, are noticeable in low-frequency bands (<30 Hz). Conversely, for higher frequencies (>30 Hz), we noticed the reverse directional connections, from lower-order to higher-order brain regions. Furthermore, Fig. 4A reveals strong top-down connectivity from bilateral 3b and left FOP to bilateral A5 and left STS, particularly prominent in two distinct frequency bands: theta (peak at 7 Hz) and beta (peak at 21 Hz). Additionally, similar to the left hemisphere, our results in Fig. 4A show significant bottom-up connectivity from lower-order cortical areas (bilateral A5 and left STS) to higher-order cortical regions (bilateral 3b and bilateral SCEF) across both low and high frequency bands.
We observed a similar pattern during the listening condition compared to the speaking condition, including reversed directionality between low and high frequencies (Fig. 3, second column). However, both top-down effects in the low frequency band and bottom-up effects in the high frequency band were less pronounced during listening compared to speaking. The third column in Fig. 3 as well as the third panel of Fig. 4 confirm these results by showing the direct comparisons of DAI connectivity between speaking and listening. This figure illustrates significant differences in connectivity between speaking and listening conditions in different frequency bands. Specifically, stronger top-down signals originating from higher-order cortical areas (such as bilateral SCEF and 3b) towards lower-order brain areas, such as bilateral A5 and left STS were observed during speaking compared to listening.Conversely, the individual spectrally-resolved DAI plots for the listening conditions, along with the connectograms, indicate the presence of significant connectivity solely from low-order areas (such as bilateral A5) towards higher-order cortical areas (bilateral SCEF and 3b) during the speaking condition, as opposed to the listening condition. More detailed spectrally-resolved DAI between all ROIs and both conditions are illustrated in Fig. 4.
Notably, we found significant connections between subcortical and cortical areas, indicating their involvement in speech production and perception networks. Fig. 4 depicts the connectivity from the right cerebellum to other cortical and subcortical parcels. In contrast with what we reported for the speaking condition, during listening, there is only a significant connectivity in low frequency to the left temporal area but not a reverse connection in the high frequencies. Our analysis also revealed robust connectivity between the right cerebellum and the left parietal cortex, evident in both speaking and listening conditions, with stronger connectivity observed during speaking. Notably, Figure 4 depicts a prominent frequency peak in the alpha band, illustrating the specific frequency range through which information flows from the cerebellum to the parietal areas. There is also an intriguing connection between the left thalamus and right cerebellum, which reveals a distinct pattern of connectivity. Specifically, we found significant connectivity from the cerebellum to the thalamus in the low-frequency range during speaking, while the opposite pattern was observed in the high-frequency range (Fig. 4).
Finally, we examined the relationship between the connectivity patterns from the selected cortical parcels to the right cerebellum and the coupling of the speech envelope with the oscillations in the superior temporal gyrus (STG). The speech-brain coupling was computed using a multivariate mutual information (MI) approach presented in our recently published study (Abbasi et al., 2023). We investigated the correlations between the directional connectivity indices and MI values across participants for each parcel and frequency band in speaking condition. Our analysis showed significant negative correlations between the RCB6 to L_A5 connectivity and the speech-STG coupling in the theta band (at 130 ms lag) for speaking condition (Fig. 5; p < .05).
Discussion
The present study utilized multivariate Granger causality analysis to investigate the frequency-specific brain networks during continuous speaking and listening. Our results revealed significant connectivity from bilateral higher-order cortical areas to bilateral auditory areas (STG and STS) in lower frequency bands (up to beta) during speaking, and in the opposite direction in gamma frequencies. Notably, subcortical areas also contributed to the speech network, with directional communication observed from the right cerebellum lobule VI (R-CB6) to the left auditory area below 30 Hz. Conversely, in frequencies above 30 Hz, we observed reverse signaling from the left auditory areas to the right cerebellum.
Directed connectivity in speech production and perception
In this study, we aimed to investigate directional connectivity in speech production and perception. Using the automatic meta-analysis provided by neurosynth.org, we included cortical and subcortical brain regions involved in these processes. Our multivariate Granger causality analysis revealed the contribution of distinct frequency channels in both continuous speaking and listening in the auditory-motor-cerebellum domain. Specifically, feedback signals were communicated via slow rhythms below 40 Hz, whereas feedforward signals were communicated via faster rhythms. Our study builds upon our previous work, which demonstrated top-down signaling in low frequencies from higher-order cortical areas to STG and STS and the reverse pattern in higher frequencies. Our findings are also supported by previous studies showing distinct frequency channels for feedforward and feedback communication between two hierarchically different auditory areas (Fontolan et al., 2014) and demonstrating that prediction errors are represented in gamma power while predictions are represented in lower frequency beta power (Sedley et al., 2016).
Based on our meta-analysis results, several subcortical areas such as cerebellum and thalamus are involved in speech production and perception. Previous studies have supported the cerebellum’s role in speech production and perception (Giraud et al., 2007; Skipper and Lametti, 2021). Our connectivity analysis findings demonstrate the involvement of the cerebellum within speech networks. Specifically, our results indicate directed connectivity from the right cerebellum to left temporal areas (L-A5 and L-STS) in lower frequencies, and the reverse direction in higher frequencies. Similar to the connections between higher-order cortical areas and temporal areas, feedback signaling between the cerebellum and temporal areas occurs in low frequencies (below 40 Hz), that could potentially play a role in conveying timing information for upcoming speech. In contrast, feedforward signaling occurs in gamma frequency (above 60 Hz) from temporal areas to the cerebellum, facilitating the update of sensory predictions. Comparison between speech production and perception conditions revealed stronger feedback signaling from the cerebellum to temporal areas during speech production, aligning with the nature of updating sensory predictions in this context. Our findings closely align with recent studies that have investigated the role of the cerebellum in predictive internal modeling mechanisms in speech production and perception (Stockert et al., 2021; Todorović et al., 2023). These studies suggest that the combined yet distinct activation of temporal, parietal, and cerebellar regions during internal and external monitoring points toward their involvement in auditory and somatosensory targets and continuous updating of auditory environmental models. Moreover, temporo-cerebellar coupling may underlie the precise encoding of temporal structure and support the ongoing optimization of spectro-temporal models of the auditory environment within a network comprising the prefrontal cortex, temporal cortex, and cerebellum (Kotz et al., 2014; Stockert et al., 2021; Todorović et al., 2023).
Our findings of significant negative correlations between right cerebellar connectivity to the left temporal area and speech-STG coupling in the theta band during speech production resonate with our prior work (Abbasi et al., 2023). In our previous study, we reported a similar negative correlation involving theta-range speech-brain coupling in the left auditory area and top-down beta connectivity from motor areas. These observations parallel existing research on sensory attenuation, where the brain predicts the sensory outcomes of self-generated actions, resulting in reduced cortical responses (Martikainen et al., 2005). In earlier studies, we associated this phenomenon with the modulation of beta-range directional couplings originating from motor cortices toward bilateral auditory regions, indicating predictive processes (Abbasi and Gross, 2020). Now, our new results introduce the cerebellum as another key node in this sensory attenuation mechanism, expanding our understanding of how the brain anticipates and minimizes sensory responses during speech production.
The similarities observed between the feedback and feedforward signaling from the cerebellum and higher-order cortical areas to the temporal areas suggest a shared contribution in predicting the sensory consequences of generated speech. It is proposed that cortico-cerebellar and corticocortical predictions interact in speech networks, with the cerebellum potentially involved in predicting well-learned speech, while the cortex flexibly applies predictions in novel contexts (Ackermann, 2008; Ackermann et al., 1998; Skipper and Lametti, 2021). These findings deepen our understanding of the role of the cerebellum in speech production and perception and align with previous research highlighting its involvement in predictive processing mechanisms.
We also observed significant connectivity between the right cerebellum and the left parietal cortex, specifically in the low-frequency ranges, which peaked in the alpha range. This result is consistent with previous research demonstrating the alpha band connectivity between parietal and temporal areas during speech production and perception (Abbasi et al., 2023). We and others previously suggested that the parietal cortex might control and modulate the alpha rhythms in the early auditory areas which serves as a multi-dimensional filter (Abbasi et al., 2023; Lakatos et al., 2019). Moreover, our current results reveal that during speaking and listening, the parietal cortex not only receives inputs from higher cortical areas such as the motor cortex, but also receives inputs from the cerebellum. These inputs enable the parietal cortex to encode predicted sensory consequences during speaking and provide top-down signals to early auditory areas. Additionally, our new findings demonstrate stronger connectivity from the cerebellum to the parietal areas during the speaking condition compared to the listening condition. This enhanced connectivity may be attributed to the selective inhibition of auditory signals during speaking, as predicted by the internal predictive coding model (Cao et al., 2017; Floegel et al., 2020).
The thalamus is another subcortical structure involved in speech perception and production which appears to play a significant role as it serves as a crucial intermediary in the cortico-subcortical neural network involving the cerebellum (Barbas et al., 2013; Silveri, 2021). Our findings reinforce this notion, revealing significant connectivity from the right cerebellum to the left thalamus in the low-frequency band, with the opposite direction observed in the high-frequency band during speaking. These connectivity patterns align with the results observed between the right cerebellum and other cortical areas, providing further support for the thalamus’s role in interconnecting cortical and subcortical structures, facilitating communication and coordination in various stages of verbal production (Crosson, 2013).
This study identifies separate frequency bands used in both speaking and listening. The findings highlight the importance of feedback and feedforward communication in these processes, suggesting potential involvement in optimizing neural processing during speech tasks. Notably, we have also unveiled the anticipated role of the cerebellum within this framework. However, definitive conclusions should be drawn with caution given recent studies raising concerns about the notion that top-down and bottom-up signals can only be transmitted via separate frequency channels (Ferro et al., 2021; Schneider et al., 2021; Vinck et al., 2023). Therefore, further investigation is warranted to thoroughly assess the alignment between our results and the contribution of the predictive coding framework in speech production and perception.
Methods
Participants
Thirty native German-speaking participants (15 males, mean age 25.1 ± 2.8 years [M ± SD], range 20-32 years) were recruited for this study. Prior written informed consent was obtained before measurements and participants received monetary compensation after partaking. The study was approved by the local ethics committee and conducted in accordance with the Declaration of Helsinki.
Recordings
Magnetoencephalography, EMG, and speech signals were recorded simultaneously. The speech recording had a sampling rate of 44,1 kHz, whereas the MEG, a 275 channel, whole-head sensor system (OMEGA 275, VSM Medtech Ltd., Vancouver, BC, Canada) was sampled with 1200Hz.
In order not to cause any artefacts by the microphone used for capturing audio data, it was placed at a distance of 155cm from the participants mouth. Three pairs of EMG surface electrodes were placed after tactile inspection to find the correct location to capture muscle activity from the m. genioglossus, m. orbicularis oris, and m. zygomaticus major (for exact location see Fig. 1 in (Abbasi et al., 2021)). One pair of electrodes was used for each muscle with about 1 cm between electrodes. A low-pass online filter with a 300Hz cut-off was applied to the recorded MEG and EMG data.
Paradigm
Participants were asked to sit relaxed while performing the given tasks and to keep their eyes on a white fixation cross. The experiment was split in two separate parts: The first one consisted of answering given questions, each for 60 s, thus recording overt speech. Participants had to answer seven questions covering neutral topics, such as ‘What does a typical weekend look like for you?’. A color change from white to blue fixation cross indicated the beginning of the time period in which participants should speak and the end was marked by a color change back to white. The second part focused on perceiving speech in the way that participants listened to their own answers from part one. The list of questions as well as further details of the paradigm presented can be found in (Abbasi et al., 2023).
Preprocessing and data analysis
Prior to data analysis, MEG data were visually inspected. No jump artefacts or bad channels were detected. A discrete Fourier transform (DFT) filter was applied to eliminate 50 Hz line noise from the continuous MEG and EMG data. Moreover, EMG data was highpass-filtered at 20 Hz and rectified. Continuous head position and rotation were extracted from the fiducial coils placed at anatomical landmarks (nasion, left, and right ear canals). MEG, EMG, and head movement signals were downsampled to 256 Hz and segmented to non-overlapping 60s trials corresponding to each of their overt answers. In the preprocessing and data analysis steps, custom-made scripts in Matlab R2020 (The Mathworks, Natick, MA, USA) in combination with the Matlab-based FieldTrip toolbox (Oostenveld et al., 2011) were used in accord with current MEG guidelines (Gross et al., 2013a).
Artefact rejection
For removing the speech-related artefacts we used the pipeline presented in (Abbasi et al., 2021). In a nutshell, the artefact rejection comprises four major steps: (i)Head movement-related artefact was initially reduced by incorporating the head position time-series into the general linear model (GLM) using regression analysis (Stolk et al., 2013). (ii) To further remove the residual artefact, singular value decomposition (SVD) was used to estimate the spatial subspace (components) containing the speech-related artefact from the MEG data. (iii) Artefactual components were detected via visual inspections and mutual information (MI) analysis and then removed from the single-trial data (Abbasi et al., 2016). (iv) Finally, all remaining components were back-transformed to the sensor level.
Source localization
For source localisation we aligned individual T1-weighted anatomical MRI scans with the digitised head shapes using the iterative closest point algorithm. Then, we segmented the MRI scans and generated single-shell volume conductor models (Nolte, 2003), and used this to create forward models. For group analyses, individual MRIs were linearly transformed to a MNI template provided by Fieldtrip. Next, the linearly constrained minimum variance (LCMV) algorithm was used to compute time-series for each voxel on a 5-mm grid. The time-series were extracted for each dipole orientation, resulting in three time-series per voxel. The reduced version of the HCP brain atlas as well as AAL atlas were applied on the source space time-series in order to reduce the dimensionality of the data, resulting in 230 cortical parcels (Tait et al., 2020) and 116 subcortical parcels respectively. Since the HCP atlas only covers cortical areas we used the AAL atlas for subcortical areas. Finally, we extracted the first three components of a singular value decomposition (SVD) of time-series from all dipoles in this parcel, explaining most of the variance.
ROI selection
In this study, we focused on the brain regions that are involved in speech production and perception. In order to find the areas involved in speech production and perception network, we utilized the automatic meta-analysis provided by neurosynth.org using the term ‘speech’ that resulted in a meta-analysis of 642 fMRI studies highlighting the brain areas involved in speech production and perception. We extracted the MNI coordinates of active areas from the presented map, found their corresponding voxels and identified the respective parcels on HCP and AAL atlases where these voxels are located. This resulted in 14 cortical and subcortical parcels (see Fig. 1).
Connectivity analysis
We performed connectivity analysis by using a multivariate nonparametric Granger causality approach (mCG; (Schaum et al., 2021). We computed the mCG to determine the directionality of functional coupling between all the detected involved parcels, in pairwise steps, during speech production and perception. Initially, the source signals were divided into trials of four seconds, with 500 milliseconds overlap. We used the fast Fourier transform in combination with multitapers (2 Hz smoothing) to compute the cross-spectral density (CSD) matrix of the trials. Next, using a blockwise approach, we considered the first three SVD components of each parcel as a block and estimated the connectivity between STG and other parcels. Finally, we computed the directed influence asymmetry index (DAI) defined by (Bastos et al., 2015) as
Therefore, a positive DAI for a given frequency indicates that the selected parcel conveys feedforward influences to STG in this frequency, and a negative DAI indicates feedback influences. Note that for the connectivity analysis, we used MEG data with 1200 Hz sampling rate without downsampling.
Statistical analysis
We determined significant connectivity patterns (DAI values) in both speaking and listening conditions using non-parametric cluster-based permutation tests (Maris and Oostenveld, 2007). First, we estimated the statistical contrast of connectivities during speaking and listening compared to zero for each parcel and participant. Second, the DAI values in the speaking condition were contrasted with DAI values in the listening condition at the group level. The statistical analysis was conducted for different frequency bands (Delta/Theta (0-7 Hz), alpha (7-13 Hz), beta (15-30 Hz), gamma (30-60 Hz), high gamma (60-90 Hz)) using a dependent-samples t-test. We used a cluster-based correction to account for multiple comparisons across frequencies and parcels. We performed five thousand permutations and set the critical alpha value at 0.05.
Correlation analysis
To examine the potential relationship between our connectivity findings from all the parcels to L-CB6 and speech-STG coupling, we conducted non-parametric cluster-based permutation tests. Our first step was to calculate top-down connectivity values for selected parcels and frequency bands, then to compute speech-STG couplings for each frequency band. To account for multiple comparisons across parcels, we employed the Pearson method implemented in the ft_statfun_correlationT function in Fieldtrip, with cluster-based correction. Our analysis was repeated for different frequency bands. Therefore, our results are not corrected across frequencies. We performed five thousand permutations and set the critical alpha value at 0.05.
Data availability
An example data is available at https://osf.io/9fq47/?view_only=e6bad57efb854f069474c2d6f93a00a6. Raw data, however, are protected by data privacy laws and cannot be made widely available, but may be made available upon reasonable request (subject to these privacy laws).
References
- Beta-band oscillations play an essential role in motor-auditory interactionsHum. Brain Mapp 41:656–665https://doi.org/10.1002/hbm.24830
- Rejecting deep brain stimulation artefacts from MEG data using ICA and mutual informationJ. Neurosci. Methods 268:131–141https://doi.org/10.1016/j.jneumeth.2016.04.010
- Spatiotemporal dynamics characterise spectral connectivity profiles of continuous speaking and listeningPLoS Biol 21https://doi.org/10.1371/journal.pbio.3002178
- Correcting MEG artifacts caused by overt speechFront. Neurosci 15https://doi.org/10.3389/fnins.2021.682419
- Does the cerebellum contribute to cognitive aspects of speech production? A functional magnetic resonance imaging (fMRI) study in humansNeurosci. Lett 247:187–190https://doi.org/10.1016/s0304-3940(98)00328-0
- Cerebellar contributions to speech production and speech perception: psycholinguistic and neurobiological perspectivesTrends Neurosci 31:265–272https://doi.org/10.1016/j.tins.2008.02.011
- The right hemisphere is highlighted in connected natural speech production and perceptionNeuroimage 152:628–638https://doi.org/10.1016/j.neuroimage.2017.03.006
- Cortical oscillations and sensory predictionsTrends Cogn Sci (Regul Ed) 16:390–398https://doi.org/10.1016/j.tics.2012.05.003
- Frontal-thalamic circuits associated with languageBrain Lang 126:49–61https://doi.org/10.1016/j.bandl.2012.10.001
- Visual areas exert feedforward and feedback influences through distinct frequency channelsNeuron 85:390–401https://doi.org/10.1016/j.neuron.2014.12.018
- The role of brain oscillations in predicting self-generated soundsNeuroimage 147:895–903https://doi.org/10.1016/j.neuroimage.2016.11.001
- Multivariate analysis of speech envelope tracking reveals coupling beyond auditory cortexNeuroimage 258https://doi.org/10.1016/j.neuroimage.2022.119395
- Thalamic mechanisms in language: a reconsideration based on recent findings and conceptsBrain Lang 126:73–88https://doi.org/10.1016/j.bandl.2012.06.011
- Directed information exchange between cortical layers in macaque V1 and V4 and its modulation by selective attentionProc Natl Acad Sci USA 118https://doi.org/10.1073/pnas.2022097118
- Differential contributions of the two cerebral hemispheres to temporal and spectral speech feedback controlNat. Commun 11https://doi.org/10.1038/s41467-020-16743-2
- The contribution of frequency-specific activity to hierarchical information processing in the human auditory cortexNat. Commun 5https://doi.org/10.1038/ncomms5694
- Active inference, communication and hermeneuticsCortex 68:129–143https://doi.org/10.1016/j.cortex.2015.03.025
- The use of electroencephalography in language production research: a reviewFront. Psychol 2https://doi.org/10.3389/fpsyg.2011.00208
- Endogenous cortical rhythms determine cerebral specialization for speech perception and productionNeuron 56:1127–1134https://doi.org/10.1016/j.neuron.2007.09.038
- Cortical oscillations and speech processing: emerging computational principles and operationsNat. Neurosci 15:511–517https://doi.org/10.1038/nn.3063
- Good practice for conducting and reporting MEG researchNeuroimage 65:349–363https://doi.org/10.1016/j.neuroimage.2012.10.001
- Speech rhythms and multiplexed oscillatory sensory coding in the human brainPLoS Biol 11https://doi.org/10.1371/journal.pbio.1001752
- Exploring the temporal dynamics of speech production with EEG and group ICASci. Rep 10https://doi.org/10.1038/s41598-020-60301-1
- Cerebellum, temporal predictability and the updating of a mental modelPhilos. Trans. R. Soc. Lond. B Biol. Sci 369https://doi.org/10.1098/rstb.2013.0403
- A new unifying account of the roles of neuronal entrainmentCurr. Biol 29:R890–R905https://doi.org/10.1016/j.cub.2019.07.075
- Intra-cranial recordings of brain activity during language productionFront. Psychol 2https://doi.org/10.3389/fpsyg.2011.00375
- Consensus paper: roles of the cerebellum in motor control--the diversity of ideas on cerebellar involvement in movementCerebellum 11:457–487https://doi.org/10.1007/s12311-011-0331-9
- Nonparametric statistical testing of EEG- and MEG-dataJ. Neurosci. Methods 164:177–190https://doi.org/10.1016/j.jneumeth.2007.03.024
- Suppressed responses to self-triggered sounds in the human auditory cortexCereb. Cortex 15:299–302https://doi.org/10.1093/cercor/bhh131
- The magnetic lead field theorem in the quasi-static approximation and its use for magnetoencephalography forward calculation in realistic volume conductorsPhys. Med. Biol 48:3637–3652https://doi.org/10.1088/0031-9155/48/22/002
- FieldTrip: Open source software for advanced analysis of MEG, EEG, and invasive electrophysiological dataComput. Intell. Neurosci https://doi.org/10.1155/2011/156869
- A cortical network processes auditory error signals during human speech production to maintain fluencyPLoS Biol 20https://doi.org/10.1371/journal.pbio.3001493
- Frontal top-down signals increase coupling of auditory low-frequency oscillations to continuous speech in human listenersCurr. Biol 25:1649–1653https://doi.org/10.1016/j.cub.2015.04.049
- Impaired Feedforward Control and Enhanced Feedback Control of Speech in Patients with Cerebellar DegenerationJ. Neurosci 37:9249–9258https://doi.org/10.1523/JNEUROSCI.3363-16.2017
- Spatiotemporal dynamics of word retrieval in speech production revealed by cortical high-frequency band activityProc Natl Acad Sci USA 114:E4530–E4538https://doi.org/10.1073/pnas.1620669114
- Complex network measures of brain connectivity: uses and interpretationsNeuroimage 52:1059–1069https://doi.org/10.1016/j.neuroimage.2009.10.003
- Corticomuscular coherence is tuned to the spontaneous rhythmicity of speech at 2-3 HzJ. Neurosci 32:3786–3790https://doi.org/10.1523/JNEUROSCI.3191-11.2012
- Right inferior frontal gyrus implements motor inhibitory control via beta-band oscillations in humanseLife 10https://doi.org/10.7554/eLife.61679
- A mechanism for inter-areal coherence through communication based on connectivity and oscillatory powerNeuron 109:4050–4067https://doi.org/10.1016/j.neuron.2021.09.037
- Neural signatures of perceptual inferenceeLife 5https://doi.org/10.7554/eLife.11476
- Coupled neural systems underlie the production and comprehension of naturalistic narrative speechProc Natl Acad Sci USA 111:E4687–96https://doi.org/10.1073/pnas.1323812111
- Contribution of the Cerebellum and the Basal Ganglia to Language Production: Speech, Word Fluency, and Sentence Construction-Evidence from PathologyCerebellum 20:282–294https://doi.org/10.1007/s12311-020-01207-6
- Speech Perception under the Tent: A Domain-general Predictive Role for the CerebellumJ. Cogn. Neurosci 33:1517–1534https://doi.org/10.1162/jocn_a_01729
- Temporo-cerebellar connectivity underlies timing constraints in auditioneLife 10https://doi.org/10.7554/eLife.67303
- Online and offline tools for head movement compensation in MEGNeuroimage 68:39–48https://doi.org/10.1016/j.neuroimage.2012.11.047
- Cortical source imaging of resting-state MEG with a high resolution atlas: An evaluation of methodsBioRxiv https://doi.org/10.1101/2020.01.12.903302
- Cortico-Cerebellar Monitoring of Speech Sequence ProductionNeurobiology of Language :1–21https://doi.org/10.1162/nol_a_00113
- Principles of large-scale neural interactionsNeuron 111:987–1002https://doi.org/10.1016/j.neuron.2023.03.015
Article and author information
Author information
Version history
- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:
Copyright
© 2024, Abbasi et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 176
- downloads
- 8
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.