Abstract
Complex motor skills like speech and dance are composed of ordered sequences of simpler elements, but the neuronal basis for syntactic ordering of individual actions into sequences is poorly understood. Birdsong is a learned vocal behavior composed of syntactically ordered sequences of individual syllables. Activity in song premotor nucleus HVC (proper name) has been linked to the control of syllable sequencing, but sequencing may also be affected by its recurrent inputs. We here test the contribution of one of HVC’s inputs, mMAN (medial magnocellular nucleus of the anterior nidopallium), to the variable songs of adult male Bengalese finches (Lonchura striata domestica). The syntax of Bengalese song includes several patterns: 1) chunks, where syllables follow stereotypical order 2) branch points, where a given syllable can be followed by two or more different syllables in a probabilistic manner and 3) repeat phrases, where an individual syllable is repeated a variable number of times. We found that after bilateral lesions of mMAN, the acoustic structure of syllables remained largely intact, but sequencing became more variable for each of these patterns, seen by ‘breaks’ in previously stereotyped chunks, increased uncertainty at branch points and increased variability of repeat numbers. This increase in sequencing variability highlights the potential importance of regions projecting to HVC in the ordering of vocal elements. Previous studies on zebra finches found little effect of mMAN lesions on their relatively stereotyped adult song. In contrast, our results suggest that mMAN contributes to sequencing the variable songs of Bengalese finches and highlight the utility of species with more complex song syntax in investigating neuronal control of ordered motor sequences.
Introduction
Complex behaviors are composed of sequences of simpler motor elements. For example, in human speech, sentences are formed by syllables that are flexibly sequenced according to syntactic rules (Berwick et al., 2011), which define the order of individual elements. Production of motor sequences is important not only for speech, but also for most other natural movements. Despite the ubiquity of syntactically organized sequential behaviors, little is known about how the brain produces flexible motor sequences (Lashley, 1951; Tanji, 2001; Aldridge and Berridge, 2003; Wiltschko et al., 2015; Veit et al., 2021).
Birdsong is a learned behavior that is well suited for the study of the neural control of motor sequences. Song is composed of discrete elements, called syllables, that are organized according to syntactic rules. The complexity of song syntax varies widely across different songbird species - from the simple and repetitive songs of owl finches (Wang et al., 2019) and zebra finches (Zann, 1996) to the variable songs of Bengalese finches (Honda and Okanoya, 1999) and canaries (Nottebohm, Stokes and Leonard, 1976; Cohen et al., 2020) up to the immense repertoire of nightingales (Hultsch et al., 2004; Costalunga et al., 2023). The neuronal mechanisms of birdsong production have been primarily studied in the zebra finch, partly by leveraging the structure of zebra finch song, in which syllables are produced in a relatively stereotyped and linear sequence. Consequently, we have a detailed understanding of the neuronal mechanisms for producing individual song syllables in this species, but we know little about how syllables are organized into the more complex and variable sequences that are characteristic of many other species (Ivanitskii and Marova, 2022).
Song production is controlled by neuronal activity in a set of brain nuclei called the song motor pathway, including premotor nucleus HVC (proper name). HVC neurons projecting to primary motor nucleus RA (robust nucleus of the arcopallium) encode syllable identity and have been shown to play a role in controlling syllable timing (Simpson and Vicario, 1990; Vu, Mazurek and Kuo, 1994; Hahnloser, Kozhevnikov and Fee, 2002; Long and Fee, 2008; Aronov et al., 2011; Ölveczky et al., 2011; Lynch et al., 2016; Picardo et al., 2016; Zhang et al., 2017). HVC activity in Bengalese finches (Sakata and Brainard, 2008; Fujimoto, Hasegawa and Watanabe, 2011) and canaries (Cohen et al., 2020) has also been shown to encode syllable identity, and can additionally influence sequencing and reflect the sequential context in which syllables are produced (Fujimoto, Hasegawa and Watanabe, 2011; Zhang et al., 2017; Cohen et al., 2020). The nuclei of the motor pathway are essential for song production (Nottebohm, Stokes and Leonard, 1976; Scharff and Nottebohm, 1991), but the extent to which these sequence-selective activity patterns in HVC reflect internal neural dynamics or are shaped by recurrent inputs is poorly understood.
Although HVC is often considered to be at the top of a hierarchy in the song control circuit, it is part of a network of brain nuclei involved in song production and receives multiple inputs that might also contribute to premotor activity (McCasland, 1987; Hosino and Okanoya, 2000; Jin, 2009; Hamaguchi, Tanaka and Mooney, 2016; Vyssotski et al., 2016). One prominent source of input to HVC is mMAN (medial magnocellular nucleus of the anterior nidopallium) which is part of a recurrent loop that bilaterally connects the two hemispheres through RA and bilateral connections from thalamic nucleus DMP (dorsomedial nucleus of the posterior thalamus, Figure 1E) and could thus be ideally positioned to convey sequence-related activity across hemispheres (Vates, Vicario and Nottebohm, 1997; Schmidt, Ashmore and Vu, 2004; Williams, Nast and Coleman, 2012). A parallel recurrent pathway, which projects to RA through nucleus lMAN (lateral magnocellular nucleus of the anterior nidopallium, Figure 1E) has been shown to contribute variability to syllable pitch, a parameter that is controlled by RA premotor activity (Kao, Doupe and Brainard, 2005; Sober, Wohlgemuth and Brainard, 2008; Miller, Cheung and Brainard, 2017). lMAN-guided variability can serve as a form of motor exploration which is essential for learning (Sober and Brainard, 2012; Dhawale, Smith and Ölveczky, 2017), regulating social context-dependent changes in pitch variability (Kao and Brainard, 2006; Hampton, Sakata and Brainard, 2009), and biasing pitch in the direction of improved motor performance during learning (Andalman and Fee, 2009; Warren et al., 2012; Tian and Brainard, 2017). This raises the question of whether mMAN, as the output of a parallel recurrent pathway that projects onto HVC, might have a similar role in contributing to premotor activity in its projection target, with the potential to influence syllable identity or sequencing variability (Kubikova, Turner and Jarvis, 2007; Seki and Okanoya, 2008; Ali et al., 2013). Previous studies in zebra finches showed that mMAN lesions in juveniles disrupt normal song learning, while lesions in adults affected song initiation, but not subsequent production of song in this species (Foster and Bottjer, 2001; Horita, Wada and Jarvis, 2008; Ali et al., 2013).
We tested the hypothesis that mMAN contributes to syllable sequencing by studying the effect of bilateral mMAN lesions on song production in adult Bengalese finches. We found that lesions had little effect on the acoustic structure of individual syllables but led to an increase in the variability of multiple aspects of syllable sequencing. These results highlight the potential importance of recurrent inputs such as mMAN in shaping the syntactical structure of adult birdsongs.
Methods
Subjects and sound recordings
Experiments were carried out on seven adult male Bengalese finches (Lonchura striata domestica) obtained from the Brainard lab’s breeding colony at University of California, San Francisco (median age 640 days post-hatch, range 149-1049 at start of experiment). Birds were raised with their parents and then housed in same-sex group cages. For the experiments, birds were placed in individual sound-attenuating boxes (Acoustic Systems, Austin, TX) and maintained on a 14:10 hour light: dark period. Song was recorded using an omnidirectional microphone above the cage using custom LabView software for continuous monitoring of song output (Tumer and Brainard, 2007). All procedures were performed in accordance with animal care protocols approved by the University of California, San Francisco Institutional Animal Care and Use Committee (IACUC).
mMAN lesions
After several days of sound recording, birds were anesthetized using ketamine and midazolam with isoflurane. mMAN was localized using stereotaxic coordinates and was destroyed bilaterally using current injections of 100uA current for 80s at four sites per hemisphere. After conclusion of the study, birds were deeply anesthetized and perfused with paraformaldehyde. 40um sections were cut on a microtome and processed with either Nissl stain (cresyl violet) or calcitonin gene-related peptide (CGRP) (Hampton, Sakata and Brainard, 2009). mMAN lesion size was estimated by experienced observers and estimated to be either complete (bird 1, birds 3-6) or greater than 75% (bird 2 and 7) for all birds used in this analysis (Figure 1-figure supplement 1).
Sequence analysis
Bengalese finch songs were recorded for several days before and after lesions. Singing activity resumed on average 3.8 days post lesions (range 2-6 days, Figure 1-figure supplement 1), and we analyzed songs starting on average 5.3 days post lesions (range 3-11 days) up to 7.4 days post lesions (range 5-12 days). We analyzed at least two days pre and post lesions, and subsampled the larger of the two datasets pooled over all days such that the pre and post datasets were equal in size for each bird (average of 299 song bouts, range 102-601, for all analyses). All our main analyses were carried out on this dataset. To analyze specifically the persistence of effects after the lesion, Figure 1-figure supplement 2 includes four additional data points (14, 19, 33 and 33 days post lesion for birds 2, 3, 6 and 7 respectively) after additional behavioral manipulation, from which birds typically recover (Warren et al., 2012).
Annotation
We annotated the entire syllable sequence in an automated way. We first used a subset of the data to generate a training set for classification. Separate classifiers were trained for each bird. In the training data, syllable onsets and offsets were determined by amplitude thresholding. We then performed dimensionality-reduction and unsupervised clustering of the spectrograms of detected syllables using UMAP (McInnes et al., 2018; Sainburg, Thielk and Gentner, 2020), in order to determine the number of different syllables in an objective way (mean 13, range 11-14). On average, we used 181 files (range 70-505) for creating this training dataset, which was then used to train a deep neural network (TweetyNet) for the annotation of birdsong (Cohen et al., 2022). TweetyNet was used to segment and annotate all songs, followed by semi-automated hand-checking using custom-written software in MATLAB R2021b to ensure quality annotation. In rare cases, where there was some ambiguity in the assignment of syllable identity during hand-checking, we additionally took into account the sequential context, for example during stereotyped chunks, to assign the most likely label. Thus, any errors in assignment in such cases would have tended to reduce rather than accentuate the magnitude of the lesions’ effects on reported sequencing changes.
Simplification of the syllable sequence
We defined song bouts as continuous sequences of syllables separated by at least 2 s of silence. We then simplified the syllable sequence by merging introductory notes into a single introductory state, i.e., a single node ‘I’ in the transition diagram (Figure 1A,F). Introductory notes were defined as up to three syllable types occurring at the start of bouts which were quieter, and more variable in timing and structure than other syllable types (Rajan and Doupe, 2013; Veit et al., 2021). Similarly, syllables which were repeated a variable number of at least two times were reduced to a single state (e.g. ‘bbbbbbbbb’ to ‘B’, Figure 1A,F) in the syllable sequence, and the number of repetitions within these repeat phrases was analyzed separately (Hampton, Sakata and Brainard, 2009; Zhang et al., 2017; Jaffe and Brainard, 2020). We only analyzed repeat phrases with variable numbers of syllable repetitions in this way, therefore, we did not consider cases in which a syllable was repeated an exact number of times on each occurrence (occurring with repeat numbers of 2 or 3 in our dataset, see Figure 3A) a variable repeat phrase.
Separating different syllable states based on sequential context
Transition probabilities between syllables in Bengalese finch song can depend not only on the identity of the syllable, but also on the identity of at least one preceding syllable (Jin and Kozhevnikov, 2011; Katahira et al., 2011; Morita et al., 2021). In order to consider this sequential context-dependency of transition probabilities, we followed the procedure of Katahira et al. 2011 to classify spectrally similar syllables (i.e., with the same labels) into different ‘states’ based on the identity of the preceding syllable. For example, in Figure 1A, the syllable ‘g’ is preceded by ‘I’ or ‘f’ and is followed by ‘h’ or ‘a’. If ‘g’ is preceded by ‘I’ it is always followed by ‘h’ and if it is preceded by ’f’, it can be followed by ‘h’ or ‘a’. We would therefore like to introduce two separate states g1 and g2 in the transition diagram representing this sequential context. The input for this analysis was the syllable sequence with introductory and repeat phrases already condensed into single states. We compared the distribution of transition probabilities from these syllables by themselves (e.g., syllable ‘g’) to the distribution of transition probabilities from these syllables considering the context of one preceding syllable (e.g., syllable ‘f-g’) using the chi-square goodness of fit test. We split the syllable into a new state (e.g. ‘g1’) if p < 0.01/n where n is the number of comparisons made (Bonferroni correction for multiple comparisons) (Jin, 2009; Katahira et al., 2013). After completing this for all syllables of a given type (e.g., ‘g1’, ‘g2’, ‘g3’), we compared the different output states to each other using chi-square goodness of fit test and merged them back together if p > 0.01/n (mean 15, range 6-21 additional states/bird).
Determining chunks and branchpoints
To determine chunks (Suge and Okanoya, 2010), we used the modified syllable sequence after chi-square analysis. We calculated a syllable-to-syllable transition diagram, while eliminating any syllables occurring with less than a threshold frequency of 0.9% and omitting branches with a probability of less than 5%. From the resultant graph, we merged all non-overlapping linear sequences (which consisted of nodes with only a single input and output branch) into chunks. The transition probabilities within resultant chunks were typically higher than 90% (range 87-100%). Values smaller than 100% for non-branching paths result from the omission of one or several small branches of less than 5%. We then re-calculate the transition probabilities between chunked nodes. We defined branchpoints as the set of single syllables that are retained after this processing and the end syllables of the newly defined chunks. To simplify diagrams for visualization, we eliminated any syllables which occurred less than once per bout and branches with transition probabilities of less than 5%.
Transition entropy
Transition entropy is a measure of uncertainty of sequence at a given syllable (Sakata and Brainard, 2006). With c different outputs from the given syllable ‘a’ and P(i) the probability of the ith outcome, we calculate the entropy Ha, as:
We call this value ‘transition entropy per branchpoint’ in Figure 3A.
To determine the overall variability of song before and after mMAN lesions, we calculated total transition entropy TE, over all syllables ‘b’ as:
where Hb is the transition entropy at ‘b’ and P(b) is the frequency of syllable ‘b’ (Chatfield and Lemon, 1970; Katahira et al., 2013).
History dependence
History dependence is a previously established metric that measures the extent to which the most common transition at a given syllable is influenced by the transition at the last occurrence of this syllable (Warren et al., 2012). It has been used to characterize instances of apparent sequence variability, where seemingly variable transitions are always strictly alternating. For example, if the possible transitions from syllable ‘a’ are ‘ab’ or ‘ac’ but these strictly alternated (‘ab…ac…ab…ac’ and so on) then the seemingly variable branchpoint ‘a’ is perfectly predictable based on its history (Warren et al., 2012). Such apparent variability should be largely eliminated in our sequence analysis by the introduction of context dependent states (i.e., in this example, the ‘a’ would be re-labelled as ‘a1’ or ‘a2’ depending on the context in which it occurs) and identification of chunks. However, if higher-order dependencies in the song determine the order of chunks, we might still expect some variable transitions to be governed by history dependence. If ‘ab’ is the most frequently occurring transition from ‘a’, and ‘ac’ is the collection of all other transitions from ‘a’, we define history dependence D of ‘a’ as:
where 𝑃(𝑎𝑏n|𝑎𝑏n-1) is the conditional probability of ‘ab’ transition given that ‘ab’ transition occurred at the previous instance of ‘a’ and 𝑃(𝑎𝑏n|𝑎cn-1) is the conditional probability of ‘ab’ transition given that ‘ac’ transition occurred at the previous instance of ‘a’.
Chunk consistency
As defined above, a chunk is defined by a single dominant sequence, but may have a small amount of variability across different instances. To quantify the stereotypy of chunks, we used a measure based on sequence consistency previously defined for relatively stereotyped zebra finch songs (Scharff and Nottebohm, 1991). Across all instances of a given chunk, we identified the syllable sequence that occurred most often as the ‘dominant sequence’. We then defined ‘n_dominant’ as the number of instances of the dominant sequence, and ‘n_other’ as the number of instances of other sequence variants for the chunk.
We quantified chunk consistency C as the proportion of total instances that were the dominant sequence:
To compare a chunk before and after mMAN lesions, the dominant sequence for the pre lesion chunk was used as a reference, regardless of whether the same sequence qualified as a chunk post lesion. To quantify chunk consistency post lesion, the most dominant sequence post lesion was used (even if that was not the same as the most dominant sequence pre lesion).
Repeat number variability
To study the influence of mMAN on repeat phrases, we examined the distribution of repeat numbers before and after lesions. We quantified the variability of these distributions as their coefficient of variation (CV = standard deviation/mean).
Analysis of syllable acoustic structure
Average spectrogram
For visual inspection of preservation of syllable types before and after mMAN lesions, we calculated average spectrograms for 200 instances of each syllable type for 7 birds (Figure 2-figure supplement 1). Individual spectrograms of each syllable type were rescaled to have matching time axes and then overlaid.
Syllable similarity analysis
We used Sound Analysis Pro (SAP) software (Tchernichovski et al., 2000) to quantify the similarity in syllable structure for all syllable types per bird (Figure 2-figure supplement 2). We used the settings ‘Symmetric’ and ‘Time Courses’ and compared twenty instances of each syllable type.
We first calculated SAP similarity of all syllables to syllables of the same type from two separate control recordings before mMAN lesions. We termed this as ‘Self Similarity’. We next calculated SAP similarity of the same syllable types before and after mMAN lesions. We termed this as ‘Pre vs Post’ similarity. Lastly, we calculated SAP similarity of all syllable types compared to all other syllable types per bird. This was used as a measure of dissimilarity of structure across syllable types. We termed this as ‘Cross Similarity’.
Analysis of fundamental frequency
We used the method previously described by Hampton, Sakata and Brainard, 2009 to calculate fundamental frequency and coefficient of variation (CV) of fundamental frequency for select syllables for each bird (Figure 2-figure supplement 3) (Hampton, Sakata and Brainard, 2009). Briefly, each syllable was visually inspected to have a segment with clearly defined fundamental frequency. We then found the peak of the autocorrelation function (using parabolic interpolation) of the sound waveform in this segment.
Analysis of acoustic features
To further compare syllable phonology before and after the mMAN lesions, we used the SoundSig python package (Elie and Theunissen, 2016) to calculate a set of acoustic features for all syllables for 7 birds (Figure 2-figure supplement 4). The features used were Entropy of spectral envelope (entS), Temporal centroid for the temporal envelope (meanT), first, second and third formants (F1, F2, F3).
Results
Structure of Bengalese finch song
Bengalese finch songs are composed of a fixed number of syllable types, which are arranged into variable sequences following syntactical rules. In this study, we sought to first characterize the different features of Bengalese finch song, before investigating their change following mMAN lesions. For each bird, we first annotated all songs with labels for each of the syllable types, and then defined a transition diagram for analysis according to the following procedure (see Methods):
1) We identified introductory notes (Rajan and Doupe, 2013; Veit et al., 2021) and grouped them together into a single introductory state ‘I’ in the transition diagram. This reduced the overall transition entropy of song for 7 birds from 0.95 to 0.85 (Figure 1A).Rajan and Doupe, 2013; Veit et al., 2021) and grouped them together into a single introductory state ‘I’ in the transition diagram. This reduced the overall transition entropy of song for 7 birds from 0.95 to 0.85 (Figure 1A).
2) Repeat phrases, in which the same syllable is repeated a variable number of times, were also reduced to a single state in the transition diagram (indicated by capital letters) and the distribution of repeat numbers was analyzed separately (Hampton, Sakata and Brainard, 2009; Zhang et al., 2017; Jaffe and Brainard, 2020) (Figure 1D).Hampton, Sakata and Brainard, 2009; Zhang et al., 2017; Jaffe and Brainard, 2020) (Figure 1D).
3) In Bengalese finch song a single spectrally defined and labelled syllable can occur in different fixed sequences (for example, syllable ‘b’ in the sequences a-b-c and d-b-e) and therefore reflects distinct states in the song sequence (Wohlgemuth, Sober and Brainard, 2010; Katahira et al., 2011). We followed the procedure of Katahira et al. to determine if any initially identified syllable types corresponded to more than one state based on whether the preceding syllable carried information on transition probabilities (see Methods, Katahira et al., 2013). After identifying and splitting these ‘hidden states’ (for example b1, b2) for each bird, overall transition entropy was further reduced from 0.85 to 0.54 (Figure 1F).‘b’ in the sequences a-b-c and d-b-e) and therefore reflects distinct states in the song sequence (Wohlgemuth, Sober and Brainard, 2010; Katahira et al., 2011). We followed the procedure of Katahira et al. to determine if any initially identified syllable types corresponded to more than one state based on whether the preceding syllable carried information on transition probabilities (see Methods, Katahira et al., 2013). After identifying and splitting these ‘hidden states’ (for example b1, b2) for each bird, overall transition entropy was further reduced from 0.85 to 0.54 (Figure 1F).
4) In the new syllable sequence, we identified ‘chunks’ (Suge and Okanoya, 2010), sub-sequences of high-probability transitions. We defined these as paths of continuous, non-branching sequences in the transition diagram, after omitting branches that occurred with probability < 5% (Figure 1C). Lastly, we identified branch points in the transition diagram as the set of single syllables and endpoints of chunks, i.e., all states where the syllable sequence proceeds in a probabilistic manner (Figure 1B,F).Suge and Okanoya, 2010), sub-sequences of high-probability transitions. We defined these as paths of continuous, non-branching sequences in the transition diagram, after omitting branches that occurred with probability < 5% (Figure 1C). Lastly, we identified branch points in the transition diagram as the set of single syllables and endpoints of chunks, i.e., all states where the syllable sequence proceeds in a probabilistic manner (Figure 1B,F).
Transition entropy increased after bilateral mMAN lesions
After bilateral mMAN lesions, all syllable types were preserved (Figure 2-figure supplement 1-4), but the sequencing of syllables became more variable. In the example in Figure 2B,C, we show the transition diagrams of one example bird before and after mMAN lesions. We observed a change in transition probabilities at existing branch points, as well as the appearance of novel branches. We quantified changes in sequencing variability for all syllables using transition entropy (Sakata and Brainard, 2006; Katahira et al., 2013). Total transition entropy increased significantly after mMAN lesions, indicating that song syntax overall became more variable (Figure 2D). These effects appeared as soon as the bird resumed singing after the lesions and did not recover over the time for which the birds were observed (Figure 1-figure supplement 2).
Figure 2-figure supplement 5A shows the corresponding changes at all individual branch points. To investigate the source of differences in the magnitude of changes across branch points, we considered the history-dependence of each of these branchpoints. History-dependence captures apparent variability governed by long-range dependencies in the sequence and history-dependent transitions previously have been found to be difficult to modify in a sequence modification training protocol (Warren et al., 2012). They might therefore be less affected by lesions as well. Alternatively, we might expect lesion effects to be stronger for these transitions, if mMAN contributes specifically to long-range dependencies. Consistent with the first possibility, we observed that there was a nonsignificant trend toward larger changes after mMAN lesion for transitions with low history dependence. (Figure 2-figure supplement 5B).
Chunks became more variable after bilateral mMAN lesions
Bengalese finch songs contain short, relatively stereotyped sequences of syllables, here called ‘chunks’ (Okanoya and Yamaguchi, 1997; Suge and Okanoya, 2010; Isola, Vochin and Sakata, 2020; Veit et al., 2021). We speculated that transitions within chunks might be differently affected by mMAN lesions, if for example the relatively fixed sequences within chunks are determined within premotor song nucleus HVC and inputs to HVC are only relevant at variable branch points. We found that the relatively fixed sequences within chunks were altered after mMAN lesions. An example bird (Figure 3A-C) exhibited one prominent chunk which was occasionally observed to ‘break’ after lesions, i.e., branching within the chunk increased to a degree where it would no longer be considered a chunk after the lesions. The branching increased most noticeably in the ‘l-f’ transition (Figure 3C), which is characterized by a longer gap than other transitions within the chunk (Figure 3A). We therefore wondered whether sequencing changes were related to gap durations and found that gap durations within chunks were not significantly correlated with the increase in transition entropy at the corresponding transitions (Figure 3-figure supplement 1A). Overall, we found that changes within chunks were approximately the same magnitude as changes at branch points (Figure 3-figure supplement 1B).
We quantified changes in within-chunk transitions using sequence consistency (Scharff and Nottebohm, 1991), a measure previously used to describe the relatively consistent sequence of zebra finch song. Sequence consistency measures how consistently the most probable sequence is followed (see Methods). Sequence consistency within chunks significantly decreased across 23 chunks from all birds, indicating that this aspect of song structure was affected by mMAN lesions (Figure 3D).
Repeat numbers became more variable after mMAN lesions
We next tested how mMAN lesions affected repeat phrases. Repeat phrases might be governed by separate neural mechanisms than branch points (Fujimoto, Hasegawa and Watanabe, 2011; Jin and Kozhevnikov, 2011; Wittenbach et al., 2015; Zhang et al., 2017). In our dataset of 7 birds, only 5 birds had songs which contained repeat phrases.
In the example bird in Figure 4A,B, the average repeat number for the indicated syllable increased after mMAN lesions. Across birds, the mean repeat number did not change consistently, although individual birds could exhibit quite dramatic effects on repeat number, (Figure 4B,C; p>0.05, n=6, Wilcoxon signed rank test). The distributions of repeat numbers became wider and the coefficient of variation for repeat numbers increased significantly after mMAN lesions (Figure 4D; * p<0.05, n=6, Wilcoxon signed rank test).
Discussion
Bird songs are vocal sequences composed of individual syllables which appear in sequential order dictated by syntactic rules, but the neuronal control of song syntax is largely unknown. Previous studies in zebra finches showed that mMAN lesions in adult birds had minimal effects on song production (Foster and Bottjer, 2001; Horita, Wada and Jarvis, 2008; Ali et al., 2013). We speculated that the modest effect of lesions in adult zebra finches might be due to the relatively stereotyped nature of zebra finch song. Consistent with this possibility, our results demonstrate that lesions of pallial nucleus mMAN affect the more complex and variable song syntax in Bengalese finches. We showed that while mMAN lesions did not abolish the production of song itself or grossly disrupt the acoustic structure of individual syllables, they increased the variability of syllable sequencing; the transition probabilities between syllables became less predictable (increased transition entropy), new branches were introduced into previously stereotyped chunks, and the number of repetitions of repeated syllables (repeat number) became more variable. These results indicate that mMAN is involved in controlling aspects of song syntax in a songbird species that exhibits complex syllable sequencing and suggest that mMAN might play a similar or prominent role in additional species with even greater song complexity, such as canaries or nightingales.
Song in passerine birds relies on a network of recurrently connected nuclei, with different pathways for song production and learning. The song motor pathway consists of primary motor nucleus RA, which projects directly to vocal motor nuclei in the brainstem, and is involved in the moment-by-moment control of the structure of individual syllables (Vu, Mazurek and Kuo, 1994; Sober, Wohlgemuth and Brainard, 2008; Miller, Cheung and Brainard, 2017). Neurons in premotor song nucleus HVC that project to RA have been shown to encode syllable identity and contribute to syllable timing (Simpson and Vicario, 1990; Vu, Mazurek and Kuo, 1994; Hahnloser, Kozhevnikov and Fee, 2002; Long and Fee, 2008; Aronov et al., 2011; Ölveczky et al., 2011; Zhang et al., 2017). Consistent with a possible role of HVC in also controlling variable syllable sequencing, cooling HVC influences syntax in Bengalese finches (Zhang et al., 2017). However, lesions of one input to HVC – the nucleus Nif (nucleus interfacialis of the anterior nidopallium) – can also influence syllable sequencing in the Bengalese finch by making the sequence more stereotyped (Hosino and Okanoya, 2000; Cardin, Raksin and Schmidt, 2005; Otchy et al., 2015) raising the question of how much of the control of syntax relies on the internal dynamics of HVC, versus activity relayed to HVC by its inputs (Okanoya and Yamaguchi, 1997).
In this study, we were especially interested in the possibility that mMAN’s inputs to HVC might play an analogous role in modulating syllable sequencing to the role of lMAN’s inputs to RA in modulating syllable acoustic structure. In adult birds, lMAN’s inputs to RA have been shown to contribute variability to the pitch of individual syllables (Kao, Doupe and Brainard, 2005; Kao and Brainard, 2006; Stepanek and Doupe, 2010), regulate changes to pitch variability between different social contexts (Kao and Brainard, 2006; Hampton, Sakata and Brainard, 2009), and bias pitch in the direction of improved performance during learning experiments (Andalman and Fee, 2009; Warren et al., 2011; Tian and Brainard, 2017). The variability contributed by lMAN may facilitate motor exploration which forms the basis of pitch adaptations during learning (Andalman and Fee, 2009; Sober and Brainard, 2012; Dhawale, Smith and Ölveczky, 2017). These effects are mediated by top-down influences of lMAN on RA via a recurrent loop that passes through the songbird basal ganglia (Kao, Doupe and Brainard, 2005; Andalman and Fee, 2009; Kojima et al., 2018; Tian, Warren and Brainard, 2022), and are consistent with a role of RA in controlling aspects of syllable acoustic structure. In contrast to these effects on syllable structure, lesions of lMAN in adults have not been found to appreciably influence syllable sequencing (Hampton, Sakata and Brainard, 2009). However, because mMAN forms the output of a parallel recurrent loop that passes through the basal ganglia and projects to HVC, we hypothesized that analogous modulation of syllable sequencing during production and learning of song might be mediated by mMAN. Our finding that lesions of mMAN alter multiple aspects of syllable sequencing are consistent with such an influence on song syntax.
The finding that mMAN can influence song syntax raises the question of whether and how this pathway modulates syllable sequencing in the service of motor exploration, contextual modulation, and learning, in a fashion that is analogous to lMAN’s modulation of syllable acoustic structure. That mMAN lesions consistently altered sequence variability indicates a possible role in regulating variability for motor exploration. However, whereas lMAN lesions consistently decrease the variability of acoustic structure, we found that mMAN lesions generally increase the variability of syllable sequencing, suggesting that top-down contributions to the control of syntax may be different or more distributed than for a syllable feature such as pitch. One possibility is that several inputs to HVC could work synergistically to modulate the syntactical structure of Bengalese finch songs. Consistent with this view, a prior study in Bengalese finches found that lesions of Nif decreased sequence variability (Hosino and Okanoya, 2000), suggesting that Nif and mMAN could be playing opposing roles in regulating variability. Our findings also raise the question of whether the medial anterior forebrain pathway including mMAN, plays a similar role in learning to change syllable sequences during development (Foster and Bottjer, 2001; Lipkind et al., 2013) or adulthood (Warren et al., 2012; Veit et al., 2021). In our study, we only recorded song sequencing of male Bengalese finches singing in isolation. Social context, such as female-directed song, can also change song sequencing (Hampton, Sakata and Brainard, 2009; Chen, Matheson and Sakata, 2016). It would be interesting to test whether mMAN plays a role in the social context-modulated changes in sequencing (Jarvis et al., 1998), similar to how lMAN contributes to social context-modulated changes in syllable structure (Sakata, Hampton and Brainard, 2008). Further studies with careful manipulations of the basal ganglia-mMAN-HVC circuit will be required to elucidate the mechanism with which this circuit affects sequencing in Bengalese finch song and contributes to these behaviors.
Bengalese finch syllable sequences contain structure at multiple levels, mainly the organization of individual stereotyped chunks, resembling the stereotyped motifs of zebra finch song, and the variable branch points between chunks, repeat phrases, and other individual syllables. One possibility to reconcile findings that both dynamics within HVC and its recurrent inputs affect sequencing could be that the sequence within chunks is encoded predominantly in the synaptic connections of HVC neurons (Jin, 2009), while the branching transitions found in Bengalese finch song are influenced by input from other nuclei (Hosino and Okanoya, 2000). We here show that mMAN lesions affect both transition entropy at previously variable branch points and transitions within previously stereotyped chunks, indicating that such a simple split of function between mMAN and HVC is not the case.
Besides these, other parts of the song circuit have been shown to affect syllable sequencing. The basal ganglia are involved in selection and sequencing of motor elements in mammals, and manipulations of songbird basal ganglia Area X have previously been shown to affect syllable repetitions in zebra finches and Bengalese finches (Kobayashi, Uno and Okanoya, 2001; Kubikova et al., 2014; Tanaka et al., 2016; Xiao et al., 2021). These effects might be partly mediated via connections through mMAN to HVC (Kubikova, Turner and Jarvis, 2007). Additionally, the control of syllable sequencing must include bilateral coordination of the motor pathways in the two hemispheres (Schmidt, Ashmore and Vu, 2004; Wang et al., 2008). As the two hemispheres in birds are not connected by a corpus callosum, the only pathways mediating interhemispheric synchronization of the two HVCs are by bilateral connections through thalamic nuclei. One such pathway involves a projection from thalamic nucleus Uva to HVC (Coleman and Vu, 2005; Hamaguchi, Tanaka and Mooney, 2016; Moll et al., 2023), which is required for song production and contributes to syllable initiation and timing. Another is through thalamic nucleus DMP, which projects bilaterally through mMAN to HVC (Schmidt, Ashmore and Vu, 2004; Seki and Okanoya, 2008). mMAN might therefore be well situated to coordinate syllable sequencing in the two hemispheres. Future studies will need to determine whether the effects we observed are the results of removing neuronal processing within mMAN itself, or whether mMAN is conveying signals to HVC from other brain areas, including the contralateral hemisphere, which contribute to the control of song syntax.
Acknowledgements
We thank Jacqueline Göbl, Lioba Fortkord, Yarden Cohen, Tim Gardner, Felix Moll for helpful discussions and comments on earlier versions of this manuscript. We thank David Nicholson and Peter Pilz for help with technical questions. Michael Brainard was supported by the Howard Hughes Medical Institute. Lena Veit was supported as a Howard Hughes Medical Institute Fellow of the Life Sciences Research Foundation and by a Daimler Benz postdoctoral fellowship. Avani Koparkar was supported by the Studienstiftung des deutschen Volkes and International Max Planck Research School for Mechanisms of Mental Function and Dysfunction (IMPRS-MMFD). Jonathan Charlesworth and Timothy Warren were supported by the National Science Foundation (NSF) Graduate Research Fellowship. We acknowledge support from the Open Access Publication Fund of the University of Tübingen.
Competing interests
The authors declare no competing interests.
Supplementary Figs
References
- ‘Basal Ganglia Neural Coding of Natural Action Sequences’, in The Basal Ganglia VIAdvances in Behavioral Biology :279–287https://doi.org/10.1007/978-1-4615-0179-4_28
- The Basal Ganglia Is Necessary for Learning Spectral, but Not Temporal, Features of BirdsongNeuron 80:494–506https://doi.org/10.1016/j.neuron.2013.07.049
- A basal ganglia-forebrain circuit in the songbird biases motor output to avoid vocal errorsProceedings of the National Academy of Sciences 106:12518–12523https://doi.org/10.1073/pnas.0903214106
- Two Distinct Modes of Forebrain Circuit Dynamics Underlie Temporal Patterning in the Vocalizations of Young SongbirdsJournal of Neuroscience 31:16353–16368https://doi.org/10.1523/JNEUROSCI.3009-11.2011
- Songs to syntax: the linguistics of birdsongTrends in Cognitive Sciences 15:113–21https://doi.org/10.1016/j.tics.2011.01.002
- Sensorimotor Nucleus NIf Is Necessary for Auditory Processing But Not Vocal Motor Output in the Avian Song SystemJournal of Neurophysiology 93:2157–2166https://doi.org/10.1152/jn.01001.2004
- Analysing sequences of behavioural eventsJournal of Theoretical Biology 29:427–445https://doi.org/10.1016/0022-5193(70)90107-4
- Mechanisms underlying the social enhancement of vocal learning in songbirdsProceedings of the National Academy of Sciences 113:6641–6646https://doi.org/10.1073/pnas.1522306113
- Hidden neural states underlie canary song syntaxNature 582:539–544https://doi.org/10.1038/s41586-020-2397-3
- Automated annotation of birdsong with a neural network that segments spectrogramseLife 11https://doi.org/10.7554/eLife.63853
- Recovery of impaired songs following unilateral but not bilateral lesions of nucleus uvaeformis of adult zebra finchesJournal of Neurobiology 63:70–89https://doi.org/10.1002/neu.20122
- Wild nightingales flexibly match whistle pitch in real timeCurrent Biology :1–10https://doi.org/10.1016/j.cub.2023.06.044
- The Role of Variability in Motor LearningAnnual Review of Neuroscience 40:479–498https://doi.org/10.1146/annurev-neuro-072116-031548
- The vocal repertoire of the domesticated zebra finch: a data-driven approach to decipher the information-bearing acoustic features of communication signalsAnimal Cognition 19:285–315https://doi.org/10.1007/s10071-015-0933-6
- Lesions of a telencephalic nucleus in male zebra finches: Influences on vocal behavior in juveniles and adultsJournal of Neurobiology 46:142–165https://doi.org/10.1002/1097-4695(20010205)46:2<142::AID-NEU60>3.0.CO;2-R
- Neural coding of syntactic structure in learned vocalizations in the songbirdJournal of Neuroscience 31:10023–10033https://doi.org/10.1523/JNEUROSCI.1606-11.2011
- An ultra-sparse code underlies the generation of neural sequences in a songbirdNature 419:65–70https://doi.org/10.1038/nature00974
- A Distributed Recurrent Network Contributes to Temporally Precise VocalizationsNeuron 91:680–693https://doi.org/10.1016/j.neuron.2016.06.019
- An avian basal ganglia-forebrain circuit contributes differentially to syllable versus sequence variability of adult Bengalese finch songJournal of Neurophysiology 101:3235–3245https://doi.org/10.1152/jn.91089.2008
- Acoustical and Syntactical Comparisons between Songs of the White-backed Munia (Lonchura striata) and Its Domesticated Strain, the Bengalese Finch (Lonchura striata var. domestica)Zoological Science 16:319–326https://doi.org/10.2108/zsj.16.319
- Early onset of deafening-induced song deterioration and differential requirements of the pallial-basal ganglia vocal pathwayEuropean Journal of Neuroscience 28:2519–2532https://doi.org/10.1111/j.1460-9568.2008.06535.x
- Lesion of a higher-order song nucleus disrupts phrase level complexity in Bengalese finchesNeuroReport 11:2091–5https://doi.org/10.1097/00001756-200007140-00007
- Long-term persistence of song performance rules in nightingales (Luscinia megarhynchos): a longitudinal field study on repertoire size and compositionBehaviour 141:371–390https://doi.org/10.1163/156853904322981914
- Manipulations of inhibition in cortical circuitry differentially affect spectral and temporal features of Bengalese finch songJournal of Neurophysiology 123:815–830https://doi.org/10.1152/jn.00142.2019
- The Syntactic Organization of Bird SongBiology Bulletin 49:1158–1170https://doi.org/10.1134/S1062359022080076
- Acetylcholine acts on songbird premotor circuitry to invigorate vocal outputeLife 9https://doi.org/10.7554/eLife.53288
- For Whom The Bird Sings: context-dependent gene expressionNeuron 21:775–788https://doi.org/10.1016/S0896-6273(00)80594-2
- Generating variable birdsong syllable sequences with branching chain networks in avian premotor nucleus HVCPhysical Review E 80https://doi.org/10.1103/PhysRevE.80.051902
- A Compact Statistical Model of the Song Syntax in Bengalese FinchPLOS Computational Biology 7https://doi.org/10.1371/journal.pcbi.1001108
- Lesions of an Avian Basal Ganglia Circuit Prevent Context-Dependent Changes to Song VariabilityJournal of Neurophysiology 96:1441–1455https://doi.org/10.1152/jn.01138.2005
- Contribution of an avian basal ganglia-forebrain circuit to real-time modulation of songNature 433:638–643https://doi.org/10.1038/nature03127
- Complex Sequencing Rules of Birdsong Can be Explained by Simple Hidden Markov ProcessesPLOS ONE 6https://doi.org/10.1371/journal.pone.0024516
- A simple explanation for the evolution of complex song syntax in Bengalese finchesBiology Letters 9:3–7https://doi.org/10.1098/rsbl.2013.0842
- Partial lesions in the anterior forebrain pathway affect song production in adult Bengalese finchesNeuroReport 12:353–358https://doi.org/10.1097/00001756-200102120-00034
- The Avian Basal Ganglia Are a Source of Rapid Behavioral Variation That Enables Vocal Motor ExplorationJournal of Neuroscience 38:9635–9647https://doi.org/10.1523/JNEUROSCI.2915-17.2018
- Basal ganglia function, stuttering, sequencing, and repair in adult songbirdsScientific Reports 4https://doi.org/10.1038/srep06590
- The pallial basal ganglia pathway modulates the behaviorally driven gene expression of the motor pathwayEuropean Journal of Neuroscience 25:2145–2160https://doi.org/10.1111/j.1460-9568.2007.05368.x
- ‘The problem of serial order in behavior.’, in Cerebral Mechanisms in BehaviorOxford: Bobbs-Merrill :112–146
- Stepwise acquisition of vocal combinatorial capacity in songbirds and human infantsNature 498:104–108https://doi.org/10.1038/nature12173
- Using temperature to analyse temporal dynamics in the songbird motor pathwayNature 456:189–194https://doi.org/10.1038/nature07448
- Rhythmic Continuous-Time Coding in the Songbird Analog of Vocal Motor CortexNeuron 90:877–892https://doi.org/10.1016/j.neuron.2016.04.021
- Neuronal control of bird song productionJournal of Neuroscience 7:23–39https://doi.org/10.1523/JNEUROSCI.07-01-00023.1987
- UMAP: Uniform Manifold Approximation and ProjectionJournal of Open Source Software 3https://doi.org/10.21105/joss.00861
- Vocal learning promotes patterned inhibitory connectivityNature Communications 8https://doi.org/10.1038/s41467-017-01914-5
- Thalamus drives vocal onsets in the zebra finch courtship songNature 616https://doi.org/10.1038/s41586-023-05818-x
- Measuring context dependency in birdsong using artificial neural networksPLOS Computational Biology 17https://doi.org/10.1371/journal.pcbi.1009707
- Central control of song in the canary, Serinus canariusJournal of Comparative Neurology 165:457–486https://doi.org/10.1002/cne.901650405
- Adult bengalese finches (Lonchura striata var.domestica) require real-time auditory feedback to produce normal song syntaxJournal of Neurobiology 33:343–356https://doi.org/10.1002/(SICI)1097-4695(199710)33:4<343::AID-NEU1>3.0.CO;2-A
- Changes in the neural control of a complex motor sequence during learningJournal of Neurophysiology 106:386–397https://doi.org/10.1152/jn.00018.2011
- Acute off-target effects of neural circuit manipulationsNature 528:358–363https://doi.org/10.1038/nature16442
- Population-Level Representation of a Temporal Sequence Underlying Song Production in the Zebra FinchNeuron 90:866–876https://doi.org/10.1016/j.neuron.2016.02.016
- Behavioral and Neural Signatures of Readiness to Initiate a Learned Motor SequenceCurrent Biology 23:87–93https://doi.org/10.1016/j.cub.2012.11.040
- Finding, visualizing, and quantifying latent structure across diverse animal vocal repertoiresPLOS Computational Biology 16:1–48https://doi.org/10.1371/journal.pcbi.1008228
- Real-time contributions of auditory feedback to avian vocal motor controlJournal of Neuroscience 26:9619–9628https://doi.org/10.1523/JNEUROSCI.2027-06.2006
- Online contributions of auditory feedback to neural activity in avian song control circuitryJournal of Neuroscience 28:11378–11390https://doi.org/10.1523/JNEUROSCI.3254-08.2008
- Social Modulation of Sequence and Syllable Variability in Adult BirdsongJournal of Neurophysiology 99:1700–1711https://doi.org/10.1152/jn.01296.2007
- A comparative study of the behavioral deficits following lesions of various parts of the zebra finch song system: implications for vocal learningJournal of Neuroscience 11:2896–913https://doi.org/10.1523/JNEUROSCI.11-09-02896.1991
- ‘Bilateral control and interhemispheric coordination in the avian song motor system’, in Annals of the New York Academy of SciencesNew York Academy of Sciences :171–186https://doi.org/10.1196/annals.1298.014
- Functional evidence for internal feedback in the songbird brain nucleus HVCNeuroReport 19:679–682https://doi.org/10.1097/WNR.0b013e3282fb7dae
- Brain pathways for learned and unlearned vocalizations differ in zebra finchesJournal of Neuroscience 10:1541–1556https://doi.org/10.1523/JNEUROSCI.10-05-01541.1990
- Vocal learning is constrained by the statistics of sensorimotor experienceProceedings of the National Academy of Sciences 109:21099–21103https://doi.org/10.1073/pnas.1213622109
- Central contributions to acoustic variation in birdsongJournal of Neuroscience 28:10370–10379https://doi.org/10.1523/JNEUROSCI.2448-08.2008
- Activity in a Cortical-Basal Ganglia Circuit for Song Is Required for Social Context-Dependent Vocal VariabilityJournal of Neurophysiology 104:2474–2486https://doi.org/10.1152/jn.00977.2009
- Perceptual chunking in the self-produced songs of Bengalese finches (Lonchura striata var. domestica)Animal Cognition 13:515–523https://doi.org/10.1007/s10071-009-0302-4
- Focal expression of mutant huntingtin in the songbird basal ganglia disrupts cortico-basal ganglia networks and vocal sequencesProceedings of the National Academy of Sciences 113:E1720–E1727https://doi.org/10.1073/pnas.1523754113
- Sequential Organization of Multiple Movements: Involvement of Cortical Motor AreasAnnual Review of Neuroscience 24:631–651https://doi.org/10.1146/annurev.neuro.24.1.631
- A procedure for an automated measurement of song similarityAnimal Behaviour 59:1167–1176https://doi.org/10.1006/anbe.1999.1416
- Discrete Circuits Support Generalized versus Context-Specific Vocal Learning in the SongbirdNeuron 96:1168–1177https://doi.org/10.1016/j.neuron.2017.10.019
- Dynamic top-down biasing implements rapid adaptive changes to individual movementsbioRxiv https://doi.org/10.1101/2022.06.16.496455
- Performance variability enables adaptive plasticity of “crystallized” adult birdsongNature 450:1240–4https://doi.org/10.1038/nature06390
- Reafferent thalamo- “cortical” loops in the song system of oscine songbirdsJournal of Comparative Neurology 380:275–90
- Songbirds can learn flexible contextual control over syllable sequencingeLife 10:1–19https://doi.org/10.7554/eLife.61610
- Identification of a forebrain motor programming network for the learned song of zebra finchesJournal of Neuroscience https://doi.org/10.1523/jneurosci.14-11-06924.1994
- A Neural Code That Is Isometric to Vocal Output and Correlates with Its Sensory ConsequencesPLOS Biology 14:1–21https://doi.org/10.1371/journal.pbio.2000317
- Rapid interhemispheric switching during vocal production in a songbirdPLOS Biology 6:2154–2162https://doi.org/10.1371/journal.pbio.0060250
- Transcriptional regulatory divergence underpinning species-specific learned vocalization in songbirdsPLOS Biology 17https://doi.org/10.1371/journal.pbio.3000476
- Mechanisms and time course of vocal learning and consolidation in the adult songbirdJournal of Neurophysiology 106:1806–1821https://doi.org/10.1152/jn.00311.2011
- Variable sequencing is actively maintained in a well learned motor skillJournal of Neuroscience 32:15414–15425https://doi.org/10.1523/JNEUROSCI.1254-12.2012
- Characterization of Synaptically Connected Nuclei in a Potential Sensorimotor Feedback Pathway in the Zebra Finch Song SystemPLOS ONE 7https://doi.org/10.1371/journal.pone.0032178
- Mapping Sub-Second Structure in Mouse BehaviorNeuron 88:1121–1135https://doi.org/10.1016/J.NEURON.2015.11.031
- An Adapting Auditory-motor Feedback Loop Can Contribute to Generating Vocal RepetitionPLOS Computational Biology 11https://doi.org/10.1371/journal.pcbi.1004471
- Linked control of syllable sequence and phonology in birdsongJournal of Neuroscience 30:12936–12949https://doi.org/10.1523/JNEUROSCI.2690-10.2010
- Expression of FoxP2 in the basal ganglia regulates vocal motor sequences in the adult songbirdNature Communications 12https://doi.org/10.1038/s41467-021-22918-2
- The Zebra Finch: A Synthesis of Field and Laboratory StudiesOxford ornithology series. Oxford University Press
- Temperature manipulation in songbird brain implicates the premotor nucleus HVC in birdsong syntaxJournal of Neuroscience 37:2600–2611https://doi.org/10.1523/JNEUROSCI.1827-16.2017
Article and author information
Author information
Version history
- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Version of Record published:
Copyright
© 2024, Koparkar et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 1,134
- downloads
- 89
- citations
- 2
Views, downloads and citations are aggregated across all versions of this paper published by eLife.