Abstract
Age-related hearing loss is a complex phenomenon. The earliest-onset degenerative event is the gradual loss of neural connections between cochlea and auditory brainstem. To probe for perceptual deficits that might arise from this loss, cochlear synaptopathy was induced pharmacologically in young-adult gerbils which were then tested in a challenging listening task for the perception of temporal fine structure. Treated gerbils behaved no differently than normal-hearing, young-adult animals. In contrast, old gerbils, which typically express many cochlear and central-neural pathologies, showed impaired perception. To probe for the underlying mechanisms, single-unit responses were obtained from the auditory nerve to the same test stimuli. Responses from old gerbils showed no impairment in temporal locking to the stimulus fine structure. However, responses were significantly more driven by slower temporal fluctuations of the stimulus envelope, suggesting that the central auditory system may be unable to extract the relevant information for discrimination from such altered inputs.
Introduction
Many elderly people have difficulties understanding speech in noisy environments, despite having normal tone-detection thresholds in quiet. This condition of compromised supra-threshold perception, for example of speech in background noise (e.g., Füllgrabe et al. 2015, Gómez-Álvarez et al. 2023, Lopez-Poveda 2014, Moore 2014, Parthasarathy et al. 2020), despite an absence of detectable hearing deficits in the pure-tone audiogram in quiet, is referred to as hidden hearing loss (HHL, Schaette and McAlpine 2011). It has been suggested that HHL is associated with a reduced number of synapses between auditory-nerve fibers and inner hair cells, which is commonly referred to as cochlear synaptopathy (e.g., Bharadwaj et al. 2014, Kujawa and Liberman 2009, Liberman 2017, Liberman and Kujawa 2017). Synaptopathy is one of the aging processes in the cochlea, and, in addition, can be caused by excessive noise exposure, both accumulating during a human’s life time (e.g., Liberman et al. 2016, Wu et al. 2019, but see Johannesen et al. 2019). Animal experiments demonstrate that exposure to loud sounds can produce such synaptopathy, even without permanent loss in hearing sensitivity (e.g., Kujawa and Liberman 2009). Furthermore, the number of functional synapses decreases with increasing age (e.g., gerbil: Bovee et al. 2024, Gleich et al. 2016, Steenken et al. 2021; mouse: Jeng et al. 2020, Parthasaraty and Kujawa 2018, Sergeyenko et al. 2013; rat: Cai et al. 2018; human: Viana et al. 2015).
Synaptopathy is postulated to be causal for compromised processing of temporal stimulus features (e.g., Parthasarathy and Kujawa 2018), especially of the temporal fine structure (TFS) of sounds (reviewed in Moore 2019), resulting in difficulties of speech comprehension in background noise. Specific psychophysical tests that evaluate the performance to distinguish between stimuli that differ in TFS have been developed. These tasks are then expected to be sensitive to synaptopathy (for review see Moore 2014). The most commonly used test is the TFS1 test (Moore and Sek 2009) which probes the discrimination of harmonic and inharmonic tone complexes that strongly differ in TFS but only marginally in the envelope of the waveform. The previously found link between diminished perception of stimuli differing in TFS (Moore et al. 2006) and SNHL in human patients has also been found in TFS1 test results (Mathew et al. 2015). Furthermore, a correlation between age and TFS1 sensitivity has been observed (e.g., Eipert et al. 2019, Füllgrabe et al. 2015, Moore et al. 2012).
Whereas post-mortem studies of human subjects have provided evidence for age-related synapse loss (Viana et al. 2015, Wu et al. 2019), it has been impossible to relate this loss to HHL; that is, deficits in supra-threshold sound perception of living subjects. Animal studies have suggested a close correlation between the number of surviving synapses and supra-threshold amplitude growth of auditory brainstem responses (ABRs) or compound action potentials (e.g., Bourien et al. 2014, Bramhall et al. 2018, Buran et al. 2010, Kujawa and Liberman 2009, Lin et al. 2011, Sergeyenko et al., 2013, Shi et al. 2016). A number of attempts have been made to infer the synapse loss in humans from non-invasive physiological measures, such as ABR, and psychophysical measures of the processing of sounds. However, even studies involving large samples of human subjects have not produced strong evidence that synaptopathy in humans has functional consequences for supra-threshold temporal perception (e.g., Bramhall et al. 2019, Carcagno and Plack 2022, Guest et al. 2018). Attempts to relate life-time noise exposure of human subjects to their supra-threshold perceptual deficits have also not been conclusive (e.g., Prendergast et al. 2017, 2019, Füllgrabe et al. 2020). Controlled animal studies are one strategy for testing the influence of synapse loss on discrimination of complex sounds that differ in TFS. In those, synaptopathy can be experimentally manipulated and the degree of cochlear dysfunction can be related to neural responses and performance in behavioral tests, using the same stimuli. With this approach we can test the hypothesis whether synaptopathy negatively affects the discrimination of stimuli that differ in their TFS.
Animal studies relating behavioral measures of performance in response to a variety of stimuli to synaptopathy have so far also provided mixed results (reviewed in Henry 2022). An investigation by Henry and Abrams (2021), which experimentally induced synaptopathy in budgerigars by kainic acid infusions at the round window, failed to demonstrate a relationship between the amplitude of the compound action potential (taken as a measure of cochlear synaptopathy) and tone-detection thresholds in noise. Studies in Macaque monkeys that produced a noise-induced hearing loss (NIHL) by acoustic trauma demonstrate differences between normal-hearing and acoustically traumatized monkeys for a) tone detection in noise (Burton et al. 2020), b) masking release for tone detection in modulated maskers compared to unmodulated maskers, and c) spatial release from masking for tone detection in noise (Mackey et al. 2021). However, these studies used high trauma levels and long exposure durations. Thus, effects were not limited to synapse loss at the inner hair cell but included both outer- and inner-hair-cell damage. Studies in chinchilla auditory-nerve (AN) fibers, involving normal-hearing subjects and subjects with NIHL, did reveal differences in the neural envelope representation but failed to demonstrate differences in the neural representation of TFS that match the psychophysical effects observed in human subjects (e.g., Henry and Heinz 2012, Kale and Heinz 2010, Kale et al. 2014).
To clarify the relation between synaptopathy, AN fiber responses, and behavior, we embarked on a study in Mongolian gerbils (Meriones unguiculatus) that combined behavioral measurements of sensitivity in the TFS1 test and the representation of harmonic and inharmonic complex tones with the same fundamental frequency (f0) by AN fibers. In addition, synaptopathy of many of the experimental subjects was characterized by directly counting the numbers of functional synapses on inner hair cells, and these measures were related to sensitivity in the TFS1 test. Because compromised performance in the TFS1 test has been observed in elderly humans, we compared the perception of young-adult and old gerbils. In order to experimentally induce synaptopathy in young-adult gerbils and, thus dissociate the effects of synaptopathy from effects of age, we treated young-adult gerbils with an infusion of ouabain at the round window. This treatment has been shown to reduce the number of synapses on gerbil inner hair cells (Bourien et al. 2014). We observed a lower perceptual sensitivity of old compared to young-adult gerbils in the TFS1 test. However, contrary to our hypothesis, ouabain treatment did not affect the performance in the TFS1 task, and the responses of AN fibers that survived ouabain treatment did not differ from gerbils that were not treated.
Results
Synapse loss was similar in old gerbils and gerbils treated with a high dosage of ouabain
Synapses were counted at the cochlear location corresponding to 2 kHz in 54 gerbils, divided into the five gerbil groups, as listed in Table 1, row 4. Young-adult, untreated gerbils had, on average, 23 (SE ±0.5) synapses per inner hair cell (Fig. 1). Synapse numbers were significantly different between gerbil groups (univariate ANOVA: F = 5.496, p = 0.001). Compared to untreated young adults, the average number of synapses per inner hair cell in gerbils treated with 70 µM ouabain and in old gerbils was significantly reduced, to 19 (SE ±1.4) and 19 (SE ±0.7), respectively (Bonferroni corrected pairwise comparisons: young-adult vs. 70 µM [p = 0.011]; young-adult vs. old [p = 0.002]). No significant difference between ears treated with a high dose of ouabain and old ears was found, indicating that both animal groups had similar degrees of synaptopathy in the region of interest. In surgery-only ears and ears treated with 40 µM ouabain, average synapse numbers (surgery only: 23 (SE ±0.5), 40 µM ouabain: 22 (SE ±0.7)) were not significantly different compared to untreated young-adult ears, suggesting no effect of the surgical procedure itself and no effect of the lower ouabain dose. These results were also consistent across more basal cochlear locations (up to 8 kHz equivalent frequency, see Supplemental Materials). Based on this, we pooled the single-unit data from surgery-only and 40 µM ouabain treated gerbils into a “sham” group. The gerbils treated with 70 µM ouabain are henceforth referred to simply as “ouabain”.
Average rate does not carry sufficient information about inharmonic frequency shift
Spike responses to TFS1 stimuli of 94 AN fibers from 26 animals were recorded (for details see Table 1 in Materials and Methods). BFs ranged from 470 Hz to 11,528 Hz, with a median of 3,995 Hz. The SRs of these fibers ranged from 0 sp/s to 144 sp/s with a median of 60 sp/s. Average stimulated rates (4.8 sp/s - 223 sp/s, median 135 sp/s) and driven rates (i.e., average rate minus SR, 4.8 sp/s - 215 sp/s, median 67.6 sp/s) were calculated from the spike counts across the stimulus duration of 375 ms, excluding on- and offset ramps.
Most importantly, the inharmonic frequency shift Δf% - as a covariate in the ANOVA - had no significant effect on mean driven rate [F(1;943) = 0.018, p = 0.8933] (Fig. 2, overlapping colored and black distribution curves in each panel). The overall distribution of driven rate also did not change substantially with Δf%. Thus, the inharmonic frequency shift had no effect on the mean driven rate, and AN spike rate was not a potential cue for the gerbil in the behavioral task.
Beyond this robust, basic result, there were differences in discharge rates between the animal groups and stimulus conditions that might reflect different sampling biases of single units in the different experimental groups (see Supplemental Material for details).
Neural representation of TFS was not degraded in AN fibers of old or synaptopathic gerbils
The same set of AN fibers was probed for the neural responses to the fine structure and envelope of TFS1 stimuli (for details see Table 1) using the TFS log-z-ratio (see section “Neural recordings - Data Analysis”). A high TFS log-z-ratio indicates strong phase locking to the stimulus TFS. The main findings are listed below; for more details see Supplemental Material. Importantly, the inharmonic frequency shift, Δf%, as a covariate in the ANOVA, affected the TFS log-z-ratio significantly and strongly [main effect: F(1;943) = 104.403, p = 2.569×10-23], suggesting that AN fibers followed the fine structure of the stimulus and thus represented the frequency shift in their spiking pattern (Fig. 3A-J, L). Δf% significantly interacted with stimulus condition and BF-class [interaction: F(2;943) = 20.292, p = 2.353×10-9; F(1;943) = 31.050, p = 3.282×10-8, respectively]. In response to 400/1600 Hz, the shifts affected the temporal representation most (i.e., higher Δf% resulted in higher TFS log-z-ratios). Additionally, TFS log-z-ratios of low-BF fibers were influenced more by the inharmonic shifts than those of high-BF fibers (again, higher Δf% result in higher TFS log-z-ratios).
TFS representation by the TFS log-z-ratio metric was not significantly different between AN fibers from different gerbil groups (p = 0.154). Also, no interaction between group and Δf% occurred. Together, this indicates that aged fibers (Fig. 3D, H, L) and fibers that survived the treatment with ouabain (Fig. 3C, G) had responses that did not differ from young-adult (Fig. 3A, E, I) and sham-treated fibers (Fig. 3B, F, J). As expected, TFS log-z-ratios of AN fibers responding to the stimulus condition with higher fc were lower, compared to those responding to stimuli with lower fc (200/1600 Hz: 0.987, 400/1600 Hz: 1.111, 400/3200 Hz: 0.394; [main effect: F(2;943) = 7.495, p = 5.894×10-4]). This reflects the known decrease in phase-locking with increasing stimulus frequency (Versteegh et al., 2011).
Representation of f0 was enhanced in AN fibers of old gerbils
As a metric for the relative representation of TFS vs. stimulus envelope, the ENV/TFS log-z-ratio was used (see section “Neural recordings - Data Analysis”). This metric reflects the relative emphasis in the AN fibers’ responses of either TFS (negative ENV/TFS log-z-ratio) or envelope locking (positive ENV/TFS log-z-ratio).
Here, over all animal groups, Δf% had a marginally significant effect on the ENV/TFS log-z-ratio [main effect: F(1;943) = 4.278, p = 0.039] and stimulus condition had a highly significant effect on the ENV/TFS log-z-ratio [main effect: F(2;943) = 43.929, p = 5.744×10-19]. Posthoc comparisons revealed that, for 400/3200 Hz, envelope representation (mean ENV/TFS log-z-ratio = 0.263) was stronger than fine structure representation, indicating stronger phase-locking of AN fibers to low-frequency stimulus components. Furthermore, stimulus condition
Over all animal groups, low-SR fibers had, on average, positive ENV/TFS log-z-ratios, that is, phase-locked more strongly to the stimulus envelope, whereas high-SR fibers showed, on average, negative ENV/TFS log-z-ratios, that is, phase-locked more strongly to the TFS [main effect: F(1;943) = 29.138, p = 8.533×10-8].
Age, but not synapse loss, affected behavioral discrimination of TFS1 stimuli
To investigate the effect of synapse loss on performance in the TFS1 task, the behavior of young-adult gerbils, without and with cochlear synaptopathy, and old gerbils was compared for the different stimulus conditions defined by the combinations of f0 and fc. The sensitivity of the four groups of gerbils in the different stimulus conditions is illustrated in Fig. 5, in relation to the frequency shift Δf%. The sensitivity of young-adult gerbils of the different treatment groups was quite similar, whereas the old gerbils had considerably lower sensitivity. A GLMM ANOVA with the sensitivity d’ as dependent variable, condition and treatment group as factors, and frequency shift Δf% as a covariate (only trials with Δf% > 5 included, for which data from all treatment groups were obtained), revealed significant main effects of treatment group [F(3,387) = 8.18, p = 2.725×10-5], stimulus condition [F(2,387) = 22.71, p = 4.736×10-10, and Δf% [F(1,387) = 20.457, p = 8.121×10-6]. No significant interactions were observed. There was a general increase in sensitivity with increasing Δf%. Old gerbils showed lower sensitivities than most young-adult gerbils, irrespective of their treatment (all p < 0.002, Bonferroni corrected). There was no significant difference in sensitivity between the treatment groups of young gerbils. To separately investigate the effects of f0 and harmonic number, N, on sensitivity, we conducted planned comparisons in two additional GLMM ANOVAs, each containing only the data of two conditions, with the same dependent variable and factors as the initial ANOVA. The planned comparison with the data of stimulus conditions 200/1600 Hz and 400/1600 Hz was used to investigate the effect of f0 (which inherently includes a change in harmonic number, N) on sensitivity. This ANOVA revealed an effect of f0, with a higher sensitivity for increasing f0 [F(1,257) = 45.001, p = 1.248×10-10]. Furthermore, an effect of Δf% [F(1, 257) = 13.539, p = 2.847 ×10-4] was observed. Similar to the previous analysis, the main effect of gerbil group [F(3, 257) = 4.878, p = 2.570×10-3] reflected the low sensitivity of the old gerbils compared to all other groups. The second planned comparison with the data of stimulus conditions 400/3200 Hz and 400/1600 Hz, with equal f0, investigated the dependence of sensitivity on fc. An effect of fc [F(1,256) = 12.656, p = 4.459×10-4] was shown, with decreasing sensitivity for increasing fc. As in the previous comparison, this ANOVA also revealed increased sensitivity with increasing Δf% [F(1,256) = 13.484, p = 2.929×10-4], whereas the main effect of gerbil group [F(3,256) = 4.692, p = 3.299×10-3] again reflected the lower sensitivity of the old group compared to all other groups. No interactions were observed in either planned comparison.
The frequency-shift threshold reflects Δf% for a criterion of d’ = 1 and was calculated for each gerbil and condition. If no threshold could be derived because all d’ values were below 1, a threshold of 100% f0 was assumed for further statistical analysis. On average, old gerbils were only able to discriminate inharmonically shifted TFS1 stimuli for the condition 400/1600 Hz (see d’ values > 1 in Fig. 5D, H, L). All young-adult gerbil treatment groups achieved average sensitivities above threshold for all stimulus conditions.
Frequency-shift thresholds for the three different stimulus conditions are shown in Fig. 6. A GLMM ANOVA was conducted with Δf% thresholds as the dependent variable and condition and treatment group as factors. There were main effects for stimulus condition [F(2,60) = 8.99, p < 0.0005] and treatment group [F(3,60) = 6.84, p<0.0005], with no interaction between both factors. Average thresholds were not significantly different between the 200/1600 Hz and 400/3200 Hz conditions, while thresholds were lower in the 400/1600 Hz than in the 200/1600 Hz condition (p < 0.0005). The pairwise comparison revealed that old gerbils had significantly higher thresholds than gerbils of all other animal groups (all p < 0.005).
No differences were observed between the young-adult gerbil groups with different treatments. To evaluate the effect of f0 on the threshold, a planned comparison with the data of stimulus conditions 200/1600 Hz and 400/1600 Hz (i.e., harmonic numbers 8 and 4, respectively) was conducted in an additional ANOVA, with frequency-shift thresholds as dependent variable, and condition and gerbil group as factors. A main effect of condition was revealed, indicating that f0 (respectively harmonic number N at 1600 Hz fc) did affect the threshold [F(1,40) = 15.219, p = 3.577×10-4]. There was no significant effect of gerbil group; all groups showed similarly low thresholds for the 400/1600 Hz condition. A second additional ANOVA with the data of stimulus conditions 400/3200 Hz and 400/1600 Hz (i.e., harmonic numbers 8 and 4, respectively) investigated the dependency of threshold on fc (respectively harmonic number N). An effect of gerbil group [F(3,40) = 10.460, p = 3.272×10-5] and condition [F(1,40) = 16.213, p = 2.455×10- 4] was observed. The pairwise comparison revealed lower thresholds with decreasing fc (respectively harmonic number N) and a significant difference between untreated and old gerbils (all p ≤ 0.001), whereas the young-adult gerbil treatment groups did not differ.
In summary, the behavioral data showed that the old gerbils differed from all other treatment groups of young-adult gerbils. This result suggests that age considerably affected both perceptual sensitivity and thresholds while synapse numbers had little effect.
Discussion
The ability to process temporal fine structure (TFS) of sounds is a key factor for speech perception and its age-related decline has been held responsible for compromised speech comprehension in the elderly, especially in challenging noisy acoustic backgrounds (Füllgrabe et al. 2015, Moore 2014, 2019). It has been suggested that age-related or noise-induced synaptopathy results in the observed perceptual deficits regarding the TFS of sounds (e.g., Moore 2014, 2019). However, so far a direct test of this hypothesis is lacking. Here, we present data that investigate the performance of the Mongolian gerbil in the TFS1 test (Moore and Sek 2009), a psychoacoustic paradigm that has been applied in human subjects to evaluate the ability to discriminate between stimuli based on the TFS. We determined the TFS1 test performance in young-adult gerbils and compare this to the performance of old gerbils (≥ 36 months of age, corresponding to humans ≥ 60 years of age, Castaño-González et al. 2024), demonstrating an age-related decline in TFS sensitivity similar to that found in human subjects (Füllgrabe et al. 2015, Moore et al. 2012). However, young-adult gerbils with experimentally induced synaptopathy (by applying ouabain), to a degree corresponding to the synaptopathy in old gerbils (Steenken et al. 2021, this study), showed no decline in TFS sensitivity. This result suggests that reduced synapse numbers alone cannot explain the perceptual deficits for TFS in old subjects. Auditory-nerve (AN) fiber responses of young-adult, young-adult ouabain-treated, and old gerbils, elicited by the stimuli presented in the TFS1 test, indicate that the representation of TFS in AN fibers is not affected by ouabain treatment or age. In old gerbils that show compromised TFS1 test performance behaviorally, however, the representation of the signal envelope by AN fibers was enhanced compared to the other groups. We propose that this enhancement of the signal envelope cue, which does not differentiate the stimuli in the TFS1 test, may cause the perceptual deficits in old gerbils by overriding the usability of the TFS cues. Thus, cue representation may cause the perceptual deficits, but not reduced synapse numbers, as originally proposed.
Gerbil as a model for human temporal-fine-structure perception
In the TFS1 test, subjects must discriminate an inharmonic (I) complex from a harmonic (H) complex, in which the frequency difference between the components determines the period of the envelope. The I complex is created by shifting all components of the H complex by a fixed frequency. This shift results in pairs of I and H complexes with similar envelopes but different TFS. Thus, TFS is the only acoustic cue which allows to discriminate the stimuli. Excitation-pattern differences due to the shift of the frequency components in the I complex compared to the H complex can be eliminated in the TFS1 test by appropriate spectral filtering (e.g., Marmel et al. 2015). In addition, the gerbil has larger auditory filter bandwidths than human subjects (Kittel et al. 2002). Since we used a similar filter bandwidth for the gerbil as that applied in studies involving humans (e.g., Eipert et al. 2019, Marmel et al. 2015, Moore and Sek 2009), cochlear excitation pattern differences are unlikely to have provided usable cues for discrimination by the gerbils in the present study. Furthermore, the noise masker in which the I and H complexes were presented ensured that the stimuli did not produce usable distortion products for the gerbils. Thus, as in the study with human subjects, the only usable cue for the discrimination was the TFS.
Figure 7 provides an overview of gerbil TFS1 thresholds in comparison to human thresholds. Human thresholds were selected from studies involving stimuli with spectral components and fundamental frequencies (f0) similar to those presented to the gerbils in the present study, allowing a comparison of gerbil and human performance. There is a broad overlap between gerbil and human TFS1 thresholds. Both, in humans and gerbils, old subjects showed higher (i.e., worse) TFS1 thresholds than young subjects (for human studies see Eipert et al. 2019, Füllgrabe et al. 2015, Moore et al. 2012). This similarity indicates that the gerbil is a good model for investigating the mechanisms underlying TFS perception that may also apply to humans. Behavioral TFS1 thresholds in young-adult gerbils with ouabain-induced synaptopathy, similar to that typical for old gerbils, were similar to TFS1 thresholds in untreated young-adult gerbils. This result suggests that increased thresholds in old subjects were not due to the reduced number of synapses. Thus, in the gerbil, the hypothesis that age-related deterioration of TFS perception is a direct consequence of synaptopathy can be rejected.
Auditory-nerve fiber responses do not explain differences in behavioral sensitivity
Since the use of excitation patterns by the gerbil is unlikely due to its larger auditory-filter bandwidths, we explored the temporal responses of AN fibers for alternative explanations of compromised TFS perception. TFS representation requires the ability of AN fibers to phase-lock (reviewed in Joris et al. 2004, Rose et al. 1967). Here we showed that, throughout all tested gerbil groups, the H and I complexes were neurally distinguishable based on phase-locking to the stimulus fine structure (Fig. 3, 4). In contrast, no distinction was possible based on the average firing rate of AN fibers (Fig. 2). Neural phase-locking reached highest z-values for stimuli with fc of 1600 Hz and were lower for 3200 Hz (Fig. 4), which reflects the known decrease in phase-locking of AN fibers for stimuli above 1.5 kHz (Versteegh et al. 2011). Several studies in animal models of cochlear dysfunction showed that phase locking is not compromised in single AN fibers, including quiet-aged gerbils (Heeringa et al. 2020), noise-exposed cats (Miller et al. 1997), chinchillas (Henry and Heinz 2012, Kale et al. 2013), and kanamycin-treated guinea pigs (Harrison and Evans 1979). Consistent with these findings, the surviving fibers in old, synaptopathic gerbils did not show compromised temporal fine structure representation of I and H complexes compared to young-adult gerbils (Fig. 3, 4).
Old gerbils showed significant synapse loss compared to untreated young-adult gerbils at the cochlear location corresponding to 2 kHz (Fig. 1), consistent with what was shown previously (Steenken et al. 2021). Ouabain, a cardiac glycoside that inhibits Na+-K+-ATPase pumps of AN fibers (O’Brien et al. 1994), applied to the round window, results in a dose-dependent loss of AN fibers in gerbils (Bourien et al. 2014, Lang et al. 2005, Schmiedt et al. 2002). In our hands, a low dose (40 µM) of ouabain had no effect at the cochlear location equivalent to 2 kHz. In contrast, a higher dose of 70 µM resulted in a loss that was similar to that observed in old gerbils (Fig. 1). The surviving fibers of ouabain-treated gerbils showed no deterioration of TFS representation compared to untreated or sham-treated young-adult gerbils (Fig. 3A,C,E,G); however, the behavioral outcomes of this study clearly showed that only old gerbils had deteriorated discrimination abilities between H and I complexes. We thus showed conclusively that neither synapse loss, to an extent typical for old gerbils, nor impaired TFS representation, at the level of single AN fibers, could explain the deteriorated perception in old gerbils.
Enhancement of confounding/distracting AN cues and changed central processing may explain the behavioral deficit in old subjects
Surviving AN fibers in old gerbils did not show compromised fine-structure representation (Fig. 3, 4). However, old gerbils exhibited deteriorated performance in the behavioral test in response to TFS1 test stimuli (Fig. 5, 6). Despite the overall robust phase-locking ability, other pathologies are known to develop with advancing age.
First, the age-related atrophy of the stria vascularis within the lateral wall of cochleae (Gratton and Schulte 1995) results in a decreased endocochlear potential (Schulte and Schmiedt 1992), which in turn increases the threshold (i.e., decreases the sensitivity) of AN fibers (Schmiedt et al. 1996). Envelope representation, measured as synchronization to amplitude-modulated stimuli, is non-monotonic with sound level, with a pronounced peak near the threshold of the fiber (Heeringa et al. 2023, Joris and Yin 1992). Thus, the typical sensitivity loss in old subjects will cause a difference in neural representations between young-adult and old subjects, even if the test stimulus levels are kept constant. Alternatively, stimulus levels can be raised for old subjects, to an equivalent sensation level that compensates for their sensitivity loss. Consistent with that prediction, AN fibers of old gerbils show enhanced representation of the fundamental frequency of speech vowels, compared to fibers from young adults, when responding to fixed-level stimuli (Heeringa et al. 2023), but do not show enhanced envelope representation in response to frozen noise presented at similar sensation level (Heeringa et al. 2020). Old gerbils of the same age as in the present study invariably show hearing loss, although to variable extents (Hellstrom and Schmiedt 1990, Steenken et al. 2021). Thus, since TFS1 stimuli in the current study were presented at a fixed absolute level of 68 dB SPL, fibers of old animals were stimulated closer to their individual thresholds than fibers of young adults, which is expected to increase their phase locking to f0, as observed. This increase in phase-locking to f0 may have confounded their perception. In old gerbils, the frequency selectivity of AN fibers is not significantly affected (Heeringa et al. 2020, Hellstrom and Schmied 1996), unlike that in noise-induced hearing loss which can cause additional changes to temporal coding. Chinchillas with noise-induced hearing loss showed compromised temporal coding in response to TFS1 stimuli (Kale et al. 2013), which is explained by a mismatch between the AN fiber’s characteristic frequency and the frequency of the stimulus component. In AN fibers from noise-damaged animals, this mismatch arises as the consequence of changes to their frequency tuning (Henry et al. 2019). In chinchillas treated with furosemide, a drug that reversibly reduces the endocochlear potential, the AN fibers’ frequency tuning curves remained unaffected, whereas their sensitivity is decreased (Henry et al., 2019). The same can be seen in aged gerbils (Heeringa et al. 2020, Hellstrom and Schmied 1996), and, thus, a distorted tonotopy caused by changed frequency tuning is unlikely.
Second, age-related changes in GABAergic processing in the central auditory system have been found (Caspary et al. 1995), including the gerbil (Gleich et al. 2014, Kessler et al. 2020) and shown to affect temporal processing (Gleich et al. 2003). The gerbils used by Kessler and colleagues (2020) were derived from the same population as the gerbils used for the present study and had the same age range for old animals. Therefore, it can be assumed that the gerbils in the current study showed a change in GABAergic processing in the brain that could affect processing of such supra-threshold stimuli as being used in the TFS1 test.
Conclusion
We demonstrated that the gerbils’ ability to discriminate TFS1 stimuli is neither affected by synapse loss nor by alterations to TFS representation by their AN fibers. However, the elevated neural response thresholds in old gerbils enhanced the neural envelope representation. This enhancement may have overridden the still intact TFS representation.
Materials and Methods
Experimental model and subject details
Sixty adult, agouti-colored Mongolian gerbils (Meriones unguiculatus) of both sexes (32 M, 28 F) were studied (Table 1). The gerbils originated from Charles River Laboratories and were bred in an in-house facility of the University of Oldenburg. Groups of gerbils were tested, differing in age and ouabain treatment: untreated young-adult, surgery-only young-adult, ouabain-treated young-adult, and old gerbils. Young-adult gerbils were between 3 and 19 months of age and old gerbils were 36 months or older. All animals were housed individually or in pairs in Type IV cages, which were provided with nesting material. During behavioral testing, access to water was not limited, but animals were food-deprived, such that the weight of young-adult and old gerbils was 65 - 75 g and 70 - 85 g, respectively, to motivate the animals to perform for a food reward. All protocols and procedures were approved by the Niedersӓchsisches Landesamt für Verbraucherschutz und Lebensmittelsicherheit (LAVES), Germany, permit AZ 33.19-42502-04-15/1990. All procedures were performed in compliance with the NIH Guide on Methods and Welfare Consideration in Behavioral Research with Animals (National Institute of Mental Health, 2002).
Young-adult gerbils were on average 10.3 ± 5.7 months of age (N = 21) and old gerbils were 38.4 ± 2.5 months old (N = 17) when electrophysiological recordings were conducted and cochleae were harvested. Treated young-adult gerbils were either 15.1 ± 3.6 (treated with a low concentration of ouabain, N = 8) or 12.7 ± 4.2 (high concentration of ouabain, N = 10). All surgery-only gerbils were 18 months old (N = 4). Gerbil groups had significantly different ages (univariate ANOVA: F=99.343; p < 0.0005). Bonferroni corrected post-hoc tests revealed that old gerbils were significantly different from all other young-adult gerbil groups with respect to age, while all young-adult gerbil groups, irrespective of treatment, did not differ.
Methods details
Stimulus paradigm
The stimuli were similar to those used for the TFS1 test in humans (for a detailed description see Moore and Sek 2009). In short, the reference stimulus was a harmonic tone (H) complex. The test stimulus that was discriminated from the reference by the gerbils in the behavioral tests or by analyses of AN-fiber responses was an inharmonic tone (I) complex created by shifting all frequency components upwards. This procedure creates H and I complexes with the same envelope periodicity. In the following, the frequency shift, Δf, will be expressed by the percentage of the fundamental frequency, f0 (Weber fraction).
The f0 was either 200 or 400 Hz. H and I complexes were generated with components from f0 up to 9 kHz. H and I complexes were passed through the same 3rd- or 5th-order Butterworth infinite-impulse-response band-pass filter (for AN recording and behavioral experiments, respectively), with a center frequency, fc, of 1600 or 3200 Hz, to reduce possible cues related to differences in the excitation pattern on the basilar membrane (Moore and Moore 2003). The 3-dB down points of the filter were separated by 7 f0, which resulted in a flat spectral region of 5 f0. The amplitude at the filter skirts decreased by at least 30 dB per octave (except for the 3rd-order filter with fc of 1600 Hz and f0 of 400 Hz, for which the slope was 18 dB/oct).
The phase of each harmonic was randomly chosen for each test trial, but was matched for the H and I complexes within a trial to ensure similar envelopes for both stimuli (see Fig. 1 A-D). A pink noise with frequencies between 100 and 11,050 Hz was added to the signal to mask distortion products possibly emerging in the inner ear, and signal components outside the filter passband. The masker level was 20 dB SPL at 1 kHz, to mask distortion products of H and I stimuli in the gerbil (Faulstich and Kössl, 1999). The level of each component in the tone complex was 60 dB SPL resulting in an overall level of 68 dB SPL. The duration of each H or I complex stimulus was 0.4 s. In the behavioral experiment, stimulus onset and offset were ramped with a 25-ms Hann window. Stimuli presented to AN fibers were not ramped but still filtered. By analyzing the steady-state response only, any on- and offset effects were excluded.
In both the behavioral experiments and in AN-fiber recordings, three combinations of f0 and fc were used: fc = 1600 Hz, and f0 = 200 Hz; fc = 1600 Hz, and f0 = 400 Hz, and fc = 3200 Hz and f0 = 400 Hz. The combination of f0 = 200 Hz and fc = 3200 Hz was not used because a pilot behavioral test showed that two young untreated gerbils were not able to discriminate I and H even for the largest value of Δf.
The harmonic rank, N, of a condition describes the rank of the center frequency in the harmonic tone complex. The highest value for Δf was 42.5% of the fundamental f0. Lower values for Δf were computed by dividing 42.5% by 2n, with n being 1 or increasing integers. The smallest Δf presented in the behavioral tests was 0.33% f0. In old gerbils tested behaviorally, additional test stimuli with Δf = 31.87% f0 and 15.94% f0 were presented to ensure that old gerbils were presented with a sufficient number of trials allowing a salient percept for discrimination.
In recordings from gerbil AN fibers, only subsets of the stimuli used in behavior were presented to save time and minimize the number of incomplete datasets due to deterioration or loss of single-unit isolation. The frequency shifts were further grouped into either “high” (10.6, 15.9, 21.25, 31.8, and 42.5 Hz) or “low” (0.66, 1.3, and 5.31 Hz) and the stimuli were presented in three different blocks per stimulus condition. The first block comprised the H and two I complex stimuli, one with a high and one with a low shift. The second block comprised I complex stimuli, two high shifts and another low shift. The third block always comprised the remaining shifts (1.3, 10.6, and 42.5 Hz). The order of stimuli within the blocks was random between fibers. A silent period of 0.6 s occurred between stimuli. This organization of stimuli ensured that the H complex was always presented and could be compared to one low and one high shift. The second and third block of stimuli were recorded with declining success.
The periodic envelopes of H and I complexes created in the way described above are similar and offer no cues for discrimination, whereas the TFS of H and I complexes is not identical (Fig. 8A-D). The TFS of I complexes changes across sequential periods while the TFS of the H complex does not; only this difference can be used for discrimination. The small random change in the TFS and envelope of both H and I due to the addition of pink noise offers no discrimination cues. The bandpass filtering of the tone complexes served to minimize cochlear excitation pattern cues.
Experimentally introduced synaptopathy using ouabain
For experiments including ouabain treatment, gerbils were anesthetized with ketamine (Ketamin 10%, bela-pharm GmbH, Vechta, Germany; 135 mg/kg body weight) and xylazine (Xylazin 2%, Ceva Tiergesundheit GmbH, Düsseldorf, Germany; 6 mg/kg body weight) mixed with 0.9% saline injected intra-peritoneal. Maintenance doses of one third of the initial dose were given hourly or as needed. Depth of anesthesia was monitored by withdrawal reflex to toe pinches and via an ECG with needle electrodes in the front- and contralateral hind leg, continuously displayed on an oscilloscope (SDS 1102CNL, SIGLENT Technologies, Hamburg, Germany). Body temperature was continuously controlled with a homeothermic blanket (Harvard Apparatus, Holliston, MA, USA). Experiments were carried out in a sound-attenuating booth. The gerbil’s head was fixed within a bite-bar (David Kopf Instruments, Tujunga, CA, USA). Pure, moisturized oxygen (1.5 l/min) was delivered from a tube pointing towards the nose.
Twenty-two gerbils (11M, 11F, aged 3 – 8 months at the beginning of treatment) received either ouabain or artificial perilymph solution. Twenty-four hours before the oubain or artificial perilymph injection, gerbils were treated with antibiotics (oral; Baytril (0.5%, Bayer Animal Health GmbH, Monheim am Rhein, Germany; 0.3 ml per gerbil/day) and a non-steroidal antiphlogistic/ analgetic agent (oral; Meloxidyl (0.5 mg/ml, CevaTiergesundheit GmbH; 0.3 ml per gerbil/day; or Novalgin; oral; 500 mg/ml, Sanofi-Aventis Deutschland GmbH, Frankfurt, Germany, 0.21 ml per gerbil/day). The antibiotic treatment was continued for nine more days, the antiphlogistic/ analgetic agent for at least three days. Additionally, gerbils received an antiemetic (oral; Emeprid, Ceva Tiergesundheit GmbH; 0.21 ml per gerbil/day) and, if needed, also sterofundin (4 ml s.c.; B. Braun SE, Melsungen, Germany) or amynin (4 ml s.c.; Boehringer Ingelheim Pharma GmbH Co KG, Ingelheim am Rhein, Germany) as infusion solutions if gerbils were dehydrated, all post-operatively.
Ouabain treatment was carried out according to Bourien et al. (2014). After gerbils were anesthetized, they received a sub-cutaneous infusion with sterofundin (4 ml). The location where the headholder met the skin of the face, was treated with 2% xylocaine ointment (Lidocainhydrochloride, Aspen Pharma Trading Limited, Dublin, Ireland). The skin covering the bulla was disinfected with 70% ethanol. A small incision was placed behind the ear to access the bulla. A hole was drilled into the bone with an angled blade and subsequently widened with forceps to reach the round window. A syringe needle tip was placed onto the round window to apply one of the following solutions onto the round window membrane: Eight gerbils were treated with 40 µM ouabain (Ouabain octahydrate, Sigma-Aldrich, Saint Louis, MO, USA), ten with 70 µM ouabain, dissolved in artificial perilymph solution (137 mM NaCl, 5 mM KCl, 2 mM CaCl2, 1 mM MgCl2, 1 mM NaHCO3, and 11 mM glucose; pH 7.4). Four gerbils received only artificial perilymph solution (surgery-only treatment). In 15 gerbils, the solution was continuously delivered by a pump (150 µl for 30 Min). In seven gerbils, the round window niche was bathed with the solution by manually operating the syringe and after ten minutes, the solution was aspirated and reapplied. This procedure was repeated three times. Surgery was concluded by suturing the skin incision over the open, middle ear. Gerbils were binaurally treated (n = 19), except those that were destined for electrophysiology only, which were monaurally treated (n = 3).
Note that the experimenter conducting the electrophysiology and histology was blinded to the type of treatment: the solution that was transferred to the round window was aliquoted by another person. Only after the synapse counts were completed, the treatment was revealed.
Counting inner-hair-cell synapses
Immunohistochemistry
Processing of cochleae was carried out as described in detail in Steenken et al. (2022). After concluding the single-unit recordings, gerbils were euthanized with an overdose of pentobarbital (i.p.; Narcoren, Merial GmbH, Hallbergmoos, Germany, 480 mg/kg body weigh). Twenty gerbils were cardiovascularly perfused with a mixture of phosphate-buffered saline (PBS; 137 mM NaCl, 10 mM phosphate and 2.7 mM KCl, pH 7.4) followed by 4% paraformaldehyde in PBS. To prevent blood coagulation, in 15 of these gerbils, heparin (ratiopharm, 25,000 IE/5 ml; 0.2 ml/100 ml) was added to PBS. In 34 gerbils, the bulla was rapidly exposed after euthanasia and small openings were carefully created at the apex and base of the bony cochlear walls. Cochleae were then fixed in 4% paraformaldehyde in PBS for two days on a shaker at 8 °C. After fixation, all cochleae were decalcified using 0.5 M ethylenediaminetetraacetic acid for two days. Subsequently, cochleae were treated with 1% Triton X-100-PBS for 1 hour to permeabilize the tissue and were then rinsed with 0.2% Triton X-100-PBS. To block unspecific binding sites, cochleae were then incubated in 3% bovine serum albumin (BSA, + 0.2% Triton X-100-PBS) blocking solution for 1 hour at room temperature.
Antibodies to label hair cells (anti-MyosinVIIa, IgG polyclonal rabbit; Proteus Biosciences, Ramona, CA, USA; diluted 1:400), presynaptic ribbons (anti-CtBP2 (C-terminal binding protein) antibody, IgG1 monoclonal mouse; BD Biosciences, Eysins, Switzerland; diluted 1:400), and postsynapstic receptor patches (anti-GluR2a antibody, IgG2a monoclonal mouse; Millipore, Burlington, MA, USA; diluted 1:200) in blocking solution were used. Cochleae were incubated in this mixture at 37 °C overnight. Next, cochleae were again rinsed in 0.2% Triton X-100-PBS.
Secondary antibodies were chosen to match the hosts of the primary antibodies: goat anti-mouse (IgG1)-AF488 (Molecular Probes Inc., Eugene, OR, USA; diluted 1:1000), goat anti-mouse (IgG2a)-AF568 (Invitrogen, Carlsbad, CA, USA; diluted 1:500), and donkey anti-rabbit (IgG)-AF647 (Molecular Probes Inc.; diluted 1:1000). Secondary antibodies were diluted in blocking solution and cochleae were incubated in this solution at 37 °C overnight, and subsequently rinsed in PBS.
After the immunostaining, a subset of the cochleae from untreated animals older than 12 months received treatment with an auto-fluorescence quencher (n = 13; TrueBlack Lipofuscin Autofluorescence Quencher, 20x in DMF, Biotum, Hayward, CA, USA). These cochleae were cut in half and incubated in a 5% true black-70% ethanol-mixture for 1 minute and then rinsed in PBS. Finally, all cochleae were micro-dissected under a stereomicroscope and the resulting 5 - 11 organ-of-Corti pieces were mounted on slides using Vectashield Mounting Medium (Vector Laboratories, Burlingame, CA, USA, H-1000).
Image acquisition
Overview images of cochlear pieces were acquired with a light microsope (Nikon 90i with NIS Elements software, Version 4.30, Nikon, Minato, Tokio, Japan) using a 4x objective. After measuring the length of the cochlea as a line along the row of inner hair cells with the NIS software, the cochlear location at 3.8 mm from the apex, corresponding to 2 kHz (Müller, 1996), was chosen, because 2 kHz was part of all stimulus conditions (data for additional locations corresponding to 4, 8, and 16 kHz are provided in the Supplemental Materials). Images were taken at this position with a confocal microscope (Leica Microsystem CMS GmbH, Wetzlar, Germany, Leica TCS SP8 system) using an oil-immersion objective (40x, numerical aperture 1.3). Lasers (488-nm and 522-nm optically pumped semiconductor laser), or a 638-nm diode, respectively, excited the different fluorescence tags, and released photons were counted with a hybrid detector. Image stacks had voxel dimensions of 0.05 - 0.0998 µm (XY) and 0.3 µm (Z). All confocal stacks were deconvolved (Huygens Essentials, Version 15.10, SVI, Hilversum, Netherlands) with default settings (maximum iteration: 80; signal to noise ratio: 10; quality threshold: 0.01), using a theoretical point-spread function.
Synapses were counted using ImageJ (FIJI, Schindelin et al. 2012). For each cochlear position, the number of functional synapses was evaluated for 5 inner hair cells by manually counting the co-localized ribbons and glutamate patches. Contrast and brightness were manually adjusted for each stack.
ABR recordings
Hearing sensitivity of all gerbils was evaluated prior to and during AN recordings using ABRs. The recording electrode was placed near the ipsilateral mastoid, the reference electrode was placed in the neck, and a ground electrode was placed on one hindleg. The stimulus presentation was controlled via a custom-written MATLAB software (Mathworks, Natick, MA, USA) and delivered through a RME Hammerfall DSP Multiface II sound card (RME Audio, Haimhausen, Germany). The stimuli were amplified (HB7, TDT Inc.) and presented to the gerbil’s ear via earphones (IE 800, Sennheiser, Wedemark, Germany) attached to an ear-bar. ABRs were amplified (x10^3) and band-pass filtered (0.3 - 3 kHz) using an ISO-80 pre-amplifier (World Precision Instruments, Sarasota, FL, USA), and sampled using the digital signal processor (Hammerfall DSP Multiface II, RME Audio; 48 kHz sampling rate) controlled by custom MATLAB software (Mathworks, Natick, MA, USA).
Prior to AN recordings, responses to chirps were recorded at a range of levels, from below threshold up to 20 dB above threshold in steps of 5 dB, repeated 300 times. Thresholds from averaged ABR waveforms were detected visually during the experiment. During single-unit recording sessions, ABR thresholds were re-checked periodically, and the experiment was terminated if threshold deteriorated by more than 30 dB.
Single-unit recordings
Stimulus presentation
Stimuli for single-unit recordings were generated with custom-written MATLAB-scripts controlling a digital signal processor (RX6, TDT Inc., Alachua, FL, USA) and an attenuator (PA5, TDT Inc.).The signal was then routed through a headphone buffer (HB7, TDT Inc.) and presented via earphones (IE 800, Sennheiser, Wedemark, Germany) attached to an ear-bar. Stimuli were calibrated at the start of each experiment using a probe-tube microphone (ER7-C, Etymotic Research Inc., Elk Grove Village, IL, USA) attached to the ear-bar and a microphone amplifier (MA3, TDT Inc., 40 dB amplification).
Single-unit recordings
Single-unit data was obtained in 22 gerbils (see Table 1). After the gerbil was anesthetized, a single dose of non-steroidal antiphlogistic agent (meloxicam, 0.2 mg/kg, Boehringer Ingelheim Pharma GmbH Co KG) was administered subcutaneously. The head of the gerbil was oriented with a bite bar and firmly fixed to the setup via a screw glued with dental cement to the exposed skull. The pinna of one ear was removed to place an ear-bar directly at the bony edge of the ear canal. The ear-bar was sealed with petroleum jelly to the bone to form a closed sound system. To prevent pressure buildup in the middle ear, a small hole was drilled into the dorsal bulla with an angled blade.
The AN was accessed via a dorsal approach. The skin covering the occipital bone and the bone itself were removed and the lateral part of the cerebellum was aspirated. The brainstem was left intact and was gently pushed medially using tiny saline soaked cotton balls to access the proximal part of the eighth cranial nerve. A glass electrode (GB120F-10, Science Products GmbH, Hofheim am Taunus, Germany; pulled using a P-2000, Sutter Instruments Co., Novato, CA, USA), filled with 3 M KCl-solution and with a resistance of ∼10 - 30 MΩ, was placed above the AN under visual control. The electrode was advanced through the AN via a remote-controlled piezo motor (Burleigh 6000 ULN inchworm motor controller and 6005 ULN handset, Burleigh, Inc., Fishers, NY, USA). Recordings were amplified (WPI 767, World Precision Instruments Inc., Sarasota, FL, USA), filtered for line-frequency noise (50 Hz, Hum Bug, Quest Scientific Instruments Inc., North Vancouver, BC, Canada), and digitized (sampling rate: 48828 Hz; RX6, TDT Inc., Alachua, FL, USA). Additionally, the analog signal was guided to an audio monitor (MS2, TDT Inc.) and displayed on an oscilloscope (SDS 1102CNL, SIGLENT Technologies, Hamburg, Germany).
Search stimuli (broadband noise, 1 - 9 kHz at 50 dB SPL) were played while the electrode was advanced into the AN. When a unit was isolated, its frequency response range was first assessed audio-visually using 50-ms duration tones. Next, the best frequency (BF) of each fiber was assessed by presenting pure tone bursts (50-ms duration, including 5-ms raised-cosine rise/fall times, 0 - 30 dB above threshold, 3 repetitions) at a range of frequencies within approximately ±1 octave around the audio-visually determined best response frequency. Subsequently, tone bursts at BF were presented over a range of levels (10 - 79 dB SPL, 50-ms duration including 5-ms raised-cosine rise/fall times, 10 repetitions) to derive a rate-level function. At randomly inserted trials, no stimulus was presented to determine the spontaneous rate (SR). Subsequently, TFS1 stimuli were presented at a fixed level of 60 dB SPL.
Spike detection from the recorded signal was conducted off-line with a custom-written MATLAB script (Steenken et al., 2022b). In short, the bandpass-filtered (300 – 3000 Hz) recordings were manually screened for threshold-crossing events. The threshold could be adjusted on a trial-by-trial basis, to compensate for variations in spike amplitude. The time of each spike’s peak amplitude was saved as the spike time. Spike trains were excluded if: 1.) more than 10% of the spikes for a given stimulus condition were judged to be below the set threshold criterion, or 2.) any inter-spike intervals < 0.6 ms occurred, indicating inadequate unit isolation (Heil et al. 2007), or 3.) a pre-potential was present in the spike waveform, suggesting a recording from the ventral cochlear nucleus (Keine and Rubsamen 2015), or 4.) the rate-level function was non-monotonic, also suggesting a potential cochlear nucleus recording (e.g., Davis et al. 1996, Joris et al. 1994, Sinex et al. 2001).
Neural recordings - Data Analysis
A number of measures were derived from the AN spike times: a) Peri-stimulus time histograms (PSTHs), b) rate histograms, and c) vector strengths (VS) corresponding to certain periodicities in the stimuli. Only the 375-ms long steady-state part of the responses, excluding a 12.5 ms onset and offset fringe, respectively, was analyzed.
First, peri-stimulus time histograms (PSTHs) were calculated for each animal/fiber/condition/Δf combination, pooling over repeated stimulus presentations within each combination. Spike times were collected into histograms using bin widths of 1/10 of the f0 period (0.5 ms for 200 Hz and 0.25 ms for 400 Hz). The histogram counts were converted into instantaneous spike rates by dividing by the bin width and by the number of stimulus repetitions.
Second, average rate and SR histograms were calculated for each group/condition/Δf combination, pooling over repeated stimulus presentations, animals, and fibers within each combination. Average rates were computed over the 375-ms steady-state portion of the response. SR was calculated from trials without stimulation. Driven rate was calculated as the difference between stimulated absolute rate and SR. Average driven rates and SRs were collected into histograms using bin widths of 2.5 spikes per second and 5 spikes per second, respectively. The average-driven-rate histogram counts were divided by the number of spike trains to convert the counts to relative frequency and smoothed with a moving-average window of 3 bin lengths.
Third, VS frequency spectra were calculated for each group/condition/Δf combination, pooling over repeated stimulus presentations, animals, and fibers within each combination. For this, all-order inter-spike intervals (ISIs) were calculated from each spike train, and collected into histograms using a bin width of 50 µs, corresponding to a sampling rate of 20 kHz. The bin positions ranged from −375 ms to +375 ms. A zero-centered gaussian window with a standard deviation of 200 ms was multiplied with the histogram to suppress fringe effects and spurious side lobes in the spectra. The histogram counts were divided by the total number of ISIs. A subsequent fast Fourier transform of the histograms resulted in the ISI power spectrum, and the square root of the spectral magnitudes was identical to the VS at each bin center frequency. The spectral resolution was 1.33 Hz.
VS single-peak data was calculated for each animal/fiber/condition/Δf combination, pooling over repeated stimulus presentations within each combination. In contrast to the spectra, the peak VS values were calculated individually for each relevant frequency f:
where ti are the spike times, and N is the number of spikes in each spike train. For significance analysis of the VS, the Rayleigh statistic z (henceforth referred to as z-value) was calculated: z = Navg · Vavg2 (f), where Navg is the average number of spikes per spike train and Vavg is the average VS across stimulus presentations. The Rayleigh z statistic has useful properties for statistical analyses because VS values resulting from higher numbers of spikes have larger weights, and logarithms of ratios of z-values are approximately normally distributed, as opposed to VS ratios or differences.
As a metric for TFS representation, the log ratio between two z-values was derived: First, from responses to the H complex, the z-value, z0, at the frequency component that the fibers on average responded to maximally, fmaxpeak, was obtained. Second, this value was divided by the z-value, zΔf, at the shifted frequency of the I complex, also in the maximal average response range (filled circles in Fig. 9). This ratio was used as the dependent variable in the first ANOVA model and reflects high temporal resolution; large positive values correspond to stronger phase-locked responses to the TFS of the stimulus. This metric was termed the TFS log-z-ratio.
Another log ratio between z-values was used as a metric for the balance between envelope and TFS representation (crosses in Fig. 9): The (harmonic, unshifted) z0 value at the fundamental frequency, f0, was divided by the (inharmonic, shifted) zΔf value in the maximal average response range, fmaxpeak. Large positive log ratios indicate a stronger response to the stimulus envelope, whereas large negative log ratios indicate a stronger response to the stimulus TFS. This metric was termed ENV/TFS log-z-ratio.
The representation of Δf shift in the responses was also tested by calculating z-values at the different harmonic frequencies, combined with all possible Δf shifts.. The actual position of the maximum z-value in terms of Δf and the corresponding z-value is summarized in Fig. 4 for two TFS peak frequencies (to which the Δf shifts were added): at f0 of the corresponding condition and at 1 kHz for the 200/1600 Hz condition, at 800 Hz for the 400/1600 Hz condition, and at 2 kHz for the 400/3200 Hz condition, respectively. The latter TFS peak frequencies were identified as the harmonics actually present in the acoustic stimulus, which had the maximum z-value in the summarized spectra (Fig. 3).
For the single-unit data, several main factors were defined. The low-SR fiber class comprised fibers with SR ≤ 18 spikes/s, and the high-SR class comprised fibers with SR > 18 spikes/s. In addition, fibers in the low-BF class had BF ≤ 1850 Hz, and fibers in the high-BF class had BF > 1850 Hz.
Behavioral procedure
The behavioral experiments were conducted in a single-walled sound-attenuating chamber (Industrial Acoustics, Type 401 A, Industrial Acoustic Company GmbH, Mönchengladbach, Germany) lined with a 15-cm thick layer of sound-absorbing foam (Pinta Acoustics Pyramide 100/50 on Pinta Acustics PLANO Type 50/0, Seyboth & Co., Regensburg, Germany). The walls and all devices within the chamber produced no relevant sound reflections. The time for the reverberation in the chamber to decay by 60 dB was 12 ms, indicating nearly anechoic conditions. These conditions allowed faithful presentation of the TFS1 stimuli with the desired acoustic cues in the free sound field.
In the center of the chamber, a 30-cm-long wire-mesh platform was located at 90-cm height, with an elevated pedestal at its center where the gerbils waited during the measurement. This construction minimized sound reflections. At one end of the platform, a food bowl was located, attached via a tube to a custom-made feeder that did not obstruct the sound path. Gerbils were rewarded for correct discrimination with a 10-mg custom-made food pellet (Altromin International, type 1324 rodent pellets enriched with sunflower oil and spelt flour; Altromin Spezialfutter GmbH & Co. KG, Lage, Germany). A loudspeaker (Canton Plus XS, frequency range: 150 Hz - 21 kHz, Canton, Weilrod, Germany) was mounted 30 cm in front of the pedestal to the side of the food bowl, at 0° elevation and azimuth relative to the gerbil’s normal head position when sitting on the pedestal. Out of the gerbil’s reach, a system of two custom-made light barriers was installed to detect the gerbil’s position and facing direction on the pedestal. An infrared camera (Conrad 150001 C-MOS camera module, Conrad Electronics, Hirschau, Germany) was positioned above the platform to observe the gerbil’s movements on the platform under invisible infrared light. The stimulus presentation, registration of light barriers switching, and feeder were controlled by custom-written software on a Linux-based PC with an RME sound-card (Hammerfall DSP Multiface II, sampling frequency 48 kHz). The signal output from the sound card was manually attenuated (Texio type RA-902A, TEXIO Technology Corporation, Kanagawa, Japan), amplified (Rotel type RMB 1506, Rotel Tokyo, Japan), and presented by the loudspeaker. Before testing started on each day, the system was calibrated (±1 dB) with a sound level meter (Brüel and Kjaer type 2238 Mediator, Hottinger Brüel & Kjær, Virum, Denmark) positioned on the elevated pedestal at about the position of the gerbil’s head.
The behavioral experiment used an operant Go/NoGo paradigm. Gerbils were trained to wait on the pedestal facing in the direction of the loudspeaker. During the whole session, the H complex was played every 1.3 s as a reference stimulus. As soon as the gerbil jumped onto the pedestal facing the loudspeaker, a trial was started with a random waiting time between 2 and 7 s. When the waiting time had elapsed, a target stimulus, the I complex, was played instead of the reference H complex. The target stimuli could either be equal to the reference H complex (sham trial) for estimating the false-alarm rate or an I complex (test trial) for testing discrimination ability. Leaving the pedestal before the waiting time elapsed resulted in the restart of the trial with a new waiting time. Leaving the pedestal within 1.3 s after the onset of the target stimulus of a test trial, as indicated by the light barriers, was registered as a “hit” and resulted in a food reward. Staying on the pedestal in a test trial was counted as a “miss” with no food reward. The animal had to leave the pedestal and jump back onto it in the correct sequence to start a new trial after a “miss” occurred. Leaving the pedestal within 1.3 s after the start of the target stimulus in a sham trial was registered as a “false alarm”, while staying on the pedestal during a sham trial was registered as a “correct rejection”. No food reward was provided in sham trials. To keep the gerbils motivated during the session, a salient test trial not included in the analysis was inserted after each correct rejection.
Each session contained 10 blocks with 9 trials each (six test trials, three sham trials). All stimuli within a given block were from the same condition (200/1600 Hz, 400/1600 Hz, or 400/3200 Hz). The first block of a session was a warmup block with Δf = 42.5% that was not included in the analysis. To interleave simple and more difficult conditions, blocks of the different conditions were pseudo-randomly distributed throughout the session. The least salient condition (200/1600 Hz) did not occur within two consecutive blocks. Additionally, the sham and test trials within each block were pseudo-randomly distributed with the constraint that no sham trials followed one another within and across blocks. Data collection started when the gerbil achieved a false alarm rate below 20% in three consecutive sessions with hit rates for the three highest Δf values that would allow determining a discrimination threshold.
Sessions were included in the analysis if the gerbil completed all 90 trials and the false alarm rate did not exceed 20%. Data collection continued until the sample size for each condition and Δf value was at least 20 trials. The sensitivity for each Δf value was defined by applying the z-transform, d’ = z(hit rate) - z(false-alarm rate). Individual discrimination thresholds for each animal and condition were then calculated from sensitivity, d’, as a function of inharmonic frequency shift, Δf%. A threshold at d’ = 1 was obtained by linearly interpolating between the first Δf% shift where d’ > 1 and the previous Δf% shift with d’ <= 1. If all d’ values were below 1, a linear fit was estimated through all d’ values as a function of Δf% shifts and threshold was extrapolated at d’ = 1. If all d’ values were larger than 1, the threshold was set to half of the minimal Δf% shift measured in the specific experiment. If none of the above was possible and either all d’ < 1, the fitted slope was not positive, or the extrapolated threshold larger than 100% shift (equivalent to Δf = f0 at threshold), the final threshold was limited to 100% (5 out of 72 thresholds).
Statistical analysis
All analyses of variance were calculated with IBM SPSS Statistics for Windows, Version 29.0 (IBM Corp, Armonk, NY). The family-wise error rate of all multiple post-hoc comparisons was controlled by Bonferroni correction of the significance levels.
Supplemental Information
This supplemental information summarizes extended statistical results, which are not essential for the support of the findings in the main manuscript.
Synapse counts
supplemental to “Synapse loss was similar in old gerbils and gerbils treated with a high dosage of ouabain”
The mean number of functional synapses per IHC was significantly different between gerbil groups for cochlear locations corresponding to all four frequencies. (univariate ANOVAs: 2 kHz: F = 5.496, p = 9.782×10-4; 4 kHz: F = 4.995, p = 1.988×10-3; 8 kHz: F = 7.933, p = 5.974×10-5; 16 kHz: F = 6.176, p = 4.305×10-4)
Post-hoc tests revealed significant differences (at the 5% level, with Bonferroni correction) between the young-adult and the ouabain-high groups for all equivalent frequencies, between the young-adult and the old group for 2, 4, and 8 kHz, and between the ouabain-low and the ouabain-high group for 16 kHz.
Below, we list the full ANOVA results, including the principal effects that are reported in the main manuscript. The factors used in the ANOVAs reported here had the following factor levels:
group: young-adult, sham, ouabain-high, old
condition: 200/1600, 400/1600, 400/3200 (f0/fc in Hz)
BF class: low: BF ≤ 1850 Hz, high: BF > 1850 Hz
SR class: low: SR ≤ 18 spikes/s, high: SR > 18 spikes/s
Δf%: as a covariate ranging from 0 to 21.25%
All significances are reported at the 5% level, with Bonferroni correction for post-hoc tests and planned comparisons.
Effects on mean driven rate
supplemental to “Average rate does not carry sufficient information about inharmonic frequency shift”
Note that driven rate (mean rate - spontaneous rate) was used here. The differences found below are consistent with known differences between low-SR (higher driven rates) and high-SR fibers (lower driven rates; Steenken et al., 2021), and with expected differences across BF classes (low-BF fibers less driven by stimuli with fc = 3200 Hz and high-BF fibers less driven by stimuli with fc = 1600 Hz). Differences between groups are also consistent with the sample distributions shown in Figs. S1 and S2.
Significant main effects:
group: F(3,943)=10.017, p=1.676×10-6
– Mean driven rate was lower in sham group than in any other group.
BF class: F(1,943)=6.297, p=0.0123
– Mean driven rate was lower for high BF class (BF > 1850 Hz).
Significant two-way interactions:
condition × group: F(6,943)=2.525, p=0.0198
– Planned comparisons between groups within each condition, and between conditions within each group:
In the 400/3200 condition, the mean driven rate of the sham group was significantly lower than in any other group. The mean driven rate in the sham group was significantly lower than in the young-adult group for each condition.
condition × BF class: F(2,943)=10.562, p=2.909×10-5
– Planned comparisons between conditions within each BF class, and between BF classes within each condition:
In the x/1600 conditions the mean driven rate in the high-BF class was significantly lower than in the low-BF class. No significant differences between conditions within each BF class.
group × BF class: F(3,943)=10.189, p=1.316×10-6
– Planned comparisons between groups within each BF class, and between BF classes within each group:
In the low-BF class, the mean driven rate in the sham group was significantly lower than in all other groups. For the ouabain-high and old groups, the mean driven rate in the high-BF class was lower than in the low-BF class.
group × SR class: F(2,943)=8.148, p=3.103×10-4
– Planned comparisons between groups within each SR class, and between SR classes within each group:
In the young-adult group, the mean driven rate in the high-SR class was lower than in the low-SR class. In the high-SR class, the mean driven rate in the sham group was lower than in any other group. (No samples for the sham group in the low-SR class).
Non-significant main effects and interactions:
Δf%: F(1,943)=0.018, p=0.8933 (reported in main manuscript)
condition: F(2,943)=2.187, p=0.1129
SR class: F(1,943)=1.884, p=0.1702
group × Δf%: F(3,943)=0.394, p=0.7575
BF class × Δf%: F(1,943)=0.052, p=0.8195
condition × Δf%: F(2,943)=0.081, p=0.9219
SR class × Δf%: F(1,943)=0.002, p=0.9657
condition × SR class: F(2,943)=0.698, p=0.4977
BF class × SR class: F(1,943)=0.388, p=0.5336
Effects on TFS representation (TFS log-z-ratio)
supplemental to “Neural representation of TFS was not degraded in AN fibers of old or synaptopathic gerbils”
A high TFS log-z-ratio indicates strong phase locking to the stimulus TFS.
Significant main effects:
Δf%: F(1,943)=104.403, p=2.569×10-23 (reported in main manuscript)
condition: F(2,943)=7.495, p=5.894×10-4 (reported in main manuscript)
– TFS log-z-ratio was significantly lower in the 400/3200 condition, compared to both the 200/1600 and 400/1600 conditions
Significant two-way interactions:
condition × Δf%: F(2,943)=20.292, p=2.353×10-9 (reported in main manuscript)
– Based on the parameter estimates of the ANOVA model: The slope of the TFS log-z-ratio vs. Δf% was significantly different from 0, for the 200/1600 and 400/1600 condition. It was significantly higher in the 400/1600 condition, compared to the 200/1600 condition.
BF class × Δf%: F(1,943)=31.05, p=3.282×10-8 (reported in main manuscript)
– Based on the parameter estimates of the ANOVA model:
The slope of the TFS log-z-ratio vs. Δf% was significantly higher in the low-BF class than in the high-BF class.
condition × group: F(6,943)=7.583, p=5.808×10-8
– planned comparisons between groups within each condition, and between conditions within each group:
In the 200/1600 condition, TFS log-z-ratio in the ouabain-high group was significantly higher than in the old group. For all groups except the old group, TFS log-z-ratio in the 400/1600 condition was significantly higher than in the 400/3200 condition. In the sham group, the TFS log-z-ratio in the 400/1600 condition was significantly higher than in the 200/1600 condition. In the ouabain-high group, the TFS log-z-ratio in the 200/1600 condition was significantly higher than in the 400/3200 condition.
condition × BF class: F(2,943)=5.774, p=3.200×10-3
– Planned comparisons between BF classes within each condition, and between conditions within each BF class: In the high-BF class, TFS log-z-ratio in the 400/3200 condition was significantly lower than in any other condition. In the 400/3200 condition, the TFS log-z-ratio in the high-BF class was significantly lower than in the low-BF class.
group × BF class: F(3,943)=4.228, p=5.600×10-3
– Planned comparisons between groups within each BF class, and between BF classes within each group:
In the high-BF class, the TFS log-z-ratio in the ouabain-high group was significantly higher than in the old group.
BF class × SR class: F(1,943)=4.918, p=0.0268
– In the high-SR class, TFS log-z-ratio in the low-BF class was significantly higher than in the high-BF class.
Non-significant main effects and interactions:
group: F(3,943)=1.757, p=0.1537 (reported in main manuscript)
group × Δf%: F(3,943)=1.193, p=0.3112 (reported in main manuscript)
BF class: F(1,943)=0.103, p=0.7479
SR class: F(1,943)=0.985, p=0.3212
condition × SR class: F(2,943)=2.302, p=0.1006
group × SR class: F(2,943)=0.556, p=0.5736
SR class × Δf%: F(1,943)=1.203, p=0.2730
Effects on ENV/TFS representation balance (ENV/TFS log-z-ratio)
supplemental to “Representation of f0 was enhanced in AN fibers of old gerbils“ The ENV/TFS log-z-ratio reflects the relative emphasis in the AN fibers’ responses on either TFS (negative ENV/TFS log-z-ratio) or envelope locking (positive ENV/TFS log-z-ratio)
Significant main effects:
condition: F(2,943)=43.929, p=5.744×10-19 (reported in main manuscript)
– The 200/1600 and 400/1600 conditions had negative ENV/TFS log-z-ratios, the 400/3200 condition had a positive ENV/TFS log-z-ratio. All were significantly different from each other, with the 200/1600 condition showing the most negative value.
SR class: F(1,943)=29.138, p=8.533×10-08 (reported in main manuscript)
– In the high-SR class, the ENV/TFS log-z-ratio was negative and significantly different from zero. In the low-SR class, it was not significantly different from zero.
Δf%: F(1,943)=4.278, p=0.0389 (reported in main manuscript)
Significant two-way interactions:
condition × group: F(6,943)=11.439, p=2.242×10-12 (reported in main manuscript)
– Planned comparisons between groups within each condition, and between conditions within each group:
In addition to the main condition effect: In the 200/1600 condition, the ENV/TFS log-z-ratio was more negative in the sham group than both in the young-adult group, and in the old group. In the 400/3200 condition, the ENV/TFS log-z-ratio was more positive in the old group than in the young-adult group. In the sham group, the ENV/TFS log-z-ratio was more negative in the 200/1600 condition than in the 400/1600 condition.
group × BF class: F(3,943)=11.354, p=2.551×10-7
– Planned comparisons between groups within each BF class, and between BF classes within each group:
In the high-BF class, the ENV/TFS log-z-ratio was significantly more negative in the sham and ouabain-high groups than in the young-adult and old groups.
BF class × SR class: F(1,943)=5.234, p=0.0224
– In the order from most negative to most positive ENV/TFS log-z-ratio: low-BF/high-SR, high-BF/high-SR, high-BF/low-SR, low-BF/low-SR. Both high-SR values were significantly below zero, and different from each other; both low-SR values were not significantly different from zero, and not different from each other.
Non-significant main effects and interactions:
group: F(3,943)=0.838, p=0.4731
BF class: F(1,943)=0.439, p=0.5076
group × Δf%: F(3,943)=0.236, p=0.8711
condition × Δf%: F(2,943)=1.673, p=0.1883
BF class × Δf%: F(1,943)=0.005, p=0.9410
SR class × Δf%: F(1,943)=0.58, p=0.4464
condition × BF class: F(2,943)=0.414, p=0.6611
condition × SR class: F(2,943)=0.522, p=0.5936
group × SR class: F(2,943)=2.442, p=0.0875
Acknowledgements
This work was supported by the DFG priority program “PP 1608”, the DFG Cluster of Excellence EXC 1077/1 “Hearing4all” (Project ID 390895286). We thank Laurel Carney for helpful comments on a previous version of this manuscript. Susanne Groß for assistance with the behavioral experiments. Amarins N. Heeringa for providing analysis scripts for AN data. Nadine Thiele for generous assistance with the gerbil care and for implementing the blinding process. Furthermore, many thanks go to Sonja Standfest for the great assistance with gerbil care. Sonny Bovee is acknowledged for processing cochleae, Julia Forst and Carina Lützow for assistance with the spike detection and immunostainings, and Julia Winter for assistance with the immunostainings. We acknowledge the Fluorescence Microscopy Service Unit, Carl von Ossietzky University of Oldenburg, for the use of the imaging facilities.
References
- Cochlear neuropathy and the coding of supra-threshold soundFrontiers in Systems Neuroscience 8https://doi.org/10.3389/fnsys.2014.00026
- Contribution of auditory nerve fibers to compound action potential of the auditory nerveJournal of Neurophysiology 112:1025–1039https://doi.org/10.1152/jn.00738.2013
- Cochlear ribbon synapses in aged gerbilsInternational Journal of Molecular Sciences 25https://doi.org/10.3390/ijms25052738
- The search for noise-induced cochlear synaptopathy in humans: Mission impossible?Hearing Research 377:88–103https://doi.org/10.1016/j.heares.2019.02.016
- Use of non-invasive measures to predict cochlear synapse countsHearing Research 370:113–119https://doi.org/10.1016/j.heares.2018.10.006
- Onset coding is degraded in auditory nerve fibers from mutant mice lacking synaptic ribbonsThe Journal of Neuroscience 30https://doi.org/10.1523/JNEUROSCI.0389-10.2010
- Changes in audiometric threshold and frequency selectivity correlate with cochlear histopathology in macaque monkeys with permanent noise-induced hearing lossHearing Research 398https://doi.org/10.1016/j.heares.2020.108082
- The FBN rat model of aging: investigation of ABR waveforms and ribbon synapse changesNeurobiology of Aging 62:53–63https://doi.org/10.1016/j.neurobiolaging.2017.09.034
- Relations between speech-reception, psychophysical temporal processing, and subcortical electrophysiological measures of auditory function in humansHearing Research 417https://doi.org/10.1016/j.heares.2022.108456
- Central auditory aging: GABA changes in the inferior colliculusExperimental Gerontology 30:349–360https://doi.org/10.1016/0531-5565(94)00052-5
- The crucial role of diverse animal models to investigate cochlear aging and hearing lossHearing Research 445https://doi.org/10.1016/j.heares.2024.108989
- Response properties of units in the dorsal cochlear nucleus of unanesthetized decerebrate gerbilJournal of Neurophysiology 75:1411–1431https://doi.org/10.1152/jn.1996.75.4.1411
- Uncertainty in location, level and fundamental frequency results in informational masking in a vowel discrimination task for young and elderly subjectsHearing Research 377:142–152https://doi.org/10.1016/j.heares.2019.03.015
- Neuronal response to cochlear distortion products in the anteroventral cochlear nucleus of the gerbilThe Journal of the Acoustical Society of America 105:491–502https://doi.org/10.1121/1.424586
- No evidence for a link between noise exposure and auditory temporal processing for young adults with normal audiogramsThe Journal of the Acoustical Society of America 147:EL465–EL470https://doi.org/10.1121/10.0001346
- Age-group differences in speech identification despite matched audiometrically normal hearing: contributions from auditory temporal processing and cognitionFrontiers in Aging Neuroscience 6https://doi.org/10.3389/fnagi.2014.00347
- Boosting GABA improves impaired auditory temporal resolution in the gerbilNeuroreport 14:1877–1880https://doi.org/10.1097/00001756-200310060-00024
- Comparing the inferior colliculus of young and old gerbils (Meriones unguiculatus) with an emphasis on GABAExperimental Gerontology 57:155–162https://doi.org/10.1016/j.exger.2014.05.018
- Behavioral auditory thresholds and loss of ribbon synapses at inner hair cells in aged gerbilsExperimental Gerontology 84:61–70https://doi.org/10.1016/j.exger.2016.08.011
- The relative contribution of cochlear synaptopathy and reduced inhibition to age-related hearing impairment for people with normal audiogramsTrends in Hearing 27https://doi.org/10.1177/23312165231213191
- Alterations in microvasculature are associated with atrophy of the stria vascularis in quiet-aged gerbilsHearing Research 82:44–52https://doi.org/10.1016/0378-5955(94)00161-I
- Impaired speech perception in noise with a normal audiogram: No evidence for cochlear synaptopathy and no relation to lifetime noise exposureHearing Research 364:142–151https://doi.org/10.1016/j.heares.2018.03.008
- Some aspects of temporal coding by single cochlear fibres from regions of cochlear hair cell degeneration in the guinea pigArchives of Oto-Rhino-Laryngology 224:71–78https://doi.org/10.1007/BF00455226
- Altered neural encoding of vowels in noise does not affect behavioral vowel discrimination in gerbils with age-related hearing lossFrontiers in Neuroscience 17https://www.frontiersin.org/journals/neuroscience/articles/10.3389/fnins.2023.12389
- Temporal coding of single auditory nerve fibers is not degraded in aging gerbilsThe Journal of Neuroscience 40:343–354https://doi.org/10.1523/JNEUROSCI.2784-18.2019
- Spontaneous activity of auditory-nerve fibers: insights into stochastic processes at ribbon synapsesThe Journal of Neuroscience 27:8457–8474
- Compound action potential input/output functions in young and quiet-aged gerbilsHearing Research 50:163–174https://doi.org/10.1016/0378-5955(90)90042-N
- Measures of tuning and suppression in single-fiber and whole-nerve responses in young and quiet-aged gerbilsThe Journal of the Acoustical Society of America 100:3275–3285https://doi.org/10.1121/1.417211
- Animal models of hidden hearing loss: Does auditory-nerve-fiber loss cause real-world listening difficulties?Molecular and Cellular Neuroscience 118https://doi.org/10.1016/j.mcn.2021.103692
- Normal tone-in-noise sensitivity in trained budgerigars despite substantial auditory-nerve injury: No evidence of hidden hearing lossThe Journal of Neuroscience 41:118–129https://doi.org/10.1523/JNEUROSCI.2104-20.2020
- Diminished temporal coding with sensorineural hearing loss emerges in background noiseNature Neuroscience 15:1362–1364https://doi.org/10.1038/nn.3216
- Divergent auditory nerve encoding deficits between two common etiologies of sensorineural hearing lossThe Journal of Neuroscience 39:6879–6887https://doi.org/10.1523/JNEUROSCI.0038-19.2019
- The role of excitation-pattern, temporal-fine-structure, and envelope cues in the discrimination of complex tonesThe Journal of the Acoustical Society of America 135:1356–1370https://doi.org/10.1121/1.4864306
- Pathophysiological changes in inner hair cell ribbon synapses in the ageing mammalian cochleaJournal of Physiology 598:4339–4355https://doi.org/10.1113/JP280018
- Evidence for age-related cochlear synaptopathy in humans unconnected to speech-in-noise intelligibility deficitsHearing Research 374:35–48https://doi.org/10.1016/j.heares.2019.01.017
- Neural processing of amplitude-modulated soundsPhysiological Reviews 84:541–577https://doi.org/10.1152/physrev.00029.2003
- Enhancement of neural synchronization in the anteroventral cochlear nucleus. II. Responses in the tuning curve tailJournal of Neurophysiology 71:1037–1051https://doi.org/10.1152/jn.1994.71.3.1022
- Responses to amplitude-modulated tones in the auditory nerve of the catThe Journal of the Acoustical Society of America 91:215–232https://doi.org/10.1121/1.402757
- Envelope coding in auditory nerve fibers following noise-induced hearing lossJournal of the Association for Research in Otolaryngology: JARO 11:657–673https://doi.org/10.1007/s10162-010-0223-6
- Effects of sensorineural hearing loss on temporal coding of harmonic and inharmonic tone complexes in the auditory nerveAdvances in Experimental Medicine and Biology 787:109–118https://doi.org/10.1007/978-1-4614-1590-9_13
- Implications of within-fiber temporal coding for perceptual studies of F0 discrimination and discrimination of harmonic and inharmonic tone complexesJournal of the Association for Research in Otolaryngology: JARO 15:465–482https://doi.org/10.1007/s10162-014-0451-2
- Inhibition shapes acoustic responsiveness in spherical bushy cellsThe Journal of Neuroscience 35:8579–8592https://doi.org/10.1523/JNEUROSCI.0133-15.2015
- GABA(A) receptors in the Mongolian gerbil: a PET study using [(18)F]flumazenil to determine receptor binding in young and old animalsMolecular Imaging and Biology 22:335–347https://doi.org/10.1007/s11307-019-01371-0
- An estimate of the auditory-filter bandwidth in the Mongolian gerbilHearing Research 164:69–76https://doi.org/10.1016/S0378-5955(01)00411-7
- Adding insult to injury: Cochlear nerve degeneration after “temporary” noise-induced hearing lossThe Journal of Neuroscience 29:14077–14085https://doi.org/10.1523/JNEUROSCI.2845-09.2009
- Ouabain induces apoptotic cell death in type I spiral ganglion neurons, but not type II neuronsJournal of the Association for Research in Otolaryngology: JARO 6:63–74https://doi.org/10.1007/s10162-004-5021-6
- Noise-induced and age-related hearing loss: new perspectives and potential therapies [version 1; peer review: 4 approved]F1000Research 6https://doi.org/10.12688/f1000research.11310.1
- Toward a differential diagnosis of hidden hearing loss in humansPLOS One 11https://doi.org/10.1371/journal.pone.0162726
- Cochlear synaptopathy in acquired sensorineural hearing loss: Manifestations and mechanismsHearing Research 349:138–147https://doi.org/10.1016/j.heares.2017.01.003
- Primary Neural Degeneration in the Guinea Pig Cochlea After Reversible Noise-Induced Threshold ShiftJournal of the Association for Research in Otolaryngology 12:605–616https://doi.org/10.1007/s10162-011-0277-0
- Why do I hear but not understand? Stochastic undersampling as a model of degraded neural encoding of speechFrontiers in Neuroscience 8https://doi.org/10.3389/fnins.2014.00348
- Correlations between cochlear pathophysiology and behavioral measures of temporal and spatial processing in noise exposed macaquesHearing Research 401https://doi.org/10.1016/j.heares.2020.108156
- The role of excitation-pattern cues in the detection of frequency shifts in bandpass-filtered complex tonesThe Journal of the Acoustical Society of America 137:2687–2697https://doi.org/10.1121/1.4919315
- Electrophysiological and behavioural processing of complex acoustic cuesClinical Neurophysiology 127:779–789https://doi.org/10.1016/j.clinph.2015.04.002
- Effects of acoustic trauma on the representation of the vowel /ε/ in cat auditory nerve fibersThe Journal of the Acoustical Society of America 101:3602–3616https://doi.org/10.1121/1.418321
- Influence of musical training on sensitivity to temporal fine structureInternational Journal of Audiology 54:220–226https://doi.org/10.3109/14992027.2014.969411
- Frequency discrimination of complex tones by hearing-impaired subjects: Evidence for loss of ability to use temporal fine structureHearing Research 222:16–27https://doi.org/10.1016/j.heares.2006.08.007
- Auditory processing of temporal fine structure: effects of age and hearing lossWorld Scientific Publishing Co. Pte. Ltd
- The roles of temporal envelope and fine structure information in auditory perceptionAcoustical Science and Technology 40:61–83https://doi.org/10.1250/ast.40.61
- The influence of age and high-frequency hearing loss on sensitivity to temporal fine structure at low frequencies (L)The Journal of the Acoustical Society of America https://doi.org/10.1121/1.3672808
- Development of a fast method for determining sensitivity to temporal fine structureInternational Journal of Audiology 48:161–171https://doi.org/10.1080/14992020802475235
- Perception of the low pitch of frequency-shifted complexesThe Journal of the Acoustical Society of America 113:977–985https://doi.org/10.1121/1.1536631
- The cochlear place-frequency map of the adult and developing mongolian gerbilHearing Research 94:148–156https://doi.org/10.1016/0378-5955(95)00230-8
- Methods and Welfare Considerations in Behavioral Research with Animals: Report of a National Institutes of Health WorkshopWashington, DC: U.S. Government Printing Office
- Ouabain binding kinetics of the rat alpha two and alpha three isoforms of the sodium-potassium adenosine triphosphateArchives of Biochemistry and Biophysics 310:32–39https://doi.org/10.1006/abbi.1994.1136
- Bottom-up and top-down neural signatures of disordered multi-talker speech perception in adults with normal hearingELife 9https://doi.org/10.7554/eLife.51419
- Synaptopathy in the aging cochlea: characterizing early-neural deficits in auditory temporal envelope processingThe Journal of Neuroscience 38:7108–7119https://doi.org/10.1523/JNEUROSCI.3240-17.2018
- Effects of age and noise exposure on proxy measures of cochlear synaptopathyTrends in Hearing 23https://doi.org/10.1177/2331216519877301
- Effects of noise exposure on young adults with normal audiograms II: Behavioral measuresHearing Research 356:74–86https://doi.org/10.1016/j.heares.2017.10.007
- Phase-locked response to low-frequency tones in single auditory nerve fibers of the squirrel monkeyJournal of Neurophysiology 30:769–793https://doi.org/10.1152/jn.1967.30.4.769
- Tinnitus with a normal audiogram: Physiological evidence for hidden hearing loss and computational modelThe Journal of Neuroscienc 31:13452–13457https://doi.org/10.1523/JNEUROSCI.2156-11.2011
- Fiji: an open-source platform for biological-image analysisNature Methods 9:676–682https://doi.org/10.1038/nmeth.2019
- Age-related loss of activity of auditory-nerve fibersJournal of Neurophysiology 76:2799–2803https://doi.org/10.1152/jn.1996.76.4.2799
- Ouabain application to the round window of the gerbil cochlea: A model of auditory neuropathy and apoptosisJournal of the Association for Research in Otolaryngology: JARO 3:223–233https://doi.org/10.1007/s1016200220017
- Lateral wall Na,K-ATPase and endocochlear potentials decline with age in quiet-reared gerbilsHearing Research 61:35–46https://doi.org/10.1016/0378-5955(92)90034-k
- Age-related cochlear synaptopathy: An early-onset contributor to auditory functional declineThe Journal of Neuroscience 33:13686–13694https://doi.org/10.1523/JNEUROSCI.1783-13.2013
- Coding deficits in noise-induced hidden hearing loss may stem from incomplete repair of ribbon synapses in the cochleaFrontiers in Neuroscience 10https://www.frontiersin.org/journals/neuroscience/articles/10.3389/fnins.2016.00231
- Electrophysiological responses of cochlear root neuronsHearing Research 158:28–38https://doi.org/10.1016/S0378-5955(01)00293-3
- Immunolabeling and Counting Ribbon Synapses in Young Adult and Aged Gerbil CochleaeJoVE 182https://doi.org/10.3791/63874
- Age-related decline in cochlear ribbon synapses and its relation to different metrics of auditory-nerve activityNeurobiology of Aging https://doi.org/10.1016/j.neurobiolaging.2021.08.019
- The effect of musicianship, contralateral noise, and ear of presentation on the detection of changes in temporal fine structureThe Journal of the Acoustical Society of America 146:1–10https://doi.org/10.1121/1.5114820
- Response characteristics in the apex of the gerbil cochlea studied through auditory nerve recordingsJournal of the Association for Research in Otolaryngology 12:301–316https://doi.org/10.1007/s10162-010-0255-y
- Cochlear neuropathy in human presbycusis: Confocal analysis of hidden hearing loss in post-mortem tissueHearing Research 327:78–88https://doi.org/10.1016/j.heares.2015.04.014
- Primary neural degeneration in the human cochlea: Evidence for hidden hearing loss in the aging earNeuroscience 407:8–20https://doi.org/10.1016/j.neuroscience.2018.07.053
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
Copyright
© 2025, Steenken et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 31
- downloads
- 0
- citations
- 0
Views, downloads and citations are aggregated across all versions of this paper published by eLife.