MRI experiments have revealed how throat singers from Tuva produce their characteristic sound.
Many people in Tuva – a republic in southern Siberia – have the remarkable ability to sing in two different pitches at the same time (as can be seen and heard in this video of the Alash Ensemble). This form of singing, known as throat or overtone singing, was little known outside Tuva until the author Ralph Leighton wrote a book called Tuva or Bust! (Leighton, 1991). The book described how Leighton and his friend Richard Feynman (the Nobel prize-winning physicist) tried and failed to travel to Tuva to study throat singing and Tuvan culture. Now, in eLife, Christopher Bergevin of York University in Canada, Brad Story of the University of Arizona and co-workers report how they have used MRI to uncover how throat singers control their vocal tracts when singing (Bergevin et al., 2020).
Before considering dual-pitch production we need to understand how normal single-pitch singing works. When we sing, the vocal cords in our larynx open and close periodically at a particular frequency (the glottal-pulse rate), and this frequency determines the pitch of the note that we produce. However, we also produce harmonics with frequencies that are multiples of this fundamental frequency. Moreover, the waveform produced by this combination of frequencies is filtered by the resonances in the vocal tract, which we can adjust by moving our lower jaw, tongue, cheeks and lips to change the effective shape of our vocal tract. This filtering causes different frequencies in the sounds we produce to be emphasised, but it does not usually alter pitch: instead, it determines the timbral quality of the sounds in a way that can be associated with meaning. For example, vowel sounds in the English language can be identified, independent of pitch, because each vowel sound has a distinctive pattern of peaks in its frequency spectrum.
In the brain, different frequencies are processed in different neural channels. For a periodic input sound, the fundamental frequency and the first ten or so harmonics are each processed by a different neural channel. However, the neural channels that process higher harmonics handle more than one harmonic, and interactions between these produce oscillations at the same rate as the fundamental frequency. A prominent model put forward in 1994 posits two mechanisms for pitch perception (Shackleton and Carlyon, 1994): at low frequencies, the pitch is conveyed by which neural channels are active, with each channel corresponding to a multiple of the pitch value; at high frequencies, the pitch depends on the temporal pattern produced by interacting harmonics. In general, when we hear a sung note, these 'place' and 'temporal' coding mechanisms reinforce one another and contribute to perception of the same pitch. A sung vowel with a given pitch will contain low harmonics represented in separate frequency channels that represent multiples of the fundamental frequency, and high harmonics that interact in high-frequency channels to produce oscillations at the same rate as the fundamental frequency.
From first principles, there are a number of possible ways of ways of producing a dual pitch. Birds can sing at two different pitches because they have two oscillators in their equivalent of the larynx (Riede and Goller, 2010), but there is no evidence for a similar mechanism in humans. It might also be possible, in principle, for nonlinear oscillation in the larynx to produce a complex signal comprising two distinct pitches, but again there is no evidence for this. In the latest research, Bergevin et al. carried out careful MRI work, which suggests that Tuvan singers use their larynx just like a typical singer, but they also create an extra pitch by controlling the shape of their vocal tract. Specifically, they create a shape that filters out many but not all of the higher harmonics (Figure 1). Bergevin et al. suggest that the range of high frequencies that remains is so narrow that it does not contain enough harmonics to produce oscillations at the fundamental frequency, as usually happens. The result is a new high pitch (determined by the shape of the vocal tract) along with a more typical sung pitch (determined by the lower harmonics).
The study of Bergevin et al. focuses on a style of singing called khoomei, but this is just one of a number styles practised in Tuva and beyond, so there is plenty more ground to cover for researchers interested in the biomechanics of throat or overtone singing.
Tuva or Bust!: Richard Feynman's Last JourneyW. W. Norton & Company.
The role of resolved and unresolved harmonics in pitch perception and frequency modulation discriminationJournal of the Acoustical Society of America 95:3529–3540.https://doi.org/10.1121/1.409970