Statistical modeling of subcoda structure in sperm whales.
A Sperm whale communication consists of rhythmic sequences of clicks, called codas. A coda is specified by a sequence of inter-click intervals (ICIs). Codas are classified into types based on their rhythmic pattern, which can have various degrees of regularity (e.g., 4R2 vs 1+3). B Social groups of sperm whales employ specific vocal repertoires: the set of coda types they use and their associated usage frequencies. As an illustration, we show those of the EC1 and EC2 clans from the Dominica dataset [12]. Only the most numerous coda types are shown: the rest of the vocal repertoires consists of more coda types with residual frequencies. C The subcoda structure can be modeled by considering rhythmic variations within codas of the same coda type. To do so, codas are represented as sequences of discrete inter-click intervals (dICIs), by discretizing absolute ICIs into discrete bins, which can then be considered akin to symbols (e.g. A, B, C, …), providing a tokenization of codas. Different instances of a single coda type can correspond to slightly different dICI sequences. The resulting dICI sequences are modeled using variable-length Markov chains, which can be represented as subcoda trees. These trees can be built for an individual speaker or for a group of speakers, and capture the statistical and memory structures of rhythm variations within codas and in the transitions between those. In other words, the tree captures a vocal style—how they say what they say. The vocal styles of different groups of sperm whales can be quantitatively compared by calculating a distance between their subcoda trees.