EEG-based detection of the locus of auditory attention with convolutional neural networks

Abstract

In a multi-speaker scenario, the human auditory system is able to attend to one particular speaker of interest and ignore the others. It has been demonstrated that it is possible to use electroencephalography (EEG) signals to infer to which speaker someone is attending by relating the neural activity to the speech signals. However, classifying auditory attention within a short time interval remains the main challenge. We present a convolutional neural network-based approach to extract the locus of auditory attention (left/right) without knowledge of the speech envelopes. Our results show that it is possible to decode the locus of attention within 1 to 2 s, with a median accuracy of around 81%. These results are promising for neuro-steered noise suppression in hearing aids, in particular in scenarios where per-speaker envelopes are unavailable.

Data availability

Code used for training and evaluating the network has been made available at https://github.com/exporl/locus-of-auditory-attention-cnn. The CNN models used to generate the results shown in the paper are also available at that location. The dataset used in this study had been made available earlier at https://zenodo.org/record/3377911.

The following previously published data sets were used

Article and author information

Author details

  1. Servaas Vandecappelle

    Department of Neurosciences, Katholieke Universiteit Leuven, Leuven, Belgium
    For correspondence
    servaas.vandecappelle@gmail.com
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0266-7293
  2. Lucas Deckers

    Department of Neurosciences, Katholieke Universiteit Leuven, Leuven, Belgium
    Competing interests
    The authors declare that no competing interests exist.
  3. Neetha Das

    Department of Neurosciences, Katholieke Universiteit Leuven, Leuven, Belgium
    Competing interests
    The authors declare that no competing interests exist.
  4. Amir Hossein Ansari

    Department of Electrical Engineering, Katholieke Universiteit Leuven, Leuven, Belgium
    Competing interests
    The authors declare that no competing interests exist.
  5. Alexander Bertrand

    Department of Electrical Engineering, Katholieke Universiteit Leuven, Leuven, Belgium
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-4827-8568
  6. Tom Francart

    Dept. of Neurosciences, Katholieke Universiteit Leuven, Leuven, Belgium
    For correspondence
    tom.francart@kuleuven.be
    Competing interests
    The authors declare that no competing interests exist.

Funding

KU Leuven Special Research Fund (C14/16/057)

  • Tom Francart

KU Leuven Special Research Fund (C24/18/099)

  • Alexander Bertrand

Research Foundation Flanders (1.5.123.16N)

  • Alexander Bertrand

Research Foundation Flanders (G0A4918N)

  • Alexander Bertrand

European Research Council (637424)

  • Tom Francart

European Research Council (802895)

  • Alexander Bertrand

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Human subjects: The experiment was approved by the Ethics Committee Research UZ/KU Leuven (S57102) and every participant signed an informed consent form approved by the same commitee.

Copyright

© 2021, Vandecappelle et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 3,746
    views
  • 524
    downloads
  • 72
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Servaas Vandecappelle
  2. Lucas Deckers
  3. Neetha Das
  4. Amir Hossein Ansari
  5. Alexander Bertrand
  6. Tom Francart
(2021)
EEG-based detection of the locus of auditory attention with convolutional neural networks
eLife 10:e56481.
https://doi.org/10.7554/eLife.56481

Share this article

https://doi.org/10.7554/eLife.56481

Further reading

    1. Neuroscience
    Geoffrey W Meissner, Allison Vannan ... FlyLight Project Team
    Research Article

    Techniques that enable precise manipulations of subsets of neurons in the fly central nervous system (CNS) have greatly facilitated our understanding of the neural basis of behavior. Split-GAL4 driver lines allow specific targeting of cell types in Drosophila melanogaster and other species. We describe here a collection of 3060 lines targeting a range of cell types in the adult Drosophila CNS and 1373 lines characterized in third-instar larvae. These tools enable functional, transcriptomic, and proteomic studies based on precise anatomical targeting. NeuronBridge and other search tools relate light microscopy images of these split-GAL4 lines to connectomes reconstructed from electron microscopy images. The collections are the result of screening over 77,000 split hemidriver combinations. Previously published and new lines are included, all validated for driver expression and curated for optimal cell-type specificity across diverse cell types. In addition to images and fly stocks for these well-characterized lines, we make available 300,000 new 3D images of other split-GAL4 lines.

    1. Neuroscience
    Hyun Jee Lee, Jingting Liang ... Hang Lu
    Research Advance

    Cell identification is an important yet difficult process in data analysis of biological images. Previously, we developed an automated cell identification method called CRF_ID and demonstrated its high performance in Caenorhabditis elegans whole-brain images (Chaudhary et al., 2021). However, because the method was optimized for whole-brain imaging, comparable performance could not be guaranteed for application in commonly used C. elegans multi-cell images that display a subpopulation of cells. Here, we present an advancement, CRF_ID 2.0, that expands the generalizability of the method to multi-cell imaging beyond whole-brain imaging. To illustrate the application of the advance, we show the characterization of CRF_ID 2.0 in multi-cell imaging and cell-specific gene expression analysis in C. elegans. This work demonstrates that high-accuracy automated cell annotation in multi-cell imaging can expedite cell identification and reduce its subjectivity in C. elegans and potentially other biological images of various origins.