Emergence of brain-like mirror-symmetric viewpoint tuning in convolutional neural networks
Abstract
Primates can recognize objects despite 3D geometric variations such as in-depth rotations. The computational mechanisms that give rise to such invariances are yet to be fully understood. A curious case of partial invariance occurs in the macaque face-patch AL and in fully connected layers of deep convolutional networks in which neurons respond similarly to mirror-symmetric view (e.g., left and right profiles). Why does this tuning develop? Here, we propose a simple learning-driven explanation for mirror-symmetric viewpoint tuning. We show that mirror-symmetric viewpoint tuning for faces emerges in the fully connected layers of convolutional deep neural networks trained on object recognition tasks, even when the training dataset does not include faces. First, using 3D objects rendered from multiple views as test stimuli, we demonstrate that mirror-symmetric viewpoint tuning in convolutional neural network models is not unique to faces: it emerges for multiple object categories with bilateral symmetry. Second, we show why this invariance emerges in the models. Learning to discriminate among bilaterally symmetric object categories induces reflection-equivariant intermediate representations. AL-like mirror-symmetric tuning is achieved when such equivariant responses are spatially pooled by downstream units with sufficiently large receptive fields. These results explain how mirror-symmetric viewpoint tuning can emerge in neural networks, providing a theory of how they might emerge in the primate brain. Our theory predicts that mirror-symmetric viewpoint tuning can emerge as a consequence of exposure to bilaterally symmetric objects beyond the category of faces, and that it can generalize beyond previously experienced object categories.
Data availability
The stimulus set and the source code required for reproducing our results are available at https://gitfront.io/r/afarzmahdi/p666tmWy7YuY/AL-symmetry-manuscript-codes/.
-
Object categories across viewshttps://github.com/amirfarzmahdi/AL-Symmetry.
Article and author information
Author details
Funding
National Eye Institute (R01EY021594)
- Winrich A Freiwald
National Eye Institute (R01EY029998)
- Winrich A Freiwald
National Institute of Neurological Disorders and Stroke (RF1NS128897)
- Nikolaus Kriegeskorte
Naval Research Laboratory (N00014-20-1-2292)
- Winrich A Freiwald
Charles H. Revson Foundation
- Tal Golan
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Copyright
© 2024, Farzmahdi et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 845
- views
-
- 143
- downloads
-
- 0
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Neuroscience
Processing pathways between sensory and default mode network (DMN) regions support recognition, navigation, and memory but their organisation is not well understood. We show that functional subdivisions of visual cortex and DMN sit at opposing ends of parallel streams of information processing that support visually mediated semantic and spatial cognition, providing convergent evidence from univariate and multivariate task responses, intrinsic functional and structural connectivity. Participants learned virtual environments consisting of buildings populated with objects, drawn from either a single semantic category or multiple categories. Later, they made semantic and spatial context decisions about these objects and buildings during functional magnetic resonance imaging. A lateral ventral occipital to fronto-temporal DMN pathway was primarily engaged by semantic judgements, while a medial visual to medial temporal DMN pathway supported spatial context judgements. These pathways had distinctive locations in functional connectivity space: the semantic pathway was both further from unimodal systems and more balanced between visual and auditory-motor regions compared with the spatial pathway. When semantic and spatial context information could be integrated (in buildings containing objects from a single category), regions at the intersection of these pathways responded, suggesting that parallel processing streams interact at multiple levels of the cortical hierarchy to produce coherent memory-guided cognition.
-
- Neuroscience
Orexin signaling in the ventral tegmental area and substantia nigra promotes locomotion and reward processing, but it is not clear whether dopaminergic neurons directly mediate these effects. We show that dopaminergic neurons in these areas mainly express orexin receptor subtype 1 (Ox1R). In contrast, only a minor population in the medial ventral tegmental area express orexin receptor subtype 2 (Ox2R). To analyze the functional role of Ox1R signaling in dopaminergic neurons, we deleted Ox1R specifically in dopamine transporter-expressing neurons of mice and investigated the functional consequences. Deletion of Ox1R increased locomotor activity and exploration during exposure to novel environments or when intracerebroventricularely injected with orexin A. Spontaneous activity in home cages, anxiety, reward processing, and energy metabolism did not change. Positron emission tomography imaging revealed that Ox1R signaling in dopaminergic neurons affected distinct neural circuits depending on the stimulation mode. In line with an increase of neural activity in the lateral paragigantocellular nucleus (LPGi) of Ox1RΔDAT mice, we found that dopaminergic projections innervate the LPGi in regions where the inhibitory dopamine receptor subtype D2 but not the excitatory D1 subtype resides. These data suggest a crucial regulatory role of Ox1R signaling in dopaminergic neurons in novelty-induced locomotion and exploration.