The oneirogen hypothesis: modeling the hallucinatory effects of classical psychedelics in terms of replay-dependent plasticity mechanisms

Colin Bredenberg; Fabrice Normandin; Blake Richards; Guillaume Lajoie

doi:10.7554/eLife.105968.1

eLife Assessment

This paper provides a useful new theory of the hallucinatory effects of psychedelics. The authors present convincing evidence that a computational model trained with the Wake-Sleep algorithm can reproduce some features of hallucinations by varying the strength of top-down connections in the model, but discussion of the model's relationships to the psychedelics and sleep literatures is incomplete. The work will be of interest to researchers studying hallucinations or offline activity and learning more broadly.

https://doi.org/10.7554/eLife.105968.1.sa3

Significance of findings

useful: Findings that have focused importance and scope

landmark
fundamental
important
valuable
useful

Strength of evidence

convincing: Appropriate and validated methodology in line with current state-of-the-art

incomplete: Main claims are only partially supported

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Classical psychedelics induce complex visual hallucinations in humans, generating percepts that are co-herent at a low level, but which have surreal, dream-like qualities at a high level. While there are many hypotheses as to how classical psychedelics could induce these effects, there are no concrete mechanistic models that capture the variety of observed effects in humans, while remaining consistent with the known pharmacological effects of classical psychedelics on neural circuits. In this work, we propose the “oneirogen hypothesis”, which posits that the perceptual effects of classical psychedelics are a result of their pharmacological actions inducing neural activity states that truly are more similar to dream-like states. We simulate classical psychedelics’ effects via manipulating neural network models trained on perceptual tasks with the Wake-Sleep algorithm. This established machine learning algorithm leverages two activity phases, a perceptual phase (wake) where sensory inputs are encoded, and a generative phase (dream) where the network internally generates activity consistent with stimulus-evoked responses. We simulate the action of psychedelics by partially shifting the model to the ‘Sleep’ state, which entails a greater influence of top-down connections, in line with the impact of psychedelics on apical dendrites. The effects resulting from this manipulation capture a number of experimentally observed phenomena including the emergence of hallucinations, increases in stimulus-conditioned variability, and large increases in synaptic plasticity. We further provide a number of testable predictions which could be used to validate or invalidate our oneirogen hypothesis.

1 Introduction

Classical psychedelics—including psilocybin, mescaline, DMT, and LSD—are a family of hallucinogenic compounds with a common mechanism of action: they are agonists for the 5-HT2a serotonin receptor commonly expressed on the apical dendrites of cortical pyramidal neurons [1]. These drugs induce numerous effects in human subjects, including: complex visual, auditory, and tactile hallucinations; intense spiritual experiences; long-lasting alterations in mood; changes in personality; and increases in synaptic plasticity [2, 3, 4]. They have further been used for millennia as medicine and in religious rituals [5]; more recently, they have been explored clinically as potential treatments for depression and anxiety [6], as well as PTSD [7].

The 5-HT2a receptor plays a critical role in psychedelic-induced hallucinations. Indeed, perceptual effects are largely eliminated by blocking these receptors in the cortex [8, 9]. However, very little is understood about why highly structured hallucinations and changes in synaptic plasticity emerge from activating cortical 5-HT2a receptors: to explain this, it is necessary to develop mechanistic theories that are capable of linking changes in neuron-level properties (receptor agonism) to changes in perception and behavior. Psychedelic drug users and therapists have long noted the ‘dream-like’ qualities of psychedelic drug hallucinations, which are realistic but untethered from the external world; this observation leads naturally to speculation that these drugs are ‘oneirogens,’ or dream-manifesting compounds [10]. However, beyond perceptual phenomenology (and some evidence pointing to the effects of psychedelics on sleep cycles [11, 12, 13]), we lack a mechanistic proposal that could explain the similarity between dreams and psychedelic drug experiences. Here, we articulate the ‘oneirogen hypothesis’, which describes one such potential mechanistic explanation. We propose that classical psychedelics induce a dream-like state by shifting the balance between bottom-up pathways transmitting sensory information and top-down pathways ordinarily used to create replay sequences in the brain. Replay sequences have been shown to be important for learning during sleep [14, 15, 16, 17, 18]: we propose that mechanisms supporting replaydependent learning during sleep are key to explaining the increases in plasticity caused by psychedelic drug administration. In total, our model of the functional effect of psychedelics on pyramidal neurons could provide a explanation for the perceptual psychedelic experience in terms of learning mechanisms for consolidation during sleep [19], and cortical ‘replay’ phenomena [20, 21, 22, 23, 24, 25, 26, 27, 28].

To explore the oneirogen hypothesis concretely, we use the aptly named Wake-Sleep algorithm [29], which has historically been used to train artificial neural networks (ANNs) that possess both a bottomup “recognition” pathway and a top-down “generative” pathway to learn a representation of incoming sensory data. It enables unsupervised learning in ANNs by alternating between periods of “waking perception” (wherein bottom-up recognition pathways drive activity) and “dreaming sequences” (wherein top-down generative pathways drive activity). With these alternate periods of distinct activity, connectivity parameters in each pathway are adjusted to match the activity of the opposite pathway. This way, the top-down pathway learns to generate activity consistent with that induced by sensory inputs, and the bottom-up pathway learns better representations thanks to generated activity.

In this work, we show that within a neural network trained via Wake-Sleep, it is possible to model the action of classical psychedelics (i.e. 5-HT2a receptor agonism) by shifting the balance during the wake state from the bottom-up pathways to the top-down pathways, thereby making the ‘wake’ network states more ‘dream-like’. Specifically, we model the effects of classical psychedelics by manipulating the relative influence of top-down and bottom-up connections in neural networks trained with the Wake-Sleep algorithm on images. Doing so, we capture a number of effects observed in experiments on individuals under the influence of psychedelics, including: the emergence of closed-eye hallucinations, increases in stimulus-conditioned variability, and large increases in synaptic plasticity. This data suggests that the oneirogen hypothesis may indeed help to explain why 5-HT2a agonists have the functional effects that they do. We subsequently identify several testable predictions that could be used to further validate the oneirogen hypothesis.

2 Results

Mapping the Wake-Sleep algorithm onto cortical architecture

The Wake-Sleep algorithm allows ANNs to optimize a global, unsupervised objective function for sensory representation learning— the Evidence Lower Bound (ELBO)—through local synaptic modifications to a bottom-up recognition pathway and a top-down generative pathway. As a precursor to the variational autoencoder [30, 31], the Wake-Sleep algorithm provides a mechanism for learning a probabilistic latent representation r responding to incoming sensory stimuli s, which obeys representational characteristics that are ideal for a neural system (e.g. sparsity and metabolic efficiency [32], compression and coding efficiency [33, 34], or disentanglement [35, 36]). To do this, Wake-Sleep optimizes the ELBO through an approximation of the Expectation Maximization (EM) algorithm [37] to train the two pathways (Figure 1a). For readers who are unfamiliar with the Wake-Sleep algorithm, a tutorial can be found here [38].

Notably, the Wake-Sleep algorithm requires two phases of activity (i.e. “Wake” and “Sleep”), where the network phase is controlled by a global state variable α ∈ [0, 1] that regulates the balance between the bottom-up and top-down pathways. In the Wake phase (α = 0), the network processes real sensory stimuli drawn from the environment, and network activity is sampled based on the bottom-up inputs (corresponding to the approximate inference distribution). In the Sleep phase (α = 1), the network internally samples neural activity from its generative model, which then produces generated activity in the stimulus layer s. We use this structure of the Wake-Sleep algorithm as a concrete model to express oneirogen hypothesis. Specifically, we use changes to the value of α as a means of modeling a 5-HT2a agonist-induced shift to a more dream-like state, as we detail below.

Within the Wake-Sleep algorithm, neurons alternate between ‘Wake’ and ‘Sleep’ modes, where activity during each mode is dominated by the bottom-up and top-down pathways, respectively. We can determine the neural activity for a given intermediate layer l with the following equation:

where h(r) defines bottom-up input, μ(r) defines top-down input, f (h, μ, α) is any interpolation function such that f (h, μ, 0) = h and f (h, μ, 1) = μ, σ_b and σ_p define the bottom-up and top-down activity standard deviations, and η ∼ 𝒩(0, 1) adds random noise to the neural activity (see Methods for more detail). Here, for notational conciseness we treat r as a concatenated vector of all r^(l) vectors from each layer. This equation means that α controls whether bottom-up inputs or top-down inputs control the dynamics of individual neural units.

Thus, as α moves from a value of 0 to a value of 1 the activity of the neurons shifts from being driven by the bottom-up recognition pathway to being driven by the top-down generative pathway. How could this occur in the brain? In the cortex, excitatory pyramidal neurons receive inputs from distinct sources: inputs that are from ‘higher order’ cortical areas target the apical dendrites, whereas inputs that are from ‘lower order’ cortical or sensory subcortical areas target the basal dendrites [39]. Thus, we can capture the core idea behind the oneirogen hypothesis using the Wake-Sleep algorithm, by postulating that the bottom-up basal synapses are predominantly driving neural activity during the Wake phase (when α is low), while top-down apical synapses are predominantly driving neural activity during the Sleep phase (when α is high; Figure 1) [40]; this is in agreement with several recent theoretical studies that have proposed that apical dendrites could serve as a site for integrating top-down learning signals [41, 42, 43, 44, 45, 46], particularly those which propose that the top-down signal corresponds to a predictive or generative model of neural activity [47, 48]. The idea that apical dendritic influence increases during sleep (or replay) in cortex has been only partially supported by experiments [49], and is one critical testable prediction of our model. Notably, 5-HT2a receptors are are expressed in the apical dendrites of pyramidal neurons [1] and basal dendrite-targeting parvalbumin inhibitory interneurons [50], and have an excitatory effect that positively modulates glutamatergic transmission [51, 52]. These data suggest that 5-HT2a agonists could have a push-pull effect on cortical pyramidal neurons, increasing the relative influence of apical dendrites and decreasing the relative influence of basal dendrites. Hence, we can model these effects by changing the α value in a Wake-Sleep trained network, and then ask whether the networks exhibit other phenomena that match the known impact of classical psychedelics on neural activity. We note that with this mapping of the Wake-Sleep algorithm to models of basal and apical processing, synaptic modifications at both apical and basal synapses correspond to minimizing a local prediction-error between top-down and bottom-up inputs (see Methods).

Modeling Hallucinations

To see whether a transition from waking to a more dream-like state would induce hallucinatory effects in our model, we trained multilayer neural networks with branched dendritic arbors (see Methods) on the MNIST digits dataset [53] using the Wake-Sleep algorithm and subsequently simulated hallucinatory activity by varying α (see Methods; Eq. (8)). We could visualize the effects of our simulated psychedelic with snapshots of the stimulus layer s at a fixed point in time for various values of α (Figure 2; see Supplemental Materials for videos). As α increased, we observed that network activity gradually deformed away from the ground-truth stimulus in a highly structured way, adding strokes to the original digit that were not originally present. At the highest values of α tested, we found that network states were wholly divorced from the ground-truth stimulus, but retained many characteristics of the MNIST digits on which the network was trained (e.g. smooth strokes and the rough form of digits). These results emphasize that hallucinations induced by a shift to a more dream-like state in these models are heavily influenced by the training dataset, which for an animal would correspond to the statistics of the sensory environment in which it learns its sensory representation. To emphasize this point, we further trained our networks on the CIFAR10 natural images dataset [54] (Figure 2c), to provide an example of a more naturalistic training dataset. In this case, our model was not powerful enough to reproduce realistic natural images—instead, we found that our modeled hallucinatory activity corresponded to ‘ripple’ effects, which are similar to the ‘breathing’ and ‘rippling’ phenomena reported by psychedelic drug users at low doses [2].

Visualizing the effects of psychedelics in the model.
We model the effects of classical psychedelics by progressively increasing α from 0 to 1 in our model, where α = 1 is equivalent to the Sleep phase. We visualize the effects of psychedelics on the network representation by inspecting the stimulus layer **s. a)** Example stimulus-layer activity (rows) in response to an MNIST digit presentation as psychedelic dose increases (columns, left to right). b) Same as (a) but for ‘eyes-closed’ conditions where an entirely black image is presented. **c-d)** Same as (a-b), but for the CIFAR10 dataset.

These simulations were produced with a complex, multicompartmental neuron model; however, we found similar results with two alternative network architectures, one with within-layer recurrence (Supplemental Figure S1a) and one which used a simpler single compartment neuron model (Supplemental Figure S1b). We found that our single compartment model produced qualitatively less realistic generated images than the multicompartment and recurrent models, justifying our use of the more complex models (Supplemental Figure S2). To demonstrate the importance of a learned top-down pathway to produce complex, structured hallucinations in the earliest layers of our network, we generated model hallucinations from two control networks: an untrained model and a trained network where psychedelic activity was alternatively modeled by a simple increase in the variance of individual neurons (we will refer to this latter control as the noise-based hallucination protocol). We found that hallucinations under these control conditions resembled additive white noise, rather than structured digit-like shapes (Supplemental Figure S1c-d).

Psychedelic drug users also report observing the emergence of hallucinations while their eyes are closed [2]. Interestingly, we found that our model recapitulated these phenomena: as α increased, networks trained on MNIST gradually began revealing increasingly complex and digit-like patterns (Figure 2b), whereas CIFAR10-trained networks again predominantly produced ‘ripple’ hallucinations (Figure 2d).

Effects of psychedelics on single neurons

Having recapitulated hallucinatory phenomena in stimulus space, we next explored how our proposed mechanism affected neural activity in our network model, in order to establish markers that could be used to experimentally validate or invalidate the oneirogen hypothesis. To start, we investigated the effects of learning and psychedelic drug administration on the activity of single neurons in the model. As noted previously, the learning algorithm used here trains synapses so that top-down inputs to apical dendritic compartments match bottom-up inputs to basal dendritic compartments. As a consequence, we observed that after training, inputs to apical and basal dendritic compartments were much more correlated on the same neuron than they were for random neurons (Figure 3a), which was not observed in untrained models (Supplemental Figure S3a). This form of strongly correlated tuning has been observed in both cortex and the hippocampus [55, 56].

Effects of psychedelics on single model neurons.
a) Correlations between the apical and basal dendritic compartments of either the same network neuron or between randomly selected neurons. Total plasticity for apical (left) and basal (right) synapses as α increases in the model when plasticity is either gated or not gated by α. Error bars indicate +/-1 s.e.m. c) Cosine similarity between plasticity induced under psychedelic conditions compared to baseline for apical (left) and basal (right) synapses.

There are many indicators that psychedelic drug administration in humans and animals can induce marked, long-lasting changes in behavior, as well as large increases in synaptic plasticity [3, 57, 58, 9, 4]. In Wake-Sleep learning, apical synapses learn during the Wake phase, whereas basal synapses learn during the Sleep phase—thus, plasticity at apical synapses is gated by (1− α), whereas plasticity at basal synapses is gated by α (see Methods). However, learning is still theoretically possible without this explicit gating, though it may be noisier and less efficient; furthermore, it is conceivable that classical psychedelics could increase the relative influence of apical inputs on the activity of a neuron without affecting this gating mechanism. As a consequence, we modeled the dose-dependent effects of psychedelics on plasticity both with and without gating (Figure 3b). Consistent with recent experimental results [3], for intermediate doses we found large increases in plasticity at both apical and basal synapses under both conditions, where plasticity was measured as a mean change in normalized synaptic strength across weight parameters in our network (see Methods). In our model, we found that the total evoked plasticity peaked at roughly α = 0.5; we further found that if gating was affected by psychedelics, apical plasticity would eventually be quenched at very high drug doses. We also found that plasticity induced by psychedelic drug administration gradually became unaligned from the weight updates that would have occurred in the absence of the drug (Figure 3c), indicating that these results were not simply due to modulation of the effective learning rate of the underlying plasticity. Rather, as has been suggested by other theoretical studies [59], plasticity in the model likely increased because aberrant hallucinatory activity pulled the learning mechanism out of a local optimum in which plasticity was minimal, producing much more plasticity across the network. Importantly, we observed these increases in plasticity in all network architectures and training datasets we explored, including for our noise-based hallucination protocol (Supplemental S4), demonstrating that changes in apical dendritic influence within a Wake-Sleep learning framework are sufficient, but not necessary to induce increases in synaptic plasticity: for trained networks, it would seem that even simple increases in neural variability can have similar effects.

Effects of psychedelics on neural variability

Having observed that increasing our modeled drug dosage caused heightened fluctuations and deviations from the ground-truth stimulus in the sensory layer of our network (Figure 2), we next investigated whether variability was affected at the level of individual neurons in higher layers of the model. Indeed, we found that for a fixed stimulus, neural variability increased markedly as the simulated psychedelic drug dose increased (Figure 4a). This result is consistent with the data supporting the Entropic Brain Theory [60, 61, 62, 63], in which neural activity in resting state fMRI recordings becomes increasingly ‘entropic’ (i.e. variable) under the influence of psychedelics; however, it is important to note that our noise-based hallucination protocol also produced these effects (Supplemental Figure S5a). Though most experimental data supporting the Entropic Brain Theory is taken from recordings with relatively poor spatial resolution, averaging activity over large cortical areas, our model predicts that this increase in variability should be reflected at the level of individual neurons; this increase in variability after psychedelic administration has been recently observed in auditory cortical neurons for active mice [64], but whether this phenomenon is general across tasks and cortical areas remains to be seen. We further found that this increase in variability corresponded to a decrease in ability to identify the stimulus being presented to the network: we trained a classifier to identify which MNIST digit was presented to our networks on Wake neural activity (see Methods), and found that the accuracy of our classifier decreased (Figure 4b) while the output variability of the classifier increased (Figure 4c) in response to drug administration.

Within our model, this increase in variability is quite sensible: in the ordinary Wake state, neural activity is constrained to correspond to the singular sensory stimulus being presented, whereas during Sleep states, neural activity is completely unconstrained by any particular sensory stimulus, reflecting instead the full distribution of possible sensory stimuli. As increasing α in our model interpolates between Wake and Sleep states, we can expect intermediate values of α to produce network states which are less constrained by the particular sensory stimulus being presented, reflected in increased neural variability.

Network-level effects of psychedelics

We next investigated the effects of psychedelics on network-level and inter-areal dynamics within our model. We first identified an important negative result: the pairwise correlation structure between neurons was largely preserved across psychedelic doses (Figure 5a-b), as was the effective dimensionality of population activity (Figure 5c). This was sensible, because a network that has been well-trained with the Wake-Sleep algorithm will have the same marginal distribution of network states in the Wake mode as in the Sleep mode—thus, pairwise correlations between neurons should also not differ (as measures of the second order moments of the marginal distribution). We found empirically that even for intermediate values of α in which activity is a mixture of Wake and Sleep modes, these correlations are largely unchanged; in contrast, we observed large changes in correlation structure for untrained networks, and increases in effective dimensionality for both untrained networks and for our simple noise-based hallucination protocol, suggesting that these results are more specific to our trained models in which hallucinations are caused by an increase in apical dendritic influence (Supplemental Figure S6a-b). Interestingly, these results are consistent with a recent study that has shown only minimal functional connectivity and effective dimensionality changes in task-engaged humans being presented audiovisual stimuli under the influence of psilocybin [63].

However, though the pairwise correlations between single neurons are largely preserved, the causal influence between lower and higher layers of our model network changes considerably both during hallucination and Sleep modes. Because psychedelic drug administration increases influence of apical dendritic inputs on neural activity in our model, we found that silencing apical dendritic activity reduced across-stimulus neural variability more as the psychedelic drug dose increases (Figure 5d). Further, we found that as α increased, inactivating the deepest network layer induced a large reduction in variability in the stimulus layer relative to baseline (Figure 5e), revealing that within our model, increases in top-down influence are responsible for much of the observed stimulus-conditioned variability at larger drug doses. These inactivations had no impact on neural variability in our noise-based hallucination protocol, but were observed for all network architectures and datasets that we tested in which hallucinations were caused by an increase in apical dendritic influence (Supplemental Figure S6), suggesting that these results are quite specific to our model. Further, these inactivations have not yet been performed in animals, and consequently constitute a critical testable prediction of our model.

3 Discussion

Experimental results captured by our model

In this study, we have examined a hypothetical mechanism explaining how the 5-HT2a receptor agonism of classical psychedelics could induce the highly structured hallucinations reported by people who have consumed these drugs. Specifically, we have explored the ‘oneirogen hypothesis’, which postulates that 5-HT2a agonists have the effects that they do because they shift the neocortex to a more dream-like state, wherein activity is more strongly driven by top-down inputs to apical dendrites than normally occurs during waking. To provide a concrete model to explore the ‘oneirogen hypothesis’ we used the classic Wake-Sleep algorithm, which learns by toggling between a Wake phase, where activity is driven by bottom-up sensory inputs, and a Sleep phase, where activity is driven by top-down generative signals. We modeled the ‘oneirogen hypothesis’ by simulating psychedelic administration as an increase in a neuronal state variable that switches neural activity between these two phases (α), such that the simulated psychedelic caused the network to enter a state somewhere between the Wake and Sleep phases, making activity during the Wake phase less tied to actual sensory inputs by increasing the relative influence of the top-down, apical compartment in the models (depending on the “dosage”). This formulation is consistent with anatomical wiring data [39], as well as several recent theoretical studies which propose a specialized learning role for top-down projections to the apical dendrites of pyramidal neurons [41, 42, 43, 44, 45, 46]. It is also consistent with the known cellular mechanism of action of classical psychedelics [1, 51, 52, 8]. Using this model, we were able to produce both stimulus-conditioned and “closed-eye” hallucinations that are consistent with the low-level effects reported by psychedelic drug users [2], and we were also able to recapitulate the large increases in plasticity observed at both apical and basal synapses at moderate psychedelic doses [3].

Our model uses a particular functional form of synaptic plasticity at both apical and basal synapses, reminiscent of the classical delta rule [65], which seeks to minimize a prediction error between inputs in apical and basal synapses. There are many theoretical models of learning that propose similar forms of plasticity [42, 43, 47], so while this plasticity is a necessary prediction of our model, it is not sufficient to validate it. Experimentally, plasticity dynamics which could, theoretically, minimize such a prediction error have been observed in cortex [66, 67], and it has also been proposed that behavioral timescale plasticity in the hippocampus could subserve a similar function [68]. We found that plasticity rules of this kind induce strong correlations between inputs to the apical and basal dendritic compartments of pyramidal neurons, which has been observed in the hippocampus and cortex [55, 56].

Interestingly, we also found that increasing the influence of apical dendrites in the model increased stimulus-conditioned variability in our individual neurons. In cortex, this effect has recently been shown at the level of single auditory neurons [64]; further, there have been numerous studies reporting similar increases in variability (or, analogously, entropy) in resting-state human brain recordings, previously modeled using Entropic Brain Theory [62]. This theory proposes that many of the effects of classical psychedelics on perception and learning can be explained in terms of increases in variability induced by drug administration (e.g. the increase in variability could introduce novel patterns of thinking, or perturb learning to allow it to break out of ‘local minima’). Our results are broadly consistent with this perspective, to which we have added explanatory layers that are both normative and mechanistic [69, 70]: namely, we speculate that this variability under ordinary conditions results from an ethologically important mechanism underlying generative replay for unsupervised learning during sleep or quiescence, and we propose that mechanistically this increase in variability is caused by the increased influence of top-down synapses that are not tied to incoming sensory stimuli.

Testable predictions

While our results are broadly consistent with existing experimental evidence, there are many unconfirmed aspects of our model which could be tested to validate or invalidate it (summarized in Table 1). As mentioned in the previous section, our model predicts that single neurons should increase variability in response to psychedelic drug administration in any cortical area affected by psychedelic drugs. Second, we propose that psychedelic drugs should not push network dynamics into wildly different operating regimes than normal wakefulness, beyond any differences observed between wakefulness and replay (dream) during sleep. In particular, we found that our simulated psychedelic drug administration did not perturb pairwise correlations between neurons within local circuits when averaged across an ecologically representative set of stimuli.

Within our model, psychedelic drug administration is expected to increase the relative influence of top-down projections. This could be explored experimentally in several ways: first, we have shown that apical dendrite-targeted silencing experiments can identify the amount of influence apical dendritic inputs exert on neuronal dynamics; second, we have shown that increases in top-down influence can in principle be identified with interareal silencing experiments. We caution that interpreting results in this second vein may be difficult, as establishing a clean distinction between a ‘higher order’ and ‘lower order’ cortical area may be much more difficult in a densely recurrent system such as the brain, compared to our simplified and fully observable network model. Interestingly, if psychedelic drugs are genuinely exploiting circuitry ordinarily reserved for generative replay during periods of offline quiescence or sleep, we would expect that the same changes in functional connectivity observed during psychedelic drug administration would also occur during periods of replay. In the hippocampus, periods of replay have been tied to increases in apical dendritic influence [71], and increases in the strength of apical inputs have also been observed during NREM cortical updates in prefrontal cortex [49], though in this latter case it is unclear whether apical inputs affected the firing properties of the cell: thus, though there is some circumstantial evidence supporting the idea, much work remains to assess whether apical dendritic inputs dominate pyramidal neuron activity during replay or dreaming in cortex.

Though we have provided a candidate explanation for several of the effects of psychedelic drugs, our model rests on a number of testable assumptions. Our goal has been to articulate these assumptions as clearly as possible, to facilitate experimental efforts to test them.

Comparisons to alternative models

Aside from our model, there are two prominent existing hypotheses as to how psychedelic drugs could induce hallucinations in neural networks. The first proposed that incredibly complex, geometric patterns formed by DMT administration could be attributed to pattern-formation effects in visual cortex caused by a disruption of the balance between excitation and inhibition in locally-coupled topographic recurrent neural networks [72, 73]. Our work differs from this approach in several respects. First, rather than disrupting E-I balance, we propose that psychedelics increase the relative influence of apical dendrites and top-down projections on the dynamics of neural activity. Second, though their model is able to generate geometric patterns, it is not able to generate patterns that are statistically related to the features of the sensory environment (e.g. MNIST digits). Lastly, for simplicity we avoided including topographic (or convolutional) recurrent connectivity in our model; however, it would be a very fruitful direction for future research to extend our work to generative modeling of temporal video sequences, as in [74, 75]. With such a development, it is conceivable that our model could directly generalize these pattern formation-based approaches.

Perhaps more closely related to our model is the ‘relaxed beliefs under psychedelics’ (REBUS) model, which proposes to explain the effects of classical psychedelics in terms of predictive coding theory [60]. Similar to the Wake-Sleep algorithm, predictive coding theory [76] models sensory representation learning with neural dynamics and local synaptic modifications that collectively optimize an ELBO objective function. However, at a mechanistic level, there are numerous differences, the most easily distinguishable feature being that the Wake-Sleep algorithm requires periods of offline ‘generative replay’ to train bottom-up synapses in its network, whereas predictive coding learning occurs concomitantly with stimulus presentation. Furthermore, the REBUS model of psychedelic effects is described at a computational level, in terms of a decrease in the ‘precision-weighting of top-down priors.’ While it is more difficult to map the REBUS model directly onto cortical microcircuitry, and the hallucinatory effects of such a model have, to our knowledge, not been directly analyzed, it has been shown that the proposed mechanism causes an increase in bottom-up information flow between cortical areas [77], in direct contrast to the effects that we have shown in our model (Figure 5c-d). However, because genuinely causal interareal information flow can be difficult to analyze due to dense recurrent connectivity, we stress that it would be easier to distinguish between the REBUS model and our ‘oneirogen hypothesis’ by exploring whether psychedelic drugs affect the same circuitry that induces ‘generative replay’ during periods of sleep and quiescence.

Lastly, it should be noted that the Wake-Sleep algorithm and our choice of network architecture constitute one particular model within a family of related models, all of which satisfy our key criteria for a good model of the ‘oneirogen hypothesis,’ namely that 1) the model has well-defined top-down and bottom-up pathways, 2) it learns a generative model of incoming sensory inputs, and 3) it uses periods of offline replay for learning through local synaptic plasticity. For example, in the Supplemental Materials, we have replicated all of our essential results for two alternative network architectures, also learned via the Wake-Sleep algorithm: one model uses within-layer recurrence to improve generative performance, while the other model uses a simpler single compartment neuron model. Furthermore, the closely related Contrastive Divergence learning algorithm for Boltzmann Machines [78] also involves alternations between Wake and generative Sleep phases and learns through local synaptic plasticity, though Boltmzann machines are computationally more cumbersome to train and require more non-biological network features than the Wake-Sleep algorithm. We feel as though it is important to recognize that models that satisfy these three criteria are more similar than they are different, and that it may be quite difficult to experimentally distinguish between them.

Limitations

While our model is capable of capturing several effects of classical psychedelics, it also has several clear limitations. First, and most notably, our top-down generative model does not have sufficient expressive power to induce complex hallucinations of naturalistic stimuli (though it does a better job of modeling MNIST digits), producing instead ‘ripples,’ or ‘breathing’ effects. While psychedelic drug users do report these phenomena, they also report observing much more complex figures, including people, animals, and complex scenes [79, 80]. Generative models trained through backpropagation have been much more successful in producing more complex generated sensory stimuli [31, 30, 81], and even model hallucinations [82], but, as is well known, backpropagation is itself not a good model of learning in the brain [83]. This suggests that while it is quite possible for generative modeling approaches to produce complex hallucinations through non-biological means, algorithmic or architectural improvements may be necessary in order to make the performance of the more plausible Wake-Sleep algorithm closer to that achieved by backpropagation.

Our model also oversimplifies several aspects of biology. In particular, we do not use neurons that respect Dale’s law [84, 85], and the majority of our efforts to map the Wake-Sleep algorithm onto biology focus on excitatory pyramidal neurons. Furthermore, though we do observe that neural dynamics can tolerate a significant amount of top-down input before disrupting perception, experiments and theoretical studies have shown that inputs to apical dendrites of pyramidal neurons do play an important role in waking perception [39, 86, 87], and are not just learning signals. We focused on clear distinctions between basally-driven Wake modes and apically-driven Sleep modes during training for computational efficiency reasons, and also due to the fact that parameter sharing across inference and generative networks in the Wake-Sleep algorithm is theoretically under-explored (though it is supported in closely related predictive coding approaches [76] and Boltzmann machines [78]).

Lastly, our modeling focus has been exclusively on cortical plasticity and hallucination effects: it should be noted that our model has little bearing on other important features of the psychedelic experience of potential therapeutic relevance, because we have not included the effects of psychedelics on subcortical structures including the serotonergic system [88], which plays an important role regulating mood and may be where psychedelics exert some of their antidepressant effects.

Conclusions

Here we have proposed a hypothesis for the mechanism of action of psychedelic drugs in terms of its excitatory effects on the apical dendrites of pyramidal neurons, which we propose pushes network dynamics into a state normally reserved for offline replay and learning; we have also proposed a number of testable predictions which could be used to validate or invalidate our hypothesis. If validated, our model would describe a mechanism by which the psychedelic experience causes ordinary sensory perception to become literally more dream-like; it further suggests that the plasticity increases observed during both sleep and psychedelic experience could occur via a common mechanism dedicated to sensory representation learning in the brain.

4 Methods

Model architecture and training

To model the effects of psychedelics on neural network dynamics and plasticity, we first constructed a simple model of the early visual system by training neural networks on two different image datasets (MNIST [53] and CIFAR10 [54]). Networks were trained with the Wake-Sleep algorithm [29], which requires, for each layer, two modes of stochastic network activity: a ‘generative mode’, and an ‘inference mode’. For the ‘inference’ mode we must specify a probability distribution b(r^(l) | r^(l−1)), while for the ‘generative’ mode we must specify a separate distribution p(r^(l) | r^(l+1)). As a notational convention, we will use letters when referring to mathematical objects from the generative, top-down distribution, and their vertical reflection when referring to the inference, bottom-up distribution (e.g. p and b). Notice here that activity in ‘inference’ mode is conditioned on ‘bottom-up’ network states (r^(l−1)), while activity in generative mode is conditioned on ‘top-down’ network states (r^(l+1)) (Figure 1a).

The ‘inference mode’ specifies a probability distribution over neural activity, conditioned on the next-lower layer (where the lowest layer is the stimulus layer, i.e. r⁽⁰⁾ = s)—mechanistically it corresponds to activity generated by feedforward projections. To increase the expressive power of our neural units, we use multicompartmental neuron models similar to [89] with N_d dendritic compartments, whose voltages are summed nonlinearly to form the full input to the basal dendrites. For l > 0, layer activity is sampled from the distribution [ineq,where for neuron i in layer l, h_i(r^(l−1)) is given by:

where is a 1× 𝒩 ^(l−1) matrix of synaptic weights onto dendrite n, c_in is the corresponding bias for the nth dendritic compartment, is the strictly positive weight given to the nth dendritic branch (roughly corresponding to a conductance), and is the bias for the entire basal compartment. ϕ_d(·) and ϕ(·) are nonlinearities for the dendritic branches and the total basal compartment, respectively: both are the sequential composition of the tanh nonlinearity, followed by batch normalization [90]. For the dendritic branch nonlinearities, we allow for learnable affine parameters (scale and bias), but for the entire basal dendritic compartment we constrain activity to be zero-mean and unit variance across batches in order to prevent indeterminacy between apical and basal scale parameters. For the final inference layer r^(L), as in the variational autoencoder [30], we parameterize both the mean and a diagonal covariance matrix of the inference distribution: r^(L) ∼ 𝒩 (h(r^(L−1)), diag(h₂(r^(L−1)))), where h₂(·) is also a multicompartmental model, in this case replacing the final batch normalization with an exponential nonlinearity to ensure positivity.

The ‘generative’ mode specifies a probability distribution over neural activity, conditioned on the next-higher layer—it corresponds mechanistically to activity generated by feedback projections. The highest layer, r^(L) is sampled from an N ^(L)-dimensional independent standard normal distribution, r^(L) ∼ N (0, I), and all subsequent layers are sampled from the distribution , where for the ith neuron, μ_i(r^(l+1)) is given by:

where is a 1 × N ^(l+1) matrix of synaptic weights onto apical dendritic branch n, is the corresponding bias for the nth dendritic compartment, is the strictly positive weight given to the nth dendritic branch, and is the bias for the entire apical compartment. Again, ϕ_d(·) and ϕ(·) are nonlinearities, identical to the inference (basal) pathway.

While the neuron model used here is more complicated than is normally used for single-unit neuron models, functions of this kind could feasibly be implemented by nonlinear dendritic computations [89]; we further found that using this nonlinearity qualitatively improved generative performance (Supplemental Figure S2). Given these parameterized probability distributions, we then determined the neural activity for each layer l according to Eq. 1. Our network trained on MNIST was composed of 3 layers, with widths [32, 16, 6], listed in ascending order. A full list of network hyperparameters for both our MNIST and CIFAR10-trained networks can be found in the Supplemental Methods.

All synaptic weights and parameters in our networks were trained via the Wake-Sleep algorithm [29], which is known to produce ‘local’ parameter updates for a wide range of neuron models (and rate or spike-based output distributions), though the specific functional form of the update may vary depending on the neuron model chosen [91]. These updates, for reasonable choices of neural network architecture, can be interpreted as predictions for how synaptic plasticity should look in the brain, if learning were really occurring via the Wake-Sleep algorithm or some approximation thereof.

Consider a generic inference (basal dendrite) parameter for neuron i, . The Wake-Sleep algorithm gives the following update, for a single stimulus presentation:

where η is a learning rate, and the gate α ensures that learning only occurs during sleep mode. Further, for reasons of computational efficiency, we average weight updates across a batch of 512 stimulus presentations; similar results could in principal be obtained with purely online updates [92], but we opted to present stimuli in batches here in order to parallelize computations. changes depending on the parameter θ, reflecting that particular parameter’s contribution to basal dendritic activity. For a dendritic branch weight we have:

where is the total input to the basal dendritic compartment, and is the total input to the nth dendritic branch. This update has the functional form of a classical ‘delta’ learning rule [65], where a compartmental prediction error between local dendritic activity and neuronal firing rate is multiplicatively combined with branch-specific input to provide changes in the conductance for the nth branch. Similarly, for the jth synapse on the nth dendritic branch, , we have:

Unlike for simple one-compartment neuron models, computation of parameter updates for dendritic synapses requires weighting the ‘delta’ error by the conductance of the corresponding dendritic branch (w_in), which could be approximated by the passive diffusion of signaling molecules from the principal basal dendritic compartment back along dendritic branches to individual synapses.

For generative parameters , we have a nearly identical update for a single stimulus presentation:

where now input in the apical dendritic compartment, μ_i(r^(l+1)), is being compared to the activity of the neuron as a whole to determine the magnitude and sign of plasticity. The (1 − α) gate in this case ensures that plasticity only occurs during the Wake mode.

We provide pseudocode (Algorithm 1) for our Wake-Sleep implementation, as well as a full list of algorithm and optimizer hyperparameters (Tables S1 and S2) in the Supplemental Materials. Code for reproducing all results from this study is available here: https://github.com/colinbredenberg/oneirogen-hypothesis.

Modeling Hallucinations

During training, neural network activity is either dominated entirely by bottom-up inputs (Wake, α = 0) or by top-down inputs (Sleep, α = 1). As a consequence, sampling neural activity is computationally low-cost, and can be performed in a single time step. During Wake, one can take a sampled stimulus variable s, determine the activity at layer 1, then 2, and so on until layer L, while during Sleep, one can sample a latent network state in layer L and traverse the layers in reverse order, down to the stimulus layer. However, this is not possible if α ∉ {0, 1}, because activity in each layer l should depend simultaneously on layer l + 1 and layer l − 1. For this reason, we chose to model hallucinatory neural activity dynamically, as follows:

where τ is a time constant that determines how much of the previous network state is retained, and η_t−1 ∼ 𝒩 (0, I). Critically, if we take τ = 1 these dynamics reduce to the sampling procedure used during training (Eq. 1). A priori, the choice of interpolation function f (a, b, α) is arbitrary. We selected the following function:

where κ = 0.35 is a free parameter. This function is equivalent to linear interpolation as κ → ∞, and is equivalent to the maximum function between arguments a and b as κ → 0 if α = 0.5. By selecting κ = 0.35, we are biasing the system towards registering positive inputs from apical or basal sources (in the inclusive sense). We found that this produced ‘hallucinatory’ percepts in stimulus space that did not reduce the intensity of input stimuli as α increased; rather, inputs maintained their intensity and hallucinations were added on top if they were of greater intensity than the ground-truth image. All simulations were run for 800 timesteps, with τ = 0.1. As a control, we compared our results to network dynamics produced purely by increases in noise, without increases in apical dendritic influence (which we refer to as our noise-based hallucination protocol). For these control simulations, we produced network activity time series with the following equation:

so that the standard deviation of the injected noise increased linearly with α.

Apical and Basal Alignment

To measure the alignment between inputs in the apical and basal dendritic compartments of our model neurons, we computed the ‘Wake’ neural responses to the full test dataset and measured the activity in both the basal and apical compartments of our neurons (h(r^(l−1)) and μ(r^(l+1)), respectively). We then calculated the correlation coefficient between apical and basal compartments for the same neuron, compared to the correlation between compartments for two randomly selected neurons.

Quantifying plasticity

To quantify the total amount of plasticity induced in our model system by the administration of psychedelic drugs, we measured the change in relative parameter strength (averaging across all synapses in the network and an ensemble of 512 test images). For each test image, we simulated network dynamics according to Eq. 8. Subsequently, for each parameter θ, we calculated the net amount of plasticity induced by viewing all test images, Δθ. We subsequently reported the relative change:

under conditions in which α values gate plasticity (as in ordinary Wake-Sleep) and under conditions in which psychedelic drug administration does not also affect plasticity gating. Here, we took ϵ = 10⁻² to avoid numerical instabilities.

Classifier training

As we trained our neural network using the Wake-Sleep algorithm, we simultaneously trained a separate classifier network based on Wake-phase neural activity in the second network layer on a cross-entropy loss, to identify the stimulus class of the input to the system. For our classifier, we used a multilayer perceptron neural network with a single 256-unit hidden layer and tanh(·) nonlinearities.

We then quantified the accuracy of the classifier on the test set, based on neural activity drawn from the final time step T of hallucination simulations with various values of α. We further measured the average variance of the 10-dimensional output logits of the neural network.

Quantifying correlation matrix similarity before and after psychedelics

To quantify how similar the pairwise correlations between neurons in our model networks were before and after the administration of psychedelics, we recorded hallucinatory network dynamics for an ensemble of 512 test images, and measured pairwise correlations between neurons in the first network layer. To compare these matrices, we then report the correlation coefficient between the flattened N × N matrices. For this metric, a value of 1 indicates that the correlation matrices are perfectly aligned, while a value of -1 indicates that pairwise correlations are fully inverted.

Quantifying interareal causality through inactivations

To quantify changes in interareal functional connectivity induced by psychedelics, we performed two different types of inactivation. In the first, we inactivated the apical dendritic compartments of all neurons in the stimulus layer, and measured how this inactivation affected across-stimulus variability of neurons relative to the fully active state. In the second method, we inactivated all neurons in the deepest layer, and measured the same effect in across-stimulus variability in the stimulus layer. For both inactivation schemes, we report the mean and standard error of the variance ratio:

where we added ϵ_v = 10⁻³ to the denominator to prevent numerical instability and to the numerator ensure that the ratio evaluates to 1 if the two variances are equivalent.

6 Ethics Declarations

Psychedelic drug research has a long history fraught with many instances of unethical research practice [93]. Further, psychedelic drug use itself has long been stigmatized and punished through legal measures [94], often at the expense of indigenous peoples who have incorporated psychoactive substances into their cultural and spiritual practices for millennia [5]. In the interest of avoiding a repetition of past mistakes, we feel compelled to provide explicit guidance on how our work should be interpreted and used. To do so, we will take inspiration from two principal ethical frameworks: the Montreal Declaration on Responsible AI [95], and the EQUIP framework for equity-oriented healthcare [96, 97]. We strongly encourage anyone considering extending our research or using our work in any form of clinical setting to ensure that subsequent research adheres to these frameworks.

Below, drawing from these ethical frameworks, we will provide a set of guidelines for how our work should be interpreted and used. Though these guidelines are by no means exhaustive, our hope is that adherence to them will help promote the potential positive outcomes of our work while limiting potential negative consequences.

Guidelines for the ethical use of this study

Do:

Ensure that the elements of our hypothesis have been adequately tested, as outlined in our discussion, before using our framework in any form of clinical or therapeutic setting.
Use our ideas to inform further basic neuroscience research on perception, learning, sleep, and replay phenomena.
Explore our ideas as an opportunity to inform your own understanding of cognition, learning, and perception, with the understanding that these ideas have not yet been validated experimentally.
Feel free to ask us if you are worried that your proposed use of our work may have negative impacts.

Do not:

Report our results as scientific fact. We have outlined a hypothesis, which is designed to be tested by the experimental neuroscience community.
Cite or interpret our results without an adequate understanding of the mathematics involved. Feel free to ask us if you are worried that you may be misinterpreting our results.
Use our results to extract undue or inequitable profit. The ideas developed in this paper are the product of decades of research and public funding, built upon millennia of exploration of psychedelics. Any knowledge or value contained within this paper is the common heritage of all humanity, with particular recognition due to the indigenous and marginalized communities that have historically suffered and are currently suffering from oppressive government and industry policies.
Use our results for any application that could violate human rights or harm human beings in any way.

A Supplementary Materials

Recurrent network model

To explore the extent to which our results hold for different neuron models, and to give our generative model more expressive power than the traditional Helmholtz machine [98], we constructed a network model with a single timestep of within-layer recurrent denoising in each layer, which gives our model some similarities to denoising diffusion approaches [99]. For both our ‘inference’ mode and our ‘generative’ mode we specify both a denoised network state and a noise-corrupted network state r^(l) for layer l; specifying a neural network model is then equivalent to specifying, for each layer, a joint probability distribution over denoised and noise-corrupted network states for both the inference and generative modes, i.e. for the ‘inference’ mode we must specify a probability distribution , while for the ‘generative’ mode we must specify a separate distribution . As a notational convention, we will use letters when referring to mathematical objects from the generative, top-down distribution, and their vertical reflection when referring to the inference, bottom-up distribution (e.g. p and b). Notice here that activity in ‘inference’ mode is conditioned on ‘bottom-up’ network states (r^(l−1)), while activity in generative mode is conditioned on ‘top-down’ network states (Figure 1a).

Subsequently, we add additional noise to get a noise-corrupted network state ; while noise corruption is a natural feature of network dynamics in the brain [100], we include it here in our model because it has been shown that denoising is a critical aspect of many powerful generative modeling approaches [101, 102, 103], and we have likewise found that it improves the quality of generated images in our learned networks (Supplemental Figure S2).

The ‘generative’ mode specifies a probability distribution over neural activity, conditioned on the next-higher layer—it corresponds mechanistically to activity generated by feedback projections. The highest layer, r^(L) is sampled from an N ^(L)-dimensional independent standard normal distribution, r^(L) ∼ 𝒩 (0, I), and all subsequent layers are sampled from the distribution , where is given by:

where m^(l) is a N^l ×N ^(l+1) weight matrix, and a is a bias term. Subsequently, the network goes through a single timestep of recurrent denoising, so that , where is given by:

where σ(·) is a sigmoid nonlinearity that acts as a gating function similar those used in the LSTM [104] and GRU [105], C₁ and C₂ are N ^(l) × N ^(l) recurrent weight matrices, and c₁ and c₂ are biases. While this is a more complicated nonlinearity than is normally used for single-unit neuron models, functions of this kind could feasibly be implemented by nonlinear dendritic computations [89]; we further found that using this nonlinearity qualitatively improved generative performance. Given these parameterized probability distributions, we then determined the neural activity for each layer l according to Eq. (1). As with our multicompartmental neuron model, inference and generative parameters were updated according to Eqs. (4) and (7), respectively. Recurrent network hyperparameters are available in Table S3.

Simplified neuron model

As a control, we also tested our results using a simplified multilayer perceptron neuron model, which used neither batch normalization nor multiple dendritic branches. For the ‘inference’ mode within the simplified model, for l > 0, layer activity is sampled from the distribution , where for neuron i in layer l, h_i(r^(l−1)) is given by:

where is a 1 × N ^(l−1) matrix of basal synaptic weights onto neuron i, and b_i is the corresponding bias.

The simplified ‘generative’ mode likewise replaces the branched neuron model used in the main text with a multilayer perceptron model. The highest layer, r^(L) is sampled from an N ^(L)-dimensional independent standard normal distribution, r^(L) ∼ 𝒩 (0, I), and all subsequent layers are sampled from the distribution , where for the ith neuron, μ_i(r^(l+1)) is given by:

where is a 1 ×N ^(l+1) matrix of apical synaptic weights onto neuron i, and is the corresponding bias. As with the branched neuron model, inference and generative parameters were updated according to Equations (4) and (7), respectively. For optimization, we used the identical hyperparameters to the multicompartment neuron model (Table S1).

Visualizing the effects of psychedelics for alternative model architectures.
We model the effects of classical psychedelics by progressively increasing α from 0 to 1 in alternative model architectures. We visualize the effects of psychedelics on the network representation by inspecting the stimulus layer **s. a)** Example stimulus-layer activity (rows) in response to an MNIST digit presentation as psychedelic dose increases (columns, left to right) in the recurrent network model. b) Same as (a) but for our single compartment neuron model. c) Same as (a) using the multicompartment neuron model used for our main results, but for our noise-based hallucination hallucination protocol. d) Same as (c), but in a network in which neither the generative nor inference pathways have been trained beyond initialization.

Example generated images for different model architectures and datasets.
Generated images sampled from Eq. (1) with α = 1 for: a) Our primary multicompartment neuron model trained on MNIST, b) A multicompartment neuron model trained on CIFAR10, c) The recurrent network model, d) The single compartment neuron model.

Alignment between apical and basal dendritic compartments for different model architectures and datasets.
Apical-basal alignment for: a) An untrained multicompartment neuron model trained on MNIST, b) A single compartment neuron model, c) A recurrent network model, d) A multicompartment neuron model trained on CIFAR10.

Hallucination-induced synaptic plasticity for different neuron models.
a) Basal (top) and apical (bottom) plasticity as a function of α for a multicompartment neuron model trained on MNIST, using our noise-based hallucination protocol as a control. b) Same as (a) for a single compartment neuron model, using our primary hallucination protocol. c) Same as (b) for a recurrent network model, d) Same as (b) for a multicompartment neuron model trained on CIFAR10. Error bars indicate +/-1 s.e.m.

Neural variability changes for different neuron models.
a) Stimulus-conditioned variability (top), classifier accuracy (middle), and classifier output variability (bottom) as a function of α for a multicompartment neuron model trained on MNIST, using our noise-based hallucination protocol as a control. b) Same as (b) for a single compartment neuron model, using our primary hallucination protocol. c) Same as (b) for a recurrent network model, d) Same as (b) for a multicompartment neuron model trained on CIFAR10. Error bars indicate +/-1 s.e.m.

Network-level effects of psychedelics for different network architectures and training datasets.
For each network architecture, we examine: correlation similarity as a function of α (top row), the proportion explained variance across stimuli as a function of principal component number (second row), the ratio of across-stimulus variance in stimulus layer neurons when apical dendrites have been inactivated compared to baseline conditions across different α values (third row), and the ratio of across-stimulus variance in stimulus layer neurons when the deepest network layer has been inactivated across different α values (fourth row). a) Results for an untrained multicompartment neuron. b) Results for a multicompartment neuron model trained on MNIST, using our noise-based hallucination protocol. c) Results for a single compartment neuron model. d) Results for a recurrent network model. e) Results for a multicompartment neuron model trained on CIFAR10. Error bars indicate +/-1 s.e.m.

MNIST multicompartment network hyperparameters

CIFAR10 multicompartment network hyperparameters

Algorithm 1 Wake-Sleep Pseudocode Algorithm 1 Wake-Sleep Pseudocode

Acknowledgements

We would like to thank members of both G.L. and B.R.’s labs, as well as James M. Shine, Brandon Munn, Christopher Whyte, Veronica Chelu, Jiameng Wu, Matthew Larkum, Santiago Jaramillo, Michael Wehr, Neil Savalia, Alexandra Klein, Sarah Cook, Conor Lane, Anousheh Bakhti-Suroosh, Runchong Wang, Michael Okun, and Jordan O’Byrne for insightful discussions and feedback. This work was supported by: [GL] NSERC Discovery Grant (RGPIN-2018-04821), Canada CIFAR AI Chair Program, Canada Research Chair in Neural Computations and Interfacing (CIHR, tier 2). [BR] NSERC (Discovery Grant: RGPIN-2020-05105; Discovery Accelerator Supplement: RGPAS-2020-00031; Arthur B. McDonald Fellowship: 566355-2022) and CIFAR (Canada AI Chair; Learning in Machine and Brains Fellowship). [CB] is supported in part by the FRQNT Strategic Clusters Program (Centre UNIQUE - Quebec Neuro-AI Research Center). The authors acknowledge the material support of NVIDIA in the form of computational resources.

Additional files

Figure 2a multicompartmental mnist hallucination video

Figure 2c multicompartmental cifar10 hallucination video

References

[1]
1. Jakab Robert L
2. Goldman-Rakic Patricia S
19985-hydroxytryptamine2a serotonin receptors in the primate cerebral cortex: possible site of action of hallucinogenic and antipsychotic drugs in pyramidal cell apical dendritesProceedings of the National Academy of Sciences 95:735–740Google Scholar
[2]
1. Preller Katrin H
2. Vollenweider Franz X
2018Phenomenology, structure, and dynamic of psychedelic statesBehavioral neurobiology of psychedelic drugs :221–256Google Scholar
[3]
1. Shao Ling-Xiao
2. Liao Clara
3. Gregg Ian
4. Davoudian Pasha A
5. Savalia Neil K
6. Delagarza Kristina
7. Kwan Alex C
2021Psilocybin induces rapid and persistent growth of dendritic spines in frontal cortex in vivoNeuron 109:2535–2544Google Scholar
[4]
1. Grieco Steven F
2. Castrén Eero
3. Knudsen Gitte M
4. Kwan Alex C
5. Olson David E
6. Zuo Yi
7. Holmes Todd C
8. Xu Xiangmin
2022Psychedelics and neural plasticity: therapeutic implicationsJournal of Neuroscience 42:8439–8449Google Scholar
[5]
1. Samorini Giorgio
2019The oldest archeological data evidencing the relationship of homo sapiens with psychoactive plants: A worldwide overviewJournal of Psychedelic Studies 3:63–80Google Scholar
[6]
1. Muttoni Silvia
2. Ardissino Maddalena
3. John Christopher
2019Classical psychedelics for the treatment of depression and anxiety: A systematic reviewJournal of affective disorders 258:11–24Google Scholar
[7]
1. Krediet Erwin
2. Bostoen Tijmen
3. Breeksema Joost
4. van Schagen Annette
5. Passie Torsten
6. Vermetten Eric
2020Reviewing the potential of psychedelics for the treatment of PTSDInternational Journal of Neuropsychopharmacology 23:385–400Google Scholar
[8]
1. Kraehenmann Rainer
2. Pokorny Dan
3. Vollenweider Leonie
4. Preller Katrin H
5. Pokorny Thomas
6. Seifritz Erich
7. Vollenweider Franz X
2017Dreamlike effects of LSD on waking imagery in humans depend on serotonin 2a receptor activationPsychopharmacology 234:2031–2046Google Scholar
[9]
1. Vargas Maxemiliano V
2. Dunlap Lee E
3. Dong Chunyang
4. Carter Samuel J
5. Tombari Robert J
6. Jami Shekib A
7. Cameron Lindsay P
8. Patel Seona D
9. Hennessey Joseph J
10. Saeger Hannah N
11. et al.
2023Psychedelics promote neuroplasticity through the activation of intracellular 5-HT2A receptorsScience 379:700–706Google Scholar
[10]
1. Carhart-Harris Robin
2007Waves of the unconscious: the neurophysiology of dreamlike phenomena and its implications for the psychodynamic model of the mindNeuropsychoanalysis 9:183–211Google Scholar
[11]
1. Thomas Christopher W
2. Blanco-Duque Cristina
3. Bréant Benjamin J
4. Goodwin Guy M
5. Sharp Trevor
6. Bannerman David M
7. Vyazovskiy Vladyslav V
2022Psilocin acutely alters sleep-wake architecture and cortical brain activity in laboratory miceTranslational psychiatry 12:77Google Scholar
[12]
1. Dudysová Daniela
2. Janků Karolina
3. Šmotek Michal
4. Saifutdinova Elizaveta
5. Kopřivová Jana
6. Bušková Jitka
7. Mander Bryce Anthony
8. Brunovský Martin
9. Zach Peter
10. Korčák Jakub
11. et al.
2020The effects of daytime psilocybin administration on sleep: implications for antidepressant actionFrontiers in pharmacology 11:602590Google Scholar
[13]
1. Barbanoj Manel J
2. Riba Jordi
3. Clos S
4. Giménez S
5. Grasa E
6. Romero S
2008Daytime ayahuasca administration modulates REM and slow-wave sleep in healthy volunteersPsychopharmacology 196:315–326Google Scholar
[14]
1. Girardeau Gabrielle
2. Benchenane Karim
3. Wiener Sidney I
4. Buzsáki György
5. Zugaro Michaël B
2009Selective suppression of hippocampal ripples impairs spatial memoryNature neuroscience 12:1222–1223Google Scholar
[15]
1. Deuker Lorena
2. Olligs Jan
3. Fell Juergen
4. Kranz Thorsten A.
5. Mormann Florian
6. Montag Christian
7. Reuter Martin
8. Elger Christian E.
9. Axmacher Nikolai
2013Memory consolidation by replay of stimulus-specific neural activityJournal of Neuroscience 33:19373–19383Google Scholar
[16]
1. Lavilléon Gaetan D.
2. Lacroix Marie Masako
3. Rondi-Reig Laure
4. Benchenane Karim
2015Explicit memory creation during sleep demonstrates a causal role of place cells in navigationNature neuroscience 18:493–495Google Scholar
[17]
1. Maingret Nicolas
2. Girardeau Gabrielle
3. Todorova Ralitsa
4. Goutierre Marie
5. Zugaro Michaël
2016Hippocampo-cortical coupling mediates memory consolidation during sleepNature neuroscience 19:959–964Google Scholar
[18]
1. Fernández-Ruiz Antonio
2. Oliva Azahara
3. de Oliveira Eliezyer Fermino
4. Rocha-Almeida Florbela
5. Tingley David
2019Long-duration hippocampal sharp wave ripples improve memoryScience 364:1082–1086Google Scholar
[19]
1. Walker Matthew P
2. Stickgold Robert
2004Sleep-dependent learning and memory consolidationNeuron 44:121–133Google Scholar
[20]
1. Nádasdy Zoltán
2. Hirase Hajime
3. Czurkó András
4. Csicsvari Jozsef
5. Buzsáki György
1999Replay and time compression of recurring spike sequences in the hippocampusJournal of Neuroscience 19:9497–9507Google Scholar
[21]
1. Lee Albert K
2. Wilson Matthew A
2002Memory of sequential experience in the hippocampus during slow wave sleepNeuron 36:1183–1194Google Scholar
[22]
1. Foster David J
2017Replay comes of ageAnnual review of neuroscience 40:581–602Google Scholar
[23]
1. Ji Daoyun
2. Wilson Matthew A
2007Coordinated memory replay in the visual cortex and hippocampus during sleepNature neuroscience 10:100–107Google Scholar
[24]
1. Euston David R
2. Tatsuno Masami
3. McNaughton Bruce L
2007Fast-forward playback of recent memory sequences in prefrontal cortex during sleepscience 318:1147–1150Google Scholar
[25]
1. Peyrache Adrien
2. Khamassi Mehdi
3. Benchenane Karim
4. Wiener Sidney I
5. Battaglia Francesco P
2009Replay of rule-learning related neural patterns in the prefrontal cortex during sleepNature neuroscience 12:919–926Google Scholar
[26]
1. Kenet Tal
2. Bibitchkov Dmitri
3. Tsodyks Misha
4. Grinvald Amiram
5. Arieli Amos
2003Spontaneously emerging cortical representations of visual attributesNature 425:954–956Google Scholar
[27]
1. Xu Shengjin
2. Jiang Wanchen
3. Poo Mu-ming
4. Dan Yang
2012Activity recall in a visual cortical ensembleNature neuroscience 15:449–455Google Scholar
[28]
1. Hoffman Kari L
2. McNaughton Bruce L
2002Coordinated reactivation of distributed memory traces in primate neocortexScience 297:2070–2073Google Scholar
[29]
1. Hinton Geoffrey E
2. Dayan Peter
3. Frey Brendan J
4. Neal Radford M
1995The “wake-sleep” algorithm for unsupervised neural networksScience 268:1158–1161Google Scholar
[30]
1. Rezende Danilo Jimenez
2. Mohamed Shakir
3. Wierstra Daan
2014Stochastic backpropagation and approximate inference in deep generative modelsIn: International conference on machine learning PMLR pp. 1278–1286Google Scholar
[31]
1. Kingma Diederik P
2. Welling Max
2013Auto-encoding variational bayesarXiv Google Scholar
[32]
1. Simoncelli Eero P
2003Vision and the statistics of the visual environmentCurrent opinion in neurobiology 13:144–149Google Scholar
[33]
1. Simoncelli Eero P
2. Olshausen Bruno A
2001Natural image statistics and neural representationAnnual review of neuroscience 24:1193–1216Google Scholar
[34]
1. Ballé Johannes
2. Laparra Valero
3. Simoncelli Eero P
2016End-to-end optimized image compressionarXiv Google Scholar
[35]
1. DiCarlo James J
2. Zoccolan Davide
3. Rust Nicole C
2012How does the brain solve visual object recognition?Neuron 73:415–434Google Scholar
[36]
1. Higgins Irina
2. Matthey Loic
3. Pal Arka
4. Burgess Christopher P
5. Glorot Xavier
6. Botvinick Matthew M
7. Mohamed Shakir
8. Lerchner Alexander
2017beta-vae: Learning basic visual concepts with a constrained variational frameworkICLR (Poster) 3Google Scholar
[37]
1. Ikeda Shiro
2. Amari Shun-ichi
3. Nakahara Hiroyuki
1998Convergence of the wake-sleep algorithmAdvances in neural information processing systems 11Google Scholar
[38]
1. Kirby Kevin G
2006A tutorial on Helmholtz MachinesDepartment of Computer Science, Northern Kentucky University Google Scholar
[39]
1. Larkum Matthew
2013A cellular mechanism for cortical associations: an organizing principle for the cerebral cortexTrends in neurosciences 36:141–151Google Scholar
[40]
1. Aru Jaan
2. Siclari Francesca
3. Phillips William A
4. Storm Johan F
2020Apical drive—a cellular mechanism of dreaming?Neuroscience & Biobehavioral Reviews 119:440–455Google Scholar
[41]
1. Körding Konrad P
2. König Peter
2001Supervised and unsupervised learning with two sites of synaptic integrationJournal of computational neuroscience 11:207–215Google Scholar
[42]
1. Urbanczik Robert
2. Senn Walter
2014Learning by the dendritic prediction of somatic spikingNeuron 81:521–528Google Scholar
[43]
1. Guerguiev Jordan
2. Lillicrap Timothy P
3. Richards Blake A
2017Towards deep learning with segregated dendriteseLife 6:e22901https://doi.org/10.7554/eLife.22901 Google Scholar
[44]
1. Sacramento João
2. Costa Rui Ponte
3. Bengio Yoshua
4. Senn Walter
2018Dendritic cortical microcircuits approximate the backpropagation algorithmAdvances in neural information processing systems 31Google Scholar
[45]
1. Richards Blake A
2. Lillicrap Timothy P
2019Dendritic solutions to the credit assignment problemCurrent opinion in neurobiology 54:28–36Google Scholar
[46]
1. Payeur Alexandre
2. Guerguiev Jordan
3. Zenke Friedemann
4. Richards Blake A
5. Naud Richard
2021Burst-dependent synaptic plasticity can coordinate learning in hierarchical circuitsNature neuroscience 24:1010–1019Google Scholar
[47]
1. Bredenberg Colin
2. Lyo Benjamin
3. Simoncelli Eero
4. Savin Cristina
2021Impression learning: Online representation learning with synaptic plasticityAdvances in Neural Information Processing Systems 34:11717–11729Google Scholar
[48]
1. George Tom M
2. Stachenfeld Kimberly L
3. Barry Caswell
4. Clopath Claudia
5. Fukai Tomoki
2024A generative model of the hippocampal formation trained with theta driven local learning rulesAdvances in Neural Information Processing Systems 36Google Scholar
[49]
1. Seibt Julie
2. Richard Clément J
3. Sigl-Glöckner Johanna
4. Takahashi Naoya
5. Kaplan David I
6. Doron Guy
7. De Limoges Denis
8. Bocklisch Christina
9. Larkum Matthew E
2017Cortical dendritic activity correlates with spindle-rich oscillations during sleep in rodentsNature communications 8:684Google Scholar
[50]
1. Almeida J De
2. Mengod G
2007Quantitative analysis of glutamatergic and GABAergic neurons expressing 5-HT2A receptors in human and monkey prefrontal cortexJournal of neurochemistry 103:475–486Google Scholar
[51]
1. Aghajanian GK
2. Marek GJ
1997Serotonin induces excitatory postsynaptic potentials in apical dendrites of neocortical pyramidal cellsNeuropharmacology 36:589–599Google Scholar
[52]
1. Aghajanian George K
2. Marek Gerard J
1999Serotonin and hallucinogensNeuropsychopharmacology 21:16–23Google Scholar
[53]
1. Deng Li
2012The mnist database of handwritten digit images for machine learning research [best of the web]IEEE signal processing magazine 29:141–142Google Scholar
[54]
1. Krizhevsky Alex
2. Hinton Geoffrey
3. et al.
2009Learning multiple layers of features from tiny imagesGoogle Scholar
[55]
1. Beaulieu-Laroche Lou
2. Toloza Enrique HS
3. Brown Norma J
4. Harnett Mark T
2019Widespread and highly correlated somato-dendritic activity in cortical layer 5 neuronsNeuron 103:235–241Google Scholar
[56]
1. O’Hare Justin K
2. Wang Jamie
3. Shala Margjele D
4. Polleux Franck
5. Losonczy Attila
2024Distal tuft dendrites shape and maintain new place fieldsbioRxiv Google Scholar
[57]
1. Nardou Romain
2. Sawyer Edward
3. Song Young Jun
4. Wilkinson Makenzie
5. Padovan-Hernandez Yasmin
6. de Deus Júnia Lara
7. Wright Noelle
8. Lama Carine
9. Faltin Sehr
10. Goff Loyal A
11. et al.
2023Psychedelics reopen the social reward learning critical periodNature 618:790–798Google Scholar
[58]
1. Revenga Mario de la Fuente
2. Zhu Bohan
3. Guevara Christopher A
4. Naler Lynette B
5. Saunders Justin M
6. Zhou Zirui
7. Toneatti Rudy
8. Sierra Salvador
9. Wolstenholme Jennifer T
10. Beardsley Patrick M
11. et al.
2021Prolonged epigenomic and synaptic plasticity alterations following single exposure to a psychedelic in miceCell reports 37Google Scholar
[59]
1. Juliani Arthur
2. Chelu Veronica
3. Graesser Laura
4. Safron Adam
2024A dual-receptor model of serotonergic psychedelics: therapeutic insights from simulated cortical dynamicsbioRxiv Google Scholar
[60]
1. Carhart-Harris Robin L
2. Friston Karl J
2019Rebus and the anarchic brain: toward a unified model of the brain action of psychedelicsPharmacological reviews 71:316–344Google Scholar
[61]
1. Lebedev Alexander V
2. Kaelen Mendel
3. Lövdén Martin
4. Nilsson Jonna
5. Feilding Amanda
6. Nutt David J
7. Carhart-Harris Robin L
2016LSD-induced entropic brain activity predicts subsequent personality changeHuman brain mapping 37:3203–3213Google Scholar
[62]
1. Carhart-Harris Robin Lester
2. Leech Robert
3. Hellyer Peter John
4. Shanahan Murray
5. Feilding Amanda
6. Tagliazucchi Enzo
7. Chialvo Dante R
8. Nutt David
2014The entropic brain: a theory of conscious states informed by neuroimaging research with psychedelic drugsFrontiers in human neuroscience 8:20Google Scholar
[63]
1. Siegel Joshua S
2. Subramanian Subha
3. Perry Demetrius
4. Kay Benjamin P
5. Gordon Evan M
6. Laumann Timothy O
7. Reneau T Rick
8. Metcalf Nicholas V
9. Chacko Ravi V
10. Gratton Caterina
11. et al.
2024Psilocybin desynchronizes the human brainNature :1–8Google Scholar
[64]
1. Horrocks Max
2. Mohn Jennifer L
3. Jaramillo Santiago
2024The serotonergic psychedelic DOI impairs deviance detection in the auditory cortexbioRxiv Google Scholar
[65]
1. Widrow Bernard
2. Lehr Michael A
199030 years of adaptive neural networks: perceptron, madaline, and backpropagationProceedings of the IEEE 78:1415–1442Google Scholar
[66]
1. Jesper Sjöström Per
2. Häusser Michael
2006A cooperative switch determines the sign of synaptic plasticity in distal dendrites of neocortical pyramidal neuronsNeuron 51:227–238Google Scholar
[67]
1. Froemke Robert C
2. Letzkus Johannes J
3. Kampa Björn M
4. Hang Giao B
5. Stuart Greg J
2010Dendritic synapse location and neocortical spike-timing-dependent plasticityFrontiers in synaptic neuroscience 2:29Google Scholar
[68]
1. Bittner Katie C
2. Milstein Aaron D
3. Grienberger Christine
4. Romani Sandro
5. Magee Jeffrey C
2017Behavioral time scale synaptic plasticity underlies ca1 place fieldsScience 357:1033–1036Google Scholar
[69]
1. Bredenberg Colin
2. Savin Cristina
2024Desiderata for normative models of synaptic plasticityNeural Computation 36:1245–1285Google Scholar
[70]
1. Levenstein Daniel
2. Alvarez Veronica A
3. Amarasingham Asohan
4. Azab Habiba
5. Chen Zhe S
6. Gerkin Richard C
7. Hasenstaub Andrea
8. Iyer Ramakrishnan
9. Jolivet Renaud B
10. Marzen Sarah
11. et al.
2023On the role of theory and modeling in neuroscienceJournal of Neuroscience 43:1074–1088Google Scholar
[71]
1. Bittner Katie C
2. Grienberger Christine
3. Vaidya Sachin P
4. Milstein Aaron D
5. Macklin John J
6. Suh Junghyup
7. Tonegawa Susumu
8. Magee Jeffrey C
2015Conjunctive input processing drives feature selectivity in hippocampal ca1 neuronsNature neuroscience 18:1133–1142Google Scholar
[72]
1. Ermentrout G Bard
2. Cowan Jack D
1979A mathematical theory of visual hallucination patternsBiological cybernetics 34:137–150Google Scholar
[73]
1. Bressloff Paul C
2. Cowan Jack D
3. Golubitsky Martin
4. Thomas Peter J
5. Wiener Matthew C
2001Geometric visual hallucinations, Euclidean symmetry and the functional architecture of striate cortexPhilosophical Transactions of the Royal Society of London. Series B: Biological Sciences 356:299–330Google Scholar
[74]
1. Keller T Anderson
2. Welling Max
2023Neural wave machines: learning spatiotemporally structured representations with locally coupled oscillatory recurrent neural networksIn: International Conference on Machine Learning PMLR pp. 16168–16189Google Scholar
[75]
1. Keller T Anderson
2. Muller Lyle
3. Sejnowski Terrence
4. Welling Max
2023Traveling waves encode the recent past and enhance sequence learningarXiv Google Scholar
[76]
1. Rao Rajesh PN
2. Ballard Dana H
1999Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effectsNature neuroscience 2:79–87Google Scholar
[77]
1. Rajpal Hardik
2. Mediano Pedro AM
3. Rosas Fernando E
4. Timmermann Christopher B
5. Brugger Stefan
6. Muthukumaraswamy Suresh
7. Seth Anil K
8. Bor Daniel
9. Carhart-Harris Robin L
10. Jensen Henrik J
2022Psychedelics and schizophrenia: Distinct alterations to Bayesian inferenceNeuroImage 263:119624Google Scholar
[78]
1. Ackley David H
2. Hinton Geoffrey E
3. Sejnowski Terrence J
1985A learning algorithm for Boltzmann machinesCognitive science 9:147–169Google Scholar
[79]
1. Shanon Benny
2002Ayahuasca visualizations a structural typologyJournal of Consciousness Studies 9:3–30Google Scholar
[80]
1. Díaz José Luis
2010Sacred plants and visionary consciousnessPhenomenology and the Cognitive Sciences 9:159–170Google Scholar
[81]
1. Goodfellow Ian
2. Pouget-Abadie Jean
3. Mirza Mehdi
4. Xu Bing
5. Warde-Farley David
6. Ozair Sherjil
7. Courville Aaron
8. Bengio Yoshua
2020Generative adversarial networksCommunications of the ACM 63:139–144Google Scholar
[82]
1. Suzuki Keisuke
2. Roseboom Warrick
3. Schwartzman David J
4. Seth Anil K
2017A deep-dream virtual reality platform for studying altered perceptual phenomenologyScientific reports 7:15982Google Scholar
[83]
1. Lillicrap Timothy P
2. Santoro Adam
3. Marris Luke
4. Akerman Colin J
5. Hinton Geoffrey
2020Back-propagation and the brainNature Reviews Neuroscience 21:335–346Google Scholar
[84]
1. O’Donohue Thomas L
2. Millington William R
3. Handelmann Gail E
4. Contreras Patricia C
5. Chronwall Bibie M
1985On the 50th anniversary of Dale’s law: multiple neurotransmitter neuronsTrends in Pharmacological Sciences 6:305–308Google Scholar
[85]
1. Cornford Jonathan
2. Kalajdzievski Damjan
3. Leite Marco
4. Lamarquette Amélie
5. Kullmann Dimitri M
6. Richards Blake
2020Learning to live with Dale’s principle: ANNs with separate excitatory and inhibitory unitsbioRxiv Google Scholar
[86]
1. Whyte Christopher J
2. Redinbaugh Michelle J
3. Shine James M
4. Saalmann Yuri B
2024Thalamic contributions to the state and contents of consciousnessNeuron 112:1611–1625Google Scholar
[87]
1. Munn Brandon R
2. Müller Eli J
3. Medel Vicente
4. Naismith Sharon L
5. Lizier Joseph T
6. Sanders Robert D
7. Shine James M
2023Neuronal connected burst cascades bridge macroscale adaptive signatures across arousal statesNature Communications 14:6846Google Scholar
[88]
1. Carhart-Harris Robin L
2. Nutt DJ
2017Serotonin and brain function: a tale of two receptorsJournal of psychopharmacology 31:1091–1120Google Scholar
[89]
1. Poirazi Panayiota
2. Brannon Terrence
3. Mel Bartlett W
2003Pyramidal neuron as two-layer neural networkNeuron 37:989–999Google Scholar
[90]
1. Ioffe Sergey
2015Batch normalization: Accelerating deep network training by reducing internal covariate shiftarXiv Google Scholar
[91]
1. Bredenberg Colin
2. Williams Ezekiel
3. Savin Cristina
4. Richards Blake
5. Lajoie Guillaume
2024Formalizing locality for normative synaptic plasticity modelsAdvances in Neural Information Processing Systems 36Google Scholar
[92]
1. Williams Ezekiel
2. Bredenberg Colin
3. Lajoie Guillaume
2023Flexible phase dynamics for bioplausible contrastive learningIn: International Conference on Machine Learning PMLR pp. 37042–37065Google Scholar
[93]
1. Strauss Dana
2. Salle Sara de la
3. Sloshower Jordan
4. Williams Monnica T
2022Research abuses against people of colour and other vulnerable groups in early psychedelic researchJournal of Medical Ethics 48:728–737Google Scholar
[94]
1. Bauml James A
2. Schaefer Stacy B
2016Peyote: History, tradition, politics, and conservationBloomsbury Publishing USA Google Scholar
[95]
1. Dilhac Marc-Antoine
2. Abrassart Christophe
3. Voarino Nathalie
2018Report of the Montréal declaration for a responsible development of artificial intelligenceGoogle Scholar
[96]
1. Rea Kerri
2. Wallace Bruce
2021Enhancing equity-oriented care in psychedelic medicine: Utilizing the EQUIP frameworkInternational Journal of Drug Policy 98:103429Google Scholar
[97]
1. Browne Annette J
2. Varcoe Colleen
3. Ford-Gilboe Marilyn
4. Wathen C Nadine
5. and EQUIP Research Team
2015EQUIP healthcare: An overview of a multi-component intervention to enhance equityoriented care in primary health care settingsInternational journal for equity in health 14:1–11Google Scholar
[98]
1. Dayan Peter
2. Hinton Geoffrey E
3. Neal Radford M
4. Zemel Richard S
1995The Helmholtz machineNeural computation 7:889–904Google Scholar
[99]
1. Toosi Tahereh
2. Issa Elias
2024Brain-like flexible visual inference by harnessing feedback feedforward alignmentAdvances in Neural Information Processing Systems 36Google Scholar
[100]
1. Faisal A Aldo
2. Selen Luc PJ
3. Wolpert Daniel M
2008Noise in the nervous systemNature reviews neuroscience 9:292–303Google Scholar
[101]
1. Vincent Pascal
2011A connection between score matching and denoising autoencodersNeural computation 23:1661–1674Google Scholar
[102]
1. Kadkhodaie Zahra
2. Simoncelli Eero
2021Stochastic solutions for linear inverse problems using the prior implicit in a denoiserAdvances in Neural Information Processing Systems 34:13242–13254Google Scholar
[103]
1. Rombach Robin
2. Blattmann Andreas
3. Lorenz Dominik
4. Esser Patrick
5. Ommer Björn
2022High-resolution image synthesis with latent diffusion modelsIn: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition pp. 10684–10695Google Scholar
[104]
1. Hochreiter Sepp
2. Schmidhuber Jürgen
1997Long short-term memoryNeural computation 9:1735–1780Google Scholar
[105]
1. Chung Junyoung
2. Gulcehre Caglar
3. Cho KyungHyun
4. Bengio Yoshua
2014Empirical evaluation of gated recurrent neural networks on sequence modelingarXiv Google Scholar

Article and author information

Author information

Colin Bredenberg
Mila - Quebec AI Institute, Montreal, Canada, University of Montreal, Montreal, Canada
ORCID iD: 0000-0002-9749-9228
- For correspondence: colin.bredenberg@mila.quebec
Fabrice Normandin
Mila - Quebec AI Institute, Montreal, Canada
Blake Richards
Mila - Quebec AI Institute, Montreal, Canada, McGill University, Montreal, Canada
- indicates equal contribution
Guillaume Lajoie
Mila - Quebec AI Institute, Montreal, Canada, University of Montreal, Montreal, Canada
- indicates equal contribution

Author Notes

Competing interests: No competing interests declared

Version history

Preprint posted: January 13, 2025
Sent for peer review: February 3, 2025
Reviewed Preprint version 1: June 6, 2025

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.105968. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 167
downloads: 2
citations: 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Significance of findings

Strength of evidence

Abstract

1 Introduction

2 Results

Mapping the Wake-Sleep algorithm onto cortical architecture

Mapping the Wake-Sleep algorithm onto cortical architecture.

Modeling Hallucinations

Visualizing the effects of psychedelics in the model.

Effects of psychedelics on single neurons

Effects of psychedelics on single model neurons.

Effects of psychedelics on neural variability

Effects of psychedelics on neural variability.

Network-level effects of psychedelics

Network-level effects of psychedelics.

3 Discussion

Experimental results captured by our model

Testable predictions

Summarizing testable predictions of the ‘oneirogen hypothesis’.

Comparisons to alternative models

Limitations

Conclusions

4 Methods

Model architecture and training

Modeling Hallucinations

Apical and Basal Alignment

Quantifying plasticity

Classifier training

Quantifying correlation matrix similarity before and after psychedelics

Quantifying interareal causality through inactivations

6 Ethics Declarations

Guidelines for the ethical use of this study

A Supplementary Materials

Recurrent network model

Simplified neuron model

Visualizing the effects of psychedelics for alternative model architectures.

Example generated images for different model architectures and datasets.

Alignment between apical and basal dendritic compartments for different model architectures and datasets.

Hallucination-induced synaptic plasticity for different neuron models.

Neural variability changes for different neuron models.

Network-level effects of psychedelics for different network architectures and training datasets.

MNIST multicompartment network hyperparameters

CIFAR10 multicompartment network hyperparameters

Recurrent network hyperparameters

Algorithm 1 Wake-Sleep Pseudocode Algorithm 1 Wake-Sleep Pseudocode

Acknowledgements

Additional files

References

Article and author information

Author information

Colin Bredenberg

Fabrice Normandin

Blake Richards*

Guillaume Lajoie*

Author Notes

Version history

Cite all versions

Copyright

Metrics

Blake Richards

Guillaume Lajoie