Binocular rivalry reveals an out-of-equilibrium neural dynamics suited for decision-making

  1. Robin Cao
  2. Alexander Pastukhov
  3. Stepan Aleshin
  4. Maurizio Mattia
  5. Jochen Braun  Is a corresponding author
  1. Cognitive Biology, Center for Behavioral Brain Sciences, Germany
  2. Gatsby Computational Neuroscience Unit, United Kingdom
  3. Istituto Superiore di Sanità, Italy
15 figures

Figures

Proposed mechanism of binocular rivalry.

(a) When the left and right eyes see incompatible images in the visual field, phenomenal appearance reverses at irregular intervals, sometimes being dominated by one image and sometimes by the other (gray and white regions). Sir Charles Wheatstone studied this multistable percept with a mirror stereoscope (not as shown!). (b) Spiking neural network implementation of a ‘local attractor.’ An assembly of 150 neurons (schematic, dark gray circle) interacts competitively with multiple other assemblies (light gray circles). Population activity of the assembly explores an effective energy landscape (right) with two distinct steady states (circles), separated by a ridge (diamond). Driven by noise, activity transitions occasionally between ‘on’ and ‘off’ states (bottom), with transition rates ν± depending sensitively on external input to the assembly (not shown). Here, ν+=ν1Hz. Spike raster shows 10 representative neurons. (c) Nested attractor dynamics (central schematic) that quantitatively reproduces the dynamics of binocular rivalry (left and right columns). Independently bistable variables (‘local attractors,’ small circles) respond probabilistically to input, transitioning stochastically between on- and off-states (red/blue and white, respectively). The entire system comprises four pools, with 25 variables each, linked by excitatory and inhibitory projections. Phenomenal appearance is decided by competition between decision pools R and R forming ‘non-local attractors’ (cross-inhibition wcomp and self-excitation wcoop). Visual input c and c accumulates, respectively, in evidence pools E and E and propagates to decision pools (feedforward selective excitation wexc and indiscriminate inhibition winh). Decision pools suppress associated evidence pools (feedback selective suppression wsupp). The time course of the number of active variables (active count) is shown for decision pools (top left and right) and evidence pools (bottom left and right), representing the left eye (red traces) and the right eye image (blue traces). The state of individual variables (black horizontal traces in left and middle columns) and of perceptual dominance (gray and white regions) is also shown. In decision pools, almost all variables become active (black trace) or inactive (no trace) simultaneously. In evidence pools, only a small fraction of variables is active at any given time. (d) Fractional activity dynamics of decision pools R and R (top, red and blue traces) and evidence pools E and E (bottom, red and blue traces). Reversals of phenomenal appearance are also indicated (gray and white regions).

Joint dynamics of evidence habituation and recovery.

Exponential development of evidence activities is governed by input-dependent asymptotic values and characteristic times. (a) Fractional activities e (blue traces) and e (red traces) of evidence pools E and E, respectively, over several dominance periods for unequal stimulus contrast (c=78,c=18). Stochastic reversals of finite system (N=25 units per pool, left) and deterministic reversals of infinite system (N, right). Perceptual dominance (decision activity) is indicated along the upper margin (red or blue stripe). Dominance evidence habituates (dom), and non-dominant evidence recovers (sup), until evidence contradicts perception sufficiently (black vertical lines) to trigger a reversal (gray and white regions). (b) Development of stronger-input evidence e (blue) and weaker-input evidence e (red) over two successive dominance periods (c=1516,c=116). Activities recover, or habituate, exponentially until reversal threshold Δrev is reached. Thin curves extrapolate to the respective asymptotic values, e and e. Dominance durations depend on distance Δ and on characteristic times τe and τe. Left: incrementing non-dominant evidence e (dashed curve) raises upper asymptotic value e and shortens dominance T by ΔT. Right: incrementing dominant evidence e (dashed curve) raises lower asymptotic value e and shortens dominance T by ΔT. (c) Increasing asymptotic activity difference Δ accelerates the development of differential activity and curtails dominance periods T, T (and vice versa). As the dependence is hyperbolic, any change to Δ disproportionately affects longer dominance periods. If T§gt;T, then ΔT§gt;ΔT (and vice versa).

Dependence of mean dominance duration on dominant and suppressed image contrast (‘Levelt’s propositions’).

(a) Mean dominance duration T (color scale), as a function of dominant contrast cdom and suppressed contrast csup, in model (left) and experiment (right). (b) Model prediction (solid traces) and experimental observation (dashed traces and symbols) compared. Levelt I and II: weak increase of T with cdom when csup=1 (red traces and symbols), and strong decrease with csup when cdom=1 (brown traces and symbols). Levelt III: symmetric increase of T with cdom (orange traces and symbols) and decrease with csup (brown traces and symbols), when cdom+cdom=1. Alternation rate (green traces and symbols) peaks at equidominance and decreases symmetrically to either side. (c) Levelt IV: decrease of T with image contrast, when csup=cdom. (d) Predicted dependence of sequential correlation cc1 (color scale) on cdom and csup. (e) Model prediction (black trace, N=25) and experimental observation (blue trace and symbols, mean ± SEM, Spearman’s rank correlation ρ), when csup=cdom. Also shown is a second model prediction (red trace, N=40).

Shape of dominance distribution depends only weakly on image contrast (‘scaling property’).

Distribution shape is parametrized by coefficient of variation cv and relative skewness γ1/cV. (a) Coefficient of variation cv (color scale), as a function of dominant contrast cdom and suppressed contrast csup, in model (left) and experiment (right). (b) Model prediction (solid traces) and experimental observation (dashed traces and symbols) compared. Left: increase of cv with cdom (red traces and symbols), and symmetric decrease with csup (brown traces and symbols), when csup=1. Right: weak dependence when cdom=csup (black traces and symbols). (c) Predicted dependence of relative skewness γ1/cV (gray scale) on cdom and csup. (d) Model prediction (solid traces), when cdom=csup (black) and cdom=1csup (orange and brown) and experimental observation when cdom=csup (blue dashed trace and symbols, mean ± SEM).

Competitive dynamics of decision pools ensures Levelt IV.

(a) The joint stable state of decision pools (here r1 and r0) can be destabilized by sufficiently contradictory evidence, e§gt;e. (b) Effective potential U(e,e,r,r) (colored curves) and steady states r (colored dots) for different levels of contradictory input, Δe=ee. Increasing Δe destabilizes the steady state and shifts r rightward (curved arrow). The critical value rcrit (dotted vertical line), at which the steady state turns unstable, is reached when Δe reaches the reversal threshold Δrev. At this point, a reversal ensues with r1 and r0. (c) The reversal threshold Δrev diminishes with combined evidence e+e. In the deterministic limit, Δrev decreases linearly with e¯=(e+e)/2 (dashed red line). In the stochastic system, the average evidence bias Δe at the time of reversals decreases similarly with the average evidence mean e¯ (black dots). Actual values of Δe at the time of reversals are distributed around these average values (gray shading). (d) Average evidence mean e¯ (left) and average evidence bias Δe (middle) at the time of reversals as a function of image contrast c and c. Decrease of average evidence bias Δe with contrast shortens dominance durations (Levelt IV). At low contrast (blue dot), higher reversal thresholds Δrev result in less frequent reversals (bottom right, gray and white regions) whereas, at high contrast (red dot), lower reversal thresholds lead to more frequent reversals (top right).

Serial dependency predicted by model and confirmed by experimental observations.

(a) Conditional expectation of dominance duration T±n (top) and of average mean evidence activity, e¯±n (bottom), in model simulations with maximal stimulus contrast (c=c=1). Dominance periods T0 were grouped into octiles, from longest (yellow) to shortest (black). For each octile, the average duration T±n of preceding and following dominance periods, as well as the average mean evidence activity e¯±n at the end of each period, is shown. All times in multiples of the overall average duration, T, and activities in multiples of the overall average activity e¯. (b) Example reversal sequence from model. Bottom: stochastic development of evidence activities e and e (red and blue traces), with large, joint fluctuations raising or lowering mean activity e¯=(e+e)/2 above or below long-term average (dashed line). Top left: episode with e¯ above average, lower Δrev, and shorter dominance periods. Top right: episode with e¯ below average, higher Δrev, and longer dominance durations. (c) Examples of reversal sequences from human observers (c=c=1 and c=c=1/2). (d) Positive lagged correlations predicted by model (mean, middle) and confirmed by experimental observations (mean ± std, top). Alternative model (Laing and Chow, 2002) with adaptation and noise (mean, bottom), fitted to reproduce the values of T, cv, γ1, and cc1 predicted by the present model (blue stars).

Appendix 1—figure 1
Proposed mechanism of binocular rivalry dynamics (schematic).

Bistable variables are represented by white (inactive) or red (active) circles. Four pools, each with N=25 variables, are shown: two evidence pools E and E, with active counts ne(t) and ne(t), and two decision pools, R and R, with active counts nr(t) and nr(t). Excitatory and inhibitory synaptic couplings include selective feedforward excitation wexc, indiscriminate feedforward inhibition winh, recurrent excitation wcoop, and mutual inhibition wcomp of decision pools, as well as selective feedback suppression wsupp of evidence pools. Visual input to evidence pools f(c) and f(c) is a function of image contrast c and c.

Appendix 1—figure 2
Metastable dynamics of spiking neural network.

(a) Eight assemblies of excitatory neurons (schematic, light and dark gray disks) and one pool of inhibitory neurons (white disc) interact competitively with recurrent random connectivity. We focus on one ‘foreground’ assembly (dark gray), with firing rate νfore and selective external input Δνext. (b) ‘Foreground’ activity explores an effective energy landscape with two distinct steady states (circles), separated by ridge points (diamonds). As this landscape changes with external input Δνext, transition rates ν± between ‘on’ and ‘off’ states also change with external input. (c) Simulation to establish transition rates ν± of foreground assembly. External input Δνext is stepped periodically between 44Hz and 4Hz. Spiking activity of 10 representative excitatory neurons in a single trial, population activity over 25 trials, thresholded population activity over 25 trials, and activation probability (fraction of ‘on’ states). (d) Relaxation dynamics in response to step change of Δνext, with ‘on’ transitions (left) and ‘off’ transitions (right). (e) Average state transition rates ν± vary anti-symmetrically and exponentially with external input: ν+2.2 Hzexp(+0.8 5sΔνext) and ν0.5 Hzexp(0.79 sΔνext) (red and blue lines).

Appendix 1—figure 3
Information retained by stochastic pool activity from normally distributed inputs.

Inputs sN(μ,σ) provide Fisher information Js=1σ2 about mean μ. Stochastic activity n(t) of a birth-death process (N{10,20,40,80} and w=2.5) driven by such inputs accumulates Fisher information Jn(t) about mean μ. (a) Accumulation over input interval t=[0,1] of fractional information Jrel(t)=Jn(t)σ2 by an initially inactive pool of size N. (b) Information about μ retained by summed activity n^=n1++n4 of four independent pools (all initially inactive and of size N) receiving concurrently four independent inputs (sN(μ,σ)) over an interval t=[0,1]. Retained fraction Jrel=Jn^(1)σ2/4 depends on pool size N and input variance σ2. (c) Information about μ retained by activity n of one pool (initially inactive and of size N) receiving successively four independent inputs (sN(μ,σ)) over an interval t=[0,4]. Retained fraction Jrel=Jn(4)σ2/4 depends on pool size N and input variance σ2.

Appendix 1—figure 4
Decision response to fixed input I, I, for random initial conditions of e, e, r, r.

(a) Expected differential steady-state activation |rr| of decision level. Steady-state activity r+r1 implies a categorical decision with activity 1 of one pool and activity of another. (b) Probability that decision correctly reflects input bias (r§gt;r if I§gt;I), and vice versa.

Appendix 1—figure 5
Exponential habituation and recovery of evidence activities.

Dominance durations depend on distance between asymptotic values and on characteristic times. (a, b) Development of evidence e (blue) and e (red), over two successive dominance periods. Input c=15/16 is stronger, input c=1/16 weaker. Activities recover, or habituate, exponentially until reversal threshold Δrev is reached. Thin curves extrapolate to the respective asymptotic values, e and e. (a) Evidence e (with weaker input c) is dominant. Incrementing input c to non-dominant evidence e shortens dominance T. (b) Evidence e (with stronger input c) is dominant. Incrementing input c to e extends dominance T. (c–f) Contrast dependence of relaxation dynamics, as a function of differential contrast cc, for c+c=1. Values when evidence e is dominant (dom, thick solid curves), and when it is non-dominant (sup, thick dotted curves). Values for e are mirror symmetric (about vertical midline cc=0). (c) Effective potential Δue+ue. (d) Characteristic time τe. (e) Relaxation range ereve (bottom left, thin curves erev, thick curves e). (f) Effective rate ρe of development. Symbols and arrows correspond to subfigures (a, b) and represent recovery (up arrow) or habituation (down arrow) of stronger-input evidence (blue) or weaker-input evidence (red). Underlying color patches indicate dominance of stronger-input evidence (blue patches) or of weaker-input evidence (red patches). Dominance durations depend more sensitively on the slower development, with smaller ρ, which generally is the recovery of non-dominant evidence (up arrows).

Appendix 1—figure 6
Birth-death dynamics of evidence pools ensures gamma-like distribution and ‘scaling property’ (invariance of distribution shape).

(a) Representative examples for the time development of evidence bias Δe=ee between reversals (i.e., between Δrev and approximately +Δrev). (b) Dominance distributions for c=c=1/16 (blue), c=c=1/4 (green), and c=c=1 (yellow). Distribution mean μ changes approximately threefold, but coefficient of variation cV and skewness γ1 are nearly invariant (inset), largely preserving distribution shape. (c) Development of expectation Δx between reversals (schematic). Left: a Poisson variable process, such as the difference Δx between two birth-death processes. Mean Δx grows linearly with t (lines, with slopes μ, μ) and variance (ΔxΔx)2 grows linearly with t (dashed curves, with scaling factors σ, σ). Constants μ and σ change with stimulus contrast (blue and red). Proportionality μσ2 ensures constant dispersion of Δx at threshold (δx=δx), and, consequently, a dispersion of threshold-crossing times that grows linearly with mean threshold-crossing time (δt/trev=δt/trev=const), preserving distribution shape. Right: a process with constant variance, σ=σ. Dispersion of Δx at threshold increases with threshold-crossing time (δx§gt;δx) and dispersion of threshold-crossing times grows supra-linearly with mean threshold-crossing time (δt/trev<δt/trev§lt;δt/trev), broadening distribution shape.

Appendix 1—figure 7
Characteristic times of evidence activity.

(a) Characteristic times τe, τe for different image contrast c=c, when evidence pool is dominant (dom) and non-dominant (sup). (b) Autocorrelation of evidence activity e, e as a function of image contrast (color) and latency, expressed in multiples of average dominance duration T. (c) Autocorrelation of joint evidence activity e¯=(e+e)/2 as a function of image contrast (color) and latency. Note that autocorrelation time lengthens substantially for high image contrast.

Appendix 1—figure 8
Burstiness of reversal sequences predicted by model and confirmed by experimental observations.

(a) Burstiness index (BI) (mean) for n successive dominance periods in experimentally observed reversal sequences, for contrasts 12.5% (green), 50% (yellow), and 100% (red). (b) BI for reversal sequences generated by model (mean ± std).

Appendix 1—figure 9
Dependence of fit error on individual parameter values (with all other parameter values fixed).

30 equally spaced values were tested (blue dots) and fitted by a quadratic function (red solid curve, with 95% confidence intervals indicated by dotted curves). For each parameter, both the optimal value (red cross) and the extremum of the parabolic fit (green circle) are shown.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Robin Cao
  2. Alexander Pastukhov
  3. Stepan Aleshin
  4. Maurizio Mattia
  5. Jochen Braun
(2021)
Binocular rivalry reveals an out-of-equilibrium neural dynamics suited for decision-making
eLife 10:e61581.
https://doi.org/10.7554/eLife.61581