Non-decision time-informed collapsing threshold diffusion model: A joint modeling framework with identifiable time-dependent parameters

Amir Hosein Hadian Rasanan; Lukas Schumacher; Michael D Nunez; Gabriel Weindel; Jörg Rieskamp

doi:10.7554/eLife.109730.1

eLife Assessment

This study provides a valuable advance in understanding how decision boundaries may change over time during simple choices by introducing a method that uses information about non-decision components to improve parameter estimates. The evidence supporting the main claims is convincing, with clear demonstrations on simulated and real data, although additional model comparison work would further strengthen confidence. The findings will be of interest to researchers studying human decision processes and the methods used to analyse them.

https://doi.org/10.7554/eLife.109730.1.sa3

Significance of findings

valuable: Findings that have theoretical or practical implications for a subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

convincing: Appropriate and validated methodology in line with current state-of-the-art

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Over the past sixty years, evidence accumulation models have emerged as a dominant framework for explaining the neural and behavioral aspects of the process underlying decision making. These models have also been widely used as a measurement instrument to assess individual differences in latent cognitive constructs underlying decision making. A central assumption of most of these models is that decision makers accumulate noisy evidence until a fixed decision threshold is reached. However, both behavioral and neuroscientific findings, along with theoretical considerations related to optimality, have suggested that the decision threshold varies over time. Although time-dependent threshold models often provide a better account of empirical data, a major challenge associated with these models is the unreliable estimation of their parameters. This limitation has led researchers to emphasize model-fitting comparisons rather than interpreting parameter values or accounting for individual differences in the dynamics of the decision threshold. In this work, we address the reliability issue of parameter estimation in time-dependent threshold diffusion models by proposing a joint modeling approach that links non-decision time to external observations. Parameter recovery simulations demonstrate that informing the diffusion model with trial-level noisy measurements of non-decision time substantially improves the reliability of parameter estimation for time-dependent threshold diffusion models. Additionally, we reanalyzed the experimental data from two perceptual decision-making tasks to illustrate the feasibility of the proposed modeling approach. Non-decision time measurements were extracted from electroencephalography (EEG) recordings using the hidden multivariate pattern method. The cognitive modeling results revealed that, in addition to the reliable parameter estimation, constraining non-decision time improves the fit to behavioral data.

Introduction

For decades, understanding how people make rapid decisions has been a central focus in cognitive science. Studying quick decisions allows researchers to investigate the properties of latent cognitive processes underlying decision making and to develop process-level theories of human choice behavior. Among the various formal cognitive models proposed to explain decision making, evidence accumulation models (EAMs; Ratcliff, 1978; Ratcliff and McKoon, 2008; Stone, 1960; Laming, 1968) represent the most prominent class of cognitive computational models (Forstmann et al., 2016). A key advantage of EAMs is their ability to account for both choice and response time (RT) simultaneously (Forstmann et al., 2016; Ratcliff et al., 2016). The most widely used variant of EAMs is the well-known diffusion decision model (DDM; Ratcliff, 1978; Ratcliff and Rouder, 1998; Ratcliff and McKoon, 2008), originally proposed to explain behavior in two-alternative choice tasks. The DDM assumes that the decision maker accumulates noisy evidence (with the mean evidence accumulation rate governed by a constant drift rate) until the relative evidence for one option reaches a fixed decision threshold. This model has made substantial contributions to our understanding of the neural and cognitive mechanisms underlying decision making (e.g., Forstmann et al., 2016; Gold and Shadlen, 2007; Ratcliff et al., 2016). Moreover, it has been widely employed as a cognitive psychometric instrument for data analysis and for describing latent cognitive processes across various domains, including perceptual decisions (e.g., Dutilh and Rieskamp, 2016; Forstmann et al., 2011; Evans and Brown, 2017), risky decisions (e.g., Olschewski et al., 2025; Bhui, 2019; Zhao et al., 2020), value-based decisions (e.g., Krajbich et al., 2010; Fontanesi et al., 2019; Khodadadi et al., 2017; Gluth et al., 2020), numerical cognition (e.g., Ratcliff and McKoon, 2018; Ratcliff, 2022), intelligence (e.g., Schmiedek et al., 2007; Schubert and Frischkorn, 2020), and aging (e.g., Starns and Ratcliff, 2010; Ratcliff, 2008; Ratcliff et al., 2010; von Krause et al., 2022), among others. Additionally, this model has been employed to investigate the information processing patterns in clinical populations (e.g., Nejati et al., 2022; Ging-Jehli et al., 2021; Pirrone et al., 2017; Pedersen et al., 2017; Karalunas et al., 2012). The broad application of the DDM is attributed to the interpretability of its parameters, which has been validated in several studies (e.g., Voss et al., 2004; Arnold et al., 2015; Lerche and Voss, 2019), as well as their reliability in parameter estimation (Lerche and Voss, 2017; Yap et al., 2012).

Although the DDM has become a standard model for describing the latent cognitive constructs of decision making, alternative variants — such as the urgency gating model (Cisek et al., 2009; Thura et al., 2012; Trueblood et al., 2021) and collapsing threshold diffusion models (CT-DDMs; Drugowitsch et al., 2012; Tajima et al., 2016) — better align with certain behavioral and neuroscientific findings (Ratcliff and Frank, 2012; Ging-Jehli et al., 2025). In particular, the DDM assumes that the decision threshold remains fixed over time, implying that the decision maker requires a constant amount of evidence to make a decision, regardless of the time spent on the choice. However, several neuroimaging studies have shown that human and non-human primates tend to become more urgent as time progresses (e.g., Ditterich, 2006; Thura et al., 2012; Cisek et al., 2009; Murphy et al., 2016; Steinemann et al., 2018; Gluth et al., 2012, 2013; Ging-Jehli et al., 2025; Grogan et al., 2025). These findings support the idea of the collapsing threshold or urgency gating model, in which the required amount of evidence for making a decision decreases over time, thereby increasing the urgency of the decision maker (although the urgency gating model and collapsing threshold diffusion model are theoretically distinct, Smith and Ratcliff (2022) demonstrated that they can be transformed into one another. Therefore, we primarily focus on collapsing threshold models for the remainder of this paper; however, we will explain how the current work can also be related to urgency gating models in more detail in the general discussion).

CT-DDMs also predict several behavioral effects that fixed-threshold diffusion models (FT-DDMs) fail to capture. First, CT-DDMs are more flexible in accounting for RT distributions with less asymmetry. Specifically, while FT-DDMs typically predict strongly right-skewed RT distributions (Ratcliff and Smith, 2004), CT-DDMs can account for Gaussian-like RT distributions (Evans and Hawkins, 2019; Hawkins et al., 2015). Several studies have observed such symmetric, Gaussian-like RT distributions under specific conditions (e.g., Roitman and Shadlen, 2002; Ditterich, 2006; Hawkins et al., 2019; Evans and Hawkins, 2019; O’Connell et al., 2012). A second key prediction of CT-DDMs is that slower responses tend to exhibit greater variability — an effect supported by several empirical studies (e.g., Olschewski et al., 2025; Murphy et al., 2016; Steinemann et al., 2018). In other words, these studies showed that the error rate increases as time passes. In contrast, the FT-DDM predicts a constant error rate across both fast and slow responses (it is worth noting that including trial-to-trial variability in drift rate enables the FT-DDM to predict slow errors (Ratcliff and Rouder, 1998; Voskuilen et al., 2016)). Beyond these behavioral effects, some empirical studies have shown that introducing a decision deadline or emphasizing urgency encourages individuals to adopt a collapsing threshold strategy (Hawkins et al., 2015; Evans et al., 2020a; Evans and Hawkins, 2019; Katsimpokis et al., 2020). Furthermore, quantitative model comparison approaches have found broad empirical support for the idea that people adjust their decision threshold during a single decision (Olschewski et al., 2025; Palestro et al., 2018; Bhui, 2019; Khodadadi et al., 2017; Ging-Jehli et al., 2025; Ratcliff and Frank, 2012).

On top of the supporting evidence from behavioral and neuroimaging studies for CT-DDMs, there is also a strong theoretical foundation for these models. CT-DDMs offer a normative, optimal policy for evidence accumulation in several scenarios. The FT-DDM maximizes accuracy for a fixed mean RT — or, equivalently, minimizes mean RT for a fixed accuracy level — in environments with homogeneous choice difficulty and no external biases (Bogacz et al., 2006; Moran, 2015). However, this information processing strategy is not optimal in more complicated settings. For instance, in environments where the difficulty of the choice problem is unknown or varies across trials (Dru-gowitsch et al., 2012; Tajima et al., 2016; Fudenberg et al., 2018), or where a stochastic deadline governs decision timing (Frazier and Yu, 2007), or where external biases are present (Moran, 2015), CT-DDMs provide the optimal evidence accumulation policy.

Although there is both theoretical and empirical support for CT-DDMs, a major limitation that restricts their use as measurement models for exploring individual differences is that some CT-DDM parameters cannot be reliably estimated. Several systematic evaluations using various parameter estimation methods have reported poor parameter recovery for CT-DDMs (Evans et al., 2020b; Murrow and Holmes, 2024a; Fengler et al., 2021), especially when the threshold declines nonlinearly. In other words, these studies found that the actual generating parameters of CT-DDMs are not recoverable. Yet, reliable parameter recovery is essential for interpreting the parameters of a cognitive model. Accordingly, the poor parameter recovery of CT-DDMs prevents the use of parameter estimates to test hypotheses about latent cognitive constructs related to threshold dynamics in decision making (Kruschke and Liddell, 2018). Thus, developing a reliable method for estimating CT-DDM parameters is a crucial step toward understanding how individual differences influence within-trial threshold dynamics. This is particularly important given that whether people adjust their thresholds during the course of a single trial remains an open question (despite all the aforementioned evidence supporting CT-DDMs, some computational modeling studies have not found substantial improvements in model fit when incorporating a time-dependent threshold compared to a fixed threshold (e.g., Voskuilen et al., 2016; Smith and Ratcliff, 2022)).

Since CT-DDMs lack a closed-form likelihood function, most methodological efforts to improve parameter estimation have focused on developing numerical procedures to approximate the likelihood function. One such approach is the integral equation method (e.g., Buonocore et al., 1987, Buonocore et al., 1990; Smith, 2000; Zhang et al., 2014; Smith and Ratcliff, 2022; Hadian Rasanan et al., 2025), which is both simple and computationally efficient. In this method, the first-passage time distribution of the diffusion process is approximated by solving a linear Volterra integral equation of the second kind. Another prominent approach involves partial differential equations, where the first-passage time distribution is estimated by numerically solving a forward or backward Kolmogorov equation (e.g., Hadian Rasanan et al., 2023, Hadian Rasanan et al., 2024b; Murrow and Holmes, 2024b; Shinn et al., 2020; Boehm et al., 2021; Richter et al., 2023; Voss and Voss, 2008). In addition to these two numerical methods, which directly estimate the first-passage time distribution, there are simulation-based techniques that approximate the likelihood function using large-scale simulations. These include kernel density estimation approaches (e.g., Turner and Sederberg, 2014; Holmes, 2015) and neural network-based methods (e.g., Fengler et al., 2021; Radev et al., 2023a, Radev et al., 2020). Each of these likelihood approximation procedures has its strengths and limitations. However, it is important to note that none of these methods fully resolved the reliability issues associated with parameter estimation in CT-DDMs. One reason that more precise likelihood approximation methods did not improve the reliability in parameter estimation is the trade-off between non-decision time and threshold parameters. In other words, there is an identifiability issue in CT-DDMs, which makes estimating threshold dynamics challenging and cannot be resolved by a more precise likelihood approximation. This issue will be discussed further in the next section.

A less explored yet promising solution for improving the reliability of parameter estimation in CT-DDMs is neural-informed cognitive modeling (also known as joint models). Traditionally, these models have been employed to simultaneously fit and predict both behavioral and neural data (e.g., Forstmann and Wagenmakers, 2015; Schall, 2004; Turner et al., 2017, 2015; Nunez et al., 2017; Ghaderi-Kangavari et al., 2022, Ghaderi-Kangavari et al., 2023). Recent studies have demonstrated that neural-informed approaches can also enhance parameter recovery in EAMs and even render previously unidentifiable parameters identifiable (Nunez et al., 2025; Ghaderi-Kangavari et al., 2023). Fundamentally, linking DDM parameters to external sources of information — such as neurophysiological signals — introduces additional constraints on the parameter space, thereby enhancing identifiability (e.g., Nunez et al., 2025; Ghaderi-Kangavari et al., 2023). For instance, Nunez et al. (2025) recently proposed a joint modeling framework to identify the diffusion coefficient (i.e., the noise parameter) in the FT-DDM. By constraining the threshold parameter using external observations (e.g., neural signals), the authors demonstrated that one could simultaneously estimate the drift rate, threshold, and diffusion coefficient parameters that cannot be jointly identified using only choice and RT data (Nunez et al., 2025).

In this work, we explain how non-decision time and collapsing threshold parameters trade off and introduce a non-decision time–informed (NDT-informed) diffusion modeling framework that links non-decision time to external observations, enabling more reliable estimation of collapsing threshold parameters. In contrast to previous methodological studies that primarily focused on likelihood approximation techniques, our contribution lies in proposing a novel joint modeling framework, rather than a new estimation algorithm. We demonstrate that informing the model with trial-level noisy measurements of non-decision time — potentially derived from neural data (e.g., Nunez et al., 2019) — substantially improves parameter recovery in CT-DDMs. In addition, linking non-decision time to external observations enhances the model’s fit to behavioral data (i.e., RTs and choices).

The remainder of this paper is organized as follows. First, we present the NDT-informed diffusion modeling framework and discuss a method for estimating non-decision time at the trial level using neural signals, along with theoretical justifications for how non-decision time can improve parameter estimation in CT-DDMs. Next, we report results from an extensive simulation study to assess the effectiveness of the proposed joint modeling framework. Finally, we reanalyze two empirical datasets to illustrate the practical applicability of the method and to examine whether empirical evidence supports the collapsing threshold hypothesis. Finally, we discuss how the proposed NDT-informed diffusion modeling can address the mixed findings in support of CT-DDM and explain how this method can be generalized to other types of diffusion models. We also discuss some behavioral techniques for estimating non-decision time, which do not require neural data.

Non-decision time-informed diffusion modeling

The evidence accumulation process considered in the CT-DDM can be represented by a Wiener process (also known as Brownian motion) fluctuating between two time-dependent thresholds (Ratcliff et al., 2016):

in which X(t) represents the accumulated evidence until time t, v is the mean evidence accumulation rate (drift rate), s is the diffusion coefficient, x₀ is the starting point bias (also known as pre-decision bias), and dW (t) is the Wiener process. The process starts the accumulation from x₀ and continues until the accumulator crosses either the upper (b_u(t)) or the lower (b_l(t)) threshold (i.e., X(t) ≥ b_u(t) or X(t) ≤ b_l(t)). v modulates the rate of evidence accumulation, and it corresponds to the quality of the input signal. When the signal-to-noise ratio is high, the task becomes easier, and the speed of evidence accumulation increases. Besides, s determines the level of noise in the process. In addition to these components related to decision time, the DDM also includes a non-decision time parameter (τ), which corresponds to the total time unrelated to the decision process, such as perceptual encoding time and motor execution time. These — namely, v, x₀, b_u(t), b_l(t) and τ — are the main components of the model (note that DDMs can also contain trial-to-trial variability in drift rate, non-decision time, or starting point (Ratcliff and Rouder, 1998; Ratcliff and Tuerlinckx, 2002). While incorporating trial-to-trial variability parameters generally enhances model fit, they often face significant challenges in parameter recovery (Boehm et al., 2018; Lerche and Voss, 2016) and are typically not the primary focus of parameter inference. Thus, we did not consider trial-to-trial variability in the model’s parameters). Figure 1 illustrates a fixed threshold and a hyperbolic collapsing threshold.

An illustration of the effect of non-decision time on the threshold value on the final stopping point.
The illustration of the evidence accumulation process in the fixed threshold diffusion model (left panel; b_u(t) = −b_l(t) = θ) and hyperbolic collapsing threshold diffusion model (right panel; ).

Why constraining non-decision time improves parameter estimation in CT-DDMs

A crucial question that needs to be addressed before presenting the NDT-informed diffusion model is why constraining the non-decision time can enhance the parameter estimation of CT-DDMs. To answer this question, we should note that to compute the likelihood function of the CT-DDMs, we need to incorporate the threshold value in the final stopping time (see the discussion on Girsanov change-of-measure theorem in Smith (2016) and Hadian Rasanan et al. (2025)). The final stopping time determines the decision time or, equivalently, is equal to the difference between the RT and the non-decision time (i.e., t = RT − τ). When the decision threshold is fixed over time, its value at the stopping point is the same for any decision time (RT −τ), meaning that non-decision time does not influence the threshold value at the stopping point. However, when the decision threshold is time-dependent, the threshold value at the final stopping point depends on non-decision time (i.e., we need to incorporate b_u(RT − τ) and b_l(RT − τ)). In other words, optimizing the likelihood yields maximizing the probability of reaching a particular threshold value at the final stopping point (RT − τ). Therefore, the estimated non-decision time parameter affects the threshold dynamic by forcing the threshold to reach the value at the stopping time that maximizes the likelihood function. Figure 1 illustrates the effect of non-decision time on the threshold value at the final stopping point.

Consequently, the dependence of the threshold value at the final stopping point on non-decision time causes a trade-off between the threshold parameters (e.g., starting threshold and decay rate) and non-decision time. For instance, if the threshold has a hyperbolic dynamic (, in which θ is the starting threshold, and λ is the decay rate), the threshold value at the final stopping point becomes equal to . By dividing both the numerator and the denominator by θ, we obtain , which shows that there is a trade-off between non-decision time, starting threshold, and decay rate. Accordingly, one cannot estimate these three parameters simultaneously. Besides, it implies that inaccurate non-decision time estimation, even if the error is minimal, can lead to erroneous estimates of time-dependent threshold parameters. However, constraining non-decision time in the NDT-informed diffusion modeling framework makes all parameters identifiable, thereby improving the reliability of parameter recovery (see also Kira et al., 2025).

Model specification

The traditional diffusion modeling approach estimates model parameters solely based on RT and choice data (e.g., Ratcliff and Rouder, 1998; Ratcliff and Tuerlinckx, 2002; Ratcliff and Smith, 2004). In contrast, in this work, we also assume that an additional source of data related to non-decision time is available. This additional data represents a trial-level noisy measurement of non-decision time, which differs from the actual non-decision time parameter (τ) in the model. Therefore, for each trial n, we assume that a measurement of RT_n, Choice_n, and non-decision time measurement (Z_n) is available. It is worth highlighting the difference between Z_n and τ. τ is the mean non-decision time parameter, which is fixed across trials and is estimated through model fitting, whereas Z_n is a trial-level non-decision time measurement (observation) that can be extracted from an external source of data (e.g., from neural signals; see the following Section). It is crucial to note that during parameter estimation, we treat Z_n as an input to the model. We assume that the non-decision time measurements are approximately log-normally distributed as they must take on positive values and produce a right-skewed distribution (Verdonck and Tuerlinckx, 2016) (it is also common to assume the non-decision time is uniformly (e.g., Verdonck and Tuerlinckx, 2016) or normally distributed (e.g., Nunez et al., 2019; Christie and Luce, 1956; Kira et al., 2025). We also tested the normal distribution assumption, and all the results presented in this paper replicated under this assumption). Thus, we model the non-decision time measurements Z_n, using a log-normal distribution with parameters µ and σ_z. The available measurements (i.e., observed data) at trial n can be represented as follows:

To link the non-decision time measurements (Z_n) to non-decision time parameter (τ), we assume that the actual non-decision time (τ) is the mean value of the observation distribution, which implies or equivalently . Therefore, the joint negative log-likelihood of the NDT-informed CT-DDM can be formulated as follows:

The first term (LL_CT-DDM) corresponds to the likelihood of the choice behavior (i.e., RT and choice data) predicted by the CT-DDM. The second term represents the log-likelihood of the observed non-decision times, which are assumed to follow a log-normal distribution with a mean of τ and a shape parameter of σ. The non-decision time parameter τ is shared across both terms, and linking it to trial-level measurements of non-decision time (Z_n) helps constrain its estimation. By minimizing this joint negative log-likelihood function, we can simultaneously estimate the CT-DDM parameters that predict human choice behavior and the summary statistics of the non-decision time observations.

As previously noted, an exact likelihood function for CT-DDMs is not available. Nevertheless, several numerical methods exist for approximating this likelihood, including the integral equation method (e.g., Smith, 2000; Zhang et al., 2014; Smith and Ratcliff, 2022) and the partial differential equation method (e.g., Hadian Rasanan et al., 2023; Shinn et al., 2020; Boehm et al., 2021; Richter et al., 2023). Any of these methods can be used to compute −LL_CT-DDM. In this work, we employed the integral equation method, which is a relatively simple and accurate approach (Richter et al., 2023; Hadian Rasanan et al., 2025). In Appendix 1, we detail the procedure for computing −LL_CT-DDM using the integral equation method. Moreover, we replicated the main results with a BayesFlow approach (Radev et al., 2020) to check for any sensitivity to the likelihood approximation method.

How to measure non-decision time

The central assumption of the NDT-informed diffusion modeling is that the non-decision time measurement (Z_n) is available for each trial. Thus, providing a reliable measurement of non-decision time is crucial for the proposed model. Measuring non-decision time using neurophysiological signals like electromyographic (EMG) or electroencephalogram (EEG) has been studied in the field of model-based cognitive neuroscience for a decade (e.g., Servant et al., 2016; Nunez et al., 2019; Weindel et al., 2021a; Ghaderi-Kangavari et al., 2022). Accordingly, several techniques have been proposed in the literature to measure non-decision time. One class of methods involves measuring single-trial event-related potentials (ERPs), enabling researchers to link the ERP components to cognitive parameters of the diffusion model through a (non-)linear regressor (e.g., Nunez et al., 2017; Bridwell et al., 2018; Ghaderi-Kangavari et al., 2022, Ghaderi-Kangavari et al., 2023). One main single-trial ERP component of interest is the N200 peak latency, which is thought to correlate with perceptual encoding time (Nunez et al., 2019). However, single-trial N200 latencies capture only a portion of non-decision time at best, and thus, cannot be used directly as a measurement of total non-decision time. Furthermore, incorporating the N200 component as a regressor for the non-decision time parameter does not resolve the identifiability issue of the CT-DDM. This limitation was highlighted by Ghaderi-Kangavari et al. (2023), who conducted a systematic study on parameter recovery for various neural-informed diffusion models. They reported unreliable parameter estimates for the CT-DDM in which non-decision time was linked to trial-level N200 latency via a regressor (see models 5 and 6 in Ghaderi-Kangavari et al., 2023). The poor parameter recovery in their models arises from a trade-off between the threshold parameters and the regression coefficients — a problem similar to the identifiability issue we discussed earlier. Therefore, informing the model with N200 latency does not resolve the underlying identifiability problem.

Similarly, an EMG signal can be used to measure the execution time of motor responses (Weindel et al., 2021a). In this approach, we can detect and measure the motor execution time based on the muscle activity. Although this method provides an accurate measure of motor execution, it does not capture total non-decision time and fails to address the parameter recovery issues in CT-DDMs reported by Ghaderi-Kangavari et al. (2023). Furthermore, recent evidence shows that at least some portion of the EMG signal is related to evidence accumulation (Servant et al., 2021; Weindel, 2021; Weindel et al., 2021b). Although a potential approach would be to use both EMG and EEG signals together to estimate perceptual encoding and motor execution time, this approach is relatively complicated (see also Kelly et al. (2021) for a discussion on how to extract perceptual encoding onset, evidence accumulation onset, and post-decision motor-execution time from EEG signals).

More recent advancements in neural signal analysis have led to the development of methods to discover single-trial EEG or MEG events across the entire trial-level time series (e.g., Anderson et al., 2016; Weindel et al., 2024). These methods provide the sequence and timing of latent cognitive steps at each trial and have already been used to uncover the cognitive stages in various empirical paradigms, including perceptual decision tasks (Van Maanen et al., 2021), lexical decision tasks (Berberyan et al., 2021; Krause et al., 2024), and associative recognition tasks (Van Maanen et al., 2021). In this paper, we have employed the hidden multivariate pattern (HMP) method (Weindel et al., 2024) to extract non-decision time from the EEG signal. It has been demonstrated that this method can decompose the RT into components associated with decision-relevant features versus decision-unrelated physical properties of the stimuli (Weindel et al., 2025). Thus, we can extract the decision time from neural signals, and by subtracting the estimated decision time from observed RT, we can obtain a measurement of non-decision time in each trial. The entire procedure of NDT-informed diffusion modeling, which involves extracting non-decision time from neural signals using the HMP method, is presented in Figure 2.

Illustration of the non-decision time-informed diffusion model.
The HMP analysis method extracts the timing of cognitive states in each trial from EEG signals. Then, the decision time is considered the duration between the cognitive states that determines the decision process, such as the N200 latency, which marks the end of perceptual encoding (Nunez et al., 2019), and the peak of central parietal positivity, which indicates the end of the decision process (O’Connell et al., 2012) — see also Weindel et al. (2025) for a detailed discussion. Subtracting the decision time from the response time gives an approximation of non-decision time in each trial. The extracted non-decision time measurements are employed to constrain the non-decision time parameter in the CT-DDM.

Simulation Study

Methods

We conducted a parameter recovery study using simulated data to evaluate whether the parameters of the NDT-informed CT-DDM can be estimated reliably. To assess the robustness of our method, we considered two distinct functional forms for the threshold dynamics. Theoretical studies on optimality have shown that the optimal decision threshold follows a nonlinear, monotonically decreasing trajectory (Fudenberg et al., 2018; Frazier and Yu, 2007). These studies suggest that hyperbolic or exponential functions are the most plausible candidates for modeling the optimal threshold (Fudenberg et al., 2018; Frazier and Yu, 2007). These collapsing thresholds have already been used in several studies (e.g., Olschewski et al., 2025; Bhui, 2019; Milosavljevic et al., 2010; Voskuilen et al., 2016; Hanks et al., 2011). Additionally, Smith and Ratcliff (2022) demonstrated that an urgency gating model with a linear urgency signal is mathematically equivalent to a hyperbolic CT-DDM (see also Trueblood et al., 2021). It is also important to note that prior research has reported poor parameter recovery for nonlinear collapsing threshold models (Evans et al., 2020b; Murrow and Holmes, 2024a). Therefore, we selected the exponential collapsing threshold diffusion model (ECT-DDM) and the hyperbolic collapsing threshold diffusion model (HCT-DDM) — two of the most theoretically motivated and widely supported nonlinear threshold dynamics — for our simulations (some studies have also employed the Weibull function for modeling threshold dynamics (e.g., Hawkins et al., 2015; Ging-Jehli et al., 2025; Evans et al., 2020a). This choice is typically made for its flexibility, allowing the model to mimic a variety of threshold dynamical forms. However, this function has three parameters, which causes some trade-off among the threshold parameters and makes the parameter recovery much more difficult):

where θ is the starting threshold (i.e., b_u(0) = θ) and λ > 0 is the decay rate. b_u(t) stands for the upper threshold, and the lower threshold is equal to the reflection of the upper threshold (i.e., b_l(t) = −b_u(t)). In all simulations, we assumed that the accumulation process starts from zero (i.e., starting point x₀ = 0), which implies that there is no a priori bias towards one option in the simulation data. Also, we set the diffusion coefficient equal to one (i.e., s = 1). To simulate RT and choice, we considered the following discrete form of the accumulation process (1) with a time step Δt = 0.001:

To cover all possible model behavior in the simulations, we used the following uniform distributions for the CT-DDM parameters (i.e., δ, τ, θ, and λ) and the shape parameter of the non-decision time measurements (σ_z; 𝒰 [a, b] represent a uniform distribution over the interval between a and b):

After sampling random parameters from the uniform distributions described above, we generated RT, choice, and random non-decision time measurements (Z_n) for various trial numbers (i.e., 100, 250, 500, 750, and 1000). For each number of trials, we generated 1000 datasets. As we already mentioned, we assumed that the actual non-decision time (τ) is the mean value of the log-normally distributed non-decision time measurements. Thus, the location parameter of the log-normal distribution is obtained by . Therefore, in each trial, we sampled a noisy observation of non-decision time using the following log-normal distribution:

We have employed the differential evolution optimization routine to minimize the joint negative log-likelihood (3). Also, to approximate the likelihood function of the CT-DDM, we used the integral equation method with a time step Δt = 0.02 (see Appendix 1 for the details of likelihood approximation). A problem with the maximum likelihood procedure is its sensitivity to outliers, which can yield likelihood values of zero (Ratcliff and Tuerlinckx, 2002). Outliers can affect the fitting procedure and cause convergence problems (Ratcliff and Tuerlinckx, 2002; Heathcote et al., 2002). We truncated the likelihood values lower than 10⁻¹⁴ to address this issue. To account for potential influences of the estimation method on parameter inference, we replicated the main results using amortized Bayesian inference as implemented in BayesFlow (Radev et al., 2020, Radev et al., 2023b). The obtained results based on this alternative approach are presented in Appendix 2.

To evaluate the precision of parameter recovery, we employed the R-squared (R²) index, which is defined as follows:

in which ϑ and stand for the true generating parameter and estimated parameter, respectively, and in the denominator shows the mean of the true parameter. Based on the R² index, we can assess how well the estimated parameters align with the true data-generating parameters. This metric has already been employed in the literature to evaluate parameter estimation methods and assessment of estimation reliability in cognitive models (e.g., Radev et al., 2020; Hadian Rasanan et al., 2025).

Results

Figure 3 illustrates the R² value for the threshold parameters (i.e., starting threshold θ, and decay rate λ) for both the NDT-informed and uninformed CT-DDM, based on 500 trials. First, this plot demonstrates a substantial improvement in the reliability of threshold parameter recovery in the NDT-informed CT-DDM compared to the uninformed CT-DDM. This improvement is more pronounced for the hyperbolic threshold than the exponential threshold. Second, the results indicate that threshold parameter estimation is generally reliable for both the hyperbolic and exponential models, with particularly high precision observed in the exponential case.

R² values measuring the agreement between estimated and ground truth threshold parameters in exponential (left) and hyperbolic (right) collapsing threshold models.

Figure 4 shows the estimated parameters against the true parameters for both NDT-informed CT-DDMs and uninformed CT-DDMs based on 500 trials. The results suggest that the starting threshold parameter in both hyperbolic and exponential uninformed CT-DDM is unrecoverable, which aligns with previous parameter recovery assessment studies (Evans et al., 2020b; Murrow and Holmes, 2024a; Ghaderi-Kangavari et al., 2023). Additionally, the decay rate in the hyperbolic uninformed CT-DDM exhibits poor parameter recovery. In contrast, the parameters estimated by the NDT-informed CT-DDM closely match the true data-generating parameters. The only challenging parameter to estimate is the decay rate in the hyperbolic CT-DDM. As shown in Figures 3 and 4, recovering this parameter is more challenging in the hyperbolic model compared to the exponential CT-DDM.

Estimated versus ground truth parameter values for two modeling approaches.
The top two rows show parameter recovery for models estimated without additional constraints (“Uninformed”), while the bottom two rows show recovery for joint models that incorporate additional non-decision time observations (“NDT-informed”). Each point represents a recovered parameter estimate plotted against its corresponding true data-generating value. Rows correspond to different functional forms of threshold (i.e., Exponential and Hyperbolic). Dashed diagonal lines indicate perfect recovery.

Figure 5 shows the sensitivity of the parameter estimation in NDT-informed CT-DDMs to the number of trials. Generally, increasing the number of trials improves the reliability of parameter estimation. The drift rate and non-decision time parameters can be estimated precisely, even based on 100 trials in both exponential and hyperbolic models. For an accurate estimation of the decay rate in the hyperbolic model, more than 250 trials are required. However, in the exponential model, both the starting threshold and decay rate can be estimated accurately, even using 100 trials.

Illustration of the sensitivity of parameter estimation to the number of trials in the NDT-informed models.
R² values measure the agreement between estimated and ground truth parameters of the exponential (left) and hyperbolic (right) collapsing threshold models.

Figure 6 presents the sensitivity of the parameter estimation in NDT-informed CT-DDM to the noise level in non-decision time observation (Z_n). We simulated data with three fixed levels of standard deviation (i.e., SD = 0.3, 0.6, and 0.9 on the scale of seconds) for non-decision time measurements. The results demonstrate that even with a high level of variability (noise) in non-decision time measurements, parameter estimation remains reliable. Moreover, these results demonstrate that the parameter recovery of NTD-informed CT-DDMs is better than the parameter estimation of the uninformed CT-DDM, even under high levels of noise in non-decision time measurements (in the Appendix 3, we also checked the effect of biased non-decision time measurements on the parameter recovery).

Illustration of sensitivity of parameter estimation to the noise level in the non-decision time observations.
R² values measuring the agreement between estimated and ground truth parameters of the exponential (left) and hyperbolic (right) collapsing threshold models.

Collapsing threshold versus variability in drift rate

One of the main counterarguments against CT-DDMs is that the slow error responses can also be explained by across-trial variability in drift rate (Voskuilen et al., 2016). A FT-DDM that incorporates across-trial variability in drift rate can account for slow errors, and it often achieves fitting performance comparable to that of the CT-DDM when applied to empirical data (Voskuilen et al., 2016; Smith and Ratcliff, 2022). This similarity in model performance makes it difficult to determine, based solely on model fit, whether the underlying evidence accumulation process involves a collapsing threshold or a fixed threshold with drift variability. Therefore, it is essential to investigate how parameter estimation in CT-DDMs is influenced when the true generative process is an FT-DDM with across-trial variability in drift rate. To investigate this, we conducted a cross-fitting simulation study. Specifically, we generated data from an FT-DDM with across-trial variability in drift rate and then fitted a CT-DDM without trial-to-trial variability to the simulated data. Across-trial variability in drift rate was modeled as a normal distribution with mean δ and standard deviation of η (i.e., v_t ∼ 𝒩 (v, η)). The following uniform distributions were used to sample the parameters of the FT-DDM:

We generated 1000 parameter sets, and for each set, we simulated 500 trials from an FT-DDM with across-trial variability in drift rate. Consistent with the previous simulation studies, we also assumed that trial-level measurements of non-decision time are available, enabling us to fit NDT-informed diffusion models. Specifically, we fit both an NDT-informed CT-DDM with a hyperbolic collapsing threshold and an NDT-informed CT-DDM with an exponential collapsing threshold to each simulated dataset.

Figure 7 illustrates the parameter recovery results for the model misspecification study. The most important result is related to the decay rate parameter (λ). For more than 75% of the datasets, the estimated decay rate is approximately zero, and only for 5% is the estimated decay rate larger than 0.1. Moreover, these results demonstrate that both the non-decision time parameter and the variability of non-decision time observations can be recovered with high accuracy. The recovered drift rate and starting threshold parameters showed a slight underestimation for larger values. However, the R² value remains high for these parameters (i.e., R² > 0.85).

Estimated versus true parameter values for cross-fitting of CT-DDM on FT-DDM with across-trial variability in drift rate.
In each panel except the rightmost one, each point represents a recovered parameter estimate plotted against its corresponding true data-generating value, and the dashed diagonal line indicates perfect recovery. The right column illustrates the density of the estimated decay rate parameter (λ). Rows correspond to different functional forms of threshold (i.e., Exponential and Hyperbolic).

In sum, the simulation results confirm that incorporating non-decision time information into the diffusion model enhances the reliability of parameter estimation in CT-DDM. Moreover, these results suggest that NDT-informed diffusion modeling can be reliably applied to infer the underlying threshold dynamics. In other words, if the underlying generating process is an FT-DDM (with or without across-trial variability), then the estimated decay rate would be very close to zero or a relatively very small value. Therefore, we can infer the underlying threshold dynamics through the estimated decay rate value in the NDT-informed diffusion modeling framework.

Applications to empirical data

We present two case studies to: (1) show the feasibility of the proposed NDT-informed diffusion modeling approach, (2) explore whether informing the diffusion model with noisy measurements of non-decision time can enhance model fitting to behavioral data, and (3) investigate whether people adjust the decision threshold within a single trial (at least in the case studies considered here) (the third aim is set due to the controversial findings in the literature that did not find evidence in support of CT-DDMs (e.g., Voskuilen et al., 2016; Smith and Ratcliff, 2022; Milosavljevic et al., 2010; Karşılar et al., 2014)). To analyze the behavioral data, we consider five different diffusion models. For each dataset, we fitted the NDT-informed and uninformed versions of HCT-DDM and ECT-DDM. Also, as a benchmark, we fitted an FT-DDM, which includes trial-to-trial variability in drift rate. Then, we employed the Bayesian information criterion (BIC) to evaluate the goodness of fit and to compare the models. We compared the considered diffusion models in two ways: with respect to fitting on behavioral data (only RT and choice) and also fitting on all observations, including RT, choice, and non-decision time measurements. Thus, we defined these two BICs for model comparisons:

in which LL_CT-DDM is the log-likelihood of the (collapsing threshold) diffusion model (see Appendix 1 for the details of the likelihood approximation using the integral equation method), LL_Joint is the joint log-likelihood of behavioral data and non-decision time estimates defined in equation 3, K_CT-DDM is the number of free parameters in the (collapsing threshold) diffusion model, K_Joint is the number of free parameters in the joint model, and N_Observation is the number of observed data points. BIC_Behavior only considers the goodness of fit on RT and choice, while BIC_Joint also considers how well the model can predict the non-decision time observations. Thus, we can compare all six considered models based on BIC_Behavior and check whether constraining the non-decision time can improve the fitting of behavioral data. However, BIC_Joint is only used for comparing the NDT-informed diffusion models.

Study 1: Weindel et al. (2025) dataset

In the first study, we considered data from Weindel et al. (2025). Twenty-six participants took part in the study and performed a binary choice task involving contrast discrimination. On each trial, two Gabor patches were presented, and participants indicated which option had the higher contrast. The difference in contrast level between the two options was always fixed at 5%, while the absolute contrast levels varied from 5% to 95%. In total, participants completed eight experimental blocks, each containing 140 trials. At the beginning of each block, participants received a speed-accuracy instruction. In half of the blocks, participants were requested to focus on decision speed, and in the other half, they were asked to focus on accuracy. The decision difficulty was designed to follow Fechner’s law (Fechner, 1860). According to this law, keeping the difference between options constant makes it harder to discriminate between two options as the overall value increases. To easily approximate this law, the following relation between contrast level and drift rate was implemented:

in which v₀, and v₁ are free parameters. v₀ is the baseline drift rate value, and v₁ modulates the sensitivity to the contrast level. The non-decision time measurements were obtained from the fit of an HMP model using the standard cumulative procedure (Weindel et al., 2024) on preprocessed EEG data with minimal trial rejection (see Weindel et al., 2025, for the detailed procedure). The HMP model estimated five cognitive events. Their associated single-trial timing and electrode contributions allowed us to interpret these events as relating to early visual processing, perceptual encoding, attention orientation, motor planning, and decision termination. For this analysis, we subtracted the time between the attention orientation (third event) and decision termination (last event) from the RT at each trial to yield a measure of non-decision time. See Figure 8 for the sequence of these events and the time interval between them.

The event sequence and timing of each event estimated by the HMP method averaged over all participants (Weindel et al., 2025).

The quantitative results revealed that NDT-informed HCT-DDM outperforms the other models. Table 1 presents the mean estimated parameters of the computational models and a model comparison result based on BIC. The best-fitting model in both speed and accuracy conditions is the NDT-informed HCT-DDM. Also, these results show that both NDT-informed models have better predictive accuracy on RT and choice compared to the two uninformed counterpart models (i.e., based on BIC_Behavior). The estimated parameters in both NDT-informed models show that the mean starting threshold in the speed condition is lower than in the accuracy condition. However, the mean decay rate is larger in the speed condition compared to the accuracy condition. This implies that the decision threshold starts at a lower point and declines faster in the speed condition than in the accuracy condition.

The mean estimated parameters and goodness of fit results for Study 1.

Figure 9 demonstrates the estimated threshold dynamics for each individual (gray lines) and the group average (blue lines) by NDT-informed CT-DDMs (see Appendix 4 for the estimated collapsing threshold by uninformed models). The estimated threshold for each individual indicates that most of the participants employed a collapsing threshold strategy (even in the accuracy condition).

The estimated threshold dynamics from NDT-informed CT-DDMs for individuals in Study 1.
The first row illustrates the estimated hyperbolic threshold dynamics, and the second row illustrates the estimated exponential threshold dynamics. The left column shows the threshold dynamics in the speed condition, and the right column shows the accuracy condition. Each gray line corresponds to a subject. The blue line corresponds to the average group level.

Figure 10 shows the predictions of the best-fitting model (NDT-informed HCT-DDM) and its uninformed counterpart for both the speed and accuracy conditions. The models’ predictions illustrate that both models can predict the empirical data well. However, the predictions of the NDT-informed model are better than the uninformed model, particularly for the accuracy condition. In the accuracy condition, the uninformed model underestimates the RT quantiles of the incorrect choices, whereas the NDT-informed model predictions align more closely with empirical data.

Prediction of best-fitting models against empirical data for speed (top row) and accuracy (bottom row) conditions in Study 1.
In each panel, the x-axis represents the response time quantiles in seconds, and the y-axis represents the cumulative choice proportion.

In sum, both quantitative and qualitative results revealed that the NDT-informed models perform better than the uninformed models in this study. In other words, constraining CT-DDM with non-decision time improved both the model fits and predictions. In addition to improving model fitting, the results obtained in this study also support the collapsing threshold against the fixed threshold. First, the mean estimated decay rates reported in Table 1 are significantly large. Second, all the considered CT-DDMs have a lower BIC and also better posterior prediction compared to the FT-DDM, which includes across-trial variability in drift rate (see the Appendix 5 for the results of FT-DDM). Third, the estimated threshold dynamics in Figure 9 indicate that most participants adopted a collapsing threshold. Therefore, considering all the results together, this study highlights the advantage of the NDT-informed modeling approach and supports CT-DDMs.

Study 2: Boehm et al. (2014) dataset

The dataset for the second case study is taken from Boehm et al. (2014). In this experiment, twenty-five participants performed a random-dot motion task with a speed-accuracy trade-off manipulation. Before each trial, a cue (i.e., “AC” to emphasize accuracy in the subsequent trial or “SP” to emphasize speed in the following trial) was presented to inform participants how they should decide in the next trial. On each trial, participants viewed a cloud of moving dots for 1.5 s, of which a subset moved coherently in one direction while the rest moved randomly, and indicated the direction of coherent motion in a two-alternative forced-choice task. During the experiment, the coherency level (proportion of coherently moving dots) was fixed and determined in the practice block, ensuring that individuals achieved approximately 80% accuracy.

To model this dataset, we assumed that the drift rate remains fixed throughout the experiment. However, we allowed the threshold-setting and non-decision time to vary across speed and accuracy conditions (since some studies showed the drift rate can be affected by speed/accuracy manipulation (e.g., Ratcliff and McKoon, 2023), we also tested free-drift rate models in which the drift rate can vary across speed/accuracy conditions, and the obtained results were similar to what we have reported here). Similar to Study 1, we fitted HCT-DDM and ECT-DDM in two NDT-informed and uninformed versions, plus an FT-DDM, which includes across-trial variability in drift rate on empirical data. The non-decision time measurements were again obtained from a standard cumulative HMP fit (as reported in Section 4.4 of Weindel et al., 2024). The HMP solution identified three events in the speed-focused condition and four events in the accuracy-focused condition. The sequence and timing of these events are presented in Figure 11. The non-decision time was taken as the difference between the RT and the duration of the second-to-last stage.

The event sequence and timing of each event estimated by the HMP method for Accuracy (top) and Speed (bottom) conditions averaged over all participants (Weindel et al., 2024).

As in Study 1, quantitative results showed that an NDT-informed model is the best-performing. The estimated mean parameter values and the model comparison results are presented in Table 2. The BIC-based model comparison suggests that the best-fitting model is the NDT-informed ECT-DDM. Therefore, the main result of the first study, which was an improvement in model fit when we constrain non-decision time, was replicated in this study.

The mean estimated parameters and goodness of fit results for Study 2.

The obtained thresholds differed notably between speed and accuracy conditions, with both the starting level and the decay rate being lower in the speed condition. The estimated threshold dynamics for each participant, along with the mean group estimated threshold, are presented in Figure 12 (see Appendix 4 for the estimated collapsing threshold by uninformed models). Similar to Study 1, the estimated thresholds for individuals suggest that most participants adopt a collapsing threshold stopping rule. Also, participants in this study exhibited a lower starting threshold in the speed condition compared to the accuracy condition. However, in contrast to Study 1, the decay rate in the speed condition was smaller. This reduction in the decay rate under the speed emphasis condition is due to the huge effect of speed-accuracy threshold manipulation on the starting threshold.

The estimated threshold dynamics from NDT-informed CT-DDMs for individuals in Study 2.
The first row illustrates the estimated hyperbolic threshold dynamics, and the second row illustrates the estimated exponential threshold dynamics. The left column shows the threshold dynamics in the speed condition, and the right column shows the accuracy condition. Each gray line corresponds to a subject. The blue line corresponds to the average group level.

The predictions of the best fitting models (ECT-DDM) are presented in Figure 13. Also, quantitatively, the NDT-informed ECT-DDM is better than the uninformed ECT-DDM with respect to BIC; however, both models predict empirical data equally well. Both models slightly underestimate the rate of incorrect decisions in the speed condition. But, overall, both models align more closely with the empirical data.

Prediction of best-fitting models against empirical data for speed (top row) and accuracy (bottom row) conditions in Study 2.
In each panel, the x-axis represents the response time quantiles in seconds, and the y-axis represents the cumulative choice proportion.

The results from this study replicated the main findings of the first study. The model comparison results showed that NDT-informed models fit the empirical data better than the uninformed models (in this study, NDT-informed ECT-DDM was the best-fitting model). Also, comparing these results with the results of FT-DDM reported in Appendix 5 revealed that the CT-DDMs better account for empirical data than the FT-DDM. This model comparison result, along with the high mean decay rate (reported in Table 2), supports the CT-DDMs against FT-DDMs.

General discussion

Understanding whether people adjust their decision threshold over the time course of a single decision has been a central topic in cognitive science for decades. Despite the theoretical and empirical support for CT-DDMs, the main issue limiting their application in individual differences studies is the poor parameter recovery of these models. The present study addressed a long-standing question in computational cognitive neuroscience by proposing a diffusion modeling approach for the reliable estimation of time-dependent parameters. Specifically, we introduced a joint modeling framework that constrains non-decision time using additional observations extracted from neural signals. Simulation results demonstrated that incorporating non-decision time information significantly enhances the parameter recovery of collapsing threshold dynamics, making previously unrecoverable parameters recoverable. To test the robustness of this finding, the simulation results were replicated using two theoretically motivated forms of threshold dynamics: hyperbolic and exponential. While the precision of parameter estimation varied between these functional forms, the improvement in estimation was consistent across both models. That is, regardless of the threshold functional form, using noisy non-decision time measurements to inform the model consistently improved the reliability of parameter estimation. Furthermore, the simulations showed that even under conditions of high noise in the non-decision time measurements, the parameter recovery of the NDT-informed model remained superior to that of the uninformed model. Besides, the results of the model misspecification parameter recovery study showed that the NDT-informed model can be used reliably to infer the underlying evidence accumulation process.

In addition to the simulation studies, we reanalyzed two datasets on perceptual decision-making to demonstrate the feasibility and applicability of the proposed method in real empirical settings. In both studies, EEG recordings were available, and we used the HMP estimates from neural signals presented in the study by Weindel et al. (2024) as measurements of non-decision time on a trial-by-trial basis. The cognitive modeling results revealed three main findings. First, informing CT-DDMs with non-decision time measurement improved model fit compared to their uninformed counterparts. This improvement likely reflects more accurate parameter estimation enabled by the additional information. Second, across both datasets, CT-DDMs outperformed FT-DDM in terms of model fit, regardless of whether the models were informed by non-decision time (see Appendix 5). This result, along with the better qualitative predictions of CT-DDMs, supports the notion that individuals adjust their decision thresholds dynamically within a single decision. Third, the estimated mean decay rate (λ) was notably high in both datasets, especially in the NDT-informed CT-DDMs — further supporting the presence of threshold collapse, at least in the analyzed datasets. Importantly, we found the evidence in two different types of experimental paradigms. In the second study, the information available to the participant for making their decision changed over the course of a single trial. However, in the first study, the stimulus remained static throughout the trial, and the amount of information remained constant. Most studies that have found evidence for urgency have employed information-changing paradigms (Trueblood et al., 2021; Evans et al., 2020a; Evans and Hawkins, 2019; Khodadadi et al., 2017; Gluth et al., 2012, 2013), and the support for collapsing the threshold in experiments with static stimuli is limited (see Olschewski et al., 2025).

The obtained computational modeling results contribute to the literature on empirical support for collapsing threshold models. The debate on whether people become more urgent as time passes is still ongoing. Although there are some empirical and theoretical supports for CT-DDMs, some studies based on quantitative model comparison across several experiments did not find strong support for collapsing threshold models (Voskuilen et al., 2016; Smith and Ratcliff, 2022; Milosavljevic et al., 2010; Karşılar et al., 2014) or found that the collapsing threshold only happens in some specific circumstances (Hawkins et al., 2015). One reason for the mixed findings in the literature is the unreliability of parameter estimation in CT-DDMs (Evans et al., 2020b; Murrow and Holmes, 2024a), along with high model mimicry in diffusion models (Khodadadi and Townsend, 2015), particularly when key parameters, such as the drift rate, vary across trials. However, the simulation results showed that based on NDT-informed diffusion modeling, we can reliably estimate the model’s parameters and infer the underlying evidence accumulation process. Thus, the proposed NDT-informed diffusion modeling approach can provide a stronger conclusion.

In addition to evidence supporting CT-DDMs, the computational modeling results also revealed that different types of speed-accuracy trade-off manipulations have distinct effects on the dynamics of the decision threshold. In Study 1, participants received verbal instructions emphasizing speed. This manipulation led to a faster decay rate and a decrease in the starting threshold. However, in Study 2, where participants faced a clear response deadline, the manipulation of speed caused a decrease in the decay rate. This decrease is mostly due to the large decrease in the starting threshold caused by speed emphasis in study 2. Task- and instruction-dependent effects of speed-accuracy trade-off manipulation have already been reported in several studies (e.g., Kat-simpokis et al., 2020; Evans et al., 2019), and our results are consistent with this pattern.

One promising avenue for future research is the investigation of individual differences in the shape of threshold dynamics. In the present work, we examined two theoretically motivated forms — hyperbolic and exponential — and fitted them at the individual level. However, recent evidence suggests that threshold dynamics may vary substantially across individuals (e.g., Kira et al., 2025). Non-parametric approaches have been proposed to recover the full threshold trajectory without assuming a specific functional form (e.g., Kira et al., 2025; Fudenberg et al., 2020). Findings from Kira et al. (2025) indicate marked heterogeneity across participants. Understanding such variability could provide critical insights into why choice behavior sometimes deviates from optimality. A limitation of current non-parametric approaches, however, is that they require a vast number of trials per participant, making them impractical for studies with larger sample sizes. Combining NDT-informed diffusion modeling with non-parametric methods may help alleviate this limitation by reducing the trial requirements, thereby making individual differences studies on threshold dynamics more accessible to empirical investigation.

Would NDT-informed modeling lead to a different conclusion?

It is important to note that the NDT-informed diffusion modeling approach can sometimes yield conclusions that differ from those obtained using traditional modeling methods. For example, in Study 1, the results of the uninformed ECT-DDM suggest that the speed–accuracy manipulation does not affect the starting threshold. In contrast, both the NDT-informed ECT-DDM and NDT-informed HCT-DDM indicate that the speed–accuracy trade-off manipulation does influence the starting threshold (see Table 1). This discrepancy arises from unreliable parameter estimation in the uninformed CT-DDMs. However, the simulation results and sensitivity analyses reported in this paper demonstrate that the parameter estimates provided by the NDT-informed diffusion modeling framework are substantially more reliable. Therefore, any parameter-based inference concerning the underlying mechanisms of decision-making, the effects of experimental manipulations, or individual and group differences should be drawn using the NDT-informed diffusion modeling framework.

Generalization to other computational models

Although the primary focus of the present study was on two-alternative choice tasks, understanding urgency is also crucial in the context of multi-alternative and multi-attribute decisions (e.g., Tajima et al., 2019; Gluth et al., 2024; Busemeyer et al., 2019), as well as in decisions involving a continuous range of options (e.g., circles, lines, or arcs; Hadian Rasanan et al., 2025, Hadian Rasanan et al., 2024a). To examine the generalizability of the proposed method, we conducted a parallel parameter recovery study using the n-dimensional hyper-spherical diffusion model (Smith and Corbett, 2019; Smith, 2016). This model is suitable for modeling multi-alternative/multi-attribute choices (Kvam, 2019a; Smith and Corbett, 2019; Smith, 2019) as well as decision-making tasks with continuous responses (Kvam, 2019b; Smith et al., 2023, 2020). The results of this simulation study are presented in Appendix 6. Consistent with the findings from the one-dimensional DDM, the results show that informing the n-dimensional diffusion model with non-decision time measurements significantly improves the precision of threshold dynamics estimation (see Appendix 6). These findings suggest that the improvement in parameter recovery of the time-dependent threshold is not limited to binary choice tasks but also extends to the more general n-dimensional hyper-spherical diffusion model. This makes the proposed method applicable to a broader range of decision-making paradigms involving multi-alternative, multi-attribute, or continuous response decision tasks.

The proposed NDT-informed diffusion modeling approach can also be applied to parameter estimation in urgency gating models. Unlike CT-DDMs, urgency-gating models assume a fixed decision threshold, but allow the incoming evidence to scale as time progresses (Trueblood et al., 2021; Cisek et al., 2009). However, similar to CT-DDMs, urgency-gating models also suffer from poor parameter recovery, as reported by Evans et al. (2020b). Thus, the issue of parameter unreliability extends to these models. Recently, Smith and Ratcliff (2022) formalized the connection between CT-DDMs and urgency gating models, demonstrating that the multiplicative urgency gating model is mathematically equivalent to a hyperbolic collapsing threshold diffusion model. In other words, urgency gating models and hyperbolic CT-DDMs yield identical behavioral predictions, and their parameters can be transformed into one another. Consequently, the findings from our simulation studies on hyperbolic collapsing threshold dynamics also apply to urgency gating models (Trueblood et al., 2021; Cisek et al., 2009).

Another way to implement urgency in the decision process is to consider an independent accumulator for timing. The time-based racing diffusion model (Hawkins and Heathcote, 2021) considers independent accumulators corresponding to evidence for available options, as well as an additional time-based accumulator for timing. This time-based accumulator behaves like an inner clock, and when it exceeds the threshold before the other evidence accumulators, an option will be randomly selected. Similar to the CT-DDM, this model can also generate Gaussian-like distributions of RTs and shows a good fit on empirical data. However, the authors reported poor parameter recovery for the non-decision time (Hawkins and Heathcote, 2021). Thus, the reliability issue here specifically concerns the estimation of non-decision time. Informing the model with external estimates of non-decision time could, therefore, improve parameter recovery in this model as well.

More generally, collapsing thresholds are not the only evidence accumulation mechanisms that suffer from reliability issues. Poor parameter recovery has also been reported for time-dependent drift rate models, such as diffusion models of conflict or diffusion models with leakage (White et al., 2018; Evans et al., 2020b). Although several studies have attempted to improve parameter recovery in these models (e.g., Hübner and Pelzer, 2020; Miletić et al., 2017), the problem remains unresolved. A promising direction for enhancing parameter estimation in such models is to constrain the drift rate dynamics using neural data. In particular, several studies have shown that Centro-parietal positivity (CPP) in EEG signals is closely linked to the process of evidence accumulation (e.g., O’Connell et al., 2012; Steinemann et al., 2018; Kohl et al., 2020; Kelly et al., 2021; Grogan et al., 2025). Furthermore, a recent application of HMP has shown that single-trial estimates of this potential can be extracted with reasonable accuracy (Weindel et al., 2025). Thus, informing time-dependent drift rate models with CPP dynamics may improve the identifiability and reliability of their parameter estimates.

Does only non-decision time improve parameter estimation?

In this work, we focused on constraining the non-decision time and provided both theoretical justification and simulation evidence demonstrating that constraining non-decision time in the diffusion model can enhance the reliability of parameter estimation in CT-DDMs. However, it remains an open question whether constraining other parameters — such as the drift rate — can similarly improve parameter estimation in CT-DDMs. To address this, it is essential to note that, from a mathematical standpoint, assuming a fixed diffusion coefficient, there is no trade-off between drift rate and threshold parameters (Nunez et al., 2025; Ratcliff, 1978). Therefore, constraining the drift rate using neural signals would not improve the estimation of collapsing threshold parameters.

Another neurophysiological signal that may aid in improving parameter estimation of CT-DDMs is pupil dilation. Several studies have demonstrated a correlation between pupil dilation and decision threshold (e.g., Murphy et al., 2016; Cavanagh et al., 2014), suggesting that the pupil dilation time series may contain valuable information about threshold dynamics. However, this approach presents certain challenges. Notably, accurately identifying the time window corresponding to the decision process using only eye-tracking data is difficult. To overcome this limitation, additional sources of information — such as EEG signals — may be necessary. As a result, directly constraining threshold dynamics is more complex than constraining non-decision time. Nevertheless, the potential advantages of incorporating pupil dilation into joint modeling approaches warrant further investigation in future research.

Bias in non-decision time measurement

One of the main assumptions of NDT-informed diffusion modeling is that the true non-decision time equals the mean of the non-decision time measurement distribution. However, this assumption may not always hold in certain situations. In other words, there might be a systematic bias (i.e., overestimation or underestimation) in the non-decision time measurements. To investigate the effect of such biases in non-decision time measurements on parameter estimation, we conducted a model misspecification study in which the true non-decision time parameter deviates from the mean of non-decision time measurements (see Appendix 3). The results of the model misspecification study showed that when we have an overestimation bias in non-decision time measurements, we observe an underestimation in the starting threshold and decay rate parameters (with high correlation with the true parameters). In contrast, when there is an underestimation bias in non-decision time measurements, we observe an overestimation in starting threshold and decay rate parameters.

Behavioral methods for estimating non-decision time

Although the focus of the current study was on extracting non-decision time from neural signal data using the HMP method, it is also worth noting that some alternative approaches do not require neural data. Historically, disentangling decision time and non-decision time has been a long-standing question in mathematical psychology and psychophysics (e.g., Green and Luce, 1971; Kohfeld et al., 1981; Burbeck and Luce, 1982; Smith, 1990; Sheu and Ratcliff, 1995; Verdonck and Tuerlinckx, 2016; Bompas et al., 2025). These attempts resulted in the development of convolutional-based methods. The core idea behind these methods is that the response time distribution can be represented by the convolution of the decision time distribution predicted by the diffusion model and the non-decision time distribution (Smith, 1990; Verdonck and Tuerlinckx, 2016). Therefore, one way of using this relation is to deconvolve the non-decision time distribution from the observed RT distribution, resulting in the decision time distribution (Smith, 1990). Thus, running a simple one-choice reaction task (i.e., responding as soon as the stimulus is presented on the screen) with identical stimuli, which are also presented in the main task, provides an estimation of the non-decision distribution. In other words, response time in such one-choice reaction tasks includes only the perceptual encoding and motor execution, excluding the evidence accumulation process, and approximates the non-decision time in the main task. Deconvolution of the non-decision time distribution obtained from the one-choice reaction task to the response time distribution of the main task provides an approximation of decision time in the main task. This approach requires running an additional task and also has some practical issues (Sheu and Ratcliff, 1995).

To address these issues, Verdonck and Tuerlinckx (2016) proposed an alternative convolution-based approach that can estimate the distribution of non-decision time entirely based on observed data in a single experiment. They estimate the non-decision time distribution by convolving the observed RT distribution with the predicted distribution from the diffusion model in different conditions and minimizing the distance between the resulting distributions. Their results showed that their proposed method can accurately estimate the distribution of non-decision times and cancel out the effect of non-decision time on RT. Recently, Kira et al. (2025) have employed a similar approach to cancel out the impact of non-decision time on RT and then estimate the entire threshold dynamics using a non-parametric approach.

Conclusion

The question of whether people become more urgent as time passes is one of the long-standing questions in cognitive psychology. In this work, we introduced an NDT-informed diffusion modeling framework that significantly improves the reliability of parameter estimation in CT-DDM. Therefore, this approach enables researchers to use parameter values in their inference about the underlying cognitive mechanism of the decision process and use CT-DDM as a measurement tool to investigate individual differences. Based on the proposed NDT-informed diffusion modeling framework, we leveraged EEG estimates of non-decision time and reanalyzed two empirical datasets. We found evidence for CT-DDMs across two studies, which implies that people become more urgent as time passes.

Data availability

All codes and data supporting this paper are publicly available on: https://github.com/AmirHoseinHadian/CTDM_recovery

Additional information

Funding

AHHR and JR are supported by the Swiss National Science Foundation (Grant No. 214099). GW is supported by the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie (Grant No. 101066503).

Funding

Swiss National Science Foundation (SNF) (214099)

Amir Hosein Hadian Rasanan
Jörg Rieskamp

European union Horizon

https://doi.org/10.3030/101066503

Gabriel Weindel

Acknowledgements

The authors would like to thank Nathan J. Evans, Sebastian Gluth, and Amin Ghaderi-Kangavari for their insightful comments on the early version of this work.

Appendix 1

Model fitting using the integral equation method

Since the diffusion models with time-dependent thresholds do not have an exact likelihood function, we need to approximate it using a numerical method. The integral equation method provides a reliable numerical procedure for approximating the first-passage time distribution of diffusion models with time-dependent parameters (Smith, 2000; Smith and Ratcliff, 2022). This method was first introduced by Buonocore et al. (1987, Buonocore et al. 1990) and then employed in psychological research frequently (e.g., Smith, 2000; Voskuilen et al., 2016; Smith and Ratcliff, 2022; Evans et al., 2020a; Hadian Rasanan et al., 2025; Smith, 2023; Zhang et al., 2014). In this method, the first-passage time distributions of upper and lower boundaries are obtained by solving the following system of linear integral equations (in mathematics, this equation is known as the Volterra integral equation of the second kind, Wazwaz, 2011):

and

where g_u[b_u(t), t|z, 0] and g_l[b_l(t), t|z, 0] represent the first-passage time distributions of crossing the upper and lower time-dependent thresholds, respectively. Also, Ψ is the kernel function and is defined as follows (Giorno et al., 1989; Richter et al., 2023):

In this equation, f [x, t|y, τ] represents the free transition density function. For the one-dimensional diffusion process with drift rate δ, the free transition density function is defined as follows (Giorno et al., 1989; Richter et al., 2023):

To approximate the solution of this integral equation, various numerical methods are available in the literature (see Chapter 3 in Wazwaz, 2011). However, the left-point rectangular scheme is the most straightforward and efficient numerical approximation scheme, offering relatively high precision in likelihood approximation (Richter et al., 2023). So, by discretizing the temporal space using N equidistant points (i.e., we considered T_max equal to the longest RT) the approximation of the first-passage time at the first time step is obtained as follows (Buonocore et al., 1990; Smith, 2000):

Also, for the next time steps (i.e., t_i = iΔt for i = 2, 3, …) we can use the following approximation scheme (Buonocore et al., 1990; Smith, 2000):

After approximating the first-passage time distributions corresponding to the upper and lower thresholds, the negative log-likelihood can be estimated using:

where and represent the response times ended at the upper threshold and lower _{i i} threshold, respectively. Moreover, N_u is the number of trials in which the process stopped at the upper threshold, and similarly, N_l is the number of trials in which the process terminated at the lower threshold.

Appendix 2

Sensitivity to estimation method

Sometimes, problems in parameter recovery are specific to a particular estimation method and may not occur with others. To ensure the robustness of our results, we conducted the same simulation study using an alternative estimation approach – Amortized Bayesian Inference (ABI; Radev et al., 2020; Bürkner et al., 2023). ABI is a modern inference technique that leverages neural networks to approximate posterior distributions efficiently. Instead of performing Bayesian inference from scratch for each new dataset, ABI “amortizes” the computational cost by learning a mapping from observed data to posterior distributions across many simulated datasets.

For this purpose, we utilized the BayesFlow Python library (Radev et al., 2023b), which provides neural network architectures tailored for ABI. Our inference pipeline consists of two main components: a set transformer that encodes the trial-wise data into maximally informative summary statistics, and an invertible neural network (the inference network) that takes these summaries — along with the generative parameters — and learns to approximate the posterior distribution. We trained four separate neural approximators, corresponding to each of the models examined: NDT-informed ECT-DDM, NDT-informed HCT-DDM, Uninformed ECT-DDM, and Uninformed HCT-DDM. Each network was trained on 2 400 000 simulated datasets, with each dataset containing 500 trials and parameters sampled from prior distributions that matched those used in the main simulation study. Further implementation details, including network architectures and hyperparameters, are available in our publicly accessible GitHub repository: (https://github.com/AmirHoseinHadian/CTDM_recovery)

Figure 1 displays the R² values for the threshold parameters (i.e., θ and λ) under both the no-constraint and joint models. Applying constraints on non-decision time notably enhances parameter recovery accuracy, with the greatest improvement observed in the Hyperbolic collapsing threshold model. Compared to the integral equation estimation method, parameter recovery for the non-informed models was significantly better – again, especially for the Hyperbolic collapsing boundary. In contrast, R² values for the NDT-informed parameters were comparable across both methods. For completeness, Figure 2 presents the full parameter recovery results for all four models. This figure illustrates the estimated parameters against the true generating parameters for each model.

Estimated vs. true parameter values for two modeling approaches.
The top two rows show parameter recovery for models estimated without additional constraints (“Uninformed”), while the bottom two rows show recovery for joint models that incorporate additional non-decision time observations (“NDT informed”). Each point represents a recovered parameter estimate plotted against its corresponding true data-generating value. Rows correspond to different functional forms (Exponential and Hyperbolic). Dashed diagonal lines indicate perfect recovery.

Appendix 3

Bias in non-decision time measurements

To investigate the impact of bias in non-decision time measurement on parameter estimation of NDT-informed CT-DDMs, we conducted a simulation study. The simulation setup for data generation was identical to the one considered in the main text, except that the mean of non-decision time measurements (Z_n) distribution deviated from the true non-decision time parameter (τ). To generate non-decision time measurements, we considered the following biased distribution:

where ε represents the bias in the scale of milliseconds. We considered six levels of bias in measurements (i.e., −60, −40, −20, +20, +40, and +60 milliseconds). These values cover systematic bias in the range of 60 milliseconds underestimation to 60 milliseconds overestimation. This deviation is not the maximum deviation; rather, it represents a systematic bias in addition to the variability in the non-decision time measurements.

Figure 1 illustrates the effect of measurement bias on parameter estimation based on the R² value. The results suggest that the threshold parameters are sensitive to measurement bias. This sensitivity is smaller for the exponential model. At first glance, this figure may suggest that the recovery of threshold parameters becomes increasingly unreliable as the bias increases. However, it is worth noting that these low R² values result from bias in parameter estimation when there is a bias in non-decision time measurements.

R² values measuring the agreement between estimated and ground truth parameters in exponential (left) and hyperbolic (right) collapsing threshold diffusion models as a function of bias in non-decision time measurements.

To illustrate this, Figure 2 shows the correlation between the true parameters and the estimated parameters in the presence of bias in non-decision time measurement. The obtained correlation values are very high, suggesting that the poor R² values reported in Figure 1 are mostly due to a bias in parameter estimation rather than variability or unreliability in the parameter estimation.

The correlation values measuring the agreement between estimated and ground truth parameters in exponential (left) and hyperbolic (right) collapsing threshold diffusion models as a function of bias in non-decision time measurements.

Figure 3 presents the effect of bias in non-decision time measurements on threshold parameters. Underestimation in non-decision time leads to overestimation in the starting threshold and decay rate. Conversely, overestimation in non-decision time leads to underestimation in both the starting threshold and non-decision time. The effect of bias is larger on the starting threshold than on the decay rate. Moreover, the exponential model was less sensitive to the bias in non-decision time measurements than the hyperbolic model.

The effect of bias in non-decision time measurements on starting threshold (upper panels) and decay rate (lower panels) parameters for exponential (left panels) and hyperbolic (right panels) collapsing threshold diffusion models.

Appendix 4

Threshold estimation for uninformed CT-DDMs

This appendix presents the estimated threshold dynamics for uninformed CT-DDMs. Figure 1 and Figure 2 illustrate the estimated thresholds for each individual in study 1 and study 2, respectively. Similar to the obtained results for NDT-informed CT-DDMs, here we also observed that the decision threshold for most participants collapses largely over time.

The estimated threshold dynamics from uninformed CT-DDMs for individuals in Study 1.
The first row illustrates the estimated hyperbolic threshold dynamics, and the second row illustrates the estimated exponential threshold dynamics. The left column shows the threshold dynamics in the speed condition, and the right column shows the accuracy condition. Each gray line corresponds to a subject. The blue line corresponds to the average group level.

The estimated threshold dynamics from uninformed CT-DDMs for individuals in Study 2.
The first row illustrates the estimated hyperbolic threshold dynamics, and the second row illustrates the estimated exponential threshold dynamics. The left column shows the threshold dynamics in the speed condition, and the right column shows the accuracy condition. Each gray line corresponds to a subject. The blue line corresponds to the average group level.

Appendix 5

Fixed-threshold diffusion model results

Study 1

Figure 1 presents the predictions of the uninformed FT-DDM, which includes trial-to-trial variability in drift rate. This model overestimates RT quantiles in the speed condition, particularly for the later response time quantiles for both correct and incorrect responses. However, the predictions of the FT-DDM align more closely with observed empirical data in the accuracy condition. Fitting of this model on the speed condition resulted BIC_Behavior = 5817.66 and on the accuracy condition BIC_Behavior = 16577.20. These BIC values are much larger than those of all NDT-informed and uninformed CT-DDMs considered in the main text. This implies that all CT-DDMs perform better than the FT-DDM, which accounts for across-trial variability in drift rate.

Study 2

Similarly, the predictions of the FT-DDM with across-trial variability in drift rate are presented in Figure 2. The FT-DDM overestimates the RT quantiles for both correct and incorrect responses in both speed and accuracy conditions. Moreover, this model underestimates the proportion of incorrect responses chosen in the Speed condition. Also, the model fitting resulted in BIC_Behavior = 5855.32, which indicates that the fitting of CT-DDMs considered for study 2 in the main text is better than this model.

Appendix 6

Application to N-dimensional diffusion models

The hyper-spherical diffusion model (HSDM; Smith, 2016; Smith and Corbett, 2019) is the n-dimensional extension of the well-known DDM (Ratcliff, 1978; Ratcliff and Rouder, 1998). HSDM is an important model to consider because it is applicable in the modeling of decisions with continuous options (e.g., Smith et al., 2020, Smith et al., 2023), multi-alternative (e.g., Kvam, 2019a; Smith and Corbett, 2019), and multi-attribute decisions (e.g., Smith, 2019; Smith and Corbett, 2019). Thus, it covers a vast range of choice problems (see also Hadian Rasanan et al., 2024a). Recently, Hadian Rasanan et al. (2025) have extended the collapsing threshold version of HSDM and proposed an integral equation method for parameter estimation of this model. The evidence accumulation process in this model can be represented by an n-dimensional vector-valued Wiener process as follows:

Hadian Rasanan et al. (2025) showed that the square distance of the an n-dimensional zero-drift process (i.e., v_i = 0 for i = 1, …, n) from the origin (i.e., ) satisfies the following one-dimensional Feller-type stochastic differential equation (see also Göing-Jaeschke and Yor, 2003; Kersting et al., 2023):

In this equation, n (number of dimensions) is the drift rate, and is the diffusion co-efficient. The authors showed that the first-passage time distribution of an n-dimensional zero-drift process starting from y₀ in the presence of a time-dependent threshold b(t) satisfies the following integral equation (Hadian Rasanan et al., 2025):

where the kernel function Ψ is defined as follows (Giorno et al., 1989; Hadian Rasanan et al., 2025):

in which S(t) = b²(t), and . Also, I_ς (z) represents the Bessel function of the second kind of order ς defined as below (Bell, 2004):

Similar to the one-dimensional CT-DDM, we need to approximate the solution of this integral equation to estimate the first-passage time distribution of the zero-drift process (see Appendix 1). For this purpose, we employed the following discretized scheme, which is identical to the approximation scheme employed for one-dimensional DDM (Buonocore et al., 1987; Hadian Rasanan et al., 2025):

After estimating the first-passage time distribution of the zero-drift process, we apply the Girsanov change-of-measure theorem (Girsanov, 1960) to obtain the joint distribution of choice and RT for the non-zero drift process (Smith, 2016; Hadian Rasanan et al., 2025). Therefore, the joint distribution of RT and choice can be obtained as follows:

in which v = [v₁, …, v_n]^′ is the drift vector, φ_i (i = 1, …, n− 1) are the response angles with respect to the i-th axis, and X(RT − τ) denotes the final position of the accumulator in an n-dimensional space. The following Cartesian representation can be utilized for calculating the elements of the final stopping point X(RT − τ):

X₁(RT − τ) = b(RT − τ) cos(φ₁),
X₂(RT − τ) = b(RT − τ) sin(φ₁) cos(φ₂),
X_n−1(RT − τ) = b(RT − τ) sin(φ₁) sin(φ₂) … sin(φ_n−2) cos(φ_n−1),
X_n(RT − τ) = b(RT − τ) sin(φ₁) sin(φ₂) … sin(φ_n−2) sin(φ_n−1).

To assess the generalizability of the proposed NDT-informed diffusion modeling, we conducted a parameter recovery study based on the two-dimensional circular diffusion model (Smith, 2016; Hadian Rasanan et al., 2025). Similar to the simulation study reported in the main text, we considered NDT-informed and uninformed collapsing threshold circular diffusion models (CT-CDM) with hyperbolic and exponential collapsing dynamics. We sampled 1000 random parameter sets from the following uniform distributions and then generated 500 trials from CT-CDM for each parameter set:

Figures 1 and 2 present the simulation results. Similar to the one-dimensional DDM, we also observe that constraining the non-decision time significantly improves parameter recovery of the two-dimensional CT-CDM.

Estimated vs. true parameter values for two modeling approaches based on the CT-CDM.
The top two rows show parameter recovery for models estimated without additional constraints (“Uninformed”), while the bottom two rows show recovery for joint models that incorporate additional non-decision time observations (“NDT-informed”). Each point represents a recovered parameter estimate plotted against its corresponding true data-generating value. Rows correspond to different functional forms (Exponential and Hyperbolic). Dashed diagonal lines indicate perfect recovery.

References

1. Anderson JR
2. Zhang Q
3. Borst JP
4. Walsh MM
2016The discovery of processing stages: Extension of Sternberg’s methodPsychological Review 123:481–509https://doi.org/10.1037/rev0000030 Google Scholar
1. Arnold NR
2. Bröder A
3. Bayen UJ
2015Empirical validation of the diffusion model for recognition memory and a comparison of parameter-estimation methodsPsychological Research 79:882–898https://doi.org/10.1007/s00426-014-0608-y Google Scholar
1. Bell WW
2004Special functions for scientists and engineersCourier Corporation Google Scholar
1. Berberyan HS
2. van Rijn H
3. Borst JP
2021Discovering the brain stages of lexical decision: Behavioral effects originate from a single neural decision processBrain and Cognition 153:105786https://doi.org/10.1016/j.bandc.2021.105786 Google Scholar
1. Bhui R.
2019Testing optimal timing in value-linked decision makingComputational Brain & Behavior 2:85–94https://doi.org/10.1007/s42113-019-0025-9 Google Scholar
1. Boehm U
2. Annis J
3. Frank MJ
4. Hawkins GE
5. Heathcote A
6. Kellen D
7. Krypotos AM
8. Lerche V
9. Logan GD
10. Palmeri TJ
11. et al.
2018Estimating across-trial variability parameters of the Diffusion Decision Model: Expert advice and recommendationsJournal of Mathematical Psychology 87:46–75https://doi.org/10.1016/j.jmp.2018.09.004 Google Scholar
1. Boehm U
2. Cox S
3. Gantner G
4. Stevenson R.
2021Fast solutions for the first-passage distribution of diffusion models with space-time-dependent drift functions and time-dependent boundariesJournal of Mathematical Psychology 105:102613https://doi.org/10.1016/j.jmp.2021.102613 Google Scholar
1. Boehm U
2. Van Maanen L
3. Forstmann B
4. van Rijn H.
2014Trial-by-trial fluctuations in CNV amplitude reflect anticipatory adjustment of response cautionNeuroImage 96:95–105https://doi.org/10.1016/j.neuroimage.2014.03.063 Google Scholar
1. Bogacz R
2. Brown E
3. Moehlis J
4. Holmes P
5. Cohen JD
2006The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasksPsychological Review 113:700–765https://doi.org/10.1037/0033-295X.113.4.700 Google Scholar
1. Bompas A
2. Sumner P
3. Hedge C.
2025Non-decision time: The Higgs Boson of decisionPsychological Review 132:330–363https://doi.org/10.1037/rev0000487 Google Scholar
1. Bridwell DA
2. Cavanagh JF
3. Collins AG
4. Nunez MD
5. Srinivasan R
6. Stober S
7. Calhoun VD
2018Moving beyond ERP components: a selective review of approaches to integrate EEG and behaviorFrontiers in Human Neuroscience 12:106https://doi.org/10.3389/fnhum.2018.00106 Google Scholar
1. Buonocore A
2. Giorno V
3. Nobile AG
4. Ricciardi LM
1990On the two-boundary first-crossing-time problem for diffusion processesJournal of Applied Probability 27:102–114https://doi.org/10.2307/3214598 Google Scholar
1. Buonocore A
2. Nobile AG
3. Ricciardi LM
1987A new integral equation for the evaluation of first-passage-time probability densitiesAdvances in Applied Probability 19:784–800https://doi.org/10.2307/1427102 Google Scholar
1. Burbeck SL
2. Luce RD
1982Evidence from auditory simple reaction times for both change and level detectorsPerception & Psychophysics 32:117–133https://doi.org/10.3758/BF03204271 Google Scholar
1. Bürkner PC
2. Scholz M
3. Radev ST
2023Some Models Are Useful, but How Do We Know Which Ones? Towards a Unified Bayesian Model TaxonomyStatistics Surveys 17:216–310https://doi.org/10.1214/23-SS145 Google Scholar
1. Busemeyer JR
2. Gluth S
3. Rieskamp J
4. Turner BM
2019Cognitive and neural bases of multi-attribute, multi-alternative, value-based decisionsTrends in Cognitive Sciences 23:251–263https://doi.org/10.1016/j.tics.2018.12.003 Google Scholar
1. Cavanagh JF
2. Wiecki TV
3. Kochar A
4. Frank MJ
2014Eye tracking and pupillometry are indicators of dissociable latent decision processesJournal of Experimental Psychology: General 143:1476–1488https://doi.org/10.1037/a0035813 Google Scholar
1. Christie LS
2. Luce RD
1956Decision structure and time relations in simple choice behaviorThe Bulletin of Mathematical Biophysics 18:89–112https://doi.org/10.1007/BF02477834 Google Scholar
1. Cisek P
2. Puskas GA
3. El-Murr S.
2009Decisions in changing conditions: The urgency-gating modelJournal of Neuroscience 29:11560–11571https://doi.org/10.1523/JNEUROSCI.1844-09.2009 Google Scholar
1. Ditterich J.
2006Evidence for time-variant decision makingEuropean Journal of Neuroscience 24:3628–3641https://doi.org/10.1111/j.1460-9568.2006.05221.x Google Scholar
1. Drugowitsch J
2. Moreno-Bote R
3. Churchland AK
4. Shadlen MN
5. Pouget A.
2012The cost of accumulating evidence in perceptual decision makingJournal of Neuroscience 32:3612–3628https://doi.org/10.1523/JNEUROSCI.4010-11.2012 Google Scholar
1. Dutilh G
2. Rieskamp J.
2016Comparing perceptual and preferential decision makingPsychonomic Bulletin & Review 23:723–737https://doi.org/10.3758/s13423-015-0941-1 Google Scholar
1. Evans NJ
2. Bennett AJ
3. Brown SD
2019Optimal or not; depends on the taskPsychonomic Bulletin & Review 26:1027–1034https://doi.org/10.3758/s13423-018-1536-4 Google Scholar
1. Evans NJ
2. Brown SD
2017People adopt optimal policies in simple decision-making, after practice and guidancePsychonomic Bulletin & Review 24:597–606https://doi.org/10.3758/s13423-016-1135-1 Google Scholar
1. Evans NJ
2. Hawkins GE
2019When humans behave like monkeys: Feedback delays and extensive practice increase the efficiency of speeded decisionsCognition 184:11–18https://doi.org/10.1016/j.cognition.2018.11.014 Google Scholar
1. Evans NJ
2. Hawkins GE
3. Brown SD
2020The role of passing time in decision-makingJournal of Experimental Psychology: Learning, Memory, and Cognition 46:316–326https://doi.org/10.1037/xlm0000725 Google Scholar
1. Evans NJ
2. Trueblood JS
3. Holmes WR
2020A parameter recovery assessment of time-variant models of decision-makingBehavior Research Methods 52:193–206https://doi.org/10.3758/s13428-019-01218-0 Google Scholar
1. Fechner GT
1860Elemente der psychophysikBreitkopf u. Härtel Google Scholar
1. Fengler A
2. Govindarajan LN
3. Chen T
4. Frank MJ
2021Likelihood approximation networks (LANs) for fast inference of simulation models in cognitive neuroscienceeLife 10:e65074https://doi.org/10.7554/eLife.65074 Google Scholar
1. Fontanesi L
2. Gluth S
3. Spektor MS
4. Rieskamp J.
2019A reinforcement learning diffusion decision model for value-based decisionsPsychonomic Bulletin & Review 26:1099–1121https://doi.org/10.3758/s13423-018-1554-2 Google Scholar
1. Forstmann BU
2. Ratcliff R
3. Wagenmakers EJ
2016Sequential sampling models in cognitive neuroscience: Advantages, applications, and extensionsAnnual Review of Psychology 67:641–666https://doi.org/10.1146/annurevpsych-122414-033645 Google Scholar
1. Forstmann BU
2. Tittgemeyer M
3. Wagenmakers EJ
4. Derrfuss J
5. Imperati D
6. Brown S.
2011The speed-accuracy tradeoff in the elderly brain: A structural model-based approachJournal of Neuroscience 31:17242–17249https://doi.org/10.1523/JNEUROSCI.0309-11.2011 Google Scholar
1. Forstmann BU
2. Wagenmakers EJ
2015An introduction to model-based cognitive neuroscienceSpringer Google Scholar
1. Frazier P
2. Yu AJ
2007Sequential hypothesis testing under stochastic deadlinesAdvances in Neural Information Processing Systems 20:465–472https://dl.acm.org/doi/10.5555/2981562.2981621 Google Scholar
1. Fudenberg D
2. Newey W
3. Strack P
4. Strzalecki T.
2020Testing the drift-diffusion modelProceedings of the National Academy of Sciences 117:33141–33148https://doi.org/10.1073/pnas.2011446117 Google Scholar
1. Fudenberg D
2. Strack P
3. Strzalecki T.
2018Speed, accuracy, and the optimal timing of choicesAmerican Economic Review 108:3651–3684https://doi.org/10.1257/aer.20150742 Google Scholar
1. Ghaderi-Kangavari A
2. Amani Rad J
3. Nunez MD
2023A general integrative neurocognitive modeling framework to jointly describe EEG and decision-making on single trialsComputational Brain & Behavior 6:317–376https://doi.org/10.1007/s42113-023-00167-4 Google Scholar
1. Ghaderi-Kangavari A
2. Rad JA
3. Parand K
4. Nunez MD
2022Neuro-cognitive models of single-trial EEG measures describe latent effects of spatial attention during perceptual decision makingJournal of Mathematical Psychology 111:102725https://doi.org/10.1016/j.jmp.2022.102725 Google Scholar
1. Ging-Jehli NR
2. Cavanagh JF
3. Ahn M
4. Segar DJ
5. Asaad WF
6. Frank MJ
2025Basal ganglia components have distinct computational roles in decision-making dynamics under conflict and uncertaintyPLoS Biology 23:e3002978https://doi.org/10.1371/journal.pbio.3002978 Google Scholar
1. Ging-Jehli NR
2. Ratcliff R
3. Arnold LE
2021Improving neurocognitive testing using computational psychiatry—A systematic review for ADHDPsychological Bulletin 147:169–231https://doi.org/10.1037/bul0000319 Google Scholar
1. Giorno V
2. Nobile AG
3. Ricciardi LM
4. Sato S.
1989On the evaluation of first-passage-time probability densities via non-singular integral equationsAdvances in Applied Probability 21:20–36https://doi.org/10.2307/1427196 Google Scholar
1. Girsanov IV
1960On transforming a certain class of stochastic processes by absolutely continuous substitution of measuresTheory of Probability & Its Applications 5:285–301https://doi.org/10.1137/1105027 Google Scholar
1. Gluth S
2. Deakin J
3. Rieskamp J
4. Gluth S.
2024A Theory of Multi-Attribute Search and ChoicePsyArXiv https://doi.org/10.31234/osf.io/3qzak_v2 Google Scholar
1. Gluth S
2. Kern N
3. Kortmann M
4. Vitali CL
2020Value-based attention but not divisive normalization influences decisions with multiple alternativesNature Human Behaviour 4:634–645https://doi.org/10.1038/s41562-020-0822-0 Google Scholar
1. Gluth S
2. Rieskamp J
3. Büchel C.
2012Deciding when to decide: time-variant sequential sampling models explain the emergence of value-based decisions in the human brainJournal of Neuroscience 32:10686–10698https://doi.org/10.1523/JNEUROSCI.0727-12.2012 Google Scholar
1. Gluth S
2. Rieskamp J
3. Büchel C.
2013Classic EEG motor potentials track the emergence of value-based decisionsNeuroImage 79:394–403https://doi.org/10.1016/j.neuroimage.2013.05.005 Google Scholar
1. Göing-Jaeschke A
2. Yor M.
2003A survey and some generalizations of Bessel processesBernoulli 9:313–349https://doi.org/10.3150/bj/1068128980 Google Scholar
1. Gold JI
2. Shadlen MN
2007The neural basis of decision makingAnnual Review of Neuroscience 30:535–574https://doi.org/10.1146/annurev.neuro.29.051605.113038 Google Scholar
1. Green DM
2. Luce RD
1971Detection of auditory signals presented at random times: IIIPerception & Psychophysics 9:257–268https://doi.org/10.3758/BF03212645 Google Scholar
1. Grogan J
2. Vermeylen L
3. Mannion S
4. McCabe C
5. Monakhovych D
6. Desender K
7. O’Connell RG
2025Neurally-informed modelling unravels a single evidence accumulation process for choices and subsequent confidence reportsbioRxiv https://doi.org/10.1101/2025.06.05.658071 Google Scholar
1. Hadian Rasanan AH
2. Evans NJ
3. Amani Rad J
4. Rieskamp J.
2025Parameter estimation of hyper-spherical diffusion models with a time-dependent threshold: An integral equation methodBehavior Research Methods 57:283https://doi.org/10.3758/s13428-025-02810-3 Google Scholar
1. Hadian Rasanan AH
2. Evans NJ
3. Fontanesi L
4. Manning C
5. Huang-Pollock C
6. Matzke D
7. Heathcote A
8. Rieskamp J
9. Speekenbrink M
10. Frank MJ
11. et al.
2024Beyond Discrete-Choice OptionsTrends in Cognitive Sciences 28:857–870https://doi.org/10.1016/j.tics.2024.07.004 Google Scholar
1. Hadian Rasanan AH
2. Evans NJ
3. Rieskamp J
4. Amani Rad J.
2023Numerical approximation of the first-passage time distribution of time-varying diffusion decision models: A mesh-free approachEngineering Analysis with Boundary Elements 151:227–243https://doi.org/10.1016/j.enganabound.2023.03.005 Google Scholar
1. Hadian Rasanan AH
2. Evans NJ
3. Rieskamp J
4. Amani Rad J
2024Response time and accuracy modeling through the lens of fractional dynamics: A foundational primer
In:
1. Chakraverty S.
2. Jena R. M.
, editors. Computation and Modeling for Fractional Order Systems Elsevier pp. 1–27
https://doi.org/10.1016/B978-0-44-315404-1.00006-0 Google Scholar
1. Hanks TD
2. Mazurek ME
3. Kiani R
4. Hopp E
5. Shadlen MN
2011Elapsed decision time affects the weighting of prior probability in a perceptual decision taskJournal of Neuroscience 31:6339–6352https://doi.org/10.1523/JNEUROSCI.5613-10.2011 Google Scholar
1. Hawkins GE
2. Forstmann BU
3. Wagenmakers EJ
4. Ratcliff R
5. Brown SD
2015Revisiting the evidence for collapsing boundaries and urgency signals in perceptual decision-makingJournal of Neuroscience 35:2476–2484https://doi.org/10.1523/JNEUROSCI.2410-14.2015 Google Scholar
1. Hawkins GE
2. Heathcote A.
2021Racing against the clock: Evidence-based versus time-based decisionsPsychological Review 128:222–263https://doi.org/10.1037/rev0000259 Google Scholar
1. Hawkins GE
2. Mittner M
3. Forstmann BU
4. Heathcote A.
2019Modeling distracted performanceCognitive Psychology 112:48–80https://doi.org/10.1016/j.cogpsych.2019.05.002 Google Scholar
1. Heathcote A
2. Brown S
3. Mewhort D.
2002Quantile maximum likelihood estimation of response time distributionsPsychonomic Bulletin & Review 9:394–401https://doi.org/10.3758/BF03196299 Google Scholar
1. Holmes WR
2015A practical guide to the Probability Density Approximation (PDA) with improved implementation and error characterizationJournal of Mathematical Psychology 68:13–24https://doi.org/10.1016/j.jmp.2015.08.006 Google Scholar
1. Hübner R
2. Pelzer T.
2020Improving parameter recovery for conflict drift-diffusion modelsBehavior Research Methods 52:1848–1866https://doi.org/10.3758/s13428-020-01366-8 Google Scholar
1. Karalunas SL
2. Huang-Pollock CL
3. Nigg JT
2012Decomposing attention-deficit/hyperactivity disorder (ADHD)-related effects in response speed and variabilityNeuropsychology 26:684–694https://doi.org/10.1037/a0029936 Google Scholar
1. Karşılar H
2. Simen P
3. Papadakis S
4. Balcı F
2014Speed accuracy trade-off under response deadlinesFrontiers in Neuroscience 8:248https://doi.org/10.3389/fnins.2014.00248 Google Scholar
1. Katsimpokis D
2. Hawkins GE
3. van Maanen L.
2020Not all speed-accuracy trade-off manipulations have the same psychological effectComputational Brain & Behavior 3:252–268https://doi.org/10.1007/s42113-020-00074-y Google Scholar
1. Kelly SP
2. Corbett EA
3. O’Connell RG
2021Neurocomputational mechanisms of prior-informed perceptual decision-making in humansNature Human Behaviour 5:467–481https://doi.org/10.1038/s41562-020-00967-9 Google Scholar
1. Kersting H
2. Orvieto A
3. Proske F
4. Lucchi A.
2023Mean first exit times of Ornstein–Uhlenbeck processes in high-dimensional spacesJournal of Physics A: Mathematical and Theoretical 56:215003https://doi.org/10.1088/1751-8121/acc559 Google Scholar
1. Khodadadi A
2. Fakhari P
3. Busemeyer JR
2017Learning to allocate limited time to decisions with different expected outcomesCognitive Psychology 95:17–49https://doi.org/10.1016/j.cogpsych.2017.03.002 Google Scholar
1. Khodadadi A
2. Townsend JT
2015On mimicry among sequential sampling modelsJournal of Mathematical Psychology 68:37–48https://doi.org/10.1016/j.jmp.2015.08.007 Google Scholar
1. Kira S
2. Zylberberg A
3. Shadlen MN
2025Incorporation of a cost of deliberation time in perceptual decision makingJournal of Neuroscience 45https://doi.org/10.1523/JNEUROSCI.2426-24.2025 Google Scholar
1. Kohfeld DL
2. Santee JL
3. Wallace ND
1981Loudness and reaction time: II Identification of detection components at different intensities and frequenciesPerception & Psychophysics 29:550–562https://doi.org/10.3758/BF03207371 Google Scholar
1. Kohl C
2. Spieser L
3. Forster B
4. Bestmann S
5. Yarrow K.
2020Centroparietal activity mirrors the decision variable when tracking biased and time-varying sensory evidenceCognitive Psychology 122:101321https://doi.org/10.1016/j.cogpsych.2020.101321 Google Scholar
1. Krajbich I
2. Armel C
3. Rangel A.
2010Visual fixations and the computation and comparison of value in simple choiceNature Neuroscience 13:1292–1298https://doi.org/10.1038/nn.2635 Google Scholar
1. Krause J
2. van Rij J
3. Borst JP
2024Word type and frequency effects on lexical decisions are process-dependent and start earlyJournal of Cognitive Neuroscience 36:2227–2250https://doi.org/10.1162/jocn_a_02214 Google Scholar
1. Kruschke JK
2. Liddell TM
2018The Bayesian New Statistics: Hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspectivePsychonomic Bulletin & Review 25:178–206https://doi.org/10.3758/s13423-016-1221-4 Google Scholar
1. Kvam PD
2019A geometric framework for modeling dynamic decisions among arbitrarily many alternativesJournal of Mathematical Psychology 91:14–37https://doi.org/10.1016/j.jmp.2019.03.001 Google Scholar
1. Kvam PD
2019Modeling accuracy, response time, and bias in continuous orientation judgmentsJournal of Experimental Psychology: Human Perception and Performance 45:301–318https://doi.org/10.1037/xhp0000606 Google Scholar
1. Laming DRJ
1968Information theory of choice-reaction timesNew York: Wiely Google Scholar
1. Lerche V
2. Voss A.
2016Model complexity in diffusion modeling: Benefits of making the model more parsimoniousFrontiers in Psychology 7:1324https://doi.org/10.3389/fpsyg.2016.01324 Google Scholar
1. Lerche V
2. Voss A.
2017Retest reliability of the parameters of the Ratcliff diffusion modelPsychological Research 81:629–652https://doi.org/10.1007/s00426-016-0770-5 Google Scholar
1. Lerche V
2. Voss A.
2019Experimental validation of the diffusion model based on a slow response time paradigmPsychological Research 83:1194–1209https://doi.org/10.1007/s00426-017-0945-8 Google Scholar
1. Miletić S
2. Turner BM
3. Forstmann BU
4. van Maanen L.
2017Parameter recovery for the leaky competing accumulator modelJournal of Mathematical Psychology 76:25–50https://doi.org/10.1016/j.jmp.2016.12.001 Google Scholar
1. Milosavljevic M
2. Malmaud J
3. Huth A
4. Koch C
5. Rangel A.
2010The drift diffusion model can account for the accuracy and reaction time of value-based choices under high and low time pressureJudgment and Decision Making 5:437–449https://doi.org/10.1017/S1930297500001285 Google Scholar
1. Moran R.
2015Optimal decision making in heterogeneous and biased environmentsPsychonomic Bulletin & Review 22:38–53https://doi.org/10.3758/s13423-014-0669-3 Google Scholar
1. Murphy PR
2. Boonstra E
3. Nieuwenhuis S.
2016Global gain modulation generates time-dependent urgency during perceptual choice in humansNature Communications 7:13526https://doi.org/10.1038/ncomms13526 Google Scholar
1. Murrow M
2. Holmes W.
2024A parameter recovery assessment of a wide class of evidence accumulation models of decision-makingResearch Square https://doi.org/10.21203/rs.3.rs-4722049/v1 Google Scholar
1. Murrow M
2. Holmes WR
2024PyBEAM: A Bayesian approach to parameter inference for a wide class of binary evidence accumulation modelsBehavior Research Methods 56:2636–2656https://doi.org/10.3758/s13428-023-02162-w Google Scholar
1. Nejati V
2. Hadian Rasanan AH
3. Amani Rad J
4. Alavi MM
5. Haghi S
6. Nitsche MA
2022Transcranial direct current stimulation (tDCS) alters the pattern of information processing in children with ADHD: Evidence from drift diffusion modelingNeurophysiologie Clinique 52:17–27https://doi.org/10.1016/j.neucli.2021.11.005 Google Scholar
1. Nunez MD
2. Gosai A
3. Vandekerckhove J
4. Srinivasan R.
2019The latency of a visual evoked potential tracks the onset of decision makingNeuroImage 197:93–108https://doi.org/10.1016/j.neuroimage.2019.04.052 Google Scholar
1. Nunez MD
2. Schubert AL
3. Frischkorn GT
4. Oberauer K.
2025Cognitive models of decision-making with identifiable parameters: Diffusion Decision Models with within-trial noiseJournal of Mathematical Psychology 125:102917https://doi.org/10.1016/j.jmp.2025.102917 Google Scholar
1. Nunez MD
2. Vandekerckhove J
3. Srinivasan R.
2017How attention influences perceptual decision making: Single-trial EEG correlates of drift-diffusion model parametersJournal of Mathematical Psychology 76:117–130https://doi.org/10.1016/j.jmp.2016.03.003 Google Scholar
1. O’Connell RG
2. Dockree PM
3. Kelly SP
2012A supramodal accumulation-to-bound signal that determines perceptual decisions in humansNature Neuroscience 15:1729–1735https://doi.org/10.1038/nn.3248 Google Scholar
1. Olschewski S
2. Mullett TL
3. Stewart N.
2025Optimal allocation of time in risky choices under opportunity costsCognitive Psychology 157:101716https://doi.org/10.1016/j.cogpsych.2025.101716 Google Scholar
1. Palestro JJ
2. Weichart E
3. Sederberg PB
4. Turner BM
2018Some task demands induce collapsing bounds: Evidence from a behavioral analysisPsychonomic Bulletin & Review 25:1225–1248https://doi.org/10.3758/s13423-018-1479-9 Google Scholar
1. Pedersen ML
2. Frank MJ
3. Biele G.
2017The drift diffusion model as the choice rule in reinforcement learningPsychonomic Bulletin & Review 24:1234–1251https://doi.org/10.3758/s13423-016-1199-y Google Scholar
1. Pirrone A
2. Dickinson A
3. Gomez R
4. Stafford T
5. Milne E.
2017Understanding perceptual judgment in autism spectrum disorder using the drift diffusion modelNeuropsychology 31:173–180https://doi.org/10.1037/neu0000320 Google Scholar
1. Radev ST
2. Mertens UK
3. Voss A
4. Ardizzone L
5. Köthe U.
2020BayesFlow: Learning complex stochastic models with invertible neural networksIEEE Transactions on Neural Networks and Learning Systems 33:1452–1466https://doi.org/10.1109/TNNLS.2020.3042395 Google Scholar
1. Radev ST
2. Schmitt M
3. Pratz V
4. Picchini U
5. Köthe U
6. Bürkner PC
2023JANA: Jointly amortized neural approximation of complex Bayesian models
In:
1. Evans RJ
2. Shpitser I
, editors. Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence pp. 1695–1706
https://proceedings.mlr.press/v216/radev23a.html Google Scholar
1. Radev ST
2. Schmitt M
3. Schumacher L
4. Elsemüller L
5. Pratz V
6. Schälte Y
7. Köthe U
8. Bürkner PC
2023BayesFlow: Amortized Bayesian Workflows With Neural NetworksJournal of Open Source Software 8:5702https://doi.org/10.21105/joss.05702 Google Scholar
1. Ratcliff R.
1978A theory of memory retrievalPsychological Review 85:59–108https://doi.org/10.1037/0033-295X.85.2.59 Google Scholar
1. Ratcliff R.
2008Modeling aging effects on two-choice tasks: Response signal and response time dataPsychology and Aging 23:900–916https://doi.org/10.1037/a0013930 Google Scholar
1. Ratcliff R.
2022Integrated diffusion models for distance effects in number memoryCognitive Psychology 138:101516https://doi.org/10.1016/j.cogpsych.2022.101516 Google Scholar
1. Ratcliff R
2. Frank MJ
2012Reinforcement-based decision making in corticostriatal circuits: mutual constraints by neurocomputational and diffusion modelsNeural Computation 24:1186–1229https://doi.org/10.1162/NECO_a_00270 Google Scholar
1. Ratcliff R
2. McKoon G.
2008The diffusion decision model: Theory and data for two-choice decision tasksNeural Computation 20:873–922https://doi.org/10.1162/neco.2008.12-06-420 Google Scholar
1. Ratcliff R
2. McKoon G.
2018Modeling numerosity representation with an integrated diffusion modelPsychological Review 125:183–217https://doi.org/10.1037/rev0000085 Google Scholar
1. Ratcliff R
2. McKoon G.
2023Reexamining the effects of speed–accuracy instructions with a diffusion-model-based analysisJournal of Experimental Psychology: Learning, Memory, and Cognition 49:1732–1751https://doi.org/10.1037/xlm0001285 Google Scholar
1. Ratcliff R
2. Rouder JN
1998Modeling response times for two-choice decisionsPsychological Science 9:347–356https://doi.org/10.1111/1467-9280.0006 Google Scholar
1. Ratcliff R
2. Smith PL
2004A comparison of sequential sampling models for two-choice reaction timePsychological Review 111:333–367https://doi.org/10.1037/0033-295X.111.2.333 Google Scholar
1. Ratcliff R
2. Smith PL
3. Brown SD
4. McKoon G.
2016Diffusion decision model: Current issues and historyTrends in Cognitive Sciences 20:260–281https://doi.org/10.1016/j.tics.2016.01.007 Google Scholar
1. Ratcliff R
2. Thapar A
3. McKoon G.
2010Individual differences, aging, and IQ in two-choice tasksCognitive Psychology 60:127–157https://doi.org/10.1016/j.cogpsych.2009.09.001 Google Scholar
1. Ratcliff R
2. Tuerlinckx F.
2002Estimating parameters of the diffusion model: Approaches to dealing with contaminant reaction times and parameter variabilityPsychonomic Bulletin & Review 9:438–481https://doi.org/10.3758/BF03196302 Google Scholar
1. Richter T
2. Ulrich R
3. Janczyk M.
2023Diffusion models with time-dependent parameters: An analysis of computational effort and accuracy of different numerical methodsJournal of Mathematical Psychology 114:102756https://doi.org/10.1016/j.jmp.2023.102756 Google Scholar
1. Roitman JD
2. Shadlen MN
2002Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time taskJournal of Neuroscience 22:9475–9489https://doi.org/10.1523/JNEUROSCI.22-21-09475.2002 Google Scholar
1. Schall JD
2004On building a bridge between brain and behaviorAnnual Review of Psychology 55:23–50https://doi.org/10.1146/annurev.psych.55.090902.141907 Google Scholar
1. Schmiedek F
2. Oberauer K
3. Wilhelm O
4. Süß HM
5. Wittmann WW
2007Individual differences in components of reaction time distributions and their relations to working memory and intelligenceJournal of Experimental Psychology: General 136:414–429https://doi.org/10.1037/0096-3445.136.3.414 Google Scholar
1. Schubert AL
2. Frischkorn GT
2020Neurocognitive psychometrics of intelligence: How measurement advancements unveiled the role of mental speed in intelligence differencesCurrent Directions in Psychological Science 29:140–146https://doi.org/10.1177/0963721419896365 Google Scholar
1. Servant M
2. Logan GD
3. Gajdos T
4. Evans NJ
2021An integrated theory of deciding and actingJournal of Experimental Psychology: General 150:2435–2454https://doi.org/10.1037/xge0001063 Google Scholar
1. Servant M
2. White C
3. Montagnini A
4. Burle B.
2016Linking theoretical decision-making mechanisms in the Simon task with electrophysiological data: A model-based neuroscience study in humansJournal of Cognitive Neuroscience 28:1501–1521https://doi.org/10.1162/jocn_a_00989 Google Scholar
1. Sheu CF
2. Ratcliff R.
1995The application of Fourier deconvolution to reaction time data: A cautionary notePsychological Bulletin 118:285–299https://doi.org/10.1037/0033-2909.118.2.285 Google Scholar
1. Shinn M
2. Lam NH
3. Murray JD
2020A flexible framework for simulating and fitting generalized drift-diffusion modelseLife 9:e56938https://doi.org/10.7554/eLife.56938 Google Scholar
1. Smith PL
1990Obtaining meaningful results from Fourier deconvolution of reaction time dataPsychological Bulletin 108:533–550https://doi.org/10.1037/0033-2909.108.3.533 Google Scholar
1. Smith PL
2000Stochastic dynamic models of response time and accuracy: A foundational primerJournal of Mathematical Psychology 44:408–463https://doi.org/10.1006/jmps.1999.1260 Google Scholar
1. Smith PL
2016Diffusion theory of decision making in continuous reportPsychological Review 123:425–451https://doi.org/10.1037/rev0000023 Google Scholar
1. Smith PL
2019Linking the diffusion model and general recognition theory: Circular diffusion with bivariate-normally distributed drift ratesJournal of Mathematical Psychology 91:145–158https://doi.org/10.1016/j.jmp.2019.06.002 Google Scholar
1. Smith PL
2023“Reliable organisms from unreliable components” revisited: The linear drift, linear infinitesimal variance model of decision makingPsychonomic Bulletin & Review 30:1323–1359https://doi.org/10.3758/s13423-022-02237-3 Google Scholar
1. Smith PL
2. Corbett EA
2019Speeded multielement decision-making as diffusion in a hypersphere: Theory and application to double-target detectionPsychonomic Bulletin & Review 26:127–162https://doi.org/10.3758/s13423-018-1491-0 Google Scholar
1. Smith PL
2. Corbett EA
3. Lilburn SD
2023Diffusion theory of the antipodal “shadow” mode in continuous-outcome, coherent-motion decisionsPsychological Review 130:1167–1202https://doi.org/10.1037/rev0000377 Google Scholar
1. Smith PL
2. Ratcliff R.
2022Modeling evidence accumulation decision processes using integral equations: Urgency-gating and collapsing boundariesPsychological Review 129:235–267https://doi.org/10.1037/rev0000301 Google Scholar
1. Smith PL
2. Saber S
3. Corbett EA
4. Lilburn SD
2020Modeling continuous outcome color decisions with the circular diffusion model: Metric and categorical propertiesPsychological Review 127:562–590https://doi.org/10.1037/rev0000185 Google Scholar
1. Starns JJ
2. Ratcliff R.
2010The effects of aging on the speed–accuracy compromise: Boundary optimality in the diffusion modelPsychology and Aging 25:377–390https://doi.org/10.1037/a0018022 Google Scholar
1. Steinemann NA
2. O’Connell RG
3. Kelly SP
2018Decisions are expedited through multiple neural adjustments spanning the sensorimotor hierarchyNature Communications 9:3627https://doi.org/10.1038/s41467-018-06117-0 Google Scholar
1. Stone M.
1960Models for choice-reaction timePsychometrika 25:251–260https://doi.org/10.1007/BF02289729 Google Scholar
1. Tajima S
2. Drugowitsch J
3. Patel N
4. Pouget A.
2019Optimal policy for multi-alternative decisionsNature Neuroscience 22:1503–1511https://doi.org/10.1038/s41593-019-0453-9 Google Scholar
1. Tajima S
2. Drugowitsch J
3. Pouget A.
2016Optimal policy for value-based decision-makingNature Communications 7:12400https://doi.org/10.1038/ncomms12400 Google Scholar
1. Thura D
2. Beauregard-Racine J
3. Fradet CW
4. Cisek P.
2012Decision making by urgency gating: theory and experimental supportJournal of Neurophysiology 108:2912–2930https://doi.org/10.1152/jn.01071.2011 Google Scholar
1. Trueblood JS
2. Heathcote A
3. Evans NJ
4. Holmes WR
2021Urgency, leakage, and the relative nature of information processing in decision-makingPsychological Review 128:160–186https://doi.org/10.1037/rev0000255 Google Scholar
1. Turner BM
2. Forstmann BU
3. Love BC
4. Palmeri TJ
5. Van Maanen L.
2017Approaches to analysis in model-based cognitive neuroscienceJournal of Mathematical Psychology 76:65–79https://doi.org/10.1016/j.jmp.2016.01.001 Google Scholar
1. Turner BM
2. Sederberg PB
2014A generalized, likelihood-free method for posterior estimationPsychonomic Bulletin & Review 21:227–250https://doi.org/10.3758/s13423-013-0530-0 Google Scholar
1. Turner BM
2. Van Maanen L
3. Forstmann BU
2015Informing cognitive abstractions through neuroimaging: the neural drift diffusion modelPsychological Review 122:312–336https://doi.org/10.1037/a0038894 Google Scholar
1. Van Maanen L
2. Portoles O
3. Borst JP
2021The discovery and interpretation of evidence accumulation stagesComputational Brain & Behavior 4:395–415https://doi.org/10.1007/s42113-021-00105-2 Google Scholar
1. Verdonck S
2. Tuerlinckx F.
2016Factoring out nondecision time in choice reaction time data: Theory and implicationsPsychological Review 123:208–2018https://doi.org/10.1037/rev0000019 Google Scholar
1. von Krause M
2. Radev ST
3. Voss A.
2022Mental Speed Is High until Age 60 as Revealed by Analysis of over a Million ParticipantsNature Human Behaviour 6:700–708https://doi.org/10.1038/s41562-021-01282-7 Google Scholar
1. Voskuilen C
2. Ratcliff R
3. Smith PL
2016Comparing fixed and collapsing boundary versions of the diffusion modelJournal of Mathematical Psychology 73:59–79https://doi.org/10.1016/j.jmp.2016.04.008 Google Scholar
1. Voss A
2. Rothermund K
3. Voss J.
2004Interpreting the parameters of the diffusion model: An empirical validationMemory & Cognition 32:1206–1220https://doi.org/10.3758/BF03196893 Google Scholar
1. Voss A
2. Voss J.
2008A fast numerical algorithm for the estimation of diffusion model parametersJournal of Mathematical Psychology 52:1–9https://doi.org/10.1016/j.jmp.2007.09.005 Google Scholar
1. Wazwaz AM
2011Linear and nonlinear integral equationsSpringer Google Scholar
1. Weindel G.
2021On the measurement and estimation of cognitive processes with electrophysiological recordings and reaction time modelingAix-Marseille Google Scholar
1. Weindel G
2. Anders R
3. Alario F
4. Burle B.
2021Assessing model-based inferences in decision making with single-trial response time decompositionJournal of Experimental Psychology: General 150:1528–1555https://doi.org/10.1037/xge0001010 Google Scholar
1. Weindel G
2. Borst JP
3. van Maanen L.
2025Decision-making components and times revealed by the single-trial electro-encephalogrameLife 14:RP108049https://doi.org/10.7554/eLife.108049.1 Google Scholar
1. Weindel G
2. Gajdos T
3. Burle B
4. Alario FX
2021The decisive role of non-decision time for interpreting the parameters of decision making modelsHAL https://hal.science/hal-03384458v1 Google Scholar
1. Weindel G
2. van Maanen L
3. Borst JP
2024Trial-by-trial detection of cognitive events in neural time-seriesImaging Neuroscience 2:1–28https://doi.org/10.1162/imag_a_00400 Google Scholar
1. White CN
2. Servant M
3. Logan GD
2018Testing the validity of conflict drift-diffusion models for use in estimating cognitive processes: A parameter-recovery studyPsychonomic Bulletin & Review 25:286–301https://doi.org/10.3758/s13423-017-1271-2 Google Scholar
1. Yap MJ
2. Balota DA
3. Sibley DE
4. Ratcliff R.
2012Individual differences in visual word recognition: Insights from the English Lexicon ProjectJournal of Experimental Psychology: Human Perception and Performance 38:53–79https://doi.org/10.1037/a0024177 Google Scholar
1. Zhang S
2. Lee MD
3. Vandekerckhove J
4. Maris G
5. Wagenmakers EJ
2014Time-varying boundaries for diffusion models of decision making and response timeFrontiers in Psychology 5:1364https://doi.org/10.3389/fpsyg.2014.01364 Google Scholar
1. Zhao WJ
2. Walasek L
3. Bhatia S.
2020Psychological mechanisms of loss aversion: A drift-diffusion decompositionCognitive Psychology 123:101331https://doi.org/10.1016/j.cogpsych.2020.101331 Google Scholar
1. Weindel
2. Gabriel and Borst
3. Jelmer P.and van Maanen
4. Leendert
2025Decision-making components and times revealed by the single-trial electro-encephalogramGitHub https://github.com/GWeindel/decision-times
1. Boehm
2. Udo and Van Maanen
3. Leendert and Forstmann
4. Birte and van Rijn
5. Hedderik
2014Trial-by-trial fluctuations in CNV amplitude reflect anticipatory adjustment of response cautionGitHub https://github.com/GWeindel/man_hmp

Article and author information

Author information

Amir Hosein Hadian Rasanan
Department of Psychology, University of Basel, Basel, Switzerland
ORCID iD: 0000-0001-7785-1514
- For correspondence: amirhosein.hadianrasanan@unibas.ch
Lukas Schumacher
Department of Psychology, University of Basel, Basel, Switzerland
Michael D Nunez
Psychological Methods, University of Amsterdam, Amsterdam, Netherlands
ORCID iD: 0000-0002-9965-6282
Gabriel Weindel
Department of Experimental Psychology, Utrecht University, Utrecht, Netherlands, Institute of Psychology, University of Lausanne, Lausanne, Switzerland
ORCID iD: 0000-0002-7592-1686
Jörg Rieskamp
Department of Psychology, University of Basel, Basel, Switzerland
ORCID iD: 0000-0003-2632-8015

Author Notes

Competing interests: No competing interests declared

Version history

Sent for peer review: October 30, 2025
Preprint posted: October 31, 2025
Reviewed Preprint version 1: December 19, 2025

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.109730. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 261
downloads: 12
citations: 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Significance of findings

Strength of evidence

Abstract

Introduction

Non-decision time-informed diffusion modeling

An illustration of the effect of non-decision time on the threshold value on the final stopping point.

Why constraining non-decision time improves parameter estimation in CT-DDMs

Model specification

How to measure non-decision time

Illustration of the non-decision time-informed diffusion model.

Simulation Study

Methods

Results

R2 values measuring the agreement between estimated and ground truth threshold parameters in exponential (left) and hyperbolic (right) collapsing threshold models.

Estimated versus ground truth parameter values for two modeling approaches.

Illustration of the sensitivity of parameter estimation to the number of trials in the NDT-informed models.

Illustration of sensitivity of parameter estimation to the noise level in the non-decision time observations.

Collapsing threshold versus variability in drift rate

Estimated versus true parameter values for cross-fitting of CT-DDM on FT-DDM with across-trial variability in drift rate.

Applications to empirical data

Study 1: Weindel et al. (2025) dataset

The event sequence and timing of each event estimated by the HMP method averaged over all participants (Weindel et al., 2025).

The mean estimated parameters and goodness of fit results for Study 1.

The estimated threshold dynamics from NDT-informed CT-DDMs for individuals in Study 1.

Prediction of best-fitting models against empirical data for speed (top row) and accuracy (bottom row) conditions in Study 1.

Study 2: Boehm et al. (2014) dataset

The event sequence and timing of each event estimated by the HMP method for Accuracy (top) and Speed (bottom) conditions averaged over all participants (Weindel et al., 2024).

The mean estimated parameters and goodness of fit results for Study 2.

The estimated threshold dynamics from NDT-informed CT-DDMs for individuals in Study 2.

Prediction of best-fitting models against empirical data for speed (top row) and accuracy (bottom row) conditions in Study 2.

General discussion

Would NDT-informed modeling lead to a different conclusion?

Generalization to other computational models

Does only non-decision time improve parameter estimation?

Bias in non-decision time measurement

Behavioral methods for estimating non-decision time

Conclusion

Data availability

Additional information

Funding

Funding

Acknowledgements

Appendix 1

Model fitting using the integral equation method

Appendix 2

Sensitivity to estimation method

R2 values measuring the agreement between estimated and ground truth threshold parameters in exponential (left) and hyperbolic (right) collapsing boundary models.

Estimated vs. true parameter values for two modeling approaches.

Appendix 3

Bias in non-decision time measurements

R2 values measuring the agreement between estimated and ground truth parameters in exponential (left) and hyperbolic (right) collapsing threshold diffusion models as a function of bias in non-decision time measurements.

The correlation values measuring the agreement between estimated and ground truth parameters in exponential (left) and hyperbolic (right) collapsing threshold diffusion models as a function of bias in non-decision time measurements.

The effect of bias in non-decision time measurements on starting threshold (upper panels) and decay rate (lower panels) parameters for exponential (left panels) and hyperbolic (right panels) collapsing threshold diffusion models.

Appendix 4

Threshold estimation for uninformed CT-DDMs

The estimated threshold dynamics from uninformed CT-DDMs for individuals in Study 1.

The estimated threshold dynamics from uninformed CT-DDMs for individuals in Study 2.

Appendix 5

Fixed-threshold diffusion model results

Study 1

Posterior prediction of uninformed FT-DDM against empirical data for speed (left panel) and accuracy (right row) conditions in Study 1.

Study 2

Posterior prediction of uninformed FT-DDM against empirical data for speed (left panel) and accuracy (right row) conditions in Study 2.

Appendix 6

Application to N-dimensional diffusion models

R2 values measuring the agreement between estimated and ground truth threshold parameters in exponential (left) and hyperbolic (right) collapsing boundary circular diffusion models.

Estimated vs. true parameter values for two modeling approaches based on the CT-CDM.

References

Article and author information

Author information

Amir Hosein Hadian Rasanan

Lukas Schumacher

Michael D Nunez

Gabriel Weindel

Jörg Rieskamp

Author Notes

Version history

Cite all versions

Copyright

Metrics

R² values measuring the agreement between estimated and ground truth threshold parameters in exponential (left) and hyperbolic (right) collapsing threshold models.

R² values measuring the agreement between estimated and ground truth threshold parameters in exponential (left) and hyperbolic (right) collapsing boundary models.

R² values measuring the agreement between estimated and ground truth parameters in exponential (left) and hyperbolic (right) collapsing threshold diffusion models as a function of bias in non-decision time measurements.

R² values measuring the agreement between estimated and ground truth threshold parameters in exponential (left) and hyperbolic (right) collapsing boundary circular diffusion models.