Visual attention modulates the integration of goal-relevant evidence and not value
Abstract
When choosing between options, such as food items presented in plain view, people tend to choose the option they spend longer looking at. The prevailing interpretation is that visual attention increases value. However, in previous studies, ‘value’ was coupled to a behavioural goal, since subjects had to choose the item they preferred. This makes it impossible to discern if visual attention has an effect on value, or, instead, if attention modulates the information most relevant for the goal of the decision-maker. Here, we present the results of two independent studies—a perceptual and a value-based task—that allow us to decouple value from goal-relevant information using specific task-framing. Combining psychophysics with computational modelling, we show that, contrary to the current interpretation, attention does not boost value, but instead it modulates goal-relevant information. This work provides a novel and more general mechanism by which attention interacts with choice.
Introduction
How is value constructed and what is the role played by visual attention in choice? Despite their centrality to the understanding of human decision-making, these remain unanswered questions. Attention is thought to play a central role, prioritising and enhancing which information is accessed during the decision-making process. How attention interacts with value-based choice has been investigated in psychology and neuroscience (Krajbich et al., 2010; Krajbich and Rangel, 2011; Cavanagh et al., 2014; Polanía et al., 2014; Gluth et al., 2015; Gluth et al., 2020; Folke et al., 2017; Tavares et al., 2017; Glickman et al., 2018; Gluth et al., 2018; Thomas et al., 2019) and this question is at the core of the theory of rational inattention in economics (Sims, 2003; Sims, 2010; Caplin and Dean, 2015; Hébert and Woodford, 2017).
In this context, robust empirical evidence has shown that people tend to look for longer at the options with higher values (Anderson et al., 2011; Gluth et al., 2018; Gluth et al., 2020) and that they tend to choose the option they pay more visual attention to (Krajbich et al., 2010; Krajbich and Rangel, 2011; Folke et al., 2017; Cavanagh et al., 2014; Thomas et al., 2019). The most common interpretation is that attention is allocated to items based on their value and that looking or attending to an option boosts its value, either by amplifying it (Krajbich et al., 2010; Krajbich and Rangel, 2011; Smith and Krajbich, 2019) or by shifting it upwards by a constant amount (Cavanagh et al., 2014). This intuition has been elegantly formalised using models of sequential sampling, in particular the attentional drift diffusion model (aDDM), which considers that visual attention boosts the drift rate of the stochastic accumulation processes (Krajbich et al., 2010). More recently, this same model has been also used to study the role of attention in the accumulation of perceptual information (Tavares et al., 2017). These lines of investigation have been extremely fruitful, as they have provided an elegant algorithmic description of the interplay between attention and choice.
As consequence of this development, the predominant assumption in the field of neuroeconomics has become that attention operates over the value of the alternatives (Smith and Krajbich, 2019). However, this view overlooks the fact that in the majority of these studies, value is coupled to the agents’ behavioural goal, that is, participants had to choose the item they found more rewarding. However, some recent studies have called into question this assumption and have hinted towards a flexible role of attention on sampling goal-relevant options (Kovach et al., 2014; Glickman et al., 2018). Even further, recent developments have shown that the ‘value networks’ in the brain could be tracking not purely reward value, but actually goal-congruent information (Frömer et al., 2019; Suri et al., 2020). Considering all this, our study aims to understand in more detail the role of goals on visual attention during both value-based and perceptual decisions: we aim to test the hypothesis that attention acts in a flexible way upon the accumulation of goal-relevant information and to examine the effects on the mechanism of preference formation and confidence.
Our experimental design decouples reward value from choice by means of a simple task-framing manipulation. In the main eye-tracking part of our value-based experiment, participants were asked to choose between different pairs of snacks. We used two frame manipulations: like and dislike. In the like frame, they had to indicate which snack they would like to consume at the end of the experiment; this is consistent with the standard tasks used in value-based decision studies. But in the dislike frame, subjects had to indicate the snack that they would prefer not to eat, equivalent to choosing the other option. Crucially, in the latter frame value is distinct from the behavioural goal of which item to select. In fact, in the dislike frame participants need to consider the ‘anti-value’ of the item to choose the one to reject.
To anticipate our results, in the like frame condition we replicated the typical gaze-boosting effect: participants looked for longer at the item they were about to choose – the item they deemed most valuable. In the dislike frame, however, participants looked for longer at the item that they then chose to eliminate, that is, the least valuable item. This means that agents paid more attention to the option they selected in the task, not to the option to which they deemed more valuable or wanted to consume. This suggests that attention does not boost value but rather is used to gather task-relevant information.
In order to understand the mechanism via which attention interacts with value in both framings, we use a dynamic accumulation model, which allows us to account for the preference formation process and its dependency on task variables (values of the options). We also show how goal-relevance shapes confidence and how confidence interacts with attention.
To test the generality of our findings, we also conducted a new perceptual decision-making experiment and tested a new set of participants. In this perceptual task, participants were asked to choose between two circles filled with dots. In some blocks, they had to indicate the circle with more dots – most frame; in others, the circle with fewer dots – fewest frame. In this second study, we replicated all the effects of the first, value-based one, corroborating the hypothesis of a domain-general role for attention in modulating goal-relevant information that drives choice.
This work questions the dominant view in neuroeconomics about the relationship between attention and value, showing that attention does not boost value per se but instead modulates goal-relevant information. We conclude our work by presenting an economic model of optimal evidence accumulation. Using this model, we suggest that the behavioural strategy we observe in our experiment may be the result of deploying, in the context of binary choice, a behavioural strategy that is optimal when agents face more natural larger sets of options.
Results
In our first experiment, hungry participants (n = 31) made binary choices between snacks in one of two task-frames, like and dislike. In the like frame, participants had to report the item they would prefer to eat; in the dislike frame, they chose the item they wanted to avoid eating (Figure 1A). After each choice, participants reported their confidence in having made a good choice (De Martino et al., 2013; Folke et al., 2017). At the beginning of the experiment, participants reported the subjective value of individual items using a standard incentive-compatible Becker-DeGroot-Marschak mechanism (BDM; see Materials and methods).
Our second experiment was done to test whether the results observed in value-based decisions could be generalised to perceptual decisions. A different group of participants (n = 32) made binary choices between two circles containing a variable number of dots (Figure 1D). In the most frame, participants reported the circle containing the higher number of dots; in the fewest frame, the one with the lower. As in the Value Experiment, at the end of each trial participants reported their confidence in their choice.
The effect of attention on choice
Value experiment
Our results confirmed that participants understood the task and chose higher value items in the like frame and lower value items in the dislike frame (Figure 1B,C). This effect was modulated by confidence (Figure 1B) similarly to previous studies (De Martino et al., 2013; Folke et al., 2017; Boldt et al., 2019). For a direct comparison of the differences between the goal manipulations in the two tasks (Value and Perceptual) see Appendix 1 (Appendix 1—figure 1).
We then tested how attention interacts with choice by examining the eye-tracking variables. Our frame manipulation, which orthogonalised choice and valuation, allowed us to distinguish between two competing hypotheses. The first hypothesis, currently dominant in the field, is that visual attention is always attracted to high values items and that it facilitates their choice. The alternative hypothesis is that the attention is attracted to items whose value matches the goal of the task. These two hypotheses make starkly different experimental predictions in our task. According to the first, gaze will mostly be allocated to the more valuable item independently of the frame. The second hypothesis instead predicts that in the like frame participants will look more at the more valuable item, while this pattern would reverse in the dislike frame, with attention mostly allocated to the least valuable item. In other words, according to this second hypothesis, visual attention should predict choice (and the match between value and goal) and not value, independently of the frame manipulation.
Our data strongly supported the second hypothesis because we found participants preferentially gaze (Figure 2A) the higher value option during like (t(30) = 7.56, p<0.001) and the lower value option during dislike frame (t(30) = -4.99, p<0.001). From a hierarchical logistic regression analysis predicting choice (Figure 2B), the difference between the time participants spent observing the right over left item (ΔDT) was a positive predictor of choice both in like (z = 6.448, p<0.001) and dislike (z = 6.750, p<0.001) frames. This means that participants looked for longer at the item that better fits the frame and not at the item with the highest value. Notably, the magnitude of this effect was slightly lower in the dislike case (t(30) = 2.31, p<0.05). In Figure 2B are also plotted the predictors of the other variables on choice from the best fitting model.
Perceptual experiment
We then analysed the effect of attention on choice in the perceptual case to test the generality of our findings. As in the Value Experiment, our data confirmed that participants did not have issues in choosing the circle with more dots in the most frame and the one with least amount dots in the fewest frame (Figure 1D,F). Furthermore, as in the Value Experiment and many other previous findings (De Martino et al., 2013; Folke et al., 2017), confidence modulated the accuracy of their decisions (Figure 1E). Critically for our main hypothesis, we found that participants’ gaze was preferentially allocated to the relevant option in each frame (Figure 2C): they spent more time observing the circle with more dots during most frame (t(31)=13.85, p<0.001) and the one with less dots during fewest frame (t(31)=-10.88, p<0.001). ΔDT was a positive predictor of choice (Figure 2D) in most (z = 10.249, p<0.001), and fewest (z = 10.449, p<0.001) frames. Contrary to the results in the Value Experiment in which the effect of ΔDT on choice was slightly more marked in the like condition (Figure 2B), in the Perceptual Study the effect of ΔDT was the opposite: ΔDT had a higher effect in the fewest frame (ΔDTMost-Few: t(31)=-2.17, p<0.05)(Figure 2D). However, and most importantly, in both studies ΔDT was a robust positive predictor of choice in both frame manipulations. To summarise, these results show that in the context of a simple perceptual task, visual attention also has a specific effect in modulating information processing in a goal-directed manner: subjects spend more time fixating the option they will select, not necessarily the option with the highest number of dots.
In both Value and Perceptual Experiments, the most parsimonious models were reported in the manuscript and in Figure 2B and D. For a full model comparison see Appendix 2—figure 1 and Appendix 2—table 1. More details on the choice models are reported in the Appendix 2.
Fixations effects in choice
An important prediction of attentional accumulation models is that the chosen item is generally fixated last (unless that item is much worse than the other alternative), with the magnitude of this effect related to the difference in value between the alternatives. This feature of the decision has been consistently replicated in various previous studies (Krajbich et al., 2010; Krajbich and Rangel, 2011; Krajbich et al., 2012). We therefore tested how the last fixation was modulated by the frame manipulation.
Value experiment
In the Value Experiment in both frames, we replicated the last fixation effect and its modulation by value difference between the last fixated option and the other one (Figure 3A). In the like frame, the probability of choosing the last item fixated upon increases when the value of the last item is higher, as is shown by the positive sign of the slope of the logistic curve (mean βLike = 0.922). Crucially, during the dislike frame the opposite effect was found: the probability of choosing the last seen option increases when the value of the non-chosen item is higher, seen from the negative slope of the curve (mean βDislike = −0.951; ΔβLike-Dislike: t(30)=7.963, p<0.001).
Perceptual experiment
We observed the same pattern of results that in the Value Experiment (Figure 3B). In the most frame, it was more probable that the last fixation was on the chosen item when the fixated circle had a higher number of dots (mean βMost = 1.581). In the fewest frame, the effect flipped: it was more likely that the last circle seen was chosen when it had fewer dots (mean βFew = −0.944; ΔβMost-Few: t(31)=3.727, p<0.001).
The previous set of analysis shows that the last fixation is modulated by the difference in evidence according to the goal that the participant is set to achieve. However, since the last fixation is in general followed by the participant response, one could suspect that the goal-dependent modulation of attention (i.e. ΔDT) we identified in our choice regression analysis (Figure 2) is entirely driven by the final fixation. This would be problematic since one would have similar results to the one presented in Figure 2 even if participants’ pattern of attention is not modulated by the goal (i.e. attention is directed in both frames to the most valuable item) or even if the pattern of fixation, before the last fixation, is random. To control for this possibility, we performed a series of further analyses:
First of all, we repeated the analysis presented in the previous section (hierarchical choice regression – Figure 2), removing the last two fixations when calculating the ΔDT. Note that we removed the last two fixations and not just the last one to avoid statistical artefacts (i.e. since the final fixation is mostly directed towards the chosen item there would be an increased probability that second to last fixation is on the unchosen item). In Appendix 2—figure 3, we show that once removed the last two fixations the pattern of results is unchanged.
Second, we specifically investigated the middle fixations. Previous studies (Krajbich et al., 2010; Krajbich and Rangel, 2011; Tavares et al., 2017) have reported that middle fixations duration increases when the difference in value ratings (or perceptual evidence) of the fixated minus unfixated item increases. We replicated this result for our like and most frames but critically the effect was reversed in dislike and fewest frames (i.e. middle fixations durations decreased when the relative value of the fixated item was higher). The results suggesting that the goal-relevant modulation of attention affects also the middle fixations are presented in the Appendix 3—figure 4.
Finally, we investigated in more detail how the relation between attentional allocation and difference in value or perceptual evidence changed over time in the context of the goal manipulation. We calculated the Pearson correlation between fixation position (0: left, 1:right) and the difference in evidence (i.e. ΔValue or ΔDots, in both cases right – left item) at different time points (Figure 3C). We observed that after an initial phase in which there was no clear gaze preference for any of the items (note that given the gaze-contingent design participants must explore both alternatives), fixations were correlated with the frame-relevant item: during like frame, fixations positions were positively correlated with ΔValue, that is the fixations were directed towards the item with higher value; during dislike frame the behaviour was the opposite: fixations were negatively correlated with ΔValue, indicating a preference for the option with lower value. Note that these results are in line with the ones reported by Kovach et al., 2014. We see a very similar pattern of results in the Perceptual Experiment too (Figure 3D).
Which factors determine confidence?
Value experiment
To explore the effect that behavioural factors had over confidence, we fitted a hierarchical linear model (Figure 4A). As it was the case for the results presented above for the choice regression, the results for the confidence regression in the like frame replicated all the effects reported in a previous study from our lab (Folke et al., 2017). Again, we presented here the most parsimonious model (Appendix 4—figure 1 and Appendix 4—table 1 for model comparison). We found that the magnitude of ΔValue (|ΔValue|) had a positive influence on confidence in like (z = 5.465, p<0.001) and dislike (z = 6.300, p<0.001) frames, indicating that participants reported higher confidence when the items have a larger difference in value; this effect was larger in the dislike frame (t(30) = -4.72, p<0.01). Reaction time (RT) had a negative effect on confidence in like (z = −6.373, p<0.001) and dislike (z = −7.739, p<0.001) frames, that is, confidence was lower when the RTs were longer. Additionally, we found that, in both conditions, higher number of gaze switches (i.e. gaze shift frequency, GSF) predicted lower values of confidence in like (z = −2.365, p<0.05) and dislike (z = −2.589, p<0.05) frames, as reported in Folke et al., 2017.
We then looked at the effect of the summed value of both options, ΣValue, on confidence. As in Folke et al., 2017, we found a positive effect of ΣValue on confidence in the like frame (z = 3.206, p<0.01); that is, participants reported a higher confidence level when both options were high in value. Interestingly, this effect was inverted in the dislike frame (z = −4.492, p<0.001), with a significant difference between the two frames (t(30)=9.91, p<0.001) This means that, contrary to what happened in the like frame in which confidence was boosted when both items had high value, in the dislike frame confidence increased when both items had low value. This novel finding reveals that the change in context also generates a reassessment of the evidence used to generate the confidence reports; that is, confidence also tracks goal-relevant information.
Perceptual experiment
We repeated the same regression analysis in the perceptual decision experiment, replacing value evidence input with perceptual evidence (i.e. absolute difference in the number of dots, |ΔDots|). We directly replicated all the results of the Value Experiment, generalising the effects we isolated to the perceptual realm (Figure 4B). Specifically, we found that |ΔDots| had a positive influence on confidence in most (z = 3.546, p<0.001) and fewest frames (z = 7.571, p<0.001), indicating that participants reported higher confidence when the evidence was stronger. The effect of absolute evidence |ΔDots| on confidence was bigger in the fewest frame (t(31)=-4.716, p<0.001). RT had a negative effect over confidence in most (z = −7.599, p<0.001) and fewest frames (z = −5.51, p<0.001), that is, faster trials were associated with higher confidence. We also found that GSF predicted lower values of confidence in most (z = −4.354, p<0.001) and fewest (z = −5.204, p<0.001) frames. Critically (like in the Value Experiment), the effect of the sum of evidence (ΣDots) on confidence also changes sign depending on the frame. While ΣDots had a positive effect over confidence in the most frame (z = 2.061, p<0.05), this effect is the opposite in the fewest frame (z = −7.135, p<0.001), with a significant difference between the parameters in both frames (t(31)=14.621, p<0.001). The magnitude of ΣDots effect was stronger in the fewest frame (t(31)=-10.438, p<0.001). For further details on the confidence models see the Appendix 4 (Appendix 4—table 2 and Appendix 4 —table 3).
Attentional model: GLAM
To gain further insights into the dynamic of the information accumulation process, we modelled the data from both experiments adapting a Gaze-weighted Linear Accumulator Model (GLAM) recently developed by Thomas et al., 2019. The GLAM belongs to the family of race models and approximates the aDDM model (Krajbich et al., 2010; Krajbich and Rangel, 2011) in which the dynamic aspect is discarded, favouring a more efficient estimation of the parameters. This model was chosen since, unlike the aDDM, it allowed us to test the prediction of the confidence measures as balance of evidence (Vickers, 1979; Kepecs et al., 2008; De Martino et al., 2013). Crucially, in both experiments, we used goal-relevant evidence (not the value or the number of dots) to fit the models in the dislike and fewest frames (for further details see the Materials and methods Attentional Model: Glam section).
Parameter fit and simulation
Value experiment
The simulations estimated with the parameters fitted for like and dislike frames data (even-trials) reproduced the behaviour observed in the data not used to fit the model (odd-trials). In both like and dislike frames, the model replicated the observed decrease of RT when |ΔValue| is high, that is, the increase in speed of response in easier trials (bigger value difference). The RT simulated by the models significantly correlated with the RT values observed in participants odd-numbered trials (Like: r(29)=0.90, p<0.001; Dislike: r(29)=0.89, p<0.001) (Figure 5A). In the like frame, the model also correctly predicted a higher probability of choosing the right item when ΔValue is higher. In the dislike frame, the model captured the change in the task goal and predicted that the selection of the right item will occur when -ΔValue is higher, that is when the value of the left item is higher. Overall, in both frames the observed and predicted probabilities of choosing the most valuable item were significantly correlated (Like: r(29)=0.80, p<0.001; Dislike: r(29)=0.79, p<0.001) (Figure 5B). See Appendix 5—figure 4A and Appendix 5—figure 5A for further details.
In both frames, the models also predicted choice depending on the difference in gaze (ΔGaze = gright – gleft), that is, that the probability of choosing the right item increases when the time spent observing that item is higher. However, in this case, we cannot say if gaze allocation itself is predicting choice if we do not account for the effect of |ΔValue|. To account for the relationship between choice and gaze, we used a measure devised by Thomas et al., 2019, ‘gaze influence’. Gaze influence is calculated taking the actual choice (1 or 0 for right or left choice, respectively) and subtracting the probability of choosing the right item given by a logistic regression for ΔValue calculated from actual behaviour. The averaged ‘residual’ choice probability indicates the existence of a positive or negative gaze advantage. Then, we compared the gaze influence predicted by GLAM with the empirical one observed for each participant. As in Thomas et al., 2019, most of the participants had a positive gaze influence and it was properly predicted by the model in both frames (Like: r(29)=0.68, p<0.001; Dislike: r(29)=0.63, p<0.001) (Figure 5C).
Perceptual experiment
As in the Value Experiment, we fitted the GLAM to the data and we conducted model simulations. Again, these simulations showed that we could recover most of the behavioural patterns observed in participants. We replicated the relationship between RT and |ΔDots| (Most: r(26)=0.97, p<0.001; Fewest: r(26)=0.98, p<0.001) (Figure 5D). As in the value-based experiment, the model also predicted a higher probability of choosing the right-hand item when ΔDots is higher in the most frame and when -ΔDots is higher in the fewest frame. However, in the Perceptual Experiment, the simulated choices only in the fewest frame were significantly correlated with the observed data, although we observed a non-significant trend in the most frame (Most: r(26)=0.69, p<0.001; Fewest: r(26)=0.37, p=0.051) (Figure 5E). In both frames, we observed that the model predicted that choice was linked to ΔGaze and, as in the Value Experiment, we show that the gaze influence predicted by the model is indeed observed in the data (Most: r(26)=0.65, p<0.001; Fewest: r(26)=0.47, p<0.05) (Figure 5F). See Appendix 5—figure 4B and Appendix 5—figure 5B for further details.
Results of the models fitted without accounting for the change in goal-relevant evidence provided a poor fit of the data, these results are presented in Appendix 5—figures 1–3 and 6. For a direct comparison of the different GLAM parameters see Appendix 6. Additionally, we were able to mirror the results obtained with GLAM using aDDM (Krajbich et al., 2010; Tavares et al., 2017). For dislike and fewest frames, the best model was the one fitted using goal-relevant evidence (see Appendix 7 for details).
Balance of evidence and confidence
The GLAM belongs to the family of race models in which evidence is independently accumulated for each option. Therefore, using the GLAM we were able to adapt the model to estimate a measure of confidence in the decision that is defined by the balance of evidence (Vickers, 1979; Vickers, 1970; Kepecs et al., 2008; De Martino et al., 2013) allowing us to characterise the pattern of the confidence measures. Balance of evidence is defined as the absolute difference between the accumulators for each option at the moment of choice, which is when one of them reaches the decision threshold (i.e., Δe = |Eright(tfinal) - Eleft(tfinal)|) (Figure 6A). To estimate Δe, we performed a large number of computer simulations using the fitted parameters for each participant in both experiments.
Value experiment
To confirm that the relationship between confidence and other experimental variables was captured by the balance of evidence simulations, we constructed a linear regression model predicting Δe as function of the values and the RTs obtained in the simulations (Δe ∼ |ΔValue| + simulated RT + ΣValue). We found that this model replicated the pattern of results we obtained experimentally (Figure 4). We then explored whether the model was able to recover the effect of ΣValue on confidence (Figure 6B). As we have shown when analysing confidence, ΣValue boosted Δe in the like frame (βΣValue = 0.071, t(37196) = 14.21, p<0.001) and reduced Δe in the dislike frame (βΣValue = −0.061, t(37196) = −12.07, p<0.001). The effect of ΣValue over confidence was replicated in the simulations with an increase of Δe when high value options are available to choose (Appendix 8—figure 1 and Appendix 8—figure 3A,D for more details). In the dislike frame, the fitted model also replicated this pattern of behaviour, including the adaptation to context which predicts higher Δe when both alternatives have low value. Interestingly, the replication of the effect for ΣValue over Δe with GLAM did not hold when the gaze bias was taken out of the model in like (βΣValue = −0.007, t(37196) = −1.495, p=0.13, ns) and dislike (βΣValue = −0.002, t(37196) = −0.413, p=0.679, ns) frames (Figure 6B). We also found that the effect of |ΔValue| on confidence was replicated by the simulated balance of evidence, increasing Δe when the difference between item values is higher (i.e. participants and the model simulations are more ‘confident’ when |ΔValue| is higher) (Appendix 8—figure 1).
Perceptual experiment
We conducted a set of similar analyses and model simulations in the Value Experiment (Figure 6C). We found that ΣDots boosted Δe in the most frame (Most : βΣDots = 0.029, t(33596) = 4.71, p<0.001) and reduced Δe in the fewest frame (Fewest : βΣDots = −0.088, t(33596) = −14.41, p<0.001) . As in the Value Experiment, this effect disappeared when the gaze bias was taken out of the model (Most: βΣDots = −0.0002, t(33596) = −0.04, p=0.96, ns; Fewest: βΣDots = −0.006, t(33596) = −1.03, p=0.29, ns) (see Appendix 8—figure 2 and Appendix 8—figure 3B,E for more details).
Overall, these results show how the model is capable of capturing the novel empirical effect on confidence we identified experimentally, giving computational support to the hypothesis that goal-relevant evidence is fed to second order processes like confidence. It also hints at a potential origin to the effects of the sum of evidence (i.e. ΣValue, ΣDots) on confidence: asymmetries in the accumulation process, in particular the multiplicative effect of attention over accumulation of evidence, may enhance the differences between items that are more relevant for the frame. This consequentially boosts the level of confidence that participants have in their decisions.
A model of optimal information acquisition
We then sought to understand why participants systematically accumulated evidence depending on the task at hand, instead of first integrating evidence using a task-independent strategy and then emitting a response appropriate with the task. We reasoned that this may reflect a response in line with models of rational information acquisition popular in economics. These include models of so-called rational inattention, according to which agents are rationally choosing which information to acquire considering the task, the incentives, and the cost of acquiring and processing information (Sims, 2003; Sims, 2010; Caplin and Dean, 2015; Hébert and Woodford, 2017). As opposed to DDM or GLAM, these models attempt to investigate not only what the consequences of information acquisition are, but also which information is acquired.
In this model, we consider an agent facing n available options. Each item i has value vi to the agent, which is unknown, and agents have a prior such that values follow an independent, identical distribution; for simplicity, we assume it to be a Normal . Agents can acquire information in the form of signals , with independently and identically distributed with . They follow Bayes’ rule in updating their beliefs after information. Once they finish acquiring information, they then choose the item with the highest expected value.
Consider first the case in which an agent needs to pick the best item out of n possible ones. Suppose that she already received one signal for each item. Denote i1 the item for which the agent received the highest signal, which is also the item with the highest expected value; i2 the second highest, etc. (Because each of these is almost surely unique, let us for simplicity assume they are indeed unique). The agent can acquire one additional signal about any of the available items or select any probability distribution over signals. The following proposition shows that it is (weakly) optimal for the agent to acquire a second signal about the item that is currently best, that is, i1.
Denote Δ the set of all probability distributions over signals and the utility after acquiring a new signal about item i, that is,
Proposition 1. The optimal strategy when choosing the best option is to acquire one more signal about item i1 or i2, that is, either the item with the currently highest expected value or the one with second highest value. That is:
This proposition shows that agents have asymmetric optimal sampling strategies: they are not indifferent between which item to sample, but rather want to acquire extra signals about items that current look best or second-best. (They are indifferent between the latter two). When these strategies are strictly better than acquiring signals about any other item.
How would this change if agents need instead to pick which item to eliminate, assuming that she gets the average utility of the items she keeps? In this case, the expected utility after acquiring a new signal about item i is:
Then, it is optimal to receive an additional signal about the least valuable item in or the next one,
Proposition 2. The optimal strategy when choosing which item to discard is to acquire one more signal about item in or , that is, either the one with the lowest or the one with the second lowest value. That is:
For a proof of both propositions, see Appendix 9.
Again, agents have asymmetric optimal sampling strategies: but now, they want to sample the items that currently look worse again. The intuition behind both results is that when one has to choose the best item, it is more useful to acquire information that is likely to change the ranking at the top (i.e. between best or second best item) than information that changes the ranking at the bottom, since these items won’t be selected (e.g. 4th and 5th item). Crucially, the reverse is true when one is tasked to select which item to eliminate.
This shows how in these simple tasks it is strictly more advantageous to acquire information in line with the current goal rather than adopting a goal-independent information-acquisition strategy.
Our model suggests that in many ecological settings, in which there are more than two options, the optimal strategy involves acquiring asymmetric information depending on the goal. It is only when there are only two options that individuals are indifferent about which information to acquire. We propose that the asymmetric strategies we observe even in this latter case might be a consequence of the fact that individuals have developed a strategy that is optimal for the more frequent, real-life cases in which and continue to implement this same asymmetric strategy to binary choices, where it remains optimal.
Discussion
In this study, we investigated how framing affects the way in which information is acquired and integrated during value-based and perceptual choices. Here, using psychophysics together with computational and economic models we have been able to adjudicate between two contrasting hypotheses. The first one, currently the dominant one in the field of neuroeconomics, proposes that attention modulates (either by biasing or boosting) a value integration that starts at the beginning of the deliberation process. Subsequently, at the time of the decision, the participant would give the appropriate response (in our task accepting the option with the highest value or rejecting the one with lowest one) using the value estimate constructed during this deliberation phase. The second hypothesis suggests that, from the very start of the deliberation process, the task-frame (goal) influences the type of information that is integrated. In this second scenario, attention is not automatically attracted to high value items to facilitate their accumulation but has a more general role in prioritising the type of information that is useful for achieving the current behavioural goal. Importantly, these two hypotheses make very distinct predictions about the pattern of attention and suggest very different cognitive architecture underpinning the decision process.
Our results favour the second hypothesis: specifically, we show that, in both perceptual and value-based tasks, attention is allocated depending on the behavioural goal of the task. Although our study does not directly contradict previous findings (Krajbich et al., 2010; Krajbich and Rangel, 2011; Cavanagh et al., 2014; Smith and Krajbich, 2019), it adds nuance to the view that this is a process specifically tied to value integration (defined as a hedonic or reward attribute). Our findings speak in favour of a more general role played by attention in prioritising the information needed to fulfil a behavioural goal in both value and perceptual choices (Gottlieb, 2012; Kovach et al., 2014; Glickman et al., 2018). Importantly, the seeking of goal-relevant information is observed along the trial, opposing the assumption that attentional sampling is random except for the last fixation (Krajbich et al., 2010; Krajbich and Rangel, 2011; see Gluth et al., 2018; Gluth et al., 2020 for additional support for this idea). Pavlovian influences have been proposed to play a key role in the context of accept/reject framing manipulation (De Martino et al., 2006; Guitart-Masip et al., 2012; Guitart-Masip et al., 2014; Dayan, 2012). However, the fact that we found almost identical results in a follow-up perceptual study in which the choice was not framed in terms of ‘accept’ or ‘reject’ but using a different kind of instruction (i.e. ‘choose the option with fewer or more dots’) suggests that attention acts on a more fundamental mechanism of information processing that goes beyond simple Pavlovian influences.
We also measured the trial-by-trial fluctuations in confidence to gain a deeper insight in the dynamics of this process. We found that the role of confidence goes beyond that of simply tracking the probability of an action being correct, as proposed in standard signal detection theory. Instead, it is also influenced by the perceived sense of uncertainty in the evaluation process (Navajas et al., 2017; Vaghi et al., 2017), and contextual cues (Lebreton et al., 2019). In turn, confidence influences future responses and information seeking (Folke et al., 2017; Guggenmos et al., 2016; Fleming et al., 2018; Rollwage et al., 2018). In previous work (Folke et al., 2017), we reported how, in value-based choice, confidence was related not only to the difference in value between the two items, but also to the summed value (ΔValue and ΣValue using the current notation), and we found that confidence was higher if both items have a high value (Folke et al., 2017). Here, we replicate this effect in both experiments in the like and most conditions. However, this effect flips in the dislike or fewest frame: in these cases, confidence increases when the summed value or number of dots is smaller. This result is particularly striking since the frame manipulation should be irrelevant for the purpose of the decision and has little effect on the objective performance. This suggests that similarly to attention, the sense of confidence is also shaped by the behavioural goal that participants are set to achieve.
In both experiments, the incorporation of goal-relevant evidence to fit the GLAM resulted in a better model fit compared with the model in which the value or perceptual evidence was integrated independently of the frame. We then modified the GLAM to include a measure of confidence defined as balance of evidence (Δe) (Vickers, 1979; Kepecs et al., 2008; De Martino et al., 2013). In doing so we confirm that our model can replicate all the main relations between confidence, choice and RT. We then tested if the model simulation was also recovering the flip in the relationship between confidence and summed evidence (ΣValue or ΣDots) triggered by the frame manipulation. We found the model captures this effect only if the attentional bias is included in the simulations. The boost in Δe when goal-relevant evidence in both alternatives is high can attributed to the architecture of the model: gaze has a multiplicative effect over evidence accumulation. For example, consider a case with two items of value A1 = 2 and A2 = 1, and a discount factor for the unattended item u = 0.3. Assuming the item with higher value is gazed more we could express, in a very simplified way, the Δe for this choice as ΔeA = A1-A2*u = 2–1*0.3 = 1.7. Consider now two new items with identical ΔValue but higher magnitude of the ΣValue, B1 = 10 and B2 = 9. Notice that since ΔValue is the same, this choice in absence of attentional effect should be considered of identical difficulty than in case A (A1-A2 = B1-B2 = 1), and therefore the agent should be neither more, nor less confident. But, keeping the same attentional factors than for the first set, we have that the Δe between the items increases, ΔeB = B1-B2*u = 10–9*0.3 = 7.3 (ΔeA<ΔeB). This effect would not be observed if attention affected evidence accumulation in an additive way (A1-(A2-u) = B1-(B2-u)). Our empirical confidence data therefore provide further support to a multiplicative (Smith and Krajbich, 2019) instead of additive effect of attention into goal-relevant information. Overall, these data speak in favour of a coding scheme in which the goal sets, from the beginning of the task, the allocation of attention and, by doing so, influences first-order processes such as choice, but also second order process such as confidence. Further empirical data will be required to test this idea more stringently.
The idea that the goal of the task plays a central role in shaping value-based decisions should not be surprising. Indeed, value-based decision is often called goal-directed choice. Nevertheless, there has been a surprisingly little amount of experimental work in which the behavioural goal has been directly manipulated as the key experimental variable for studying the relation between attention and value. Notable exceptions are two recent studies from Frömer et al., 2019 and Kovach et al., 2014. In the first study (Frömer et al., 2019), participants were shown a set of four items and asked, in half of the trials, to determine the best item and, in the second half, the worst item. In line with our findings, they found that behaviour and neural activity in the ‘value network’, including vmPFC and striatum, was determined by goal-congruency and did not simply reflected the expected reward. In the second study, Kovach et al., 2014 implemented a design similar to our value-based experiment in which participants were required to indicate the item to keep and the one to discard. They found, similarly to our findings in the value-based experiment, that the overall pattern of attention was mostly allocated according to the task goal. However, in the first few hundred milliseconds, these authors found that attention was directed more prominently to the most valuable item in both conditions. We did not replicate this last finding in our experiment (see Figure 3C and D and Appendix 2—figure 2, showing that fixations were randomly allocated during the early moments of the trial). One possible reason for this discrepancy is that the experiment by Kovach and colleagues presented both items on the screen at the beginning of the task – unlike in our task, in which the item was presented in a gaze-contingent way (to avoid processing in the visual periphery). This setting might have triggered an initial and transitory bottom-up attention grab from the most valuable (and often most salient) item before the accumulation process started.
To gain a deeper insight into our findings, we developed a normative model of optimal information acquisition rooted in economic decision theory. Our model shows that in many real-life scenarios in which the decision set is larger than two, the optimal strategy to gather and integrate information depends on the behavioural goal. Intuitively, this happens because new information is all the more useful the more likely it is to change the behavioural output, that is, the choice. When the agent needs to select the best item in a set, it is best to search for evidence that it is more likely to affect the top of the ranking (e.g. is the best item still the best one?); information that changes the middle or the bottom of the ranking is instead less valuable (e.g. is the item ranked as seven is now ranked as six?) because it would not affect the behavioural output. When choosing which item to discard, instead, the optimal strategy involves acquiring information most informative of the bottom of the ranking and not the top. We propose that even in the context of binary choice studied here, humans might still deploy this normative strategy (for multi-alternative choice), and that while it does not provide a normative advantage, it is not suboptimal. Further work in which the size of the set is increased would be required to test this idea more stringently. Notably, two recent pre-prints have also introduced models to explain how the attentional patterns in choice are generated assuming optimal information sampling (Jang et al., 2020; Callaway et al., 2020). Both models are based on Bayesian updates of value beliefs, with visual attention playing a role in selecting the information to sample. However, both studies were developed considering only a standard appetitive like frame (Krajbich et al., 2010 study was used as benchmark in both cases).
The most far reaching conclusion of our work is that context and behavioural demand have a powerful effect on how information is accumulated and processed. Notably, our data show that this is a general effect that spans both more complex value-based choice and simpler perceptual choice. Our conclusion is that, given the limited computational resources of the brain, humans have developed a mechanism that prioritises the processing or recollection of the information that is most relevant for the behavioural response that is required. This has profound implications when we think about the widespread effect of contextual information on decision making that has been at the core of the research in psychology, behavioural economics and more recently neuroeconomics (Kahneman and Tversky, 1984; Kahneman and Tversky, 2000; Camerer et al., 2004; De Martino et al., 2006; Glimcher and Fehr, 2014). Most of these contextual or framing effects have been labelled as ‘biases’ because, once one strips away the context, the actual available options should remain identical. However, this perspective may not be putting enough emphasis on the fact that the decision maker has to construct low dimensional (and therefore imperfect) representations of the decision problem. As we have shown here, from the very beginning of the deliberation process, the context — even when it is simple (like/dislike, most/fewest) or irrelevant from the experimenter perspective — affects which information is processed, recalled, or attended to, with effects that spread into post-decision processing such as confidence estimation. This, as a consequence, will produce profoundly dissimilar representations according to the behavioural goal set by the context. With this shift of perspective, it may well be the case that many of the so-called ‘biases’ will be shown in a new light, given that participants are dealing with very different choices once the behavioural goal changes. This viewpoint might provide a more encouraging picture of the human mind, by suggesting that evolution has equipped us well to deal with ever-changing environments in the face of limited computational resources.
Materials and methods
Procedure
Value experiment
Request a detailed protocolAt the beginning of this experiment, participants were asked to report on a scale from £0 to £3 the maximum they would be willing to pay for each of 60 snack food items. They were informed that this bid will give them the opportunity to purchase a snack at the end of the experiment, using the BDM (Becker et al., 1964), which gives them incentives to report their true valuation. Participants were asked to fast for 4 hr previous to the experiment, expecting they would be hungry and willing to spend money to buy a snack.
After the bid process, participants completed the choice task: in each trial, they were asked to choose between two snack items, displayed on-screen in equidistant boxes to the left and right of the centre of the screen (Figure 1A). After each binary choice, participants also rated their subjective level of confidence in their choice. Pairs were selected using the value ratings given in the bidding task: using a median split, each item was categorised as high- or low-value for the agent; these were then combined to produce 15 high-value, 15 low-value, and 30 mixed pairs, for a total of 60 pairs tailored to the participant’s preferences. Each pair was presented twice, inverting the position to have a counterbalanced item presentation.
The key aspect of our experimental setting is that all participants executed the choice process under two framing conditions: (1) a like frame, in which participants were asked to select the item that they liked the most, that is, the snack that they would prefer to eat at the end of the experiment and (2) a dislike frame in which participants were asked to select the item that they liked the least, knowing that this is tantamount to choosing the other item for consumption at the end of the experiment. See Figure 1A for a diagram of the task.
After four practice trials, participants performed a total of 6 blocks of 40 trials (240 trials in total). Like and dislike frames were presented in alternate blocks and the order was counterbalanced across participants (120 trials per frame). An icon in the top-left corner of the screen (‘thumbs up’ for like and ‘stop sign’ for dislike) reminded participants of the choice they were required to make; this was also announced by the investigator at the beginning of every block. The last pair in a block would not be first in the subsequent block.
Participants’ eye movements were recorded throughout the choice task and the presentation of food items was gaze-contingent: participants could only see one item at a time depending on which box they looked at; following Folke et al., 2017, this was done to reduce the risk that participant, while gazing one item, would still look at the other item in their visual periphery.
Once all tasks were completed, one trial was randomly selected from the choice task. The BDM bid value of the preferred item (the chosen one in the like frame and the unchosen one in the dislike frame) was compared with a randomly generated number between £0 and £3. If the bid was higher than the BDM generated value, an amount equivalent to the BDM value was subtracted from their £20 payment and the participant received the food item. If the bid was lower than the generated value, participants were paid £20 for their time and did not receive any snack. In either case, participants were required to stay in the testing room for an extra hour and were unable to eat any food during this time other than food bought in the auction. Participants were made aware of the whole procedure before the experiment began.
Perceptual experiment
Request a detailed protocolPerceptual Experiment had a design similar to the one implemented in Value Experiment, except that alternatives were visual stimuli instead of food items. In this task, participants had to choose between two circles filled with dots (for a schematic diagram see Figure 1), again in two frames. In the most frame, they had to pick the one with more dots; and the one with fewer dots in the fewest frame. The total number of dots presented in the circles could have three numerosity levels (=50, 80 and 110 dots). For each pair in those three levels, the dot difference between the circles varied in 10 percentage levels (ranging from 2% to 20% with 2% steps). To increase the difficulty of the task, in addition to the target dots (blue-green coloured), distractor dots (orange coloured) were also shown. The number of distractor dots was 80% of that of target dots (40, 64, 88 for the three numerosity levels, respectively). Pairs were presented twice and counterbalanced for item presentation. After 40 practice trials (20 initial trials with feedback, last 20 without), participants completed 3 blocks of 40 trials in the most frame and the same number in the fewest frame; they faced blocks with alternating frames, with a presentation order counterbalanced across participants. On the top left side of the screen a message indicating Most or Fewest reminded participants of the current frame. Participants reported their confidence level in making the correct choice at the end of each trial. As in the previous experiment, the presentation of each circle was gaze contingent. Eye tracking information was recorded for each trial. Participants received £7.5 for 1 hr in this study.
Both tasks were programmed using Experiment Builder version 2.1.140 (SR Research).
Exclusion criteria
Value experiment
Request a detailed protocolWe excluded individuals that met any of the following criteria:
Participants used less than 25% of the BDM value scale.
Participants gave exactly the same BDM value for more than 50% of the items.
Participants used less than 25% of the choice confidence scales.
Participants gave exactly the same confidence rating for more than 50% of their choices.
Participants did not comply with the requirements of the experiment (i.e., participants that consistently choose the preferred item in dislike frame or their average blink time is over 15% of the duration of the trials).
Perceptual experiment
Request a detailed protocolSince for Perceptual Experiment the assessment of the value scale is irrelevant, we excluded participants according to criteria 3, 4, and 5.
Participants
Value experiment
Request a detailed protocolForty volunteers gave their informed consent to take part in this research. Of these, 31 passed the exclusion criteria and were included in the analysis (16 females, 17 males, aged 20–54, mean age of 28.8). One participant was excluded for using less than 25% of the bidding scale (criteria 1). A second participant was excluded according to criteria 2 as they frequently gave the same bid value. A further four participants were excluded under criteria 4. Three participants were excluded due to criteria 5. In the latter case, one participant’s eye-tracking data showed the highest number of blink events and made choices without fixating any of the items; the other two did not comply with the frame manipulation. To ensure familiarity with the snack items, all the participants in the study had lived in the UK for 1 year or more (average 17 years).
Perceptual experiment
Request a detailed protocolForty volunteers were recruited for the second experiment. Thirty-two participants (22 females, 10 males, aged 19–50, mean age of 26.03) were included in the behavioural and regression analyses. Three participants were excluded for repetition of the confidence rating (criteria 4). Five participants were removed for criteria 5: four of them had performance close to chance level or did not followed the frame modification, and one participant presented difficulties for eye-tracking. Due to instability in parameter estimation (problem of MCMC convergence), four additional participants were removed from the GLAM modelling analysis.
All participants signed a consent form, and both studies were done following the approval given by the University College London, Division of Psychology and Language Sciences ethics committee.
Eye-tracking
Value and perceptual experiments
Request a detailed protocolAn Eyelink 1000 eye-tracker (SR Research) was used to collect the visual data. Left eye movements were sampled at 500 Hz. Participants rested their heads over a head support in front of the screen. Display resolution was of 1024 × 768 pixels. To standardise the environmental setting and the level of detectability, the lighting was monitored in the room using a dimmer lamp and light intensity was maintained at 4 ± 0.5 lx at the position of the head-mount when the screen was black.
Eye-tracking data were analysed initially using Data Viewer (SR Research), from which reports were extracted containing details of eye movements. We defined two interest-areas (IA) for left and right alternatives: two squares of 350 × 350 pixels in Value Experiment and two circles of 170 pixels of radius for Perceptual Experiment. The data extracted from the eye-tracker were taken between the appearance of the elements on the screen (snack items or circle with dots in experiments 1 and 2, respectively) and the choice button press (confidence report period was not considered for eye data analysis).
The time participants spent fixating on each IA was defined the dwelling time (DT). From it, we derived a difference in dwelling time (ΔDT) for each trial by subtracting DT of the right IA minus the DT of the left IA. Starting and ending IA of each saccade were recorded. This information was used to determine the number of times participants alternated their gaze between IAs, that is, ‘gaze shifts’. The total number of gaze shifts between IAs was extracted for each trial, producing the gaze shift frequency (GSF) variable.
Data analysis: behavioural data
Request a detailed protocolBehavioural measures during like/dislike and most/fewest frames were compared using statistical tests available in SciPy. Sklern toolbox in Python was used to perform logistic regressions on choice data. Fixation time series analysis was performed following Kovach et al., 2014 methodology. We segmented the time series of all the trials in samples of 10 ms. We fixed all the trials time series to the beginning of the trial, when participant could start exploring the gaze-contingent alternatives. We considered an analysis window of 2000 ms after the presentation of stimuli for all the trials. Please notice that not all the trials have the same duration and no temporal normalisation was performed in this analysis. For each time sample, we obtained the gaze position and the difference in evidence (i.e. ΔValue or ΔDots) for all trials across participants and then Pearson correlation was calculated. Permutations testing was used to assess the difference between the time series in like/dislike and most/fewest frames. Instantaneous fixations (across trials and frames) were shuffled 200 times to create a null distribution of the difference of correlation coefficients between frames. False discovery rate (FDR) was used to correct for multiple tests the p-values obtained from the permutation test (α ≤ 0.01). All the hierarchical analyses were performed using lme4 package (Bates et al., 2015) for R integrated in a Jupyter notebook using the rpy2 package (https://rpy2.readthedocs.io/en/latest/). For choice models, we predicted the log odds ratio of selecting the item appearing at the right. Fixed-effects confidence interval were calculated by multiplying standard errors by 1.96. Additionally, we predicted confidence using a linear mixed-effects model. Predictors were all z-scored at participant level. Matplotlib/Seaborn packages were used for visualisation.
Data analysis: attentional model - GLAM
To get further insight on potential variations in the evidence accumulation process due to the change in frames we used the Gaze-weighted Linear Accumulator Model (GLAM) developed by Thomas et al., 2019. GLAM is part of the family of linear stochastic race models in which different alternatives (i, i.e. left or right) accumulate evidence (Ei) until a decision threshold is reached by one of them, determining the chosen alternative. The accumulator for an individual option is described by the following expression:
With a drift term (ν) controlling the speed of relative evidence (Ri) integration and i.i.d. noise terms with normal distribution (zero-centered and standard deviation σ). Ri is a term that expresses the amount of evidence that is accumulated for item i at each time point t. This is calculated as follows. We denote by gi, the relative gaze term, calculated as the proportion of time that participants observed item i:
with DT as the dwelling time for item i during an individual trial. Let ri denote the value for item i reported during the initial stage of the experiment. We can then define the average absolute evidence for each item (Ai) during a trial:
This formulation considers a multiplicative effect of the attentional component over the item value, capturing different rates of integration when the participant is observing item i or not (unbiased and biased states, respectively). The parameter γ is the gaze bias parameter: it controls the weight that the attentional component has in determining absolute evidence. Thomas et al., 2019 interpret γ as follows: when γ = 1, bias and unbiased states have no difference (i.e. the same r is added to the average absolute evidence regardless the item is attended or not); when γ <1, the absolute evidence is discounted for the biased condition; when γ <0, there is a leak of evidence when the item is not fixated. Following Thomas et al., 2019, in our analysis, we allowed γ to take negative values, but our results do not change if γ is restricted to [0, 1] (Appendix 6—figure 2). Finally, the relative evidence of item i, Ri*, is given by:
Since our experiment considers a binary choice the original formulation of the model (Thomas et al., 2019), proposed for more than two alternatives, Ri* is reduced to subtract the average absolute evidence of the other item. Therefore, for the binary case, the Ri* for one item will be additive inverse of the other, for example if the left item has the lower value, we would have Rleft*<0 and Rright*>0. Additionally, in their proposal for GLAM, Thomas et al., 2019 noted that Ri* range will depend on the values that the participant reported, for example evidence accumulation may appear smaller if participant valued all the items similarly, since Ri* may be lower in magnitude. This may not represent the actual evidence accumulation process since participants may be sensitive to marginal differences in relative evidence. To account for both of these issues, a logistic transformation is applied over Ri* using a scaling parameter τ:
In this case, Ri will be always positive and the magnitude of the difference between Rleft and Rright will be controlled by τ, for example higher τ will imply a bigger difference in relative evidence (and hence accumulation rate) between left and right item. In the case that τ = 0 the participant will not present any sensitivity to differences in relative evidence.
Given that Ri represents an average of the relative evidence across the entire trial, the drift rate in Ei can be assumed to be constant, which enables the use of an analytical solution for the passage of time density. Unlike aDDM (Krajbich et al., 2010), GLAM does not deal with the dynamics of attentional allocation process in choice. Details of these expressions are available at Thomas et al., 2019. In summary, we have four free parameters in the GLAM: ν (drift term), γ (gaze bias), τ (evidence scaling), and σ (normally distributed noise standard deviation).
The model fit with GLAM was implemented at a participant level in a Bayesian framework using PyMC3 (Salvatier et al., 2016). Uniform priors were used for all the parameters:
Value experiment
Request a detailed protocolWe fitted the model for each individual participant and for like and dislike frames, separately. To model participant’s behaviour in the like frame, we used as input for GLAM the RTs and choices, plus BDM bid values and relative gaze for left and right alternatives for each trial. The original GLAM formulation (as presented above) assumes that evidence is accumulated in line with the preference value of a particular item (i.e. ‘how much I like this item’). When information about visual attention is included in the model, the multiplicative model in GLAM assumes that attention will boost the evidence accumulation already defined by value. Our proposal is that evidence accumulation is a flexible process in which attention is attracted to items based on the match between their value and task-goal (accept or reject) and not based on value alone, as most of the previous studies have assumed. Since in the dislike frame the item with the lower value becomes relevant to fulfil the task, we considered the opposite value of the items (ri,dislike = 3 - ri,like, e.g. item with value 3, the maximum value, becomes value 0) as an input for GLAM fit. For both conditions, model fit was performed only on even-numbered trials using Markov-Chain-Monte-Carlo sampling, using implementation for No-U-Turn-Sampler (NUTS), four chains were sampled, 1000 tuning samples were used, 2000 posterior samples to estimate the model parameters. The convergence was diagnosed using the Gelman-Rubin statistic (| – 1|<0.05) and also corroborating that the effective sample size (ESS) was high (ESS >100) for the four parameters (ν, γ, σ, and τ). Considering all the individual models, we found divergences in less than 3% of the estimated parameters. Model comparison was performed using Watanabe-Akaike Information Criterion (WAIC) scores available in PyMC3, calculated for each individual participant fit.
Pointing to check if the model replicates the behavioural effects observed in the data (Palminteri et al., 2017), simulations for choice and response time (RT) were performed using participant’s odd trials, each one repeated 50 times. For each trial, value and relative gaze for left and right items were used together with the individual estimated parameters. Random choice and RT (within a range of the minimum and maximum RT observed for each particular participant) were set for 5% of the simulations, replicating the contaminating process included in the model as described by Thomas et al., 2019.
Additionally, we simulated the accumulation process in each trial to obtain a measure of balance of evidence (Vickers, 1970; Vickers, 1979) for each trial. The purpose of this analysis was to replicate the effect of ΣValue over confidence (check Results for details) and check if it arises from the accumulation process and its interaction with attention. Balance of evidence in accumulator models has been used previously as an approximation to the generation of confidence in perceptual and value-based decision experiments (Vickers, 1979; Smith and Vickers, 1988; De Martino et al., 2013). Consequently, using the value of the items and gaze ratio from odd-numbered trials, we simulated two accumulators (Equation 5), one for each alternative. Our simulations used the GLAM parameters obtained from participant’s fit. Once the boundary was reached by one of the stochastic accumulators (fixed boundary = 1), we extracted the simulated RT and choice. The absolute difference between the accumulators when the boundary was reached (Δe = |Eright(tfinal) - Eleft(tfinal)|) delivered the balance of evidence for that trial. In total 37,200 trials were simulated (10 repetitions for each one of the trials done by the participants). A linear regression model to predict simulated Δe using |ΔValue|, simulated RT and ΣValue as predictors was calculated with the pooled participants’ data. This model was chosen since it was the most parsimonious model obtained to predict participant’s confidence in the Value Experiment (Appendix 4—figure 1). The best model includes GSF as predictor in the regression, but since GLAM does not consider the gaze dynamics we removed it from the model. Δe simulations using a GLAM without gaze influence (i.e. equal gaze time for each alternative) were also generated, to check if gaze difference was required to reproduce ΣValue effect over confidence. The parameters fitted for individual participants were also used in the no-gaze difference simulation. The same linear regression model (Δe ∼ |ΔValue| + simulated RT + ΣValue) was used with the data simulated with no-gaze difference.
Perceptual experiment
Request a detailed protocolIn the Perceptual Experiment, we repeated the same GLAM analysis done in Value Experiment. Due to instabilities in the parameters’ fit for some participants, we excluded four extra participants. Twenty-eight participants were included in this analysis. Additionally, the GLAM fit in this case was done removing outlier trials, that is, trials with RT higher than 3 standard deviations (within participant) or higher than 20 s. Overall less than 2% of the trials were removed. For most frame, relative gaze and perceptual evidence (number of dots) for each alternative were used to fit choice and RT. In a similar way to the consideration taken in the dislike case, we reassigned the perceptual evidence in the fewest frame (ri,fewest = 133 - ri,most+ 40 , considering that 133 is the higher number of dots presented and 40 dots the minimum) in a way that the options with higher perceptual evidence in the most frame have the lower evidence in the fewest frame. The same MCMC parameters used to fit the model for each participant in the Value Experiment were used in this case (again, only even-numbered trials were used to fit the model). As in the Value Experiment, model convergence was assessed using and ESS. Overall, we observed divergences in less than 2% of parameter estimations across participants. Behavioural out-of-sample simulations (using the odd-numbered trials) and balance of evidence simulations (33,600 trials simulated in the Perceptual Experiment) were considered in this analysis. We tested the effect of ΣDots over confidence with a similar linear regression model than the one used in the Value Experiment. Pooled participants’ data for |ΔDots|, simulated RT and ΣDots was used to predict Δe. Δe simulations using a GLAM without gaze asymmetry were also calculated in this case. All the figures and analysis were done in Python using GLAM toolbox and custom scripts.
Appendix 1
Task framing differences
Value experiment
We examined how the frame manipulation impacted overall performance (Appendix 1—figure 1A). We defined ‘accuracy’ as the proportion of trials in which participant’s reported values (BDM bid) correctly predicted their binary decision, that is, they select the item with highest value in the like frame and the one with lowest value in the dislike frame. Overall accuracy was not significantly different in both frames (MeanLike = 0.77; MeanDislike = 0.75, t(30) = 1.71; p=0.1). We also found that participants had slightly slower reaction times (RTs) in the dislike frame (MeanLike = 2858.2 ms, MeanDislike = 3152.7 ms; t(30) = −2.52; p<0.05). Participants reported lower confidence in the dislike frame (Mean∆Confidence = 0.19; t(30) = 4.49; p<0.001) and shifted their gaze (gaze shift frequency, GSF) between items more during dislike trials (Mean∆|GSF| = −0.110; t(30) = −2.99; p<0.01). These results overall suggest that the subjects may have found the dislike condition slightly less intuitive. Although this did not affect their performance, it slightly reduced their confidence and increased RT and GSF.
As observed in previous studies (Folke et al., 2017; De Martino et al., 2013), we found that choice accuracy was modulated by confidence: decisions in which participants reported high-confidence were more accurately predicted by the value estimate collected before the experiment – the slope of the logistic curve is steeper in the case of high confidence (Figure 1B, Results section). In this study, this effect is replicated in both like (low confidence: β = 0.769; high confidence: β = 1.633) and dislike (low confidence: β = −0.642; high confidence: β = −1.363) frames. Note that the inversion of the sign of the slopes in like vs dislike frames indicate that participants were performing the task correctly (∆βLike-Dislike: t(30) = 8.14, p<0.001), selecting the item with lower value during the dislike frame (Figure 1C, Results section). Choice accuracy (steepness of the slopes) was not significantly different between like and dislike frames (∆|βLike-Dislike|: t(30) = 1.58, p=0.124).
Perceptual experiment
We repeated the same analysis for the behavioural performance in most and fewest frames (Appendix 1—figure 1B). In contrast to the Value Experiment, we observed a slight reduction in accuracy in participant responses for the fewest frame (MeanMost = 0.77, MeanFew = 0.74, t(31) = 2.46; p<0.05); unlike the Value Experiment, however, we did not find differences in RTs (MeanMost = 4029.57 ms, MeanFew = 3975.59 ms; t(31) = 0.32; p=0.75). During the fewest frame participants reported lower confidence (Mean∆Confidence=0.24; t(31) = 5.62; p<0.001) and shifted their gaze more between alternatives (Mean∆|GSF| = -0.17; t(31) = -4.15; p<0.001), as observed in the Value Experiment.
Participants also reported higher confidence in trials that better discriminated the number of dots (Figure 1E, Results section). This effect was replicated in both most (low confidence: β = 1.142; high confidence: β = 2.164) and fewest (low confidence: β = −1.118; high confidence: β = −2.010) frames. The inversion of the sign of the slopes in most vs fewest frames also shows that participants were performing correctly (∆βMost-Few: t(31) = 22.22, p<0.001); the magnitude of the slopes was not significantly different between the two frames (∆|βMost-Few|: t(31) = 0.79, p=0.434; Figure 1F, Results section). This pattern of results mirrors the pattern seen in the Value Experiment.
Appendix 2
Choice regression models
In Value Experiment: ∆Value: difference in value; ΣValue: summed value; ∆DT: difference in dwell time; GSF: gaze shift frequency. In Perceptual Experiment similar models were compared but replacing ∆Value for ∆Dots and ΣValue for ΣDots.
Value experiment
Using a logistic hierarchical regression model, we investigated which factors modulated choice-proportion, defined here as the probability of choosing the item on the right side of the screen. We report here the results of the most parsimonious model (i.e. the model with a lowest BIC; Appendix 2—figure 1) fitted to the like and dislike frames independently (Figure 2B, Results section). In Appendix 2—table 1, we present the parameters for each factor included in the model. In the like frame, the difference in the value of the right item minus left item (∆Value) had a positive influence on choice-proportion, that is, participants selected the items that had higher value. This is reversed in the dislike frame: ∆Value is now a negative predictor of choice, that is, participants selected the items that had lower value. In both conditions, confidence enhanced the effect of ∆Value, as shown by the interaction between ∆Value and confidence in the like and dislike frame. These results confirm the findings presented in Figure 1B (Results section) while controlling for other relevant variables. Unsurprisingly, confidence and summed value (ΣValue, the added value of both alternatives) were found to show no main effect on the choice-proportion. As discussed in the Results section, gaze allocation (difference in dwell time, ∆DT) is directed to the chosen item in both frames, that is, the parameters are positive for ∆DT in like and dislike frame (Appendix 2—table 2).
Perceptual experiment
As in the Value Experiment, we used a logistic hierarchical regression to determine the relevant factors modulating perceptual choice (choosing the circle with dots on the right side of the screen) (Figure 2D, Results section). We found that the most parsimonious model for choice was the same used in the Value Experiment, where like and dislike were replaced by most and fewest frames (Appendix 2—figure 1B). In the most frame, the difference in the number of dots of the right alternative minus the left one (∆Dots) had a positive influence over choice; that is, participants tended to select the circle with more dots. As expected, this pattern was reversed in the fewest frame: ∆Dots was a negative predictor of choice. As in the Value Experiment, confidence modulated the effect of ∆Dots in most and fewest frames. The sum of dots presented in both circles during a trial (ΣDots) was found not to have a significant effect on either frame, as expected. However, as discussed in the Results section, confidence was found to be a negative predictor of choice in most and fewest frames. This means participants had a bias to report higher confidence when they chose the left circle. In a similar way to the Value Experiment, participants spend more time fixating the chosen alternative in both frames, with ∆DT effect being positive in most and fewest frames (Appendix 2—table 3).
In a study by Kovach et al., 2014 a design similar to our value-based experiment was implemented. Participants were required to indicate the item to keep and the one to discard. They found, similarly to our findings in the Value Experiment, that the overall pattern of attention was mostly allocated according the task goal. However, in the first few hundred milliseconds, these authors found that attention was directed more prominently to the most valuable item in both conditions. We did not replicate this last finding in our experiment, but one possible reason for this discrepancy is that the experiment by Kovach and colleagues presented both items on the screen at the beginning of the task -- unlike in our task, in which the item was presented in a gaze-contingent way (to avoid processing in the visual periphery). This setting might have triggered an initial and transitory bottom-up attention grab from the most valuable (and often most salient) item before the accumulation process started.
Appendix 3
Fixation analysis
In the main text, we reported the analysis of last fixation and how its allocation to the (chosen) goal-relevant alternative is modulated by value/number of dots. This result confirmed the findings in Krajbich et al., 2010 and expanded them to dislike frame and the perceptual realm. To give a more complete view of the fixations properties, we additionally performed a similar analysis to Krajbich et al., 2010 for first and middle fixations.
It is important to notice that in our Value and Perceptual Experiments, at the beginning of each trial participants do not visualise the options since the presentation is gaze contingent. Therefore, an initial exploration is required to identify the alternatives involved in the decision. In Krajbich et al., 2010 both options are visible from the beginning of the trial, however, participants’ initial fixation is still randomly allocated.
For the analysis of middle fixations, if blank fixations were recorded between fixations to the same item, then those fixations were assigned to that item (e.g. ‘Right’, ‘Blank’, ‘Right’ was considered as ‘Right’, ‘Right’, ‘Right’). Trials without middle fixations (i.e. only a first and a last fixation) were removed from the analysis. Trials with no item fixations for more than 40 ms at the beginning of the trial were also removed. In the following figures, results from Krajbich et al., 2010 are presented together with our findings, as a reference.
Appendix 4
Confidence regression models
In Value Experiment: |∆Value|: absolute difference in value; RT: reaction time; ΣValue: summed value; ∆DT: difference in dwell time; GSF: gaze shift frequency. In Perceptual Experiment similar models were compared, but replacing ∆Value for ∆Dots and ΣValue for ΣDots.
Appendix 5
GLAM – model comparison and out-of-sample simulations
Appendix 6
GLAM – parameter comparison
The results from the regression models presented in the Results section show that the nature of evidence integrated during the accumulation process depends on the frame in which participants make their choices. The Gaze-weighted Linear Accumulator Model (GLAM) predicts well participants’ behaviour once frame-relevant evidence is employed to fit the model. Here we show the parameters obtained from this process. Four free parameters are fitted in GLAM: ν (drift term), γ (gaze bias), τ (evidence scaling), and σ (normally distributed noise standard deviation) (Thomas et al., 2019). For Value and Perceptual Experiments, we fitted the model in both frames and in each participant separately. The parameters were fitted using the even-numbered trials and in both studies the model fit was estimated using the WAIC score (used with Bayesian Models) (Appendix 5—figure 1).
Value experiment
To explore variations in the process of accumulation of evidence characterised by GLAM, we compared the parameters obtained from the individual fit in like and dislike frames (Appendix 6—figure 1A). No significant variation between frames was found for the gaze bias (Mean γ Like = -0.14, Mean γ Dislike = 0.03, ∆γ Like-Dislike = -0.17, t(30) = -1.66; p=0.11, ns), the scaling parameter (τ Like = 2.81, τDislike = 2.69, ∆τLike-Dislike = 0.115, t(30) = 0.313; p=0.75, ns), and the noise term (Mean σLike = 0.0075, Mean σ Dislike = 0.0074, ∆σLike-Dislike = 0.00012, t(30) = 0.342; p=0.734, ns). We observed a significantly higher value of the drift term, ν, during the like frame (νLike = 5.60x10−5, νDislike = 4.53x10−5, ∆νLike-Dislike = 1.06 x 10−5, t(30) = 3.44; p<0.01). This means that evidence is accumulated faster during the like frame, which gives us an insight into the differences in the evidence accumulation product of the change frame modification.
Perceptual experiment
We also compared the parameters obtained from GLAM individual fit in the perceptual experiment (Appendix 6—figure 1B). No significant variation between frames was found for the scaling parameter (τMost = 0.34, τFew = 0.13, ∆τ Most-Few = 0.212, t(27) = 1.43; p=0.16, ns) or the drift term (Mean νMost = 3.8x10−5, Mean ν Few = 3.99x10−5, ∆νMost-Few = -1.92x10−6, t(27) = -0.465; p=0.645, ns). The gaze bias is larger during the fewest frame (γMost = 0.48, γFew = 0.26, ∆γMost-Few = 0.22, t(27) = 2.61; p<0.05). The σ parameter is also significantly different depending on the frame, with higher noise in the most frame (σMost = 0.0073, σFew = 0.0066, ∆σ Most-Few = 0.0007, t(27) = 2.26; p<0.05). In summary, the accumulation process seems to be noisier and less affected by visual attention in the most frame. In both frames, the finding that γ < 1 indicates that gaze modulates the accumulation of evidence.
Appendix 7
Attentional drift diffusion model
The attentional Drift Diffusion Model (aDDM) has been extensively used in literature to characterise the effect of attention over choice (Krajbich et al., 2010). Unlike GLAM, aDDM considers the dynamics of fixations during trials to fit the model. To further support our idea that goal-relevant evidence is accumulated, we fitted both Value and Perceptual datasets with the aDDM model, as implemented by Tavares et al., 2017 (aDDM toolbox, https://github.com/goptavares/aDDM-Toolbox).
The aDDM model assumes that evidence is accumulated dynamically in a variable called the relative decision value (RDV) signal. RDV starts at 0 and it evolves over time, accumulating evidence until a barrier is reached (+1 or −1) which will define the alternative to be selected (right or left). Every time step, RDV changes according to μΔt + εt, with μ the deterministic change (slope term) and ε the Gaussian noise term. The fixation to the two alternatives will define the value of μ: when the left option is fixated μ = d(rleft − θrright) and μ = d(rright − θrleft) for the right option. Therefore, the aDDM model considers three free parameters: d, σ, and θ. The parameter d is a positive constant characterising the speed of integration; σ is the standard deviation for a zero-mean Gaussian distribution for noise, and θ is the attentional parameter that controls the size of the attentional bias (range between 0 and 1). If θ = 1, the model is reduced to a standard drift-diffusion model (DDM) without attentional bias.
Group model fitting
The models were fitted to choice and RT data independently for like and dislike frames in our Value Experiment and for most and fewest frames in the Perceptual Experiment. The odd trials of the pooled data from 31 participants in value-based data and 32 participants for perceptual case was used to fit the models. The model considers the available evidence (item value and number of dots) and the sequence of fixations for each trial. As in GLAM, we fitted the parameters in dislike and fewest frames considering a version of the input values/evidence that accounted for the change in the objective of the task (i.e. reporting item not preferred or the alternatives with fewer dots, respectively). To compare, we also fitted another model using the evidence ‘by default’ (i.e. BDM bid values or number of dots in the circles). To account for the different ranges of item valuation used by the participants we normalised the value reports by binning at a participant level. In the Value Experiment, the data were separated in six bins using quantiles-based discretisation. In the Perceptual Experiment, given the distribution of the evidence (i.e three numerosity levels and smaller dots differences between two alternatives), we separated the dots data in eight bins. The maximum likelihood estimation (MLE) procedure was carried in iterative steps searching over a grid with the three model parameters. Initial grid was set to [0.001, 0.005, 0.01] for d, [0.01, 0.05, 0.1] for σ and [0.01, 0.5, 1] for θ. The likelihood for choice and RT in odd-trials, conditional to the pattern of fixations, was calculated for each combination of parameters in the grid (check Tavares et al., 2017 for the details of the algorithm to simulate aDDM trials). The time step used for the estimation of aDDM was 10 ms. The set of parameters with lower negative log-likelihood (NLL) was used as centre of the grid for the next iteration. Therefore, the grid to search in the next iteration (t+1) was defined as [dt − Δdt/2, dt, dt + Δdt/2], [θt − Δθt/2, θt, θt + Δθt/2], and [σt − Δσt/2, σt, σt+Δσt/2], considering the respective constrains of each parameter value. The iterative process finished once the improvement in the MLE of the proposed parameter solution was smaller than 0.05% (|minNNLt+1 – minNNLt| < 0.0005*minNNLt). The convergence was reached after two iterations in our models. In our results Appendix 7—table 1), we found that for both, dislike and fewest conditions, the model fitted using goal-relevant evidence had better performance than the model using default value or number of dots, as indicated by a lower NLL value.
Out-of-sample group simulations
To test the capacity of the model to predict out-of- sample, the aDDM with the best fitted parameters using odd-numbered trials was used to predict the behaviour observed on the even-numbered trials. We generated 40,000 simulations for the Value Experiment and 48,000 trials for the Perceptual Experiment. Fixations, latencies and inter-fixations transitions were sampled from empirical distributions, obtained from the pooled even-numbered trials across participants following the procedure used by Tavares et al., 2017.
Appendix 8
GLAM – balance of evidence simulations
Appendix 9
Normative model – proof of propositions 1 and 2
All the uses of µ in this proof: is the mean of the belief on the value of item i after the agent has acquired one signal about item i; is the mean of the belief on the value of item i after the agent has acquired two signals about item i.
We begin by proving Proposition 1. Recall that qualities are distributed independently according to a Normal distribution and that the agent knows it, thus holds a correct prior belief. Recall also that the agent has taken a sample, , with independently and identically distributed with . Because the prior belief is Normal, and because also the signal is Normally distributed around the true value, standard arguments give us that the posterior belief about is also Normal. Denote by and the mean and the variance, respectively, of this posterior belief about , for each . Note that is the same for all (since, with Normal distributions, the variance of the posterior only depends on the variance of the prior and of the signal).
The agent can now acquire a second signal about only one of the items and needs to decide which item. Note that, after a second signal about item is acquired, this will further change the belief about . Denote by the mean of this belief: that is, is the mean of the belief about after the agent has acquired two signals about it.
Recall that indicates the utility that the agent expects to have after acquiring the second signal about item . Recall also that we denote by the item for which the agent has received the highest first signal, the second-highest, etc. Suppose first that the second signal acquired is not about the best item. Then, there are two possibilities. First, we may have that , that is, after the second signal, the posterior mean about the quality of , is below that of . In that case the agent will choose , and receive an expected quality of . If instead , then the agent chooses i and has an expected quality . It follows that, for , we have
For similar reasons,
When the agent needs to decide which item to acquire a second signal about, however, the second signal has not been observed yet: we thus need to compute the expectation of . In order to compute this, the agent needs to form a belief about what will be the value of before acquiring the second signal about (but after acquiring the first signal). Such belief must again be normally distributed, and have mean . (This is because, of course, the expectation that the agent holds about the posterior mean before receiving the signal must be centered at the prior mean, which in this case is .) Denote by the variance of this belief; again, this is the same for all is. Thus,
We are now ready to prove the following claims.
Claim 1.
Proof. Recall that, for we have and . This means that the belief about , for , coincides with for values above , but has a mass point at equal to the probability that is below . If we denote by the Probability Density Function of , it follows that we have
Recall also that . The belief about coincides with above , but has a mass point at equal to the probability that is below Then,
Note that, by construction, we have
It follows that
and
But we also know that
Together with Equation A1 and A2, this proves the claim.■
Claim 2. If N>2, for all .
Proof. Recall that, for , we have , where . It follows that the beliefs about both and (held before the second signal is acquired) has support . Denote by the Cumulative Density Function (CDF) of this belief. To prove the claim, we show that First Order Stochastically Dominates for all , while the converse is not true: that is, we aim to show that for all in the support, , strictly for some x. (Recall that a distribution F first order stochastically dominates another distribution G if for all x, the probability that F returns at least x is not below the probability that G returns x or more.) This implies
Let and note that we have and that for all . Since coincides with whenever that lies above and since , it follows that, for all , we have the probability that assigns to being or higher is the same that assigns to being or higher. Then, Because CDFs are increasing and , then , thus for all . Moreover, notice that we must have
That is, assigns to values below a lower probability than does. It follows that for all in the support ,we have , strictly for some. Thus, First Order Stochastically Dominates for all , while the converse is not true. The claim follows.
The two claims together prove Proposition 1.
Proposition 2
The proof of Proposition 2 is identical once we replace by for . Intuitively, the problem of maximising the expected utility of the remaining items is strategically equivalent to the problem of choosing the lowest item, which, in turn, is symmetric to the problem of choosing the best item.
Data availability
Data and codes used in this study are available at the Brain Decision Modelling Lab GitHub https://github.com/BDMLab/Sepulveda_et_al_2020 (copy archived at https://archive.softwareheritage.org/swh:1:rev:a04585ff20b1389c713709b543a1af420bd300c1/).
-
GitHub repositoryID Sepulveda et al 2020. Sepulveda_et_al_2020.
References
-
Fitting linear Mixed-Effects models using lme4Journal of Statistical Software 67:i01.https://doi.org/10.18637/jss.v067.i01
-
Measuring utility by a single-response sequential methodBehavioral Science 9:226–232.https://doi.org/10.1002/bs.3830090304
-
Confidence modulates exploration and exploitation in value-based learningNeuroscience of Consciousness 2019:niz004.https://doi.org/10.1093/nc/niz004
-
BookAdvances in behavioral economics. The roundtable series in behavioral economicsRussell Sage Foundation ; Princeton University Press, New York: Princeton, N.J.
-
Revealed preference, rational inattention, and costly information acquisitionAmerican Economic Review 105:2183–2203.https://doi.org/10.1257/aer.20140117
-
Eye tracking and pupillometry are indicators of dissociable latent decision processesJournal of Experimental Psychology: General 143:1476–1488.https://doi.org/10.1037/a0035813
-
Instrumental vigour in punishment and reward: vigour in punishment and rewardEuropean Journal of Neuroscience 35:1152–1168.https://doi.org/10.1111/j.1460-9568.2012.08026.x
-
Neural mediators of changes of mind about perceptual decisionsNature Neuroscience 21:617–624.https://doi.org/10.1038/s41593-018-0104-6
-
Explicit representation of confidence informs future value-based decisionsNature Human Behaviour 1:0002.https://doi.org/10.1038/s41562-016-0002
-
Attentional selection mediates framing and Risk-Bias effectsPsychological Science 29:2010–2019.https://doi.org/10.1177/0956797618803643
-
BookNeuroeconomics: Decision Making and the BrainAcademic Press is an imprint of Elsevier.
-
Action versus Valence in decision makingTrends in Cognitive Sciences 18:194–202.https://doi.org/10.1016/j.tics.2014.01.003
-
ReportRational Inattention and Sequential Information Sampling, Technical Report W23787National Bureau of Economic Research.
-
Choices, values, and framesAmerican Psychologist 39:341–350.https://doi.org/10.1037/0003-066X.39.4.341
-
BookChoices, Values, and FramesCambridge University Press, Russell sage Foundation.
-
Two systems drive attention to rewardsFrontiers in Psychology 5:46.https://doi.org/10.3389/fpsyg.2014.00046
-
Visual fixations and the computation and comparison of value in simple choiceNature Neuroscience 13:1292–1298.https://doi.org/10.1038/nn.2635
-
The attentional drift-diffusion model extends to simple purchasing decisionsFrontiers in Psychology 3:193.https://doi.org/10.3389/fpsyg.2012.00193
-
Contextual influence on confidence judgments in human reinforcement learningPLOS Computational Biology 15:e1006973.https://doi.org/10.1371/journal.pcbi.1006973
-
The idiosyncratic nature of confidenceNature Human Behaviour 1:810–818.https://doi.org/10.1038/s41562-017-0215-1
-
The importance of falsification in computational cognitive modelingTrends in Cognitive Sciences 21:425–433.https://doi.org/10.1016/j.tics.2017.03.011
-
Metacognitive failure as a feature of those holding radical beliefsCurrent Biology 28:4014–4021.https://doi.org/10.1016/j.cub.2018.10.053
-
Probabilistic programming in Python using PyMC3PeerJ Computer Science 2:e55.https://doi.org/10.7717/peerj-cs.55
-
Implications of rational inattentionJournal of Monetary Economics 50:665–690.https://doi.org/10.1016/S0304-3932(03)00029-1
-
BookRational Inattention and Monetary EconomicsIn: Friedman B, Woodford M, editors. Handbook of Monetary Economics. Elsevier. pp. 155–181.https://doi.org/10.1016/B978-0-444-53238-1.00004-1
-
Gaze amplifies value in decision makingPsychological Science 30:116–128.https://doi.org/10.1177/0956797618810521
-
The accumulator model of two-choice discriminationJournal of Mathematical Psychology 32:135–168.https://doi.org/10.1016/0022-2496(88)90043-0
-
Value-based decision making: an interactive activation perspectivePsychological Review 127:153–185.https://doi.org/10.1037/rev0000164
-
The attentional drift diffusion model of simple perceptual Decision-MakingFrontiers in Neuroscience 11:468.https://doi.org/10.3389/fnins.2017.00468
-
Gaze Bias differences capture individual choice behaviourNature Human Behaviour 3:625–635.https://doi.org/10.1038/s41562-019-0584-8
Article and author information
Author details
Funding
Chilean National Agency for Research and Development (Graduate student scholarship - DOCTORADO BECAS CHILE/2017 - 72180193)
- Pradyumna Sepulveda
Wellcome Trust (Sir Henry Dale Fellowship (102612 /A/13/Z))
- Benedetto De Martino
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
This study was funded by a Sir Henry Dale Fellowship (102612/A/13/Z) awarded to Benedetto De Martino by the Wellcome Trust. Pradyumna Sepulveda was funded by the Chilean National Agency for Research and Development (ANID)/Scholarship Program/DOCTORADO BECAS CHILE/2017–72180193. We thank Antonio Rangel for his valuable comments on an earlier version of the manuscript and Mariana Zurita for the help in the proofreading of the manuscript.
Ethics
Human subjects: All participants signed a consent form and both studies were done following the approval given by the University College London, Division of Psychology and Language Sciences ethics committee (project ID number 1825/003).
Copyright
© 2020, Sepulveda et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 2,726
- views
-
- 358
- downloads
-
- 54
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Neuroscience
Combining electrophysiological, anatomical and functional brain maps reveals networks of beta neural activity that align with dopamine uptake.
-
- Neuroscience
During rest and sleep, memory traces replay in the brain. The dialogue between brain regions during replay is thought to stabilize labile memory traces for long-term storage. However, because replay is an internally-driven, spontaneous phenomenon, it does not have a ground truth - an external reference that can validate whether a memory has truly been replayed. Instead, replay detection is based on the similarity between the sequential neural activity comprising the replay event and the corresponding template of neural activity generated during active locomotion. If the statistical likelihood of observing such a match by chance is sufficiently low, the candidate replay event is inferred to be replaying that specific memory. However, without the ability to evaluate whether replay detection methods are successfully detecting true events and correctly rejecting non-events, the evaluation and comparison of different replay methods is challenging. To circumvent this problem, we present a new framework for evaluating replay, tested using hippocampal neural recordings from rats exploring two novel linear tracks. Using this two-track paradigm, our framework selects replay events based on their temporal fidelity (sequence-based detection), and evaluates the detection performance using each event's track discriminability, where sequenceless decoding across both tracks is used to quantify whether the track replaying is also the most likely track being reactivated.