Transdiagnostic compulsivity is associated with reduced reminder setting, only partially attributable to overconfidence

eLife Assessment

This important work addresses the relationship between the transdiagnostic compulsivity dimension and confidence as well as confidence-related behaviours like reminder setting. The relationship between confidence and compulsive disorders has recently received a lot of attention and has been considered to be a key cognitive change. The authors paired an elegant experimental design and pre-registration to give convincing evidence of the relationship between compulsivity, reminder setting, and confidence. In the revised version they thoroughly addressed the reviewer's comments, in particular adding new analyses clarifying how their findings relate to prediction error based learning as well as presenting additional recovery analyses and psychometric curves further strengthening the manuscript.

https://doi.org/10.7554/eLife.98114.4.sa0

Significance of the findings:

Important: Findings that have theoretical or practical implications beyond a single subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Convincing: Appropriate and validated methodology in line with current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
eLife digest
Introduction
Results
Discussion
Materials and methods
Appendix 1
Data availability
References
Article and author information
Metrics

Abstract

In the current study, we explored the behavioural and cognitive correlates of the transdiagnostic trait ‘compulsive behaviour and intrusive thought’ (CIT) in humans. CIT is associated with impaired metacognition, which in turn has been associated with cognitive offloading behaviours such as external reminder setting that play a key role in fulfilling cognitive goals. In an online study (N=600), we investigated individual differences in compulsivity, metacognition, and external reminder usage. Compulsive individuals had reduced preference for external reminders. This was partially, but not fully, attributable to their relative overconfidence. In contrast to previous studies, we found no evidence for an impaired confidence-action link: compulsive individuals used their metacognition to guide offloading just as much as their non-compulsive counterparts. Given the compensatory nature of cognitive offloading, our findings imply that compulsive individuals are at increased risk of inadequate external memory support. Along with transdiagnostic variation in the general population, this finding could also have implications for clinical conditions, such as obsessive-compulsive disorder (OCD).

eLife digest

You have just been prescribed a new course of antibiotics; will you schedule alarms to make sure you take your treatment as you should – three times a day, every day, for the next week? Or will you trust yourself to remember to do so unprompted?

You may find it easy to make this choice, yet it is in fact a rather complex task. Research has shown that the use of reminders (a process known as cognitive offloading) is guided, in part, by how confident we are about our ability to remember. Accurately assessing our own cognitive skills, however, can be shaped by a range of psychological factors. People with high levels of compulsivity, for example, tend to struggle with judging their own abilities. This trait, commonly present in a range of mental health conditions such as obsessive-compulsive disorder, is characterized by repetitive behaviors and intrusive thoughts. Here, Boldt et al. investigate whether differences in compulsivity can impact how and when people choose to set reminders.

To do so, an online study was conducted on 600 adults from the general population. Before completing a highly demanding memory task, participants first answered questionnaires assessing traits including compulsivity and anxiety. They were also asked to predict how well they would perform on the test.

When going through the memory task, participants could choose to use reminders to help themselves at the start of each trial. By doing so, however, they knew they would earn fewer points for each accurate answer given.

The results showed that individuals who scored higher on compulsivity tended to set fewer reminders. This was partly because they were more confident in their memory than other participants, but also because compulsivity itself seemed to directly reduce reminder use.

Taken together, these findings suggest that people who are highly compulsive may not adequately use memory aids even when they might need them. Although none of the participants had a clinical diagnosis, the results could inform future studies of conditions such as obsessive-compulsive disorder, as well as guide the design of interventions to support memory and daily functioning.

Introduction

In recent studies of clinically relevant individual differences, there has been a paradigm shift towards the study of transdiagnostic traits, challenging the traditional, diagnostic approach. Using factor analysis, temporally stable (see Fox et al., 2023; Sookud et al., 2024), transdiagnostic phenotypes can be extracted from extensive symptom datasets (Wise et al., 2023). These traits are not confined to a single clinical diagnosis but instead can span a range of conditions, at the same time addressing the diagnostic heterogeneity within conditions, such as obsessive-compulsive disorder (OCD; e.g. Gillan et al., 2016; Wise and Dolan, 2020). There are obvious practical benefits of these methodologies, such as their potential to reduce the clinical burden by making the treatment of comorbid conditions more efficient and effective (Harvey, 2025). At the same time, they contribute valuable insights into mental health conditions by increasing statistical power and opening new avenues of inquiry (Dalgleish et al., 2020).

In the present study, our focus lies on the latter with the goal to investigate the downstream cognitive and behavioural correlates associated with transdiagnostic compulsivity. This symptom dimension represents a clinical concept characterised by an inability to regulate repetitive behaviours that are harmful to oneself, commonly observed in a variety of conditions, particularly OCD, schizophrenia, addiction, and eating disorders. Previous research links transdiagnostic compulsivity to impairments in metacognition, defined as thinking about one’s own thoughts, encompassing a broad spectrum of self-reflective signals, such as feelings of confidence (e.g. Rouault et al., 2018; Seow and Gillan, 2020; Benwell et al., 2022; Fox et al., 2023; Fox et al., 2024; Hoven et al., 2023a). Other studies have shown that metacognitive signals such as feelings of confidence guide cognitive offloading strategies like setting external reminders as memory aids (e.g. Gilbert, 2015; Boldt and Gilbert, 2019). Here, we aim to bridge these two literatures by investigating compulsivity, metacognition, and cognitive offloading within a single experimental paradigm. While compulsivity and cognitive offloading have both separately been linked to metacognition, the relationship between the two – with metacognition as a potential mediating factor – has not previously been systematically examined. This matters because cognitive offloading plays an integral role in our daily lives and is a key contributor to our effectiveness as cognitive agents (Gilbert et al., 2023).

Metacognition guides reminder setting

Reminders constitute an example of cognitive offloading, defined as the use of physical action to reduce the cognitive demands of a task. By offloading memory demands this way, we not only increase the likelihood of successfully completing tasks (Boldt and Gilbert, 2019), but we may also free up cognitive resources for other activities (Dupont et al., 2023). Choosing between setting a reminder and relying on memory is not a trivial matter. Prior research has emphasised the role of metacognition in determining when individuals resort to cognitive offloading (Gilbert et al., 2023; Gilbert, 2015; Boldt and Gilbert, 2019; Sachdeva and Gilbert, 2020; Risko and Gilbert, 2016): People tend to set more reminders when they feel less confident. In other words, people tend to set reminders when they think that they will forget, and this effect holds even after taking into account actual memory ability (e.g. Boldt and Gilbert, 2019). The link between confidence and offloading is observed both for situational fluctuations in confidence due to varying task difficulties (state variable; Boldt and Gilbert, 2022) and for a general predisposition towards over- or underconfidence (trait variable; Boldt and Gilbert, 2019).

Metacognition, compulsivity, and checking behaviours

Given the known metacognitive impairments associated with compulsivity, changes in reminder-setting behaviour are plausible. More specifically, individuals characterised by transdiagnostic compulsivity have been consistently found to exhibit overconfidence (Rouault et al., 2018; Seow and Gillan, 2020; Benwell et al., 2022; Fox et al., 2023; Fox et al., 2024; Hoven et al., 2023a). If we consider the link between reminder setting and confidence, this implies a reduced likelihood of utilising external aids, such as reminders. However, while transdiagnostic compulsivity is liked to overconfidence, the opposite pattern of underconfidence is more common in patients with OCD, a compulsive disorder (as reviewed in Hoven et al., 2019). Recent research suggests that metacognitive impairments in transdiagnostic compulsivity and OCD may originate from different mechanisms (Hoven et al., 2023b; Hoven et al., 2023c), advising caution against broad generalisations between these groups. It should also be noted that the composite measure of transdiagnostic compulsivity includes questionnaire items linked not only with OCD but also other clinical conditions such as eating disorders (Tasca et al., 2011; Gillan et al., 2016). This results in an overlap between transdiagnostic compulsivity and other traits such as rigid perfectionism.

Despite opposite trends in metacognitive monitoring performance (under- versus overconfidence), individuals high in transdiagnostic compulsivity and those with a diagnosis of OCD show similar impairments in metacognitive control, characterised by a disrupted connection between confidence and future actions (Seow and Gillan, 2020; Vaghi et al., 2017). Metacognitive impairments are also central to explanations of compulsive behaviours, notably in OCD patients. In such patients, compulsivity can manifest in the form of checking behaviours, e.g., checking that doors are locked or that appliances are switched off (Den Ouden et al., 2022). Whilst checking behaviours are also present in other compulsive disorders (e.g. ‘body checking’ in eating disorders; Mountford et al., 2006), in OCD, these checks are often repetitive and ritualised and are typically associated with obsessive thoughts. However, the exact function that checking compulsions serve is unclear; patients commonly report that they have the aim of reducing anxiety generally, preventing a feared consequence from taking place or that they are performed automatically and without thinking (Starcevic et al., 2011). Understanding these motivators has been challenging as studies rely on self-report of often highly individual real-world behaviours.

Some research argues that OCD patients’ checking arises from low memory confidence despite intact memory (Tolin et al., 2001). Our study has the potential to shed some light on the link between confidence and checking: While checking behaviours can be seen as a way of ensuring that a necessary action was performed in the past, reminder setting is a way of ensuring that a necessary action will be performed in the future. In other words, a reminder can serve as a future checkpoint that allows us to revisit a task at an appropriate time to complete it, perhaps by setting an alarm on our phone, jotting down a note, or strategically placing a related object somewhere visible. Given these insights, one might expect an increased reliance on reminders among OCD patients as they strive to establish more checkpoints. By contrast, seeing as transdiagnostic compulsivity is associated with increased confidence, this could be associated with the opposite pattern: a decreased reliance on reminders.

Three possible mechanisms for changes in reminder setting

If, as hypothesised, compulsivity is linked with altered reminder setting, this could be attributed to at least three underlying mechanisms. First is the Metacognitive Control Mechanism: Previous research has found that more compulsive individuals tend to have impaired metacognitive control (Seow and Gillan, 2020), meaning they use metacognitive signals to a lesser extent to guide future behaviour. Compulsivity is a hallmark symptom of OCD, and similar deficits in metacognitive control have been observed in a case-control studies comparing OCD patients with healthy controls examining how confidence and action are correlated (Vaghi et al., 2017; though see also Hoven et al., 2023b; Marzuki et al., 2022). In the context of our study, a Metacognitive Control Mechanism would be reflected in a disrupted relationship between confidence levels and their tendency to set reminders (i.e. the interaction between the bias to be over- or underconfident and transdiagnostic ‘compulsive behaviour and intrusive thought’ (CIT) in a regression model predicting a bias to set reminders).

Second, more compulsive individuals might conceivably differ in their reminder-setting strategies due to an altered level of confidence. We call this the Metacognitive Monitoring Mechanism, which suggests that the issue arises when forming the confidence signal, rather than in its behavioural application (for clarification on metacognitive monitoring vs. control in cognitive offloading, see Boldt and Gilbert, 2022). Prior evidence exists for overconfidence in compulsivity (Rouault et al., 2018; Seow and Gillan, 2020; Benwell et al., 2022; Fox et al., 2023; Fox et al., 2024; Hoven et al., 2023a), which would therefore result in fewer reminders.

Lastly, there could be a direct link between compulsivity and reminder usage, independent of any metacognitive influence. We refer to this as the Direct Mechanism and it constitutes any possible influences that affect reminder setting in highly compulsive CIT participants outside of metacognitive mechanisms, such as perfectionism and the wish to control the task without external aids. Our study aims to differentiate between these three mechanisms. Back when we preregistered our hypotheses, only a limited number of studies about confidence and transdiagnostic CIT were available. This resulted in us hypothesising to find support for the Metacognitive Control Mechanism and that highly compulsive individuals would offload more due to an increased need for checkpoints. Both of these hypotheses turned out to be incorrect.

Anxious-depressed transdiagnostic phenotype

As well as investigating individual differences in compulsivity, we also measured an anxious-depression (AD) factor. Based on the previous findings, we predicted opposite influence of these two factors on confidence. Whereas compulsivity has been linked to increased confidence, AD individuals typically display relative underconfidence (Rouault et al., 2018; Seow and Gillan, 2020; Benwell et al., 2022). By taking a transdiagnostic approach, we were able to jointly investigate the influence of these two factors of confidence which could potentially cancel out if they were investigated separately.

Online reminder-setting task

In the present preregistered study, we asked 600 participants drawn from the general population to complete several individual differences questionnaires. These responses were then weighted to produce both a ‘CIT’ factor and an ‘AD’ factor (Gillan et al., 2016; Wise and Dolan, 2020). Participants’ scores on these factors were then correlated with their behaviour in a reminder-setting task, which was a modified, 20 min version of the online reminder setting task developed by Gilbert et al., 2020; Figure 6.

Participants performed a highly demanding, short-term memory task. On some trials, they relied on internal memory alone (which typically resulted in poor accuracy); on other trials, they could set external reminders (which dramatically improved accuracy). The key manipulation was the number of points associated with the two strategies. Correct responses always earned 10 points if participants used internal memory, but a lower number of points between 2 and 9 if they used external reminders. The latter number of points varied from trial to trial, and participants were required each time to decide which strategy they preferred (e.g. 10 points for each correct response with internal memory or 6 points for each correct response with external reminders). The ‘optimal indifference point’ (OIP) was that point value at which an unbiased individual would be indifferent between the two strategies based on their objective accuracy in the two conditions. The ‘actual indifference point’ (AIP) was the point at which they were actually indifferent, based on all of their decisions. By comparing these two values, we obtained a ‘reminder bias’: the extent to which an individual had a pro- or anti-reminder bias relative to their individually calculated optimal strategy. Note that this is different from the absolute rate of reminder usage, because the same absolute rate might reflect inadequate use of reminders in a person with poor memory and excessive reminder usage in a person with good memory ability. Along with the reminder bias, we also calculated a metacognitive bias, which represents participants’ over- or underconfidence in memory ability, relative to objective performance. Our study controlled for age, gender, educational attainment, as well as cognitive ability (ICAR5; Kirkegaard and Bjerrekær, 2016), and working memory.

Previewing our results, in line with previous evidence, we found that confidence varied positively with the CIT factor and negatively with the AD factor. However, contrary to our initial expectations, more compulsive individuals offloaded less rather than more, and there was no evidence for disruption in the link between metacognition and offloading. Instead, we discovered an incomplete mediation effect: while a significant proportion of the reduced reminder setting could be attributed to overconfidence, not all the variance was accounted for by this variable. Even after controlling for it, compulsivity still predicted reduced reminder setting. This constitutes a combination of the Metacognitive Monitoring Mechanisms and the Direct Mechanism.

Results

Here, we present the results of a preregistered online study on the relationship between reminder setting, metacognition, and transdiagnostic compulsivity. We excluded 69 out of a total of 669 participants based on our six preregistered criteria described in the Materials and methods section, leaving us with a final sample of 600 participants. All participants completed a previously validated reminder setting task in combination with 49 items from six mental health questionnaires. Three hundred and seventy-five participants identified as male, 218 as female and 7 as other. Participants were on average 32.9 years of age (min = 18; max = 76). Figure 1 shows the included (black) and excluded (red) data, with higher average performance for included participants when reminders were used (96.1%) compared to when people had to do the task unaided (59.2%).

Figure 1

Download asset Open asset

Average accuracy as a function of whether a reminder was used.

‘No Reminder’: forced internal condition; ‘Reminder’: forced external condition. Each pair of dots linked by a line indicates one participant. The red data points are excluded participants. The box plots indicate the median surrounded by the interquartile range (25th and 75th percentile). The whiskers show the minimum and maximum. The preregistered exclusion criteria for the accuracies with or without reminder are indicated as horizontal dotted lines (10% and 70%, respectively).

We calculated six key measures for each participant:

The first relevant measure is the OIP. The OIP describes the reward value (2–9 points) at which an unbiased, reward-maximising participant should be indifferent between the two strategies: using reminders or relying on their own memory. The OIP is calculated from their accuracy with and without reminders. Imagine a participant who achieves 60% accuracy when using their own memory or 100% accuracy when using reminders. In this case the OIP would be 6, because scoring 6 points per item with reminders (100% accuracy) would earn the same number of points as scoring 10 points with internal memory (60% accuracy). For any reward above 6, it would be optimal to choose external reminders; for any reward below 6, it would be optimal to choose internal memory.
In contrast, the second relevant measure is the AIP, which is the number of points at which participants showed indifference between the two strategies. This measure is calculated by fitting a psychometric function to participants’ choices at different levels of reward for targets when reminders were used. Please note that all choices were used to calculate the AIP, as participants only found out whether or not they would use a reminder after the decision was made.
Together, these variables can be used to calculate the third measure, the reminder bias, which is the difference between the OIP and the AIP and therefore reflects participants’ tendency to over- or underuse reminders, relative to the optimal strategy. Note that the optimal strategy is calculated individually for each participant and will depend on their own level of performance when using internal memory and external reminders.
Fourth, we calculated a metacognitive bias, reflecting participants’ over- or underconfidence. This is calculated by subtracting objective accuracy (percentage of targets remembered when using internal memory) from the percentage that they predicted that they would be able to remember.
Fifth and sixth, based on the questionnaire ratings, we calculated how much someone scored on the transdiagnostic CIT and AD factors. Our analyses focus on the relationship between these key measures.

Replication and sanity checks

In the following section, we aim, where the design allows it, to replicate four previous effects for this task. First, with Hypothesis 1, we predicted that the reminder bias and metacognitive bias are negatively correlated, replicating previous findings (as reviewed in Gilbert et al., 2023). This effect tests the above-mentioned link between metacognition and cognitive offloading: the less confident someone feels, the more they use reminders. There was indeed a significant negative correlation, r=–0.2, p<0.001 (Figure 2). Second, in replication of previous findings (e.g. Gilbert et al., 2020; Sachdeva and Gilbert, 2020; Kirk et al., 2021; Engeler and Gilbert, 2020), Hypothesis 2 expressed our expectation to find an excessive use of reminders reflected in significantly higher OIPs compared to AIPs. In other words, we expected the reminder bias to be greater than zero, which was indeed the case, m=0.52, t(599) = 5.1, p<0.001, d=0.21. Third, with Hypothesis 3, we expected to replicate that participants would be underconfident in their own memory (e.g. Engeler and Gilbert, 2020), expressed in an average, negative metacognitive bias. Our data supported this hypothesis, m=–3.64, t(599) = –3.1, p=0.001, d=–0.13. Fourth, Hypothesis 4 predicts that as in previous studies, we would find evidence for compensatory reminder use. Keeping in mind that the OIP reflects the cut-off at which participants should be indifferent between offloading and not offloading and the AIP the cut-off they actually displayed, then looking at these two measures together should show that participants with poorer memory and greater benefit from reminders (lower OIP) tend to use them more (lower AIP). Indeed, the OIP and AIP were positively correlated, suggesting participants who benefited most from reminders were more likely to use them, r=0.36, p<0.001. Taken together, we found that participants showed the usual hallmarks of this offloading task, using their confidence to strategically decide when to offload, general tendencies for setting reminders and for underconfidence, and compensatory reminder use.

Figure 2

Download asset Open asset

People’s tendency to set reminders above or below the optimal offloading strategy (reminder bias) plotted against people’s tendency towards over- or underconfidence (metacognitive bias).

The solid line indicates the fitted relationship between both variables. The dashed lines represent the 95% confidence interval around it. Each circle represents a single participant.

Testing our key hypotheses

Elevated confidence in CIT and reduced confidence in AD

We predicted that the metacognitive bias would correlate negatively with AD (Hypothesis 8a; more AD individuals tend to be underconfident). For CIT, we preregistered a non-directional, significant link with metacognitive bias (Hypothesis H6a). We found support for both hypotheses, both for AD, β=–0.23, SE = 0.05, t=–4.99, p<0.001, and CIT, β=0.15, SE = 0.05, t=3.11, p=0.002, controlling for age, gender, and educational attainment (Figure 3; see also Appendix 1—table 1). Note that for CIT, this effect was positive, and more compulsive individuals tend to be overconfident.

Figure 3

Download asset Open asset

Standardised regression weights for the ‘anxious-depression’ (AD) factor and the ‘compulsive behaviour and intrusive thought’ (CIT) factor predicting metacognitive bias.

Error bars indicate 95% confidence intervals. Asterisks indicate significance: ‘***’: <0.001; ‘**’: <0.01; ‘*’: <0.05.

We furthermore preregistered to also test this for raw confidence (percentage of circles participants predicted they will remember, rather than the accuracy-corrected metacognitive bias score; Hypotheses H8b and H6b). Indeed, the same patterns were found for both AD, β=–0.29, SE = 0.04, t=–6.43, p<0.001, and CIT, β=0.12, SE = 0.05, t=2.76, p=0.006 (see Appendix 1—table 2). Including scores from the cognitive ability test as an additional covariate (Hypotheses H8c and H6c, respectively) furthermore did not change the results, AD, β=–0.20, SE = 0.05, t=–4.46, p<0.001; CIT, β=0.12, SE = 0.05, t=2.57, p=0.011 (see Appendix 1—table 3). Taken together, these results suggest that concordant with our hypotheses, compulsivity was linked to inflated confidence and anxiety to deflated confidence.

Contrary to expectations, compulsivity reduced pro-offloading bias

We expected to find a positive link between CIT factor scores and reminder bias. In other words, we predicted that more compulsive individuals would show a greater pro-offloading bias, relative to the optimal strategy (Hypothesis H5a). However, our results showed the exact opposite effect with a significantly reduced reminder bias in compulsive individuals, β=–0.14, SE = 0.05, t=–2.91, p=0.004, controlling for age, gender, and educational attainment (Figure 4; see also Appendix 1—table 4). This trend persisted when, instead, we predicted the absolute number of reminders chosen by the participant (Hypothesis H5b), β=–0.09, SE = 0.05, t=–1.94, p=0.053 (see Appendix 1—table 5), as well as when predicting the AIP (Hypothesis H5c), β=0.10, SE = 0.05, t=2.25, p=0.025 (see Appendix 1—table 6).

Figure 4

Download asset Open asset

Previous studies have found reduced working memory in OCD (Harkin and Kessler, 2011), which could potentially lead to increased reminder use in compulsivity. However, the reduced reminder bias persisted if d’ from the 2-back task was included as an additional covariate (Hypothesis H5d), β=–0.12, SE = 0.05, t=–2.57, p=0.010 (see Appendix 1—table 7). Finally, we predicted that our results would persist independent of whether or not the scores from the cognitive ability test were included as an additional covariate (Hypothesis H5e), which was indeed the case, β=–0.14, SE = 0.05, t=–2.85, p=0.005 (see Appendix 1—table 8). It should be noted that all our regression models included both CIT and AD as predictors to separate out the potentially competing influences of these predictors, as well as age, gender, and educational attainment as demographic covariates.

We furthermore preregistered to conduct the same tests for the AD factor but without any directional hypotheses. AD was not significantly linked to any changes in reminder bias, β=0.07, SE = 0.05, t=1.46, p=0.15 (see Appendix 1—table 4), absolute number of reminders, β=0.06, SE = 0.05, t=1.33, p=0.18 (see Appendix 1—table 5), or AIP, β=–0.08, SE = 0.05, t=–1.76, p=0.08, (see Appendix 1—table 6) controlling for age, gender, and educational attainment. This null effect did not change when working memory, β=0.06, SE = 0.05, t=1.23, p=0.22 (see Appendix 1—table 7), or scores from the cognitive ability test were included as additional covariates, β=0.07, SE = 0.05, t=1.41, p=0.16 (see Appendix 1—table 8).

Taken together, these results suggest that compulsive individuals are less biased towards offloading, in contrast to our hypothesised direction of the effect, but consistent with the observation of increased confidence in their ability on this task.

No evidence for impaired confidence-offloading link

We predicted to find support for the Metacognitive Control Mechanism, meaning that CIT would act as a moderator on the link between confidence and offloading (Hypothesis H7a). In other words, we expected to find that the correlation between the metacognitive and the reminder bias to be weakened in highly compulsive individuals. However, the interaction between metacognitive bias and compulsivity in a model predicting the reminder bias was not significant, β=–0.01, SE = 0.04, t=–0.18, p=0.86, controlling for age, gender, and educational attainment (see Appendix 1—table 9). This means that in our task, confidence and offloading were linked just as much as in their low compulsive counterparts. These results remained the same even if working memory performance (d’ from the 2-back task) was included as an additional covariate (Hypothesis H7b), β=–0.01, SE = 0.04, t=–0.26, p=0.79 (see Appendix 1—table 10), or if scores from the cognitive ability test were included as an additional covariate (Hypothesis H7c), β=–0.01, SE = 0.04, t=–0.18, p=0.86 (see Appendix 1—table 11).

Contrary to our initial hypotheses, we found that increased CIT was associated with decreased rather than increased bias towards offloading. Seeing as CIT was also associated with increased confidence, and high confidence predicts low bias towards offloading, we tested whether the relationship between CIT and offloading was mediated via confidence: whilst parts of the reduction of reminders could be traced back to overconfidence, β=–0.19, SE = 0.04, t=–4.66, p<0.001, there was still a significant proportion of variance that was linked to compulsivity independently of this effect, β=–0.10, SE = 0.05, t=–2.14, p=0.032. Figure 5 summarises this incomplete mediation effect. In previous sections, we have already reported the total effect of CIT on the reminder bias, β=–0.14, SE = 0.05, t=–2.91, p=0.004. Equally, we have already reported the effect of CIT on the mediator (the metacognitive bias), β=0.15, SE = 0.05, t=3.11, p=0.002.

Figure 5

Download asset Open asset

Diagram of the mediation analysis testing for the influence of the ‘compulsive behaviour and intrusive thought’ (CIT) factor on reminder bias, both directly and indirectly through the metacognitive bias.

Standardised regression coefficients are given for each path. The value in parentheses indicates the influence of CIT on reminder bias controlling for the influence of the metacognitive bias. Asterisks indicate significance: ‘***’: <0.001; ‘**’: <0.01; ‘*’: <0.05.

We validated this outcome through an exploratory causal mediation analysis. The indirect influence of the CIT factor on reminder bias going through the metacognitive bias was calculated to be (0.15) * (–0.19)=–0.0285. To determine the significance of this influence, we implemented bootstrapping procedures. We computed unstandardised indirect effects for each of the 1000 bootstrapped samples, followed by the calculation of the 95% confidence interval, identifying the indirect effect at the 2.5th and 97.5th percentiles. The bootstrapped unstandardised indirect effect (the average causal mediation effect) computed to be –0.0256, with the 95% confidence interval ranging from –0.05 to –0.01. This indicated that the effect was statistically significant at p=0.002.

Finally, we preregistered to run the same analysis for the AD factor without hypothesising about any specific direction for any potential effects. However, we did not find evidence for a moderation effect (an interaction between AD scores and metacognitive bias when predicting the reminder bias), β=–0.04, SE = 0.04, t=–0.94, p=0.35, controlling for age, gender, and educational attainment (see Appendix 1—table 12).

In summary, whilst we found no support for the Metacognitive Control Mechanism (as would be reflected in a disrupted link between confidence and offloading), we did find support for both the Metacognitive Monitoring Mechanism (reduced pro-reminder bias as a downstream consequence of overconfidence) and the Direct Mechanism (independent contribution of CIT on offloading). Appendix 1 furthermore lists several additional analyses, both planned and exploratory.

Discussion

In the current study, we explored the behavioural and cognitive correlates of two transdiagnostic traits: ‘CIT’ and ‘AD’. We focused on changes in cognitive offloading and metacognition related to transdiagnostic compulsivity. Our results replicated that more compulsive individuals were relatively overconfident, while those who were more AD were relatively underconfident. Contrary to expectations, we observed a decreased bias towards reminders among more compulsive participants. This reduction in bias was only partially accounted for by their relative overconfidence. This partial mediation can be interpreted through both a Metacognitive Monitoring Mechanism (differences in the formation of the confidence signal rather than its behavioural application) and a Direct Mechanism (no metacognitive involvement). We found no support for a Metacognitive Control Mechanism, which would centre on how confidence is used to adapt behaviour (Nelson & Narens, 1990; Boldt and Gilbert, 2022).

Perfectionism and the need to control as potential explanations

Contrary to our hypothesis, our study revealed an inverse relationship between transdiagnostic compulsivity and offloading: the reminder bias was reduced in more compulsive individuals. One possible interpretation is perfectionism: Some compulsive individuals may avoid using reminders altogether due to rigid, perfectionistic beliefs about needing to remember everything without relying on external aids, and using reminders could trigger their anxiety or feed into their obsessions about being forgetful or unreliable. This interpretation aligns with findings, suggesting that perfectionism serves as a transdiagnostic maintaining and risk factor for various mental health conditions, including compulsive disorders like eating disorders and OCD (Egan et al., 2011).

No effect of anxiety on offloading

Interestingly, we found no significant influence of the AD transdiagnostic phenotype on offloading. This aligns with a recent study by Kirk et al., 2021, which also found no effect of anxiety on offloading. However, their study, which used the ‘trait’ component of the STAI to measure anxiety (Spielberger et al., 1983), found no relative underconfidence among anxious participants either. Our transdiagnostic approach likely revealed this confidence effect by separating the counteracting influences of AD and CIT factors. This distinction underscores the value of a transdiagnostic approach.

Our findings align with those reported in a recent study by Mohr et al., 2024. The authors observed that while high-AD participants were underconfident in a perceptual task, this underconfidence did not lead to increased information-seeking behaviour. Future research should explore whether this is due to their pessimism regarding the effectiveness of confidence-modulated strategies (i.e. setting reminders or seeking information) or whether it stems from apathy. Another possibility is that the relevant downstream effects of anxiety were not measured in our study and instead may lie in reminder-checking behaviours.

No evidence for an impaired confidence-action link in compulsivity

Contrary to Seow and Gillan, 2020, and Vaghi et al., 2017, our study did not find the impaired confidence-action link (Metacognitive Control Mechanism) reported for transdiagnostic compulsivity and OCD patients. This may be because of differences between tasks – prior work used a reinforcement learning task with a clear learning element from trial to trial. Alternatively, it is possible our study was underpowered, as our sample size was designed to detect overconfidence in compulsivity, not the more nuanced but still psychometrically robust confidence-action link (Loosen et al., 2022), which would have required a far larger sample size. Recent studies also failed to find decreased action-confidence coupling with relatively small groups of OCD patients and controls (Hoven et al., 2023b; Marzuki et al., 2022). Indeed, both our paradigm and the earlier predictive-inference task tested for an interaction effect, which is more challenging to power adequately. Future research should consider using more direct measures that ideally aim to manipulate confidence directly.

Implications

Participants in our current study were recruited from the general population through Prolific, meaning that the variance likely represents primarily subclinical sources. Consequently, caution should be exercised when extrapolating these results to clinical populations. For example, a recent study indicated that metacognitive impairments in OCD originate from different mechanisms than those observed in transdiagnostic compulsivity (Hoven et al., 2023c). Given its metacognitive impairments and the prevalent symptom of checking, OCD still remains a particularly relevant patient group for studying reminder setting, and future studies need to explore this area further. Due to their underconfidence, OCD patients might engage in more frequent reminder setting. This behaviour could serve as a compensatory mechanism, especially since OCD patients often face challenges with working memory (Harkin and Kessler, 2011) and prospective memory (Harris et al., 2010; Racsmany et al., 2011). However, it could also worsen their checking symptoms as more reminders mean more opportunities to check.

On the other hand, it is possible that the observed underconfidence in OCD populations may actually reflect the impact of an uncontrolled anxiety factor, effectively neutralising the influence of compulsivity on confidence. This confounding issue could explain the inconsistent findings regarding confidence bias in both compulsivity and OCD. If this was the case, then future research should investigate which influences on confidence – the reductions caused by the AD factor or the increases caused by the CIT factor – are the driving force behind any changes in reminder setting in OCD.

A pivotal question remains: will the overall reduction in reminder setting, referred to as a ‘direct effect’ in this study, also be observed in OCD patients and other compulsive disorders? Such findings could support the hypothesis that an inherent aspect of compulsivity leads to the decreased use of external aids, potentially due to perfectionism or a need for control.

Limitations

Our results are based on a well-validated paradigm which our lab has previously used in other, published studies (as reviewed in Gilbert et al., 2023). However, reliance on a single behavioural task also means that our results might not generalise onto cognitive offloading more broadly or even reminder setting in other contexts. As a first step, future work should aim to replicate our findings in the context of other experimental designs.

Another limitation is that in the present study, we focused solely on measuring two transdiagnostic factors: CIT and AD. We omitted the third factor, ‘social withdrawal’. By doing so, we were able to reduce the number of items from 6 clinical questionnaires to 49 (Wise and Dolan, 2020), thereby shortening the required time for completion – an essential consideration for online research (Sauter et al., 2020). Nevertheless, this focused approach could introduce variability in capturing these transdiagnostic phenotypes. A recent preprint from Hopkins et al., 2022 supports this approach. They used machine learning to select 71 items capable of reliably measuring all three factors, suggesting that future transdiagnostic studies might similarly adopt more concise item sets.

Conclusion

With the present study, we investigated the downstream cognitive and behavioural effects of two transdiagnostic traits, CIT and AD. In particular, we were interested in the effect these factors have on metacognition and cognitive offloading, operationalised as prospective confidence and reminder setting, respectively. We replicated the finding that more compulsive individuals tend to be relatively overconfident, whereas AD individuals tend to be relatively underconfident. Contrary to our hypotheses, however, we found that compulsivity was linked reduced offloading, and that this effect was only in part explained by overconfidence.

Fulfilling delayed intentions (i.e. prospective memory) is a vital process for daily living and behavioural independence. However, this process is also highly fallible (e.g. Crawford et al., 2003). External memory aids are highly effective and commonplace tools that compensate for these memory failures (e.g. Jones et al., 2021; Scullin et al., 2022). Our findings suggest that compulsive individuals are at particular risk of inadequate external memory support and would potentially benefit from interventions that target cognitive offloading strategies.

Materials and methods

Key resources table

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Software, algorithm	R	R Development Core Team, 2024	4.4.2; RRID:SCR_001905
Software, algorithm	RStudio	RStudio Team, 2020	2024.09.1+394; RRID:SCR_000432
Software, algorithm	diagram	Soetaert, 2020	1.6.5; RRID:SCR_026982	R package
Software, algorithm	effectsize	Ben-Shachar et al., 2020	0.8.9; RRID:SCR_026983	R package
Software, algorithm	lmerTest	Kuznetsova et al., 2017	3.1-3; RRID:SCR_015656	R package
Software, algorithm	lme4	Bates et al., 2015	1.1-35.5; RRID:SCR_015654	R package
Software, algorithm	mediation	Tingley et al., 2014	4.5.0; RRID:SCR_026984	R package
Software, algorithm	plyr	Wickham, 2011	1.8.9; RRID:SCR_026985	R package
Software, algorithm	pwr	Champely, 2020	1.3-0; RRID:SCR_025480	R package
Software, algorithm	quickpsy	Linares and López-Moliner, 2016	0.1.5.1; RRID:SCR_026986	R package

Task and procedure

Request a detailed protocol

For the present, preregistered study, we used a novel variant of an online cognitive-offloading task (‘optimal reminders task’; cf. Gilbert et al., 2020). This task allowed us to measure how people set reminders in relation to their confidence. All procedures, hypotheses, and planned analyses were preregistered at https://osf.io/kztf8 prior to the commencement of data collection.

On every trial, participants were instructed to move several numbered, yellow circles to the bottom of a square in consecutive order (see Figure 6A). Whenever a circle was removed, a new one appeared up to a total of 15 circles. The source of difficulty of this task stems from the ‘special’ circles, which constitute the delayed intentions people have to fulfil. These circles flashed in a colour (blue, orange, or magenta) when they first appeared on screen before fading to yellow. Participants’ task was to drag these circles to their colour-corresponding side once the time had come to remove the respective special circle (top, left, or right). There were six special circles per trial. On some trials, participants had to rely on their own memory to complete the task and remember the target locations of the special circles. On other trials, they set spatial reminders, indicating the locations to which the special circles must be moved to. More specifically, they were taught to move the special circle next to the border through which it would have to be moved out of the square later.

Figure 6

Download asset Open asset

Overview of the intention offloading paradigm.

(A) Example sequence of events within a single trial. Trajectories of movement made by a fictive participant are shown as black arrows. The blue coloured circle corresponds to the left boundary of the square and indicates that this circle must be moved to this side rather than the bottom. (B) Example of an offloading decision which participants were required to make before each trial. (C) After each decision, they were informed whether or not they would perform the upcoming trial with reminders. The cell’s shading indicates the participant’s original choice. (D) Confidence was rated once before the introduction of the offloading strategy on a scale ranging from 0% to 100%. (E) Sequence of events within the task. All aspects of the task were performed online in the web browser.

Every trial began with a decision: participants could choose to do the task without reminders and earn 10 points for every special circle they remembered to move to the correct border, or they could choose to use reminders but earn less for each special circle (Figure 6B). Critically, this lesser amount was varied between 2 and 9 points, allowing us to calculate the participants’ indifference point when trading off the benefit of reminders with their reduced reward. This AIP could then be contrasted against their OIP, calculated from participants’ accuracy with or without reminders, see below for further details. Since our task included only 4 trials each with or without reminders, we counterbalanced the assignment of odd or even target values to these conditions.

Together, there were three key conditions in our task presented intermixed throughout the experiment: the Forced Internal condition (FI; 4 trials) in which participants had to remember the circles unaided, the Forced External condition (FE; 4 trials) in which they had to use reminders, and the Choice Only controlling for age, gender, and educational attainment (CO; 8 trials) in which they were free to choose whichever strategy they preferred but the trial ended after only six circles and without any special circles. To give participants the impression of maximum agency over the task, we only told them that their choice would be overwritten whenever there was a mismatch with the pseudorandomly assigned condition (25.3% of all trials; SD = 5.8; see Figure 6C). This way, participants were unable to tell which condition they were currently in and whether it would be a partial trial. Participants used reminders on average on 49.9% of trials (SD = 16.5).

Participants were asked to rate their confidence once during the experiment, being asked to indicate the ‘percentage of the special circles [they] can correctly drag to the instructed side of the square’ (Figure 6D). Importantly, this confidence judgement was given after the first practice trials and before the offloading strategy was introduced to ensure participants answered this question with regard to their own perceived memory capabilities. Average confidence was 55.6% (proportion of trials on which participants predicted to remember to move the special circles; SD = 24.2).

In addition to the reminder task, we included items from six individual differences questionnaires shortened to include only the items required to reliably measure the CIT and AD factors (Wise and Dolan, 2020). These questionnaires were presented in random order: 4 items from the Apathy Evaluation Scale (AES; Marin et al., 1991), 8 items from the Zung Depression Scale (SDS; Zung, 1965), 4 items from the Eating Attitudes Test (EAT-26; Garner et al., 1982), 12 items from the Barratt Impulsiveness Scale (BIS-11; Patton et al., 1995), 11 items from the Obsessive Compulsive Inventory – Revised (OCIR; Foa et al., 2002), and 11 items from the ‘trait’ part of the State-Trait Anxiety Inventory (STAI; Spielberger et al., 1983). A list of all included items can be found in Appendix 1.

We also included a catch item in the BIS-11 (‘I competed in the 1917 Summer Olympics Games.’) to ensure participants were paying attention to the task, as well as three covariates aimed at measuring cognitive ability (a 5-item version of the International Cognitive Ability Resource; ICAR5; Kirkegaard and Bjerrekær, 2016; Condon and Revelle, 2014); educational attainment mapped onto a 1–9 scale and based on the ISCED 2011 categories (see Appendix 1); and working memory, assessed using 100 consecutive letters from the 2-back task (e.g. Kirchner, 1958). The logic behind including the latter covariate was that whilst our key dependent variables already corrected for working memory (more specifically: unaided prospective memory performance), this could tap into additional working memory components not measured already and potentially impacted in compulsivity based on the finding that they have often been found to be impaired in OCD (Harkin and Kessler, 2011). Together, these elements resulted in a total duration of approximately 35 min. The sequence of events within the task is shown in Figure 6E.

Participants

Ethical approval for this study was received from the local Ethics Committee at University College London (UCL) under the reference number 1584/003. Informed consent was obtained from all participants prior to the study. Participants were invited on prolific.co to participate for £3.90. Based on points won during the main task, the upper 50% of participants were furthermore rewarded with a bonus payment of £1. We restricted our search to the Prolific standard sample, allowed participants from all countries, with a minimum of 18 years. All participants had to be fluent in English and were required to have an approval rate of over 90% based on Prolific’s criteria. Moreover, we required participants to not have participated in one of the four pilots prior to this study.

All analyses were conducted with R in RStudio (R Development Core Team, 2024; RStudio Team, 2020) together with the following R packages: effectsize (Ben-Shachar et al., 2020), lmerTest (Kuznetsova et al., 2017), lme4 (Bates et al., 2015), mediation (Tingley et al., 2014), plyr (Wickham, 2011), pwr (Champely, 2020), and quickpsy (Linares and López-Moliner, 2016). We calculated our sample size based on the link between confidence and transdiagnostic compulsivity as reported in two recent studies (Rouault et al., 2018; Seow and Gillan, 2020). To be able to detect a link between these variables of β=0.23, p<0.001, as in Rouault et al., 2018, we required N=288 participants (two-sided testing, power = 0.8, CL = 0.95). To be able to detect a link of β=6.74, p<0.001, as in Seow and Gillan, 2020, we required N=291 participants (two-sided testing, power = 0.8, CL = 0.95). In both cases, the power calculation was based on a partial regression approach, excluding the effect in question from the model and comparing the explained variance compared to the full model. Since we are furthermore aiming to test a moderation effect of compulsivity on the link between the metacognitive bias and the reminder bias, we decided to collect a larger sample of N=600 after exclusions.

We preregistered six exclusion criteria, based on which we excluded and replaced 69 participants: Nine participants were excluded due to a higher hit rate on forced internal than forced external trials, 22 participants were excluded due to less than 70% accuracy on FE trials, and 3 participants due to less than 10% accuracy on FI trials. We furthermore preregistered to exclude participants with a negative correlation between value and reminder choice (1=reminder, 0=no reminder), as this would indicate participants did not understand the instructions: in order to maximise points in our task, participants should preferentially choose reminders when this strategy brings a higher number of points. Based on this, we excluded 40 participants. No participants were excluded based on scoring lower or higher three times the median absolute deviation calculated separately based on both the reminder bias and the metacognitive bias. Finally, we excluded 9 participants because they failed to answer with ‘Do not agree at all’ to the catch item. Figure 1 visualises the exclusions shown in red. In total, we excluded 10.3% of all participants. There were an additional 26 participants excluded for technical reasons, raising the exclusion rate to 13.7%.

Key dependent variables

Request a detailed protocol

Our task allowed us to calculate several dependant variables relevant in the context of our study question. The first is the OIP, the optimal indifference point. The OIP describes the number of points at which an unbiased, reward-maximising participant is indifferent between the two strategies (reminders or no reminders) and is calculated as:

O I P = (10 * A C C_{F I}) / A C C_{F E}

where ACC_FE is the accuracy measured during trials in which the participants had to solve the task using reminders (FE condition), and ACC_FI is the accuracy measured during trials in which participants had to solve the task without reminders (FI condition). In contrast, the AIP is the the AIP is the actual indifference point, which is the point cut-off at which participants actually were indifferent and is operationalised as the threshold parameter from fitting a psychometric function to the choice data (target values predicting the decision whether or not to use reminders). Fitting was done using the quickpsy package in R, and more detail is given in Appendix 1. It should be noted that the OIP has a slightly finer resolution due to the number of special circles per trial.

Setting the OIP and the AIP in relation, we can calculate the reminder bias, reflecting participants’ tendency to use reminders corrected for their actual performance and calculated as the difference between both indifference points:

b i a s_{r e m} = O I P - A I P

Positive values reflect that people set more reminders relative to the optimal strategy. The fourth measure is the metacognitive bias, reflecting participants’ over- or underconfidence relative to their performance and was calculated as:

b i a s_{m e t a} = c o n f i d e n c e - A C C_{F I}

Negative values can be interpreted as underconfidence.

Crucially, our study relies on the key assumption that the metacognitive bias can predict the reminder bias, but ACC_FI contributes to both biases. To avoid circularity, we therefore split the accuracy data to avoid potentially inflating the correlation. More specifically, we included only the even trials to calculate the ACC_FI for the OIP, whereas we included only the odd trials to calculate the ACC_FI for the metacognitive bias. All available trials from the FE condition were used to calculate the OIP.

It should be noted that we had incorrectly stated in the preregistration that accuracy from forced external trials would contribute to the calculation of the metacognitive bias. However, the metacognitive bias is a judgement given about the unaided memory performance, in fact confidence is measured before participants were even introduced to the offloading strategy (see above). We therefore used only the internal trials in calculating the metacognitive bias.

Finally, the transdiagnostic scores for the ‘CIT’ factor and the ‘AD’ factor were calculated from participants’ ratings to the individual differences questionnaires by multiplying them with the item weights from Wise and Dolan, 2020, prior to summing them. The items composing the CIT and AD scores, respectively, were non-overlapping with 24 items forming the AD score and 25 items forming the CIT score.

Preregistered hypotheses and statistical analyses

Request a detailed protocol

We preregistered eight hypotheses (see Table 1), half of which were sanity checks (H1-H4) aimed to establish whether our task would generally lead to the same patterns as previous studies using a similar task (as reviewed in Gilbert et al., 2023). H1 was a replication of the central finding of the link between confidence and offloading. More specifically, we entered the unconfounded metacognitive bias and reminder bias into a Pearson correlation analysis. We expected to find a negative relationship between the two measures, which we planned to test for significance using a one-sided test. We furthermore expected to find that people would use more reminders than optimal. This pro-reminder bias would be reflected in a positive reminder bias (H2). We planned to test this using a one-sided paired t-test. Relatedly, we expected to find people to be generally underconfident (i.e. expecting to remember fewer special circles than they actually did when doing the task without reminders). Such underconfidence would be reflected in a negative metacognitive bias (H3), which we again planned to test using a one-sided paired t-test. Furthermore, we expected that those who required more reminders would also be the ones to use them more, as reflected in a positive correlation between the AIP and OIP, again as a one-sided test (H4). We decided to use Spearman’s rho due to the data most likely being distributed around the extremes of the scale. For H2-H4 (as well as H5, H6, and H8, see below), we used the biases and indifference points calculated from all available trials as there was no circularity issue.

Table 1

List of preregistered hypotheses together with the empirical support our study found.

White background indicates sanity check hypotheses, and grey background indicates key hypotheses. OIP = optimal indifference point. AIP = actual indifference point. CIT = compulsive behaviour and intrusive thought.

Number	Hypothesis	Support?
H1	The reminder bias and metacognitive bias are negatively correlated.	Yes
H2	Participants use reminders excessively.	Yes
H3	Participants are underconfident in their own memory.	Yes
H4	OIP and AIP are positively correlated.	Yes
H5a	Positive link between CIT and reminder bias.	No (significant negative effect)
H5b	Positive link between CIT and absolute number of reminders chosen.	No (negative effect but significance not reached)
H5c	Positive link between CIT and AIP.	No (significant negative effect)
H5d	Positive link between CIT and reminder bias even if working memory is included as a covariate.	No (significant negative effect)
H5e	Positive link between CIT and reminder bias even if cognitive ability is included as a covariate.	No (significant negative effect)
H6a	A significant link exists between CIT and metacognitive bias (preregistered as a two-sided test, so either more or less confident).	Yes (positive)
H6b	A significant link exists between CIT and raw confidence.	Yes (positive)
H6c	A significant link exists between CIT and metacognitive bias even if cognitive ability is included as a covariate.	Yes (positive)
H7a	CIT acts as a moderator on the link between confidence and offloading. In other words, we expect to find that the correlation between the metacognitive and the reminder bias to be weakened in highly compulsive individuals.	No
H7b	CIT acts as a moderator on the link between confidence and offloading even if working memory is included as a covariate.	No
H7c	CIT acts as a moderator on the link between confidence and offloading even if cognitive ability is included as a covariate.	No
H8a	A significant negative link exists between AD and metacognitive bias (i.e. more anxious-depressed individuals tend to be underconfident).	Yes
H8b	A significant negative link exists between AD and raw confidence.	Yes
H8c	A significant negative link exists between AD and metacognitive bias even if cognitive ability is included as a covariate.	Yes

Hypotheses H5-H8 were the key hypotheses of our study. Here, we address them out of order in the interest of an improved logical flow. Hypothesis H6 predicted that more compulsive individuals would show an effect in confidence bias, reflected in a reliable predictor of the CITs scores on the metacognitive bias from the following regression model:

b i a s_{m e t a} \sim C I T + A D + a g e + g e n d e r + e d u c a t i o n + ε

Though we did not preregister a direction for this effect, in the light of recent findings, it has now become clear that compulsivity would most likely be linked to overconfidence (Rouault et al., 2018; Seow and Gillan, 2020; Benwell et al., 2022; Fox et al., 2023; Fox et al., 2024; Hoven et al., 2023a). The same model was used to test hypothesis H8, predicting that more AD individuals tend to be underconfident. This would be reflected in AD scores being negatively linked to the metacognitive bias. The model above represents the main models designed to test hypotheses H6a and H8a. We furthermore also tested these hypotheses but predicted raw confidence (percentage of circles participants predicted they would remember; H6b and H8b, respectively), as well as extending the main model with the scores from the cognitive ability test (ICAR5) as an additional covariate (H6c and H8c, respectively). For this, as well as all following regression models, we z-transformed all non-binary variables prior to fitting the models.

With H5, we predicted that more compulsive individuals would show a bias towards more offloading, reflected in a positive regression coefficient when using the CIT score as a predictor of the reminder bias. This hypothesis was not a replication; consequently, we decided to carry out the test two-sided. Throughout this section, whenever not explicit specified, we plan to carry out a test two-sided. Due to the diametrically opposing effects of CIT and AD, both transdiagnostic scores need to be entered into the model, alongside our demographic covariates age, gender, and educational attainment:

b i a s_{r e m} \sim C I T + A D + a g e + g e n d e r + e d u c a t i o n + ε

We fitted several different versions of this model: the main model predicted the reminder bias (H5a), but we also fit one with the absolute number of reminders chosen (H5b) or the AIP (H5c). To understand whether any differences in offloading behaviour could stem from differences in working memory capacity not already captured by our correction for unaided task performance, we furthermore extended the main model by also including the d’ from a 2-back task as a covariate (H5d). Finally, we fit an extended version of the main model with scores from the cognitive ability test (ICAR5) as an additional covariate to capture cognitive ability (H5e). We ran the same analysis but for the AD factor. We included this test as a preregistered analysis but did not specify any directional hypotheses.

Our final hypothesis, H7, aimed to differentiate between the Metacognitive Monitoring Mechanism, the Metacognitive Control Mechanism, and the Direct Mechanism. We tested how compulsivity would affect the relationship between confidence and offloading. More specifically, we predicted that CIT scores would act as a moderator variable between the metacognitive and the reminder bias, and that highly compulsive individuals would have a weaker link. We tested this by fitting the following regression model to the data:

b i a s_{r e m} \sim b i a s_{m e t a} * C I T + A D + a g e + g e n d e r + e d u c a t i o n + ε

To avoid circularity, we used the unconfounded metacognitive bias and reminder bias for this analysis. The moderation of CIT is reflected in its interaction term with the bias_meta predictor. A significant interaction term can be interpreted as support for the Metacognitive Control Mechanism. In addition to this main model (H7a), we furthermore also tested whether this effect would persist if working memory (2-back d’; H7b) or educational attainment (H7c) were included as additional covariates. We ran the same analysis but for the AD factor. We included this test as a preregistered analysis but did not specify any directional hypotheses.

It should be noted that whilst not explicitly preregistered, our planned models also allow testing for a mediation effect (metacognitive bias acting as a mediator on the effect of the CIT score on the reminder bias). This is done by comparing the effect of CIT on the reminder bias when the effect of the metacognitive bias is accounted for (Hypothesis 7) to when it is not (Hypothesis 5). In addition, we included a causal mediation analysis (not preregistered) using the mediation package in R. This analysis involved testing of the indirect effect using bootstrapping. More specifically, we computed unstandardised indirect effects for each of our 1000 bootstrapped samples and based on those the 95% confidence interval. To keep the information entering into the mediation analysis constant, we re-fitted the models from our sections on H5 and H6/H8 but with the unconfounded metacognitive bias and reminder bias, respectively. Furthermore, we had to treat the covariate ‘gender’ as a continuous variable as the mediation package would otherwise not have been able to fit the data. We expect that this difference is unlikely to cause any issues with the interpretation of our effects. A significant mediation effect can be interpreted as support for the Metacognitive Monitoring Mechanism. A significant direct effect can be interpreted as support for the Direct Mechanism.

Appendix 1

Appendix 1—table 1

Predicting metacognitive bias.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other.

	β	SE	t	p
Intercept	0.07	0.05	1.30	0.193
AD	–0.23	0.05	–4.99	<0.001
CIT	0.15	0.05	3.11	0.002
Age	–0.02	0.04	–0.55	0.586
gender1 (m vs. f)	–0.17	0.08	–2.06	0.040
gender2 (m vs. o)	–0.29	0.38	–0.78	0.438
education	–0.0001	0.04	–0.005	0.996

Appendix 1—table 2

Predicting confidence.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other.

	β	SE	t	p
Intercept	0.09	0.05	1.81	0.070
AD	–0.29	0.04	–6.43	<0.001
CIT	0.12	0.05	2.76	0.006
Age	–0.14	0.04	–3.44	<0.001
gender1 (m vs. f)	–0.24	0.08	–2.93	0.004
gender2 (m vs. o)	–0.23	0.37	–0.63	0.528
education	0.04	0.04	1.01	0.311

Appendix 1—table 3

Predicting metacognitive bias with ICAR5 scores as an additional covariate.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other.

	β	SE	t	p
Intercept	0.08	0.05	1.68	0.094
AD	–0.20	0.05	–4.46	<0.001
CIT	0.12	0.05	2.57	0.011
Age	–0.03	0.04	–0.66	0.507
gender1 (m vs. f)	–0.22	0.08	–2.61	0.009
gender2 (m vs. o)	–0.45	0.37	–1.21	0.226
education	0.04	0.04	0.91	0.364
ICAR5	–0.20	0.04	–4.84	<0.001

Appendix 1—table 4

Predicting reminder bias.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other.

	β	SE	t	p
Intercept	–0.01	0.05	–0.24	0.813
AD	0.07	0.05	1.46	0.146
CIT	–0.14	0.05	–2.91	0.004
Age	0.07	0.04	1.69	0.092
gender1 (m vs. f)	0.005	0.08	0.06	0.955
gender2 (m vs. o)	0.88	0.38	2.32	0.021
education	–0.06	0.04	–1.42	0.157

Appendix 1—table 5

Predicting absolute number of reminders.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other.

	β	SE	t	p
Intercept	–0.03	0.05	–0.68	0.496
AD	0.06	0.05	1.33	0.183
CIT	–0.09	0.05	–1.94	0.053
Age	0.18	0.04	4.38	<0.001
gender1 (m vs. f)	0.07	0.08	0.86	0.393
gender2 (m vs. o)	0.73	0.38	1.93	0.054
education	–0.10	0.04	–2.58	0.010

Appendix 1—table 6

Predicting actual indifference point (AIP).

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other.

	β	SE	t	p
Intercept	0.02	0.05	0.45	0.657
AD	–0.08	0.05	–1.76	0.079
CIT	0.10	0.05	2.25	0.025
Age	–0.17	0.04	–3.95	<0.001
gender1 (m vs. f)	–0.04	0.08	–0.45	0.652
gender2 (m vs. o)	–0.75	0.38	–1.99	0.047
education	0.09	0.04	2.24	0.025

Appendix 1—table 7

Predicting reminder bias with 2-back d’ as an additional covariate.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other.

	β	SE	t	p
Intercept	–0.01	0.05	–0.23	0.821
AD	0.06	0.05	1.23	0.219
CIT	–0.12	0.05	–2.57	0.010
Age	0.07	0.04	1.78	0.076
gender1 (m vs. f)	0.004	0.08	0.05	0.961
gender2 (m vs. o)	0.86	0.38	2.25	0.025
education	–0.06	0.04	–1.56	0.120
2-back d’	0.10	0.04	2.41	0.016

Appendix 1—table 8

Predicting reminder bias with ICAR5 scores as an additional covariate.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other.

	β	SE	t	p
Intercept	–0.01	0.05	–0.26	0.796
AD	0.07	0.05	1.41	0.160
CIT	–0.14	0.05	–2.85	0.005
Age	0.07	0.04	1.70	0.091
gender1 (m vs. f)	0.01	0.08	0.09	0.927
gender2 (m vs. o)	0.90	0.38	2.33	0.020
education	–0.06	0.04	–1.45	0.147
2-back d’	0.01	0.04	0.32	0.751

Appendix 1—table 9

Predicting reminder bias with metacognitive bias as an additional covariate (i.e. testing for a moderation effect).

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other; MetaBias = metacognitive bias.

	β	SE	t	p
Intercept	–0.01	0.05	–0.25	0.802
Metacognitive bias	–0.19	0.04	–4.66	<0.001
AD	0.03	0.05	0.67	0.506
CIT	–0.10	0.05	–2.14	0.032
Age	0.13	0.04	3.22	0.001
gender1 (m vs. f)	0.003	0.08	0.04	0.969
gender2 (m vs. o)	0.99	0.37	2.65	0.008
education	–0.08	0.04	–2.03	0.043
CIT X MetaBias	–0.01	0.04	–0.18	0.857

Appendix 1—table 10

Predicting reminder bias with metacognitive bias and 2-back d’ as additional covariates.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other; MetaBias = metacognitive bias.

	β	SE	t	p
Intercept	–0.01	0.05	–0.26	0.797
Metacognitive Bias	–0.17	0.04	–4.27	<0.001
AD	0.03	0.05	0.55	0.584
CIT	–0.09	0.05	–1.90	0.058
Age	0.14	0.04	3.28	0.001
gender1 (m vs. f)	0.004	0.08	0.06	0.953
gender2 (m vs. o)	0.97	0.37	2.60	0.010
education	–0.09	0.04	–2.13	0.034
2-back d’	0.08	0.04	1.95	0.052
CIT X MetaBias	–0.01	0.04	–0.26	0.793

Appendix 1—table 11

Predicting reminder bias with metacognitive bias and ICAR5 scores as additional covariates.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other; MetaBias = metacognitive bias.

	β	SE	t	p
Intercept	–0.01	0.05	–0.27	0.789
Metacognitive bias	–0.19	0.04	–4.57	<0.001
AD	0.03	0.05	0.64	0.521
CIT	–0.10	0.05	–2.11	0.035
Age	0.13	0.04	3.22	0.001
gender1 (m vs. f)	0.005	0.08	0.06	0.949
gender2 (m vs. o)	1.00	0.37	2.66	0.008
education	–0.08	0.04	–2.03	0.043
ICAR5	0.009	0.04	0.22	0.829
CIT X MetaBias	–0.01	0.04	–0.18	0.859

Appendix 1—table 12

Predicting reminder bias with metacognitive bias and ICAR5 scores as additional covariates.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other; MetaBias = metacognitive bias.

	β	SE	t	p
Intercept	–0.02	0.05	–0.37	0.712
Metacognitive bias	–0.19	0.04	–4.65	<0.001
AD	0.03	0.05	0.67	0.501
CIT	–0.10	0.05	–2.15	0.032
Age	0.13	0.04	3.15	0.002
gender1 (m vs. f)	0.004	0.08	0.04	0.966
gender2 (m vs. o)	0.99	0.37	2.65	0.008
education	–0.08	0.04	–1.91	0.057
AD X MetaBias	–0.04	0.04	–0.94	0.349

Questionnaire items

Apathy Evaluation Scale

Response scores:

Not at all characteristic (1)
Slightly characteristic (2)
Somewhat characteristic (3)
Very characteristic (4)

AES_2. I get things done during the day. (Reverse)
AES_7. I approach life with intensity. (Reverse)
AES_17. I have initiative. (Reverse)
AES_18. I have motivation. (Reverse)

Barrett’s Impulsivity Scale

Response scores:

Rarely/never (1)
Occasionally (2)
Often (3)
Almost always/Always (4)

Items:

BIS_1. I plan tasks carefully. (Reverse)
BIS_6. I have ‘racing’ thoughts.
BIS_9. I concentrate easily. (Reverse)
BIS_13. I plan for job security. (Reverse)
BIS_14. I say things without thinking.
BIS_15. I like to think about complex problems. (Reverse)
BIS_17. I act ‘on impulse’.
BIS_20. I am a steady thinker. (Reverse)
BIS_22. I buy things on impulse.
BIS_25. I spend or charge more than I earn.
BIS_26. I often have extraneous thoughts when thinking.
BIS_check. I competed in the 1917 Summer Olympics Games.

Eating Attitudes Test

Response scores:

Always (3)
Usually (2)
Often (1)
Sometimes (0)
Rarely (0)
Never (0)

Items:

EAT_1. I am terrified about being overweight.
EAT_11. I am preoccupied with a desire to be thinner.
EAT_12. I think about burning up calories when I exercise.
EAT_14. I am preoccupied with the thought of having fat on my body.

Obsessive Compulsive Inventory

Response scores:

Not at all (0)

A little (1)

Moderately (2)

A lot (3)

Extremely (4)

Items:

OCI_1. I have saved up so many things that they get in the way.
OCI_2. I check things more often than necessary.
OCI_4. I feel compelled to count while I am doing things.
OCI_6. I find it difficult to control my own thoughts.
OCI_7. I collect things I don’t need.
OCI_9. I get upset if others change the way I have arranged things.
OCI_11. I sometimes have to wash or clean myself simply because I feel contaminated.
OCI_12. I am upset by unpleasant thoughts that come into my mind against my will.
OCI_13. I avoid throwing things away because I am afraid I might need them later.
OCI_16. I feel that there are good and bad numbers.
OCI_18. I frequently get nasty thoughts and have difficulty in getting rid of them.

Self-rating Depression Scale (SDS)

Response scores:

A little of the time (1)
Some of the time (2)
Good part of the time (3)
Most of the time (4)

Items:

SDS_11. My mind is as clear as it used to be. (Reverse)
SDS_12. I find it easy to do the things I used to. (Reverse)
SDS_13. I am restless and can’t keep still.
SDS_14. I feel hopeful about the future. (Reverse)
SDS_16. I find it easy to make decisions. (Reverse)
SDS_17. I feel that I am useful and needed. (Reverse)
SDS_18. My life is pretty full. (Reverse)
SDS_20. I still enjoy the things I used to do. (Reverse)

State Trait Anxiety Inventory

Response scores:

Almost never (1)

Sometimes (2)

Often (3)

Almost always (4)

Items:

STAI_1. I feel pleasant. (Reverse)
STAI_3. I feel satisfied with myself. (Reverse)
STAI_5. I feel like a failure.
STAI_8. I feel that difficulties are piling up so that I cannot overcome them.
STAI_9. I worry too much over something that really doesn’t matter.
STAI_10. I am happy. (Reverse)
STAI_12. I lack self-confidence.
STAI_13. I feel secure. (Reverse)
STAI_16. I am content. (Reverse)
STAI_19. I am a steady person. (Reverse)
STAI_20. I get in a state of tension or turmoil as I think over my recent concerns and interests.

Educational attainment questions

Educational attainment was based on the ISCED 2011 categories and included the following options mapped onto 1–9 in response to the question ‘What is the highest level of education you have completed to this date?’:

‘Early childhood education or no formal education (e.g. early childhood education and development, play school, reception, pre-primary, pre-school, educación inicial)’
‘Primary education (e.g. primary education, elementary education, basic education; typically ends around age 10–12 years)’
‘Lower secondary education (e.g. lower grades of secondary school, junior secondary school, middle school, junior high school)’
‘Upper secondary education (e.g. upper grades of secondary school, senior secondary school, senior high school; typically ends around age 17–18 years)’
‘Post-secondary non-tertiary education (e.g. technician diploma, primary professional education, préparation aux carrières administratives; usually designed for direct labour market entry)’
‘Short-cycle tertiary education (e.g. junior college, higher technical education, community college education, technician or advanced/higher vocational training, associate degree, bac+2; practically based, occupationally specific and prepare for the labour market but can also be a pathway to other tertiary education programmes)’
‘Bachelor’s or equivalent level’
‘Master’s or equivalent level’
‘Doctoral or equivalent level’

Psychometric curve fitting

We used the quickpsy package in R to fit psychometric curves to each participant’s choice data to derive their AIP, which was operationalised as the threshold parameter when predicting reminder choices from target values. We set the initial parameter ranges from 2 to 9 for the threshold parameter and from 1 to 500 for the slope parameter, based on the task’s properties and pilot data. Such a restriction of the threshold parameter was intended to increase the comparability between AIP and OIP, and hence improved the calculation of the reminder bias. Apart from those parameter ranges, we used only default settings of the quickpsy() function.

Each participant has only 16 trials (2 for each target value) contribute to the curve fitting. To understand the robustness of the AIP based on such limited data, we conducted a parameter recovery analysis. We simulated 16 trials based on each psychometric function and re-ran the curve fitting based on those simulated choices. There was close correspondence between the actual and recovered threshold parameters (or AIPs) with a correlation of r=0.94, p<0.001 (see also Appendix 1—figure 1). In contrast, the slope parameter – which was not central to any of our analyses – exhibited greater variability during the initial fitting. This increased uncertainty likely contributed to slightly poorer recovery in the simulation (r=0.23, p<0.001).

Appendix 1—figure 1

Download asset Open asset

The actual indifference point (AIP) is shown on the x-axis against its recovered estimates on the y-axis.

Each marker represents one participant’s estimates.

Appendix 1—figure 2

Download asset Open asset

Psychometric functions linking target values to offloading choices.

The average choice data is shown as dots. Panels show the individual curves for participants 1–20.

Appendix 1—figure 3

Download asset Open asset

Appendix 1—figure 4

Download asset Open asset

Appendix 1—figure 5

Download asset Open asset

Appendix 1—figure 6

Download asset Open asset

Appendix 1—figure 7

Download asset Open asset

Appendix 1—figure 8

Download asset Open asset

Appendix 1—figure 9

Download asset Open asset

Appendix 1—figure 10

Download asset Open asset

Appendix 1—figure 11

Download asset Open asset

Appendix 1—figure 12

Download asset Open asset

Appendix 1—figure 13

Download asset Open asset

Appendix 1—figure 14

Download asset Open asset

Appendix 1—figure 15

Download asset Open asset

Appendix 1—figure 16

Download asset Open asset

Appendix 1—figure 17

Download asset Open asset

Appendix 1—figure 18

Download asset Open asset

Appendix 1—figure 19

Download asset Open asset

Appendix 1—figure 20

Download asset Open asset

Appendix 1—figure 21

Download asset Open asset

Appendix 1—figure 22

Download asset Open asset

Appendix 1—figure 23

Download asset Open asset

Appendix 1—figure 24

Download asset Open asset

Appendix 1—figure 25

Download asset Open asset

Appendix 1—figure 26

Download asset Open asset

Appendix 1—figure 27

Download asset Open asset

Appendix 1—figure 28

Download asset Open asset

Appendix 1—figure 29

Download asset Open asset

Appendix 1—figure 30

Download asset Open asset

Appendix 1—figure 31

Download asset Open asset

Planned additional analyses

We preregistered several analyses that whilst planned were not thought out in detail and thus of a more exploratory nature.

First, we investigated the relationship between the transdiagnostic phenotypes and unaided task performance (accuracy on trials in which participants were not allowed to use a reminder). We predicted task performance (accuracy on FI trials) from the two transdiagnostic factors and our demographic covariates (age, gender, and education).

A C C_{F I} \sim C I T + A D + a g e + g e n d e r + e d u c a t i o n + ε

Neither AD, β=–0.02, SE = 0.05, t=–0.39, p=0.69; nor CIT, β=–0.06, SE = 0.05, t=–1.33, p=0.18, were significant predictors, suggesting there was no evidence to support meaningful performance differences for these two transdiagnostic phenotypes. The full results can be found in Appendix 1—table 13 as well as in Appendix 1—figure 32.

Appendix 1—table 13

Predicting internal accuracy.

All continuous variables are z-transformed. SE = standard error; m=male; f=female; o=other.

	β	SE	t	p
Intercept	0.01	0.05	0.27	0.784
AD	–0.02	0.05	–0.39	0.693
CIT	–0.06	0.05	–1.33	0.183
Age	–0.15	0.04	–3.50	<0.001
gender1 (m vs. f)	–0.04	0.08	–0.52	0.606
gender2 (m vs. o)	0.15	0.38	0.40	0.687
education	0.05	0.04	1.29	0.199

Appendix 1—figure 32

Download asset Open asset

The distribution of the unaided task performance (internal accuracy) as a function of ‘compulsive behaviour and intrusive thought’ (CIT; left panel) and ‘anxious depression’ (AD; right panel).

Second, we planned to investigate whether participants would show a response ‘stickiness’ in their reminder use as reported by Scarampi and Gilbert, 2020, and whether such response perseverance would correlate with their CIT (e.g. Shahar et al., 2021) and AD scores. We conducted this analysis based on a subset of participants: only participants whose first trial was not a partial trial were included (choice condition, as no strategy was performed that could later be repeated). Furthermore, only participants who indicated with their strategy choice that they would not have chosen the strategy they were randomly assigned were included. This resulted in a subset of N=157 participants. We then calculated the proportion of the remaining trials in which participants chose to repeat this strategy and compared the resulting proportion to 50%. Participants repeated this strategy on only 31.6% of the remaining trials, significantly lower than 50%, t(156) = –8.43, p<0.001, d=0.67. We need to keep in mind that the strategy in question was the one they did not choose on the first trial. Most people have an overall preference for or against reminders in this task independent of the manipulation of reward. A person who shows numbers lower than 50% might thus have repeatedly chosen the same strategy they had overwritten on that first trial, reflecting stable biases for their preferred response strategies. We furthermore tested whether this response perseverance was predicted by CIT or AD (and demographic covariates) fitting the following regression model to the data:

p e r s e v e r a n c e \sim C I T + A D + a g e + g e n d e r + e d u c a t i o n + ε

We indeed found that the effect was modulated by CIT: Compulsive individuals showed more response perseverance, β=0.20, SE = 0.09, t=2.17, p=0.03, whereas there was no significant effect on AD, β=–0.08, SE = 0.09, t=–0.85, p=0.40. As a conclusion, we can say that the more compulsive individuals show a tendency to repeat that first forced/overwritten trial later in the remaining experiment. However, caution should be advised to not over-interpret this effect as only a small subset of our sample was included in the analysis.

Exploratory analyses

During our analysis, several additional questions arose, which we aimed to address with exploratory analyses.

First, we aimed to understand whether highly compulsive individuals would approach our task differently, potentially even struggling with it. For instance, some OCD patients prefer ordered sequences, and it might therefore have been aversive for our compulsive individuals to move the target circles out of the numbered order to set reminders. Additionally, they might have been put off by the scattered nature of the visual display and might have spent time rearranging circles, e.g., in a grid-like fashion. We therefore tested whether the transdiagnostic phenotypes were reliable predictors of response times (RTs), depending on condition with the following linear mixed model with random intercepts for participants:

R T \sim c o n d i t i o n * (C I T + A D) + a g e + g e n d e r + e d u c a t i o n + ε

where condition denoted whether or not participants did the task with (FE) or without (FI) reminders. CIT did not significantly predict RT, β=–0.01, SE = 0.02, t(1153)=–0.42, p=0.67. AD, on the other hand, was a significant predictor of RT, β=–0.04, SE = 0.02, t(1228)=–2.05, p=0.04. There was furthermore a crucially significant interaction between CIT and condition, β=0.08, SE = 0.02, t(9580)=3.36, p<0.001. In other words, when reminders were possible, compulsive individuals were slower (0.04); but when reminders were not possible, compulsive individuals were faster (–0.04). There was no such interaction effect for the AD factor, β=–0.02, SE = 0.02, t(9577)=–0.82, p=0.41.

We followed up this analysis by fitting two additional linear mixed models with random intercepts for participants to gain insight into what actions highly compulsive individuals might have been performing during the reminder trials that might have led to the increase in RT. The first model predicted the trial-wise number of times that participants rearranged a circle they had previously already moved:

c i r c l e s_{m o v e d a g a i n} \sim c o n d i t i o n * C I T + A D + a g e + g e n d e r + e d u c a t i o n + ε

Compulsive individuals showed a tendency to this more often; however, this effect did not reach our level of significance, β=0.03, SE = 0.02, t(1245)=1.80, p=0.07.

The second model focused on a smaller subset of trials (m=4.2 trials per participant; min = 4, max = 12) in which more than one circle was moved and expressed the extent to which these circles were moved in their numbered order:

c i r c l e s_{m o v e d i n o r d e r} \sim c o n d i t i o n * C I T + A D + a g e + g e n d e r + e d u c a t i o n + ε

Numerically and contrary to expectation, highly compulsive individuals showed a reduced tendency to move circles in their numbered order. However, this effect did not reach our level of significance, β=–0.06, SE = 0.04, t(929.5)=–1.61, p=0.107. Taken together, high CIT individuals took significantly longer on reminder trials, but we cannot say with certainty why, and this will therefore need to be the focus of future studies.

Second, we asked whether the transdiagnostic phenotypes affected the compensatory nature of reminder use (cf. Hypothesis 4), meaning people who need reminders more tend to be the ones who use them more. The motivation for this analysis was the compulsive individual’s tendency towards a reduced reminder bias. To this end, we fit a regression model to predict the AIP from the OIP, the transdiagnostic phenotypes, and the demographic covariates:

A I P \sim O I P * (C I T + A D) + a g e + g e n d e r + e d u c a t i o n + ε

The key effects of interest were the interaction terms between the transdiagnostic phenotypes and the OIP.

Indeed, this seems to be the case reflected numerically in the interaction between CIT and OIP when predicting AIP, in other words, there was a stronger link between AIP and OIP in low compulsive individuals, compared to high compulsive individuals. However, this effect did not reach our required level of significance, β=–0.08, SE = 0.04, t=–1.75, p=0.08. There was no such interaction effect for the AD factor, β=0.01, SE = 0.04, t=0.34, p=0.73. In a follow-up analysis, we then investigated the influence of compulsivity on accuracy in the FE condition by fitting the following regression model:

A C C_{F E} \sim C I T + A D + a g e + g e n d e r + e d u c a t i o n + ε

Compulsive individuals were found to be less accurate on reminder trials, β=–0.12, SE = 0.5, t=–2.53, p=0.01, pointing towards a picture of not only impaired reminder setting but also impaired reminder use. We note that whilst this is an interesting finding, it certainly needs follow-up in future studies to understand the mechanisms at play.

Data availability

All data and analysis scripts are available for download at https://osf.io/b9rxz/.

The following data sets were generated

1. Boldt A
2. Fox CA
3. Gillan C
4. Gilbert S
(2025) Open Science Framework
ID b9rxz. Compulsivity, confidence and reminder setting.

https://osf.io/b9rxz/

References

1. Bates D
2. Mächler M
3. Bolker B
4. Walker S
(2015) Fitting linear mixed-effects models using lme4
Journal of Statistical Software 67:1–48.

https://doi.org/10.18637/jss.v067.i01
- Google Scholar
(2020) effectsize: estimation of effect size indices and standardized parameters
Journal of Open Source Software 5:2815.

https://doi.org/10.21105/joss.02815
- Google Scholar
1. Benwell CSY
2. Mohr G
3. Wallberg J
4. Kouadio A
5. Ince RAA
(2022) Psychiatrically relevant signatures of domain-general decision-making and metacognition in the general population
Npj Mental Health Research 1:10.

https://doi.org/10.1038/s44184-022-00009-4
- PubMed
- Google Scholar
1. Boldt A
2. Gilbert SJ
(2019) Confidence guides spontaneous cognitive offloading
Cognitive Research 4:45.

https://doi.org/10.1186/s41235-019-0195-y
- PubMed
- Google Scholar
1. Boldt A
2. Gilbert SJ
(2022) Partially overlapping neural correlates of metacognitive monitoring and metacognitive control
The Journal of Neuroscience 42:3622–3635.

https://doi.org/10.1523/JNEUROSCI.1326-21.2022
- PubMed
- Google Scholar
Software
1. Champely S
(2020) Pwr: basic functions for power analysis, version 1.3-0
R Package.

https://github.com/heliosdrm/pwr
1. Condon DM
2. Revelle W
(2014) The international cognitive ability resource: development and initial validation of a public-domain measure
Intelligence 43:52–64.

https://doi.org/10.1016/j.intell.2014.01.004
- Google Scholar
(2003) The prospective and retrospective memory questionnaire (PRMQ): normative data and latent structure in a large non-clinical sample
Memory 11:261–275.

https://doi.org/10.1080/09658210244000027
- PubMed
- Google Scholar
(2020) Transdiagnostic approaches to mental health problems: Current status and future directions
Journal of Consulting and Clinical Psychology 88:179–195.

https://doi.org/10.1037/ccp0000482
- PubMed
- Google Scholar
1. Den Ouden L
2. Suo C
3. Albertella L
4. Greenwood LM
5. Lee RSC
6. Fontenelle LF
7. Parkes L
8. Tiego J
9. Chamberlain SR
10. Richardson K
11. Segrave R
12. Yücel M
(2022) Transdiagnostic phenotypes of compulsive behavior and associations with psychological, cognitive, and neurobiological affective processing
Translational Psychiatry 12:10.

https://doi.org/10.1038/s41398-021-01773-1
- PubMed
- Google Scholar
(2023) Value-based routing of delayed intentions into brain-based versus external memory stores
Journal of Experimental Psychology. General 152:175–187.

https://doi.org/10.1037/xge0001261
- PubMed
- Google Scholar
(2011) Perfectionism as a transdiagnostic process: a clinical review
Clinical Psychology Review 31:203–212.

https://doi.org/10.1016/j.cpr.2010.04.009
- PubMed
- Google Scholar
1. Engeler NC
2. Gilbert SJ
(2020) The effect of metacognitive training on confidence and strategic reminder setting
PLOS ONE 15:e0240858.

https://doi.org/10.1371/journal.pone.0240858
- PubMed
- Google Scholar
1. Foa EB
2. Huppert JD
3. Leiberg S
4. Langner R
5. Kichic R
6. Hajcak G
7. Salkovskis PM
(2002)
The obsessive-compulsive inventory: development and validation of a short version

Psychological Assessment 14:485–496.
- PubMed
- Google Scholar
1. Fox CA
2. Lee CT
3. Hanlon AK
4. Seow TXF
5. Lynch K
6. Harty S
7. Richards D
8. Palacios J
9. O’Keane V
10. Stephan KE
11. Gillan CM
(2023) An observational treatment study of metacognition in anxious-depression
eLife 12:e87193.

https://doi.org/10.7554/eLife.87193
- PubMed
- Google Scholar
1. Fox CA
2. McDonogh A
3. Donegan KR
4. Teckentrup V
5. Crossen RJ
6. Hanlon AK
7. Gallagher E
8. Rouault M
9. Gillan CM
(2024) Reliable, rapid, and remote measurement of metacognitive bias
Scientific Reports 14:14941.

https://doi.org/10.1038/s41598-024-64900-0
- PubMed
- Google Scholar
(1982) The eating attitudes test: psychometric features and clinical correlates
Psychological Medicine 12:871–878.

https://doi.org/10.1017/s0033291700049163
- PubMed
- Google Scholar
1. Gilbert SJ
(2015) Strategic use of reminders: influence of both domain-general and task-specific metacognitive confidence, independent of objective memory ability
Consciousness and Cognition 33:245–260.

https://doi.org/10.1016/j.concog.2015.01.006
- PubMed
- Google Scholar
1. Gilbert SJ
2. Bird A
3. Carpenter JM
4. Fleming SM
5. Sachdeva C
6. Tsai P-C
(2020) Optimal use of reminders: metacognition, effort, and cognitive offloading
Journal of Experimental Psychology. General 149:501–517.

https://doi.org/10.1037/xge0000652
- PubMed
- Google Scholar
1. Gilbert SJ
2. Boldt A
3. Sachdeva C
4. Scarampi C
5. Tsai PC
(2023) Outsourcing memory to external tools: a review of “intention offloading”
Psychonomic Bulletin & Review 30:60–76.

https://doi.org/10.3758/s13423-022-02139-4
- PubMed
- Google Scholar
1. Gillan CM
2. Kosinski M
3. Whelan R
4. Phelps EA
5. Daw ND
(2016) Characterizing a psychiatric symptom dimension related to deficits in goal-directed control
eLife 5:e11305.

https://doi.org/10.7554/eLife.11305
- PubMed
- Google Scholar
1. Harkin B
2. Kessler K
(2011) The role of working memory in compulsive checking and OCD: A systematic classification of 58 experimental findings
Clinical Psychology Review 31:1004–1021.

https://doi.org/10.1016/j.cpr.2011.06.004
- PubMed
- Google Scholar
1. Harris LM
2. Vaccaro L
3. Jones MK
4. Boots GM
(2010) Evidence of impaired event-based prospective memory in clinical obsessive–compulsive checking
Behaviour Change 27:84–92.

https://doi.org/10.1375/bech.27.2.84
- Google Scholar
Website
1. Harvey A
(2025) Cognitive Behavioural Processes across Psychological Disorders: A transdiagnostic approach to research and treatment
Accessed May 16, 2025.

https://doi.org/10.1093/med:psych/9780198528883.001.0001
Preprint
1. Hopkins AK
2. Gillan C
3. Roiser JP
4. Wise T
5. Sidarus N
(2022) Optimising the Measurement of Anxious-Depressive, Compulsivity and Intrusive Thought and Social Withdrawal Transdiagnostic Symptom Dimensions
PsyArXiv.

https://doi.org/10.31234/osf.io/q83sh
- Google Scholar
(2019) Abnormalities of confidence in psychiatry: an overview and future perspectives
Translational Psychiatry 9:268.

https://doi.org/10.1038/s41398-019-0602-7
- PubMed
- Google Scholar
1. Hoven M
2. Luigjes J
3. Denys D
4. Rouault M
5. van Holst RJ
(2023a) How do confidence and self-beliefs relate in psychopathology: a transdiagnostic approach
Nature Mental Health 1:337–345.

https://doi.org/10.1038/s44220-023-00062-8
- Google Scholar
Preprint
1. Hoven M
2. Mulder T
3. Denys D
4. van Holst R
5. Luigjes J
(2023b) OCD patients show lower confidence and higher error sensitivity while learning under volatility compared to healthy and highly compulsive samples from the general population
PsyArXiv.

https://doi.org/10.31234/osf.io/37nad
- Google Scholar
(2023c) Differences in metacognitive functioning between obsessive-compulsive disorder patients and highly compulsive individuals from the general population
Psychological Medicine 53:7933–7942.

https://doi.org/10.1017/S003329172300209X
- PubMed
- Google Scholar
(2021) Preserving prospective memory in daily life: a systematic review and meta-analysis of mnemonic strategy, cognitive training, external memory aid, and combination interventions
Neuropsychology 35:123–140.

https://doi.org/10.1037/neu0000704
- PubMed
- Google Scholar
1. Kirchner WK
(1958) Age differences in short-term retention of rapidly changing information
Journal of Experimental Psychology 55:352–358.

https://doi.org/10.1037/h0043688
- PubMed
- Google Scholar
(2021) Trait anxiety does not correlate with metacognitive confidence or reminder usage in a delayed intentions task
Quarterly Journal of Experimental Psychology 74:634–644.

https://doi.org/10.1177/1747021820970156
- PubMed
- Google Scholar
1. Kirkegaard EOW
2. Bjerrekær JD
(2016) ICAR5: design and validation of a 5-item public domain cognitive ability test
Open Differential Psychology 01:e0711.

https://doi.org/10.26775/ODP.2016.07.11
- Google Scholar
(2017) lmerTest package: tests in linear mixed effects models
Journal of Statistical Software 82:1–26.

https://doi.org/10.18637/jss.v082.i13
- Google Scholar
1. Linares D
2. López-Moliner J
(2016) quickpsy: an R package to fit psychometric functions for multiple groups
The R Journal 8:122.

https://doi.org/10.32614/RJ-2016-008
- Google Scholar
Preprint
(2022) Consistency within change: evaluating the psychometric properties of a widely-used predictive-inference task
PsyArXiv.

https://doi.org/10.31234/osf.io/qkf7j
- Google Scholar
(1991) Reliability and validity of the apathy evaluation scale
Psychiatry Research 38:143–162.

https://doi.org/10.1016/0165-1781(91)90040-v
- PubMed
- Google Scholar
(2022) Atypical action updating in a dynamic environment associated with adolescent obsessive-compulsive disorder
Journal of Child Psychology and Psychiatry, and Allied Disciplines 63:1591–1601.

https://doi.org/10.1111/jcpp.13628
- PubMed
- Google Scholar
(2024) Information search under uncertainty across transdiagnostic psychopathology and healthy ageing
Translational Psychiatry 14:353.

https://doi.org/10.1038/s41398-024-03065-w
- PubMed
- Google Scholar
(2006) Body checking in the eating disorders: associations between cognitions and behaviors
The International Journal of Eating Disorders 39:708–715.

https://doi.org/10.1002/eat.20279
- PubMed
- Google Scholar
(1995) Factor structure of the barratt impulsiveness scale
Journal of Clinical Psychology 51:768–774.

https://doi.org/10.1002/1097-4679(199511)51:6<768::AID-JCLP2270510607>3.0.CO;2-1
- Google Scholar
1. Racsmany M
2. Demeter G
3. Csigo K
4. Harsanyi A
5. Nemeth A
(2011) An experimental study of prospective memory in obsessive-compulsive disorder
Journal of Clinical and Experimental Neuropsychology 33:85–91.

https://doi.org/10.1080/13803395.2010.493147
- PubMed
- Google Scholar
Software
1. R Development Core Team
(2024) R: A language and environment for statistical computing
R Foundation for Statistical Computing, Vienna, Austria.

https://www.R-project.org
1. Risko EF
2. Gilbert SJ
(2016) Cognitive offloading
Trends in Cognitive Sciences 20:676–688.

https://doi.org/10.1016/j.tics.2016.07.002
- Google Scholar
1. Rouault M
2. Seow T
3. Gillan CM
4. Fleming SM
(2018) Psychiatric symptom dimensions are associated with dissociable shifts in metacognition but not task performance
Biological Psychiatry 84:443–451.

https://doi.org/10.1016/j.biopsych.2017.12.017
- PubMed
- Google Scholar
Software
1. RStudio Team
(2020) RStudio: integrated development for R
RStudio.

http://www.rstudio.com
1. Sachdeva C
2. Gilbert SJ
(2020) Excessive use of reminders: metacognition and effort-minimisation in cognitive offloading
Consciousness and Cognition 85:103024.

https://doi.org/10.1016/j.concog.2020.103024
- PubMed
- Google Scholar
(2020) Building, hosting and recruiting: a brief introduction to running behavioral experiments online
Brain Sciences 10:1–11.

https://doi.org/10.3390/brainsci10040251
- PubMed
- Google Scholar
1. Scarampi C
2. Gilbert SJ
(2020) The effect of recent reminder setting on subsequent strategy and performance in a prospective memory task
Memory 28:677–691.

https://doi.org/10.1080/09658211.2020.1764974
- PubMed
- Google Scholar
1. Scullin MK
2. Jones WE
3. Phenis R
4. Beevers S
5. Rosen S
6. Dinh K
7. Kiselica A
8. Keefe FJ
9. Benge JF
(2022) Using smartphone technology to improve prospective memory functioning: a randomized controlled trial
Journal of the American Geriatrics Society 70:459–469.

https://doi.org/10.1111/jgs.17551
- PubMed
- Google Scholar
1. Seow TXF
2. Gillan CM
(2020) Transdiagnostic phenotyping reveals a host of metacognitive deficits implicated in compulsivity
Scientific Reports 10:2883.

https://doi.org/10.1038/s41598-020-59646-4
- PubMed
- Google Scholar
(2021) Assigning the right credit to the wrong action: compulsivity in the general population is associated with augmented outcome-irrelevant value-based learning
Translational Psychiatry 11:564.

https://doi.org/10.1038/s41398-021-01642-x
- PubMed
- Google Scholar
Software
1. Soetaert K
(2020) Diagram: functions for visualising simple graphs (networks), plotting flow diagrams, version 1.6.5
R Package.

https://CRAN.R-project.org/package=diagram
Preprint
1. Sookud S
2. Martin I
3. Gillan C
4. Wise T
(2024) Impaired goal-directed planning in transdiagnostic compulsivity is explained by uncertainty about learned task structure
PsyArXiv.

https://doi.org/10.31234/osf.io/zp6vk
- Google Scholar
Book
(1983)
Manual for the State-Trait Anxiety Inventory

Consulting Psychologists Press.
- Google Scholar
1. Starcevic V
2. Berle D
3. Brakoulias V
4. Sammut P
5. Moses K
6. Milicevic D
7. Hannan A
(2011) Functions of compulsions in obsessive-compulsive disorder
The Australian and New Zealand Journal of Psychiatry 45:449–457.

https://doi.org/10.3109/00048674.2011.567243
- PubMed
- Google Scholar
(2011) Testing a maintenance model for eating disorders in a sample seeking treatment at a tertiary care center: a structural equation modeling approach
Comprehensive Psychiatry 52:678–687.

https://doi.org/10.1016/j.comppsych.2010.12.010
- PubMed
- Google Scholar
Software
1. Tingley D
2. Yamamoto T
3. Hirose K
4. Keele I
5. Imai K
(2014) Mediation: R package for causal mediation analysis
Journal of Statistical Software.

http://www.jstatsoft.org/v59/i05/
1. Tolin DF
2. Abramowitz JS
3. Brigidi BD
4. Amir N
5. Street GP
6. Foa EB
(2001) Memory and memory confidence in obsessive-compulsive disorder
Behaviour Research and Therapy 39:913–927.

https://doi.org/10.1016/s0005-7967(00)00064-4
- PubMed
- Google Scholar
1. Vaghi MM
2. Luyckx F
3. Sule A
4. Fineberg NA
5. Robbins TW
6. De Martino B
(2017) Compulsivity reveals a novel dissociation between action and confidence
Neuron 96:348–354.

https://doi.org/10.1016/j.neuron.2017.09.006
- PubMed
- Google Scholar
1. Wickham H
(2011) The split-apply-combine strategy for data analysis
Journal of Statistical Software 40:1–29.

https://doi.org/10.18637/jss.v040.i01
- Google Scholar
1. Wise T
2. Dolan RJ
(2020) Associations between aversive learning processes and transdiagnostic psychiatric symptoms in a general population sample
Nature Communications 11:4179.

https://doi.org/10.1038/s41467-020-17977-w
- PubMed
- Google Scholar
(2023) Identifying transdiagnostic mechanisms in mental health using computational factor modeling
Biological Psychiatry 93:690–703.

https://doi.org/10.1016/j.biopsych.2022.09.034
- PubMed
- Google Scholar
1. Zung WW
(1965) A self-rating depression scale
Archives of General Psychiatry 12:63–70.

https://doi.org/10.1001/archpsyc.1965.01720310065008
- PubMed
- Google Scholar

Article and author information

Author details

Annika Boldt

Institute of Cognitive Neuroscience, University College London, London, United Kingdom

Contribution
Conceptualization, Resources, Data curation, Software, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing

For correspondence
a.boldt@ucl.ac.uk

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6913-5099
Celine Ann Fox

School of Psychology, Trinity College Dublin, Dublin, Ireland

Contribution
Conceptualization, Resources, Methodology, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-1740-3765
Claire M Gillan

School of Psychology, Trinity College Dublin, Dublin, Ireland

Contribution
Conceptualization, Resources, Methodology, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-9065-403X
Sam Gilbert

Institute of Cognitive Neuroscience, University College London, London, United Kingdom

Contribution
Conceptualization, Resources, Software, Supervision, Funding acquisition, Methodology, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-3839-7045

Funding

Wellcome Trust

https://doi.org/10.35802/206480

Annika Boldt

Ireland's Research Frontiers (19/FFP/6418)

Claire M Gillan

European Research Council (ERC-H2020-HABIT)

Claire M Gillan

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication. For the purpose of Open Access, the authors have applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission.

Acknowledgements

The Wellcome Trust, 206480/Z/17/Z, Annika Boldt. Research Ireland’s Frontiers, 19/FFP/6418, Claire Gillan. European Research Council (ERC), ERC-H2020-HABIT, Claire Gillan.

Ethics

Both informed consent and consent to publish were obtained. Ethical approval for this study was received from the local Ethics Committee at University College London (UCL) under the reference number 1584/003.

Version history

Sent for peer review: May 2, 2024
Preprint posted: June 20, 2024
Reviewed Preprint version 1: July 29, 2024
Reviewed Preprint version 2: January 24, 2025
Reviewed Preprint version 3: February 26, 2025
Version of Record published: May 29, 2025

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.98114. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.