Abstract
Generalising information from ourselves to others, and others to ourselves allows for both a dependable source of navigation and adaptability in interpersonal exchange. Disturbances to social development in sensitive periods can cause enduring and distressing damage to lasting healthy relationships. However, identifying the mechanisms of healthy exchange has been difficult. We introduce a theory of self-other generalisation tested with data from a three-phase social value orientation task – the Intentions Game. We involved individuals with (n=50) and without (n=53) a diagnosis of borderline personality disorder and assessed whether infractions to self-other generalisation may explain prior findings of disrupted social learning and instability. Healthy controls initially used their preferences to predict others and were influenced by their partners, leading to self-other convergence. In contrast, individuals with borderline personality disorder maintained distinct self-other representations when learning about others. This allowed for equal predictive performance compared to controls despite reduced updating sensitivity. Furthermore, we explored theory-driven individual differences underpinning contagion. Overall, the findings provide a clear explanation of how self-other generalisation constrains and assists learning and how childhood adversity is associated with separation of internalised beliefs. Our model makes clear predictions about the mechanisms of social information generalisation concerning both joint and individual reward.
Highlights
Humans use self-to-other transfer to constrain initial predictions about the social behaviour of others
Information is transferred from other-to-self following observation, calibrated by the precision of beliefs
Joint vs individualistic reward is prioritised when learning about others.
Those diagnosed with BPD do not engage in self-other information transfer, instead keeping self and other representationally distinct.
Introduction
Social animals have evolved sophisticated mechanisms for cooperation, exchanging information to enable both individual and group regulation (Emerson, 1956; Wheeler, 1911). In humans, such exchanges provide crucial insights about others as well as oneself, fostering the development of representations encoding self and others that is fundamental for adaptive social orientation and interaction. Disruptions to this process can impair the formation of stable social bonds and result in rigid interpersonal beliefs (Fairbairn, 1952; Young et al., 2006).
To effectively navigate what is sometimes called relational uncertainty, individuals generalize information from themselves to others (self-to-other transfer) and from others to themselves (other-to-self transfer). The relational self theory (Anderson & Chen, 2002) posits that when uncertain about others’ behaviours, people often rely on their own preferences as initial hypotheses, a process termed self-insertion (Allport, 1924; Kreuger & Clement, 1994). Conversely, when uncertain about their own states, individuals use external social cues to adjust their self-representations, a process known as social contagion (Deutsch & Gerard, 1955; Toelch & Dolan, 2015; Moutoussis et al, 2016). This bidirectional generalization has been widely observed across various domains including economic decision-making, morality, and social group adaptation (Devaine & Daunizeu, 2017; Garvert et al., 2015; Panizza et al., 2021; Suzuki et al., 2016; Yu et al., 2021).
Understanding healthy interpersonal dynamics can be clarified by examining their disruptions. Individuals with Borderline Personality Disorder (BPD) provide a compelling case study due to their profound interpersonal instability, emotional dysregulation, and heightened sensitivity to social contexts (Gunderson et al., 2018). BPD is strongly associated with adverse childhood experiences (Afifi et al., 2011), impaired mentalizing abilities (Fonagy & Luyten, 2009), and maladaptive representations of self and other (Hanegraaf et al., 2021). This phenotype has been explained through disrupted and unstable social inference during observation (Henco et al., 2020; Hula et al., 2018, Story et al., 2024a; Siegel et al., 2020). However, the precise mechanisms linking disrupted social cognition in BPD remain elusive, particularly regarding whether individuals with BPD differ specifically in their use of self-to-other and other-to-self information transfer.
This paper seeks to address this gap by testing explicitly how disruptions in self-other generalization processes may underpin interpersonal disruptions observed in BPD. Specifically, our hypotheses were: (i) healthy controls will demonstrate evidence for both self-insertion and social contagion, integrating self and other information during interpersonal learning; and (ii) individuals with BPD will exhibit diminished self-other integration, reflected in stronger evidence for observations that assume distinct selfother representations.
We tested these hypotheses by designing a dynamic, sequential, three-phase Social Value Orientation (Murphy & Ackerman, 2014) paradigm—the Intentions Game—that would provide behavioural signatures assessing whether BPD differed from healthy controls in these generalization processes (Figure 1A). We coupled this paradigm with a lattice of models (M1-M4) that distinguish between self-insertion and social contagion (Figure 1B), and performed model comparison:
M1. Both self-to-other (self-insertion) and other-to-self (social contagion) occur before and after learning
M2. Self-to-other transfer only occurs
M3. Other-to-self transfer only occurs
M4. Neither transfer process, suggesting distinct self-other representations
We additionally ran exploratory analysis of parameter differences and model predictions between groups following from prior work demonstrating changes in prosociality (Hula et al., 2018), social concern (Henco et al., 2020), belief stability (Story et al., 2024a), and belief updating (Story, 2024b) in BPD to understand whether discrepancies in self-other generalisation influences observational learning. By clearly articulating our hypotheses, we aim to clarify the theoretical contribution of our findings to existing literature on social learning, BPD, and computational psychiatry.
Results
Healthy participants (CON; n=53) and participants diagnosed with BPD (n=50), matched on age, gender, education, and social deprivation indices (Table 1), were invited to participate in a three-phase social value orientation paradigm—the Intentions Game (Figure 1A)—with virtual partners. In phase 1, participants made forced choices between two options for splitting points with an anonymous partner. In phase 2 participants learned to predict the decisions of a new anonymous partner using the same forced-choice setup, receiving feedback on the accuracy of their successive predictions. Notably, using a novel server architecture (Burgess et al., 2023), partners in phase 2 were configured to be approximately 50% different from the participants in terms of their choices, ensuring that all participants had to learn about their partners. Phase 3 mirrored the first, with participants informed that they were matched with a third anonymous partner, unrelated to those in phases 1 and 2. Detailed descriptions of the task can be found in the methods section and Figure 1. All participants also self-reported their trait paranoia, childhood trauma, trust beliefs, and trait mentalizing (see Methods).
Psychometric and Behavioural Results
Participants with BPD, compared to CON, retrospectively reported significant childhood trauma, epistemic disruptions (including mistrust and credulity), elevated referential and persecutory beliefs, and demonstrated ineffective trait mentalizing (Table 1). The groups did not differ in trait measures of certainty regarding self and others’ mental states, nor in epistemic trust.
We analysed the ‘types’ of choices participants made in each phase (Supplementary Table 1). The interpretation of a participant’s choice depends on both values in a choice. For example, a participant could make prosocial (self=5; other=5) versus individualistic (self=10; other=5) choices, or prosocial (self=10; other=10) versus competitive (self=10; other=5) choices. There were 12 of each pair in phases 1 and 3 (individualistic vs. prosocial; prosocial vs. competitive; individualistic vs. competitive).
In phase 1, both CON and BPD participants made prosocial over competitive choices with similar frequency (CON=9.67[3.62]; BPD=9.60[3.57]; t=-0.11, p=0.91). However, CON participants made significantly fewer prosocial choices when individualistic choices were available (CON=2.87[4.01]; BPD=5.22[4.54]; t=2.75, p=0.007). Both groups favoured individualistic over competitive choices with similar frequency (CON=11.03[1.95]; BPD=10.34[2.63]; t=-1.52, p=0.13). For a reaction time assessment see Supplementary Text 1).
Each group showed good predictive accuracy (CON=77.2%[13.9%]; BPD=72.7%[15.6%]). There was no difference in overall predictive accuracy between BPD and CON (linear estimate=2.44, 95%CI: −0.67, 5.54; t=1.56; p=0.12), nor on a trial-by-trial basis (linear estimate=0.26, 95%CI: −0.06, 0.59; z=1.61, p=0.11). All participants showed an effect of time on accuracy, such that participants became more accurate in predicting their partner over the course of phase 2 (linear estimate=0.013, 95%CI: 0.008, 0.017; z=6.01; p<0.001). Server matching between participant and partner in phase 2 was successful, with participants being approximately 50% different to their partners with respect to the choices each would have made on each trial in phase 2 (mean similarity=0.49, SD=0.12).
In phase 3, both CON and BPD participants continued to make equally frequent prosocial versus competitive choices (CON=9.15[3.91]; BPD=9.38[3.31]; t=-0.54, p=0.59); CON participants continued to make significantly less prosocial versus individualistic choices (CON=2.03[3.45]; BPD=3.78 [4.16]; t=2.31, p=0.02). Both groups chose equally frequent individualistic versus competitive choices (CON=10.91[2.40]; BPD=10.18[2.72]; t=-0.49, p=0.62).

Demographics of participants.
CTQ=Childhood Trauma Questionnaire, MZQ = Mentalisation Questionnaire, RGPTSB=Revised Green Paranoid Thoughts Scale (Persecutory Subscale), RGPTSA=Revised Green Paranoid Thoughts Scale (Referential Subscale), CAMSQ=Certainty About Mental States Questionnaire. ETMCQ=Epistemic Trust, Mistrust and Credulity Questionnaire, M=Male, F=Female, O=Other. For continuous variables, all means are stated with corresponding standard deviation in brackets. Significant differences are highlighted in bold.

Task and Model Space.
(A) Participants were invited to play a three-phase, repeated social value orientation paradigm—the Intentions Game—with virtual partners. Phase 1 of the Intentions Game lasted 36 trials and asks participants to make a forced choice between two options as to how to split points with an anonymous virtual partner. An example of a prosocial-individualistic pair of options could be (self=5, other=5) or (self=10, other=5) – if the participant chooses option 1 they could be viewed as less individualistic and more prosocial as the outcomes to the other do not change, but the self would earn less. In phase 2, lasting 54 trials, participants were asked to predict the decisions of a new anonymous partner using the same two-forced choice set-up and the same option pairs; participants were given feedback on whether they were correct or incorrect in their prediction. We used Amazon Web Services to create a novel server architecture to match participants and (virtual) partners (Burgess et al., 2023). Partners in phase 2 were matched to be approximately 50% different from the participant with respect to their choices in phase 1 to ensure all participants needed to learn about their phase 2 partner, and to provide a mechanism to examine whether beliefs about partners had an effect on the self. Phase 3 was identical to phase 1, although participants were informed that they were matched with a third anonymous partner, unconnected to the partners in phase 1 and 2. At the end of the game, if participants collected over 1000 points overall, they were entered into a lottery to win a bonus. (B) We created four models that may explain the data and to test theories of social generalization. Model M1 assumes participants are subject to both self-insertion and social-contagion, that is, participants used their own preferences as a prior about their partner in phase 2, and partner behaviour subsequently influenced participant’s preferences in phase 3. Model M4 assumes participants are subject to neither self-insertion nor social contagion, instead forming a novel prior around the phase 2 partner rather than using their own preferences and failing to be influenced by their partner after observation. Models M2 and M3 suggest participants are only explained by either self-insertion or social-contagion, not both. (C) We assume that participants choices in phase 1 are governed by both a median (
Computational Analysis
Over all three phases, we assumed participants and their partners used a Fehr-Schmidt utility function (Fehr & Schmidt, 1999) to calculate the utility of two options
We then constructed four models to explain how participants used their own preferences (
Model M4 (Figure 1D), on the other hand, suggests that participants do not engage in these generalization processes: predictions about others are not grounded in the self, and observing others does not alter self-preferences. Models M2 and M3 allow for either self-insertion or social contagion to occur independently. Consistent with prior research, we also constructed a model that assumes the same insertion and contagion processes as M1, but along a single prosocial-competitive axis (‘Beta model’; Barnby et al., 2022). The ‘Beta model’ is equivalent to M1 in its architecture (both self-insertion and social contagion are hypothesized to occur) but differs in its utility function: participants might only consider a single dimension of relative reward allocation, which is typically emphasized in previous studies (e.g., Hula et al., 2018).
All computational models were fitted and compared using a Hierarchical Bayesian Inference (HBI) algorithm which allows hierarchical parameter estimation while assuming random effects for group and individual model responsibility (Piray et al., 2019; see Methods for more information). We report individual and group-level model responsibility, in addition to exceedance probabilities between-groups to assess model dominance.

Parameter and model specification.
Grey shading = parameters relevant to representations of the self (ppt). Orange shading = parameters relevant to representations of the other (par). Free = parameters are random variables to fit through model inversion. Derived = parameter is calculated from latent values within the model. SD = standard deviation.
Model Comparison – BPD Participants Hold Disintegrated Self-Other Beliefs
We found that CON participants were best fit at the group level by M1 (Frequency = 0.59, Exceedance Probability = 0.98), whereas BPD participants were best fit by M4 (Frequency = 0.54, Exceedance Probability = 0.86; Figure 2A). This suggests CON participants are best fit by a model that fully integrates self and other when learning, whereas those with BPD are best explained as holding disintegrated and separate representations of self and other that do not transfer information back and forth.
We first explore parameters between separate fits (see Methods). Later, in order to assuage concerns about drawing inferences from different models, we examined the relationships between the relevant parameters when we forced all participants to be fit to each of the models (in a hierarchical manner, separated by group). In sum, our model comparison is supported by convergence in parameter values when comparisons are meaningful (see Supplementary Materials). We refer to both types of analysis below.
Generative Accuracy and Recovery
We simulated data for each participant using their individual parameters from the winning model within each group and refitted our models using this simulated data. Model comparison yielded very similar results (Figure 3A): CON synthetic participants best fit at the group level by M1 (Frequency = 0.58, Exceedance Probability = 0.98) and BPD synthetic participants best fit by M4 (Frequency = 0.57, Exceedance Probability = 0.85). The simulated data closely matched the actions of participants across all three phases (median accuracy = 0.8, SD = 0.12). In phase 2, the model-predicted total correct scores were not significantly different from observed scores (Figure 3E). Both model responsibility and common parameters within each dominant model were significantly associated (model confusion ρ = 0.46–0.97, p < 0.001; parameter recovery ρ = 0.70–0.94, p < 0.001; Figure 3C).

Beliefs between groups and within phases.
(A) We used randomeffects hierarchical model fitting and comparison to jointly estimate group level and individual level parameters based on real data from participants (Piray et al., 2021). CON participants were best fit by M1, whereas BPD participants were best fit by M4 on a group level. Looking within each model by simulating the beliefs of each participant reveals that – as expected – CON participants use the median of their self-preferences (black distribution) as a basis for their prior beliefs about partners (light orange distribution), and that the precision of their posterior beliefs about partners (dark orange distribution) and the precision of their own self preferences leads to a shifted model of the self (grey distribution). BPD participants on the other hand have a disintegrated prior over their partner which is not subject to their own self representation. Likewise, there is no change in self-preferences following learning, and thus an absence of the light grey distribution. For illustration, we focus on beliefs over relative preferences (β) and use real individual participants as exemplars for illustration. (B) Across models we extracted the common parameters that generate the behaviour of both CON and BPD participants – that is, their median and standard deviation over both α (absolute reward preferences) and β (relative value preferences), the flexibility over participants’ prior beliefs about their partners over each dimension, and the absolute change in posterior beliefs in phase 2 over each dimension (
Phase 1 – BPD Participants Are More Certain About Themselves
We first examined self-representations of participants in phase 1. CON participants (under model M1) and BPD participants (under M4) were equally prosocial (CON mean[
These differences were replicated when considering parameters between groups when we fit all participants to the same models (M1-M4; see Table S2).
Phase 2 – BPD Participants Use Disintegrated and Neutral Priors
We next assessed how participants generated their prior beliefs about a partner in phase 2. CON participants were best fit by M1 which assumes the same median belief participants use in phase 1 is identical to their median prior belief about their partners. In contrast, BPD participants were best fit by M4 and so generated a new median prior belief about their partners. Assessing by individual models show this was driven by expectations about a partner’s prosocial-competitive preferences (relative reward; see Table S2).
Prior work predicts those with BPD should focus more intently on social information rather than private information that only concerns one party (Henco et al., 2020). In BPD participants, only new beliefs about the relative reward preferences – mutual outcomes for both player - of partners differed (see Fig 2E): new median priors were larger than median preferences in phase 1 (mean[
Models of moral preference learning (Story et al., 2024a) predict that BPD vs non-BPD participants have more rigid beliefs about their partners. We found that BPD participants were equally flexible around their prior beliefs about a partner’s relative reward preferences (Δμ[
We checked that conclusions about self-insertion did not depend on the different models, we found that
Analysing belief updating on a more granular trial-by-trial basis using M1 for CON and M4 for BPD revealed preference type and between-group differences in belief refinement over the course of phase 2 (Figure 2D). We examined this by analysing the Kullback-Leibler divergence (DKL) – expected informational surprise - on each trial in Phase 2.
Across both groups and belief types informational surprise reduced over time (linear estimate[DKL] = −0.007, 95%CI: −0.008, −0.005; t = −7.60, p < 0.001). Beliefs about a partner’s relative reward preferences were updated more than absolute reward preferences (linear estimate= 0.54, 95%CI: 0.47, 0.62; t = 14.00, p < 0.001). These interacted, updating over relative vs. absolute beliefs reduced over the course of phase 2 (linear estimate = −0.013, 95%CI: −0.015, −0.011; t = −10.81, p < 0.001). These findings were supported under M1-M4 only assumptions (see Table S3).
BPD informational surprise is consistently restricted over beliefs about absolute reward versus CON. CON participants remained more flexible than BPD participants along both types of preference (linear estimate [DKL(
Assessing this same relationship under M1- and M2-only assumptions reveals a replication of this group effect for absolute reward, but the effect is reversed for relative reward (see Table S3). This accords with the context of each model, where under M1 and M2, BPD participants had larger phase 2 prior flexibility over relative reward (leading to larger initial surprise), which was better accounted for by a new central tendency under M4 during model comparison. When comparing both groups under M1-M4 informational surprise over absolute reward was consistently restricted in BPD (Table S3), suggesting a diminished weight of this preference when forming beliefs about an other.
We explored how beliefs and choices were associated with reaction times, showing that belief updates and reaction times were coupled over the course of phase 2 and related to participant-partner similarity (Figure S9).
Phase 3 – BPD Participants Are Less Influenced by Partners
Prior work predicts that human economic preferences are shaped by observation (Panizza, et al., 2021; Suzuki et al. 2016; Yu et al, 2021). Associative models also predict that social contagion may be exaggerated in BPD (Story et al., 2024b). In the dominant model for the BPD group—M4—participants are not influenced in their phase 3 choices following exposure to their partner in phase 2. To further confirm this we also analysed absolute change in median participant beliefs between phase 1 and 3 under the assumption that M1 and M3 was the dominant model for both groups (that allow for contagion to occur). This analysis aligns with our primary model comparison using M1 for CON and M4 for BPD (Figure 2C). CON participants altered their median beliefs between phase 1 and 3 more than BPD participants (M1: linear estimate = 0.67, 95%CI: 0.16, 1.19; t = 2.57, p = 0.011; M3: linear estimate = 1.75, 95%CI: 0.73, 2.79; t = 3.36, p < 0.001). Relative reward was overall more susceptible to contagion versus absolute reward (M1: linear estimate = 1.40, 95%CI: 0.88, 1.92; t = 5.34, p<0.001; M3: linear estimate = 2.60, 95%CI: 1.57, 3.63; t = 4.98, p < 0.001). There was an interaction between group and belief type under M3 (M3: linear estimate = 2.13, 95%CI: 0.09, 4.18, t = 2.06, p=0.041) but not M1. There was a main effect of belief type on precision under M3 (linear estimate = 0.47, 95%CI: 0.07, 0.87, t = 2.34, p = 0.02) but not M1; relative reward preferences became more precise across the board. Derived model estimates of preference change between phase 1 and 3 strongly correlated between M1 and M3 along both belief types (see Table S2 and Fig S11). As a whole, humans are more susceptible to changing relative preferences more than absolute reward preferences, and this is disrupted in BPD.

Model Accuracy.
(A) We used random-effects hierarchical model fitting and comparison to jointly estimate group level and individual level parameters on simulated data (Piray et al., 2019). CON participants were best fit by M1, whereas BPD participants were best fit by M4 (B) Server matching between participant and partner in phase two was successful, with participants being approximately 50% different to their partners with respect to the choices each would have made on each trial in phase 2 (mean similarity=0.49, SD=0.12). Model accuracy across the task was very high (mean accuracy=0.8, SD=0.12). Model accuracy within each phase was very high (mean accuracy[phase1]=0.83, SD[phase1]=0.16; mean accuracy[phase2]=0.77, SD[phase2]=0.14; mean accuracy[phase3]=0.82, SD[phase3]=0.17). Loglikelihood values were also well above what would be expected had the model fitted the data by chance (median=-40.68, SD=22.7; chance value=-87.33). Choice probabilities generated by the model on each trial were also well above chance thresholds (median=0.91, SD=0.24; chance value=0.5). (C) The spearman association between the responsibility allocated for each participant during real and recovered model comparison was highly correlated on the diagonal. There was some correlation between M1-M2 but this was due to M2 being a nested model of M1, sharing similar free parameters; this was not worrying in light of excellent model identifiability overall in the synthetic comparison. Associations between real and recovered parameters from the dominant model within each BPD and CON participants was very high with few cross correlations on the off-diagonal. In both confusion and parameter recovery matrices, white spaces indicate insignificant associations at the p > 0.01 level. (D) (top panel) The relationship between uncertainty over the self and uncertainty over the other with respect to the change in the precision (left) and median-shift (right) in phase 3 relative reward preferences (
Exploratory Psychometric and Intentional Attribution Analysis
Childhood trauma, persecution, and poor mentalising in BPD are all predicted to disrupt the integration of information from others (Fonagy & Luyten, 2009). Therefore we explored whether social contagion may be restricted as a result of childhood trauma, paranoia, and less effective trait mentalizing. We collected psychometric data from participants prior to entering the task and asked participants to attribute explicit intentions to their partner after phase 2. All analyses were corrected for False Discovery Rate (FDR; p[fdr]) and we provide correction for group status (Table S4).
We assessed conditional psychometric associations with social contagion under the assumption of M3 for all participants. We conducted partial correlation analyses to estimate relationships conditional on all other associations and retained all that survived bootstrapping (5000 reps), permutation testing (5000 reps), and subsequent FDR correction. When not controlled for group status, RGPTSB and CTQ scores were both moderately associated with MZQ scores (RGPTSB r = 0.41, 95%CI: 0.23, 0.60, p[fdr]=0.043; CTQ r = 0.354 95%CI: 0.13, 0.56, p[fdr]=0.02). This was not affected by group correction. CTQ scores were moderately and negatively associated with shifts in individualistic reward preferences (Δ
Prior work has predicted that partner-participant preference disparity influences mental state attributions (Barnby et al., 2022; Panizza et al., 2021). We tested parameter influences on explicit intentional attributions in Phase 2 while controlling for group status. Attributions included the degree to which they believed their partner was motived by harmful intent (HI) and self-interest (SI). According with prior work (Barnby et al., 2022), greater disparity of absolute preferences before learning was associated with reduced attributions of SI (ρ[|
Uncertainty in one’s expectations is associated with subsequent mental state attributions of intentional harm (Barnby et al., 2022). Greater prior uncertainty (before interaction) over a partner’s relative preferences was associated with increased HI (ρ[
Discussion
We built and tested a theory of interpersonal generalisation in a population of matched participants with (BPD) and without (CON) a diagnosis of borderline personality disorder using the Intentions Game, a three-phase social value orientation task. We compared four hypotheses, instantiated in formal computational models, to determine whether those with a diagnosis of BPD displayed disrupted self-insertion and social contagion. Both groups demonstrated equivalent behavioural accuracy but employed different strategies. CON participants used a process of self-other generalization to predict and align with their partners, while BPD participants maintained distinct representations of self and other, particularly over joint reward outcomes. As a whole, all participants were more sensitive to updates about joint versus absolute outcomes, with BPD participants particularly concerned with how outcomes relatively affected self and other. Our exploratory findings also indicate that retrospectively reported childhood trauma and persecutory beliefs were linked to reduced trait mentalising, which was subsequently associated with diminished shifts in participants’ relative reward preferences. Collectively, our results integrate prior findings in BPD and provide a formal account of social information generalisation in humans, alongside a concise social paradigm to test these processes.
The data replicate models of social generalisation that have focused on individual processes of self-insertion and contagion, extending these theories by demonstrating both processes in conjunction. Models of self-insertion directly map participant preferences onto prior beliefs about others, which has been used to explain increased reaction times in observational learning of others’ snack food preferences (Tarantola et al., 2017), as well as improved predictive accuracy when matched with individuals of similar social values (Barnby et al., 2022). Both findings are replicated in this study. Although we did not explicitly model reaction times, we observed an interaction between reaction time reductions over time and interpersonal similarity at baseline. In tandem, computational models of social contagion have focused on intertemporal discounting (Moutoussis et al., 2016) – with behavioural studies also focusing on effort-based reward (Devaine & Daunizeu, 2017) and moral preferences (Yu et al., 2021) - and explain shifts in self-preferences as a function of uncertainty regarding self and others. In both the dominant (M1) and sub-dominant (M3) models that best explained data in healthy participants, shifts in self-beliefs were also influenced by representational uncertainty of self and other: greater self-uncertainty and reduced other uncertainty led to larger shifts in social preferences.
The data also align with prior research on social impression formation, which suggests that humans form rapid evaluations of others that are refined over time (Bone et al., 2021; Moutoussis et al., 2024). This initial ‘heating’ and subsequent ‘cooling’ of beliefs corresponds to the computational complexity employed: model-based strategies are typically used early in interactions, transitioning to simpler, model-free computations once a partner’s behaviour becomes predictable (Gęsiarz & Crockett, 2015; Guennouni & Speekenbrink, 2022). Our findings support this framework, demonstrating initial variability early in interactions followed by steady updating.
Disruptions in self-to-other generalization provide an explanation for previous computational findings related to task-based mentalizing in BPD. Studies tracking observational mentalizing reveal that individuals with BPD, compared to those without, place greater emphasis on social over internal reward cues when learning (Henco et al., 2020; Fineberg et al., 2018). Those with BPD have been shown to exhibit reduced belief adaptation (Siegel et al., 2020) along with ‘splitting’ of latent social representations (Story et al., 2024a). BPD is also shown to be associated with overgeneralisation in self-to-other belief updates about individual outcomes when using a one-sided reward structure (where participant responses had no bearing on outcomes for the partner; Story et al., 2024b). Our analyses show that those with BPD are equal to controls in their generalisation of absolute reward (outcomes that only affect one player) but disintegrate beliefs about relative reward (outcomes that affect both players) through adoption of a new, neutral belief. We interpret this together in two ways: 1. There is a strong concern about social relativity when those with BPD form beliefs about others, 2. The absence of self-insertion when predicting relative outcomes may predispose to brittle or ‘split’ beliefs. In other words, those with BPD assume ambiguity about the social relativity preferences of another (i.e. how prosocial or punitive) and are quicker to settle on an explanation to resolve this. Although self-insertion may be counter-intuitive to rational belief formation, it has important implications for sustaining adaptive, trusting social bonds via information moderation.
Those with a diagnosis of BPD also show reduced permeability in other-to-self generalising. While prior research has predominantly focused on how those with BPD use information to form impressions, it has not typically been examined whether these impressions affect the self. In interactive trust paradigms, neural responses to monetary offers from others to the self were substantially blunted in individuals with BPD compared to those without (King-Casas et al., 2008). Similarly, in non-social reward tasks, those with BPD show reduced neural feedback-related negativity amplitudes, which obstructs feedback-related self-change (Stewart et al., 2019; Vega et al., 2013). Our results suggest a mechanistic basis for social contagion, indicating that self-rigidity prevents observed social behaviours from generalizing to the self, potentially exacerbated by childhood trauma, paranoia, and impaired mentalizing capabilities. Resistance to social influence may serve as a protective response but can also contribute to the pervasive loneliness experienced by individuals with BPD, even in the absence of social isolation (Liebke et al., 2017).
Notably, despite differing strategies, those with BPD achieved similar accuracy to CON participants. While all participants were more concerned with relative vs. absolute reward, those with BPD changed their strategy contingent on this focus. Practically this difference in BPD is captured either through disintegrated priors with a new median (M4) or very noisy, but integrated priors over partners (M1) if we assume M1 can account for the full population. In either case, the algorithm underlying the computational goal for BPD participants is far higher in uncertainty, whether through a neutral central tendency (M4) or large variance (M1) prior over relative reward in phase 2, and emphasises a less stable or reliable expectation about others. It is important to assess this mechanism alongside momentary assessments of mood to understand whether more entropic learning processes contribute to distressing mood fluctuation.
Clinical implications of our work underscore the importance of consistency and stability in clinical support for individuals with a diagnosis of BPD. Encouragingly, we found that those with BPD were not entirely impermeable to observed behaviour, suggesting that consistent external models of trust could be internalized over time. Restoring a stable sense of self through social learning and effective mentalizing (Nolte et al., 2023), along with a consistent focus on differentiating self from other (de Meulemeester et al., 2021), are central to mentalization-based therapies (Bateman & Fonagy, 2010; Smits et al., 2024) and other evidence-based treatments for BPD. We hope that our paradigm and model can offer insights into the effectiveness of these and other therapies in driving mechanistic psychological change. A key task for future work will be to assess whether generalisation principles may be restored in within-individuals with a diagnosis of BPD following intervention.
More broadly, our model bridges formal theories of associative learning and social cognition. Reinforcement learning approaches have effectively organized theories around uncertainty navigation in non-social contexts (Piray et al., 2021; Zika, 2023). However, humans do not function in isolation. Bayesian models of internal and external social beliefs are better suited to capture the dynamic nature of time, context, and uncertainty during interactions (FeldmanHall & Nassar, 2021; Velez & Gweon, 2021), where joint reward rather than individual reward may be particularly salient (Barnby et al., 2023). Our paradigm is concise, visually engaging, includes straightforward rules and instructions, and allows for tight experimental control over partner similarity. Our model and paradigm effectively capture core social psychological principles grounded in general computational approaches to learning and uncertainty, elucidating key aspects of human social interaction and exchange.
We note some limitations to our study. Primarily, we focused on the ability of individuals to integrate their self-concept into beliefs about others. It is also possible that humans possess strong, salient representations of others (or groups of others) that serve as dominant templates for learning. This may be particularly relevant for individuals with BPD, who will often have interpersonal experiences of abuse, neglect, or other forms of distress. The use of a salient, negative other-prior as a basis for learning was not measured in this study, but it may explain the ambivalent prior observed in phase 2, where a mixture of self and notional other influences belief formation, leading to rigid belief updating. Individuals with BPD may integrate priors from different sources as a mixture. We can simulate this by modelling a framework that incorporates priors based on both self and a strong memory impression of a notional other (Figure S3). However, a strength of our data is that we observed impression formation independent of valence—impressions were formed regardless of whether a partner was more or less prosocial or selfish than the participant (Figure S4). This supports our hypothesis that a vulnerable self-model and lack of self-insertion contribute to the formation of overly precise beliefs during learning as a means of rapidly reducing uncertainty. Even if a mixture model better explains the ambivalent prior in phase 2, it would still support a general hypothesis about the fractured concept of self and other in BPD.
Another strength of our work is demonstrating processes of self-insertion and contagion under minimal interaction conditions: simple observation alone was sufficient to elicit both processes. However, this is also a limitation. While we predict that these processes will apply in more naturalistic settings, this has yet to be tested, and it remains unclear whether these effects will persist in richer conditions, particularly when higher affective arousal and challenges to mentalising are present. Lastly, the action space and parameters governing choice in our study were quite simple—two actions influenced by two parameters. This was a deliberate computational choice to avoid overly complex action spaces that may be difficult to fit to real human data, and which might fail to capture how these mechanisms operate in the context of increasing action and model complexity. As a whole, our findings open new possibilities for testing how social uncertainty across the lifespan (e.g. in adolescence; Sebastian et al., 2008), and in the context of ill-health, may explain the formation and maintenance of healthy social bonds as well as their disruption.
Finally, a limitation may be that behaviour in tasks based on economic preferences may not have clinical validity. This issue is central to the field of computational psychiatry, much of which is based on generalising from tasks like that within this paper and discussing correlations with psychometric measures. Extrapolating economic tasks into the real world has been the topic of discussion for the many reviews on computational psychiatry (e.g. Montague et al., 2012; Hitchcock et al., 2022; Huys et al., 2016). We note a strength of this work is the use of model comparison to understand algorithmic differences between those with BPD and matched healthy controls. Nevertheless, we wish to further pursue how latent characteristics captured in our models may directly relate to real-world affective change.
Materials and Methods
Participants
We used a case-control, between-subjects design with 103 participants: a control group from the general population (N = 53) and a clinical group diagnosed with BPD (N = 50). Both groups were recruited for a larger study investigating social exchanges in BPD and Anti-Social Personality Disorder (approved by the Research Ethics Committee for Wales, 12/WA/0283). The control and clinical groups were matched on age, sex, years in education, and the English Indices of Deprivation based on the 2019 census (IoD2019; Ministry of Housing, Communities & Local Government, 2019). Participants received £70 compensation for completing questionnaires and online tasks which included the Intentions Game. They also received a performance bonus if they were entered into the lottery for surpassing 1000 points over the course of the game.
Participants for the control group were recruited through an advertisement on the Call For Participants website (https://www.callforparticipants.com), local community services and adult schools. Inclusion criteria required control participants to have no pre-existing or current diagnoses of mental health disorders, neurological disorders, or traumatic brain injuries. Additionally, control participants must not have been currently in therapy or taking medication for any psychiatric disorders.
The majority of BPD participants were recruited through referrals by psychiatrists, psychotherapists, and trainee clinical psychologists within personality disorder services across 9 NHS Foundation Trusts in the London, and 3 NHS Foundation Trusts across England (Devon, Merseyside, Cambridgeshire). Four BPD participants were also recruited by self-referral through the UCLH website, where the study was advertised. To be included in the study, all participants needed to have, or meet criteria for, a primary diagnosis of BPD (or emotionally-unstable personality disorder or complex emotional needs) based on a professional clinical assessment conducted by the referring NHS trust (for self-referrals, the presence of a recent diagnosis was ascertained through thorough discussion with the participant, whereby two of the four also provided clinical notes). The patient participants also had to be under the care of the referring trust or have a general practitioner whose details they were willing to provide. Individuals with psychotic or mood disorders, recent acute psychotic episodes, severe learning disability, or current or past neurological disorders were not eligible for participation and were therefore not referred by the clinical trusts.
Psychometric Measures
Green et al. Paranoid Thought Scale (GPTS)
The GPTS assesses paranoid thoughts, including ideas of social reference (scale A) and persecution (scale B), in both general and clinical populations (Green et al., 2008). Each item is scored from 0 (not at all) to 5 (totally) concerning endorsement of each item. We retained items from the GPTS that were consistent with the revised version outlined in Freeman et al., 2021 (Revised GPTS; R-GPTS). The R-GPTS has demonstrated excellent psychometric properties (Freeman et al., 2021), making it a reliable and valid tool for assessing trait paranoid thoughts in non-clinical and clinical populations.
Childhood Trauma Questionnaire (CTQ)
The Childhood Trauma Questionnaire is used to screen for maltreatment history (Bernstein et al., 2003). Each item is scored from 1 (never true) to 5 (very often true). The CTQ has showed good internal consistency reliability across the five scales (Sacchi et al., 2018) and good construct validity based on significant associations with stress responsivity (McMahon et al., 2022), and dissociation (Nobakht et al., 2021).
Certainty About Mental States Questionnaire (CAMSQ)
The CAMSQ assesses one’s certainty in classifying the mental states of oneself and others at an abstract level (Müller et al., 2023), e.g. ‘I know what other people think of me’ and ‘I know my feelings’. Each subscale is scored from 1 (never) to 7 (always). In US and German samples, the CAMSQ showed high internal consistency for Self-Certainty (ω = .90/.88) and Other-Certainty (ω = .91/.89) subscales, and high two-week test-retest reliability for Self-Certainty (r = .85), Other-Certainty (r = .78), and Other-Self-Discrepancy (r = .82) scores (Müller et al., 2023).
Mentalisation Questionnaire (MZQ)
The MZQ is a 15-item questionnaire assessing an individual’s trait mentalizing, i.e., one’s ability to understand and interpret their own and others’ mental states (Hausberg et al., 2012). The MZQ demonstrated good internal consistency (α = .81) and test-retest reliability (r = .76), and was sensitive to change over a 6-month follow-up period and showed good criterion-related validity, distinguishing individuals with BPD from those without BPD (Hausberg et al., 2012). A higher score reflects worse trait mentalizing.
Epistemic Trust, Mistrust and Credulity Questionnaire
The ETMCQ is a 15-item measure calibrated to assess trust (e.g. ‘I usually ask people for advice when I have a personal problem), mistrust (e.g. ‘I’d prefer to find things out for myself on the internet rather than asking people for information), and credulity (e.g. ‘I am often considered naïve because I believe almost anything that people tell me’; Campbell et al., 2021). Each item is scored from 1(Strongly Disagree) to 7(Strongly Agree).
Paradigm, procedure and server architecture
The Intentions Game is a repeated social-value orientation paradigm with three phases.
In Phase 1 of the Intentions Game, participants take on the role of the decider with an anonymous partner over 36 trials. In each trial, participants choose between two options to distribute points between themselves and their partners. Participants make 12 choices each between prosocial and competitive (e.g. Option 1=[10,10], Option 2 = [10,5]) individualistic and competitive (e.g. Option 1=[10,5], Option 2=[8,1]), and prosocial and individualistic options (e.g. Option 1=[5,5], Option 2=[10,5]). Phase 1 choices allowed experimenters to classify participants’ social preferences as prosocial (preferring equal outcomes), individualistic (maximising own payoff), or competitive (maximising relative payoff difference at the cost of lower self-gain).
We included a task environment that balanced each type of choice pair (see Supplementary Table 1).
In phase 2 of the game, participants were matched with a new anonymous partner and played the role of the recipient over 54 trials. In this phase, the participants predicted which of the two options their partner would choose on each trial. Trial numerical values for self and other were identical to Phase 1. Partners’ decisions were determined via a dynamic algorithm (Burgess et al., 2023) to ensure partners were approximately ~50% different from the participants’ based on participants’ choices in phase 1. To surmise this architecture, we implemented a version of the client-server paradigm hosted on an Amazon Web Service (AWS) LightSail server, where the webbased behavioural task (implemented with JavaScript in Gorilla.sc) acted as the client and exchanged information with a remote AWS server. The server received all anonymised behavioural data following phase 1. The Application Programming Interface (API) to interact with the server used a customizable R script (v4.3) to process the received data from the participant, and additional R scripts were used to process and generate output for the participant. A function within the backend scripts first used Bayesian inference to approximate a participant’s parameters for phase 1. It then simulated what choices the participant would have made in phase 2 had the participant been in the role of the partner. The algorithm then sought to find parameters that would be at least 50% dissimilar from participant parameters with respect to the generated choices of those parameters. This allowed the task behaviour of phase 2 to be dynamically updated in response to participant choices in phase 1. This facilitated tight control over the state of the task and enabled advanced computations to be performed on participant data beyond the capabilities of a web browser.
Participants were incentivised in phase 2 to predict accurately, as accurate predictions would contribute to their total point scores (total correct answers were multiplied by 10 and added to their points) and determined their entry into the lottery to win an extra £20 Amazon voucher. After participants had made their predictions, they were given feedback informed on whether their predictions were accurate.
At the end of phase 2, participants were asked to rate (1) the extent to which they thought their partner was driven by the desire to earn points in this task overall (self-interest) and (2) the extent to which they thought their partner was driven by the desire to reduce the participant’s points in this task overall (attribution of harmful intent). The answers were presented using two separate sliders from 0 to 100; the sliders were initialised to be invisible until the participants made the first click.
Phase 3 was identical to phase 1 except that participants were matched with a new anonymous partner. Participants would take on the decider role similar to phase 1 which allowed experimenters to estimate whether the observation of their partner in phase 2 had an influence on participants in phase 3.
Behavioural Analysis
All analysis was conducted in R (v. 4.3.3) on a macbook pro (M2 Max; OS=Ventura13.5). All individual numeric values extraneous of statistical tests are reported with their mean and standard deviation (mean=XX, SD=YY). All statistical tests where dependent variables mapped one value to one participant (e.g. trait psychometric scores) were conducted as linear models, with the regression coefficient, 95% confidence interval (95%CI), t-value and p-value reported like so (linear estimate=XX, 95%CI:AA,BB; t=CC, p=DD). When dependent variables mapped multiple values to each participant (e.g. trial-by-trial accuracy or reaction time) random-effects linear modelling was used. All correlations used Pearson estimates (r) unless distributions were non-normal, in which case Spearman-ranked correlations (r) were performed.
Model space and Computational Analysis
We apply four computational hypotheses (M1-M4) which could explain the data collected from the Intentions Game (Figure 1), centred around formal principles of self-insertion and social contagion. Self-insertion states that a self inserts their own preferences into their beliefs about others (Anderson & Chen, 2002; Kreuger & Clement, 1994); Social Contagion states that a self’s preferences will change when exposed to the preferences of an other (Deutsch & Gerard, 1955). In each case, cognitive representations of self and other are allowed to intermingle to form a new hybrid of the two for the purposes of computational efficiency and/or social bonding.
We note some important assumptions in our notation going forward. In dyadic social interaction, both parties are trying to estimate and predict the true state (θ) of the self (θS) and the other (θO). However, this estimation is inherently imperfect. Theories of social inference need to consider three sources of noisy estimation of this quantity: the self’s (s) metacognitive model of their own state,
All models assumed a constricted Fehr-Schmidt utility function was used by participants and partners to calculate the utility of two options
In phase 1, participants made binary choices ct, t = {1… T} about whether option 1 or option 2 should be chosen given the returns for each option pair, Rt = {Rt;1; Rt;2} =
Here, αppt describes the weight a participant places on their own payoff (in one reduced model we set αppt = 1), and βppt, the weight a participant places on their payoff relative to the payoff of their partner. Large positive or negative values of βppt indicate respectively that participants like or dislike earning more than their partner. We can therefore describe these terms α and β as reflecting preferences for absolute and relative payoffs, respectively. For efficiency we discretised states of αppt from 0-30 (increments of 0.125) and βppt from −30 to 30 (increments of 0.25).
Over this state space we can construct a belief that participants are estimated to hold which generate their choices, C. Herein, we refer to this belief as θppt, where θppt is a matrix over a fixed grid of αppt and βppt values. In the models, θppt is drawn from a normal distribution made from a central tendency,
When
In phase 2, over 54 trials, we then model the participants binary predictions
The partner decisions, Dt = {d1, d2 …, dT} are then used to update the participants beliefs about the partner, written as p(θpar|Df), starting with prior p(θpar|D0). Both M1 and M2 assume participants use their own central tendency,
In models M3 and M4, we assume participant’s may instead use a new central tendency (rather than their own) as prior beliefs over their partner. This are free parameters to be approximated,
In all cases, we assume participants update their beliefs about their partner’s social preferences given their partner’s decisions D along trials 1-54 according to Bayes rule:
We can then marginalise over
We assume that participants predict the partner’s decision in the next trial by calculating the probability determined by the utility differences ΔUα, β(Rt+1) as in phase 1, summed over the joint distribution of partner parameters,
And then performed probability matching, so that:
In the third phase participants are once again asked to make choices for themselves and a new anonymous partner over 36 trials with an assumed identical utility function as in phase 1. In model M1 and M3 we assume participants use a combination of their own preferences and the posterior beliefs about their partner to form a new distribution to select between the two options available on each trial. This draws from the same formulation used previously (Moutoussis et al. 2016). In essence, we state that participants know their true preferences in phase 1 but are unsure about them. The inferred partner beliefs
Where
All computational models were fitted using a Hierarchical Bayesian Inference (HBI) algorithm which allows hierarchical parameter estimation while assuming random effects for group and individual model responsibility (Piray et al., 2019). During fitting we added a small noise floor to distributions (2.22e−16) before normalisation for numerical stability. Parameters were estimated using the HBI in untransformed space drawing from broad priors (μM=0, σ2M = 6.5; where M={M1, M2, M3, M4}). This process was run independently for each group. Parameters were transformed into model-relevant space for analysis. All models and hierarchical fitting were implemented in Matlab (Version R2022B). All other analyses were conducted in R (version 4.3.3; arm64 build) running on Mac OS (Ventura 13.0). We extracted individual and group level responsibility, as well as the protected exceedance probability to assess model dominance per group.
To conduct model recovery we simulated synthetic participants (CON=53; BPD=50) using their fitted parameters from the dominant model of the group (CON=M1; BPD=M4). We then performed model fitting with an identical procedure to the real behavioural data. We tested associations between model responsibility and individual parameters for the real and recovered models, as well as the association between choices and predictions made by the model from simulation and the choices and predictions made by participants in each trial.
Differences between groups for individual-level parameters were estimated using hierarchical Bayesian t-tests (Bååth, 2014) and hierarchical general linear models in rStanArm. Differences in mean between groups (Δμ) are additionally reported with their corresponding posterior 95% High Density Interval (95%HDI). Belief updates were calculated as the Kullback-Leibler Divergence between probabilities (P) from trial t-1 to t, marginalised along all possible states, S={s1, s2, … , sn}: DKL(Pf||Pt-1) =
Exploratory Network Analysis
To understand the individual differences of trait attributes (MZQ, RGPTSB, CTQ) with other-to-self information transfer (Δ
Open data and code:
https://github.com/josephmbarnby/SocialTransfer_Barnby_etal_2024
Acknowledgements
We would like to greatly thank all participants who took part in the research.
Additional information
CRediT
JMB: Conceptualisation, Data Curation, Investigation, Formal Analysis, Methodology, Project Administration, Software, Supervision, Visualisation, Writing – Original Draft, Writing – Review and Editing. JN: Investigation, Methodology, Writing – Original Draft, Writing – Review and Editing. JG: Conceptualisation, Investigation, Project Administration, Resources, Writing – Review and Editing. MW: Project Administration. HB: Software, Writing – Review and Editing. LR: Resources, Writing – Review and Editing. GC: Validation, Writing - Review and Editing. JK: Supervision, Writing – Review and Editing. PRM: Resources, Writing – Review and Editing. PD: Conceptualisation, Formal Analysis, Writing - Review and Editing. TN: Conceptualisation, Project Administration, Resources, Supervision, Writing – Review and Editing. PF: Conceptualisation, Resources, Supervision, Writing – Review and Editing.
Funding
JMB is supported by a Wellcome Trust award (228268/Z/23/Z) and as a scholar within the FENS-Kavli Network of Excellence. Funding for PD was from the Max Planck Society and the Humboldt Foundation. PD is a member of the Machine Learning Cluster of Excellence, EXC number 2064/1 – Project number 39072764 and of the Else Kroner Medical Scientist College “ClinbrAIn: Artificial Intelligence for Clinical Brain Research”.
Supplementary Text 1
Reaction times in the Intentions Game
Phase 1
Examining reaction times (in milliseconds; ms) in phase 1 by choice type revealed that, compared to competitive choices, individualistic choices were made faster (linear estimate = −880.60, 95%CI: −1385.42, −376.2; t = −3.42, p < 0.001), and prosocial choices were made fastest (linear estimate = −1171.1, 95%CI: −1701.97, − 640.71; t = −4.32, p < 0.001) irrespective of the type of choice pair. Prosocial choices were made significantly faster than individualistic choices (linear estimate = −290.70, 95%CI: −548.50, −32.91; t = −2.21, p = 0.027).
Phase 2
All participants were slower at the start of phase 2 and sped up over time (linear estimate = −15.03, 95%CI: −21.06, −8.99; t = −4.88, p < 0.001). Baseline participant-partner similarity did not have an overall effect on reaction time but did interact with trial – as participant-partner similarity increased, reaction times early in phase 2 were significantly slower and this effect attenuated over time (linear estimate = −0.53, 95%CI: −0.75, −0.32; t = −4.91, p < 0.001; see Figure S9). Reaction time did not vary between groups: both BPD and CON participants displayed the same effect.
Reaction times and belief updates in phase 2 were significantly coupled, such that larger shifts in posterior beliefs along both axes were associated with larger reaction times (linear estimate [DKL(
Phase 3
Reaction times in phase 3 revealed that compared to competitive choices, individualistic choices were made faster (linear estimate = −528.50, 95%CI: −943.60, − 114.6; t=-2.50, p = 0.012), and prosocial choices were made fastest (linear estimate = −693.5, 95%CI: −1137.65, −250.39; t=-3.07, p = 0.002). Prosocial choices were no longer executed significantly faster than individualistic choices. All participants made faster choices in phase 3 compared to phase 1 (linear estimate = −242.02, 95% CI: − 332.64, −151.41; t=-5.24, p = 0.001).
Supplementary Materials

Group Level Parameter Values.
BPD participants were explained by M4 which has two extra free parameters than CON participants who were best explained by M1.

Individual Level Parameter Distributions Per Group.
BPD (purple) participants were explained by M4 which has two extra free parameters (alpha_par) and (beta_par) than CON participants (blue) who were best explained by M1.

Simulation of Phase 2 priors that may be drawn from a memory of an aversive other vs from the self alone.
We can imagine a scenario where a prosocial participant (typical of BPD and CON) has a strong impression of an other from memory who is particularly aversive (competitive). Using a mixture of the median belief of the self (

(top) Exemplar distribution from an individual with a diagnosis of BPD who was competitive in phase 1 and matched with a partner who was prosocial in phase 2. We note that irrespective of the valence of BPD participants’ preferences, there was still a neutral prior generated that was not integrated into the model of self. (bottom) distribution of individual-level parameter estimates for phase 1 beliefs (self; red) and phase 2 prior beliefs (other; grey) about partners for both prosocial-competitive (left) and individualistic (right) beliefs. As reported in the main text, BPD priors about their partner’s prosocial preferences were centred closely around 0 (Δμ[0 −

(top panels) Raw trial-wise probability of correct responses from real and model-simulated observations for each group. Probabilities were approximated by grouping by trial across each group, summing the total correct responses and dividing by 54. (bottom panel) Cumulative percentage of correct predictions in phase 2 for each group are shown as thick solid lines. Individual cumulative scores are depicted as thin translucent lines.

2D Distribution of participant and partner parameters estimated through Bayesian inference at the AWS server backend during the participant-partner matching protocol.
As a sanity check we also assessed the degree to which server-derived participant parameters (

Spearman Correlations Between Psychometric Scores at Baseline and Self/Other Parameters.
(Top) Psychometric correlations with parameters for self. (Bottom) Psychometric correlations with parameters for other. All correlations with p-values > 0.05 are omitted.

Uncorrected spearman’s ρ between psychometric measures and change absolute change in self-preferences from phase 1 to 3.
All beliefs metrics are extracted from M3 which assumes all participants engage in social contagion. Cred = Credulity. Delta = whether the shift in belief was along preferences for absolute (alpha) or relative (beta) reward.

Relationship between belief updates and reaction times.
(Top) Linear random effects relationship between reaction time (ms) and belief updating. Reaction times and belief updates in phase 2 were significantly coupled, such that larger shifts in posterior beliefs along both axes were associated with larger reaction times (linear estimate [DKL(

Phase 2 prior belief flexibility following forced hierarchical fit of Model M1 to all (FULL) participants and separate (SEP) groups.
(A) Bayesian general linear model estimates of the differences in the mean of

Uncorrected Psychometric Correlations.
(A) We conducted partial correlations between MZQ, CTQ, RGPTSB and changes in

Pearson correlation between parameters of equivalence across models M1-M4.
All models were hierarchically fitted (using the HBM package; Piray et al., 2019) without comparison to each group. We then compared prior flexibility over others in M1 and M2 (which allow for self-insertion) to the absolute difference in median shift of beliefs between phase 1 and phase 2 under M3 and M4 (which do not allow for self-insertion). We also correlated prior flexibility over others in phase 2 estimated under M1 and M2, as well as central tendency over new priors over others under M3 and M4. We find excellent convergence between approximated parameters of equivalent meaning across models.

Option pair rewards for each phase and their corresponding ‘type’.
Within phase order of trials were randomised. P=Prosocial, I=Individualistic, C=Competitive. S1 = reward to self for option 1. S2 = reward to self for option 2. O1 = reward to other for option 1. O2 = reward to other for option 2.


Model Parameters of M1-M4 Following Independent Hierarchical Fitting For All Participants.

Random-effect linear relationships between DKL, trial, group, and preferences type for each model (M1-M4) following Independent Hierarchical Fitting For All Participants.
Estimates are the scaled change in DKL as a result of each fixed effect. ID was used as a random variable to control for within-subject effects. Group effects (CON vs BPD) were analysed for the DKL within each preference type.

Bootstrapped results with their 95%CI with and without group status regressed against psychometric variables.
References
- Childhood adversity and personality disorders: Results from a nationally representative population-based studyJournal of psychiatric research 45:814–822Google Scholar
- The group fallacy in relation to social scienceAmerican Journal of Sociology 29:688–706Google Scholar
- The relational self: an interpersonal social-cognitive theoryPsychological review 109:619Google Scholar
- Bayesian first aid: A package that implements Bayesian alternatives to the classical*. test functions in RProceedings of useR :2Google Scholar
- Knowing me, knowing you: Interpersonal similarity improves predictive accuracy and reduces attributions of harmful intentCognition 225:105098Google Scholar
- Formalising social representation to explain psychiatric symptomsTrends in cognitive sciences 27:317–332Google Scholar
- A Standard Framework for Social Cognition: Interoperable algorithms for inference and representationPsyArXiv https://doi.org/10.31234/osf.io/cmgu7Google Scholar
- Mentalization based treatment for borderline personality disorderWorld psychiatry 9:11Google Scholar
- Childhood maltreatment, dissociation and borderline personality disorder: Preliminary data on the mediational role of mentalizing in complex post-traumatic stress disorderPsychology and Psychotherapy: Theory, Research and Practice Google Scholar
- Development and validation of a brief screening version of the Childhood Trauma QuestionnaireChild abuse & neglect 27:169–190Google Scholar
- Random orderings and stochastic theories of responses (1960)Economic Information, Decision, and Prediction: Selected Essays: Volume I Part I Economics of Decision Dordrecht: Springer Netherlands :172–217
- Computational mechanisms underlying social evaluation learning and associations with depressive symptoms during adolescencePsyArXiv Google Scholar
- Network analysis of multivariate data in psychological scienceNature Reviews Methods Primers 1:58Google Scholar
- Realizing Dynamic Cognitive Tasks with Cloud-based ComputationIn: 1st Annual Conference of the US Research Software Engineer Association (US-RSE 2023) Google Scholar
- Development and validation of a self-report measure of epistemic trustPloS one 16:e0250264Google Scholar
- A study of normative and informational social influences upon individual judgmentThe journal of abnormal and social psychology 51:629Google Scholar
- Learning about and from others’ prudence, impatience or laziness: The computational bases of attitude alignmentPLoS computational biology 13:e1005422Google Scholar
- Regenerate behavior and social homeostasis of termitesEcology 37:248–258Google Scholar
- Estimating psychological networks and their accuracy: A tutorial paperBehavior research methods 50:195–212Google Scholar
- Interpersonal problems in borderline personality disorder: associations with mentalizing, emotion regulation, and impulsivenessJournal of Personality Disorders 35:177–193Google Scholar
- Psychoanalytic studies of the personalityPsychology Press Google Scholar
- A theory of fairness, competition, and cooperationThe quarterly journal of economics 114:817–868Google Scholar
- The computational challenge of social learningTrends in Cognitive Sciences 25:1045–1057Google Scholar
- The structured clinical interview for DSM-III-R personality disorders (SCID-II). Part II: Multi-site test-retest reliability studyJournal of personality disorders 9:92–104Google Scholar
- A developmental, mentalization-based approach to the understanding and treatment of borderline personality disorderDevelopment and psychopathology 21:1355–1381Google Scholar
- The revised Green et al., Paranoid Thoughts Scale (R-GPTS): psychometric properties, severity ranges, and clinical cut-offsPsychological Medicine 51:244–253Google Scholar
- Learning-induced plasticity in medial prefrontal cortex predicts preference malleabilityNeuron 85:418–428Google Scholar
- Goal-directed, habitual and Pavlovian prosocial behaviorFrontiers in behavioral neuroscience 9:135Google Scholar
- Measuring ideas of persecution and social reference: the Green et al. Paranoid Thought Scales (GPTS)Psychological medicine 38:101–111Google Scholar
- Transfer of learned opponent models in zero sum gamesComputational Brain & Behavior 5:326–342Google Scholar
- Borderline personality disorderNature reviews disease primers 4:1–20Google Scholar
- A systematic review and meta-analysis of ‘Systems for Social Processes’ in borderline personality and substance use disordersNeuroscience & Biobehavioral Reviews 127:572–592Google Scholar
- Is a self-rated instrument appropriate to assess mentalization in patients with mental disorders? Development and first validation of the Mentalization Questionnaire (MZQ)Psychotherapy Research 22:699–709Google Scholar
- Aberrant computational mechanisms of social learning and decision-making in schizophrenia and borderline personality disorderPLoS computational biology 16:e1008162Google Scholar
- Computational psychiatry needs time and contextAnnual review of psychology 73:243–270Google Scholar
- A reduced self-positive belief underpins greater sensitivity to negative evaluation in socially anxious individualsComputational Psychiatry 5:21Google Scholar
- The paradoxical self: Awareness, solipsism and first-rank symptoms in schizophreniaPhilosophical Psychology 31:210–231Google Scholar
- A model of risk and mental state shifts during social interactionPLoS computational biology 14:e1005935Google Scholar
- Comparing the personality disorder interview for DSM–IV (PDI–IV) and SCID–II borderline personality disorder scales: An item–response theory analysisJournal of Personality Assessment 97:13–21Google Scholar
- The truly false consensus effect: an ineradicable and egocentric bias in social perceptionJournal of personality and social psychology 67:596Google Scholar
- Differential valuation and learning from social and nonsocial cues in borderline personality disorderBiological psychiatry 84:838–845Google Scholar
- The rupture and repair of cooperation in borderline personality disorderscience 321:806–810Google Scholar
- Loneliness, social networks, and social functioning in borderline personality disorderPersonality Disorders: Theory, Research, and Treatment 8:349Google Scholar
- Inter-rater reliability of the Structured Clinical Interview for DSM-IV Axis I disorders (SCID I) and Axis II disorders (SCID II)Clinical psychology & psychotherapy 18:75–79Google Scholar
- Interrater reliability and internal consistency of the structured clinical interview for DSM-IV axis II personality disorders (SCID-II), version 2.0Journal of personality disorders 11:279–284Google Scholar
- Attachment and borderline personality disorder as the dance unfolds: A quantitative analysis of a novel paradigmJournal of Psychiatric Research 175:470–478Google Scholar
- Conditional Logit Analysis of Qualitative Choice BehaviorIn:
- Zarembka Paul
- Social integration: Implications for the association between childhood trauma and stress responsivityPsychological trauma: theory, research, practice, and policy Google Scholar
- How people use social information to find out what to want in the paradigmatic case of inter-temporal preferencesPLoS computational biology 12:e1004965Google Scholar
- Impressions about harm are formed rapidly and then refined, modulated by serotoninSocial Cognitive and Affective Neuroscience 19:nsae078Google Scholar
- Development and validation of the Certainty About Mental States Questionnaire (CAMSQ): A self-report measure of mentalizing oneself and othersAssessment 30:651–674Google Scholar
- Measuring social value orientationJudgment and Decision making 6:771–781Google Scholar
- Validity, Reliability and Internal Consistency of Persian Versions of the Childhood Trauma Questionnaire, the Traumatic Exposure Severity Scale and the Peritraumatic Dissociative Experiences QuestionnaireJournal of Trauma & Dissociation 22:332–348Google Scholar
- The role of epistemic trust in mentalization-based treatment of borderline psychopathologyJournal of Personality Disorders 37:633–659Google Scholar
- How conformity can lead to polarised social behaviourPLoS Computational Biology 17:e1009530Google Scholar
- Hierarchical Bayesian inference for concurrent model fitting and comparison for group studiesPLoS computational biology 15:e1007043Google Scholar
- A model for learning based on the joint estimation of stochasticity and volatilityNature communications 12:6587Google Scholar
- Italian validation of the Childhood Trauma Questionnaire—Short Form on a college groupPsychological Trauma: Theory, Research, Practice, and Policy 10:563Google Scholar
- Development of the self-concept during adolescenceTrends in cognitive sciences 12:441–446Google Scholar
- A computational phenotype of disrupted moral inference in borderline personality disorderBiological Psychiatry: Cognitive Neuroscience and Neuroimaging 5:1134–1141Google Scholar
- Breaking the cycle with trauma-focused mentalization-based treatment: theory and practice of a trauma-focused group interventionFrontiers in psychology 15:1426092Google Scholar
- Neurophysiological activity following rewards and losses among female adolescents and young adults with borderline personality disorderJournal of abnormal psychology 128:610Google Scholar
- A social inference model of idealization and devaluationPsychological Review Google Scholar
- A computational signature of self-other mergence in Borderline Personality DisorderTranslational Psychiatry 14:473Google Scholar
- Behavioral contagion during learning about another agent’s risk-preferences acts on the neural representation of decision-riskProceedings of the National Academy of Sciences 113:3755–3760Google Scholar
- Prior preferences beneficially influence social and non-social learningNature Communications 8:817Google Scholar
- Contagion of temporal discounting value preferences in neurotypical and autistic adultsJournal of autism and developmental disorders :1–14Google Scholar
- Informational and normative influences in conformity from a neurocomputational perspectiveTrends in cognitive sciences 19:579–589Google Scholar
- Negative reward expectations in Borderline Personality Disorder patients: Neurophysiological evidenceBiological Psychology 94:388–396Google Scholar
- Learning from other minds: An optimistic critique of reinforcement learning models of social learningCurrent opinion in behavioral sciences 38:110–115Google Scholar
- The North American ants of the genus Camponotus MAYRAnnals of the New York Academy of Sciences 20:295–354Google Scholar
- Schema therapy: A practitioner’s guideguilford press Google Scholar
- How peer influence shapes value computation in moral decision-makingCognition 211:104641Google Scholar
- The relationship between latent state inference and (intolerance of) uncertaintyNeuroscience and Biobehavioral Reviews 152:105321Google Scholar
Article and author information
Author information
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Reviewed Preprint version 3:
Cite all versions
You can cite all versions using the DOI https://doi.org/10.7554/eLife.104008. This DOI represents all versions, and will always resolve to the latest one.
Copyright
© 2025, Barnby et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
- views
- 651
- downloads
- 49
- citation
- 1
Views, downloads and citations are aggregated across all versions of this paper published by eLife.