Response to comment on ‘Criterion placement threatens the construct validity of neural measures of consciousness’

Department of Applied and Experimental Psychology, Vrije Universiteit Amsterdam, Netherlands
Institute for Brain and Behavior Amsterdam (iBBA), Vrije Universiteit Amsterdam, Netherlands
Department of Psychology, University of Amsterdam, Netherlands
Amsterdam Brain and Cognition, University of Amsterdam, Netherlands
Cognitive Psychology Unit, Institute of Psychology and Leiden Institute for Brain and Cognition, Leiden University, Netherlands
Department of Psychology, University of Lübeck, Germany
Center of Brain, Behavior and Metabolism, University of Lübeck, Germany

Aug 21, 2025

Open access
Copyright information

Abstract
Assumptions of subjective measures
Similarity of findings – different conclusions
Assumptions of instructions
Conclusions
Data availability
References
Article and author information
Metrics

Abstract

In Fahrenfort et al., 2025 we show the influence of non-perceptual criterion shifts on neural measures of consciousness. We fully agree (and point out in our article) that it was already known that subjective measures are sensitive to criterion confounds, and we are happy to read that this is acknowledged by Sandberg and Overgaard in their comment (Sandberg and Overgaard, 2025). However, we contest that the main findings of our simulations and empirical studies had already been demonstrated. Several findings from our studies are novel, such as the fact that criterion effects reveal themselves as over- (or under-) estimations of both conscious and unconscious processing in tandem, and that this has tangible implications when analyzing real neural data. We also challenge the suggestion that our experimental manipulations are (too) radical compared to signal-to-noise variations that occur naturally between experiments.

Assumptions of subjective measures

In Fahrenfort et al., 2025 we state that the underlying assumption is that selecting [0] will only occur if trials are ‘truly’ unseen. This statement is derived from the description of the PAS itself, stating that [0] refers to either: “No experience. No impression of the stimulus is experienced. All answers are experienced as mere guessing.” (Overgaard et al., 2006), “No experience” (Sandberg et al., 2010), “No experience: No subjective experience of the stimulus, not even the ‘faintest sensation’ that anything was presented at all. Not even a feeling that something might have been presented” (Overgaard and Sandberg, 2021). The implication of accepting the validity of this category is clearly that it measures what the label refers to, that the stimulus is ‘truly’ not experienced (although the exact phrasing of this category may vary slightly between studies). This indeed is how it is used in the literature, for example when making claims about the existence of unconscious working memory, and many other claims regarding unconscious processing (see references in our manuscript).

Sandberg and Overgaard now claim that it is a niche position to suggest that “awareness ratings should be treated as flawless insights into participants’ experience”, referring to their own care in highlighting that this implication is unwarranted (Sandberg and Overgaard, 2025). They further underpin this by reference to the fact that 32 researchers advise against relying on subjective measures alone to establish evidence of unconscious processing (Stockart et al., 2024, p. 14). Indeed, wise advice that is supported by our manuscript, although we would like to note that a relatively recent survey has shown that most consciousness researchers believe that subjective measures - and the PAS specifically - are the best measures to check whether a stimulus is consciously perceived (Francken et al., 2022, Figure 4). We fail to see how highlighting imperfections of subjective measures can serve as an argument in their defense, or how these concerns disappear when acknowledging them. Similarly, we fail to see how the continued use of subjective measures (with or without the acknowledgement of their imperfection) builds support for their continued use.

Similarity of findings – different conclusions

A second criticism of Sandberg and Overgaard is that ‘the main findings <of our manuscript> have already been demonstrated in other experiments’. To support this idea, they reference a publication (Sandberg et al., 2022) in which they model behavioral data from subjective reports. We have trouble matching the results presented in that article to the data, simulations and analyses from our own publication. Their article is not about neural measures of consciousness, nor does it contain brain imaging data. As such it bears little resemblance to our studies. Our article is specifically about the effect criterion shifts have on post-hoc sorted data to create neural conditions that are claimed to correspond to ‘real’ subjective states (such as expressed in the labels of the PAS, or in a ‘yes’/’no’ response). Further, we use empirical data from two EEG experiments to support the claim that the effects that we model in a signal detection framework are not merely theoretical artifacts but can have real implications for claims regarding the neural correlates of consciousness (NCC). The only correspondence seems to be that we agree on the somewhat unreliable nature of the PAS response, when discussing the fact that the PAS is not exhaustive (i.e. it may or may not capture weak conscious experiences depending on the experimental context).

Further, Sandberg and Overgaard claim that they had already established that report criteria depend on experimental context (Skewes et al., 2021). However, that article is intrinsically very different from ours, because it provided false performance feedback, which also affected participants’ accuracy. In our experiments we only provided veridical feedback and specifically kept accuracy the same between conditions so that the effects are criterion specific and not related to general effects of sensitivity or other cognitive effects. More importantly, the aim of our paper was not to show that report criteria depend on experimental context. Rather, we took this as a starting point to show the effect of this contextual change on neural correlates based on subjective measures and post-hoc sorting, including the PAS. Furthermore, we not only show that the experimental context influences report criteria, which should be well-known (although we contest that this is widely accepted given the lenient use of subjective measures in the literature), but we also show through simulation that the experimental context determines whether the relative confounding effect of criterion placement is larger in neural measures of either conscious or unconscious processing.

Assumptions of instructions

Next, Sandberg and Overgaard contend that we made changes to the PAS that we ourselves consider so substantial that it may be argued that we did not use the PAS at all (referencing our Discussion). However, this is not what we argued. Rather, we merely acknowledged Sandberg and Overgaard’s potential concern on our usage of the PAS, already in the first version of the manuscript prior to three peer reviews. Subsequently, after personal communication with them, we conceded that a particular sentence that Sandberg and Overgaard highlight and that occurred somewhere in the instructions (“Only press 0 if you are 100% convinced that no square appeared and only press 3 if you are 100% convinced that a square appeared.”) might be misconstrued by participants as a general confidence rating. To give them a fair hearing in our article, we expounded on this issue in an updated Discussion, and we asked them beforehand whether they agreed with the way we explicitly highlight this concern in the discussion of our Version of Record (to which they agreed).

However, this does not mean that we believe that our usage of the scale should not be characterized as the PAS. The PAS refers to verbal labels of a scale, i.e. it is a measurement instrument. In our experiment, the proper PAS labels were used throughout the experiment and repeatedly shown to remind participants of what we specifically asked of them. Thus, we disagree with the statement that we “did not use the PAS at all” and we do not believe that the single sentence that Sandberg and Overgaard highlighted changes this fact or would have meaningfully changed the outcome of the study.

The other criticism is that we used punishments in our experiment, and that “punishment can be used to disrupt the result of essentially any psychological test”. This might be true if we had provided participants with false feedback, which we did not. We only gave veridical feedback to experimentally create criterion shifts, the same shifts that Sandberg and Overgaard acknowledge also occur naturally in other contexts given their admission that the PAS is not an exhaustive measure. We made explicit that our criterion manipulation was strong (“As such, the current experiment can be viewed as a caricature of actual experimental practice”, page 13 top), but we disagree that such changes do not occur across experimental contexts, or even that our experimental context is highly unusual (see references in our manuscript). Indeed, one can observe similar criterion shifts in experiments comparing experimental blocks with different base-rates of stimulus occurrence (e.g., more vs less targets) without punishment manipulations (Sánchez-Fuenzalida et al., 2025; Sánchez-Fuenzalida et al., 2023). These studies also reveal that such criterion shifts do not affect conscious experience. If a measurement instrument is strongly affected by naturally occurring contextual variations, we believe it is reasonable to use experimental manipulation to show how these confounds can materialize in real data.

Conclusions

Concluding, we do not agree that our study primarily adds detail to what was already known about the limitations of subjective measures, as we explain here and in the manuscript. We also contest that our criticisms can be mitigated simply by acknowledging limitations of subjective measures. Claims based on post-hoc sorting of subjective reports are not likely to agree across experimental contexts, so that conclusions regarding the depth or extent of unconscious processing, or regarding the temporal or spatial profile of the NCC, vary considerably across the literature (Yaron et al., 2022). With respect to subjective measures, this is not due to mere ‘limitations’, but due to a serious confound in neural activation patterns based on subjective measures, one that deserves considerable attention. We hope our manuscript contributes to raising awareness about this confound.

Data availability

This manuscript does not contain data.

References

(2025) Criterion placement threatens the construct validity of neural measures of consciousness
eLife 13:RP102335.

https://doi.org/10.7554/eLife.102335
- Google Scholar
(2022) An academic survey on theoretical foundations, common assumptions and the current state of consciousness science
Neuroscience of Consciousness 2022:niac011.

https://doi.org/10.1093/nc/niac011
- PubMed
- Google Scholar
(2006) Is conscious perception gradual or dichotomous? A comparison of report methodologies during a visual task
Consciousness and Cognition 15:700–708.

https://doi.org/10.1016/j.concog.2006.04.002
- PubMed
- Google Scholar
1. Overgaard M
2. Sandberg K
(2021) The perceptual awareness scale-recent controversies and debates
Neuroscience of Consciousness 2021:niab044.

https://doi.org/10.1093/nc/niab044
- PubMed
- Google Scholar
(2023) Predictions and rewards affect decision-making but not subjective experience
PNAS 120:e2220749120.

https://doi.org/10.1073/pnas.2220749120
- PubMed
- Google Scholar
Preprint
(2025) Confidence reports during perceptual decision making dissociate from changes in subjective experience
PsyArXiv.

https://doi.org/10.31234/osf.io/xa4fj
- Google Scholar
(2010) Measuring consciousness: is one measure better than the other?
Consciousness and Cognition 19:1069–1078.

https://doi.org/10.1016/j.concog.2009.12.013
- PubMed
- Google Scholar
(2022) A window of subliminal perception
Behavioural Brain Research 426:113842.

https://doi.org/10.1016/j.bbr.2022.113842
- PubMed
- Google Scholar
1. Sandberg K
2. Overgaard M
(2025) Comment on ‘Criterion placement threatens the construct validity of neural measures of consciousness’
eLife 14:e106963.

https://doi.org/10.7554/eLife.106963
- Google Scholar
(2021) Awareness and confidence in perceptual decision-making
Brain Multiphysics 2:100030.

https://doi.org/10.1016/j.brain.2021.100030
- Google Scholar
Preprint
1. Stockart F
2. Schreiber M
3. Amerio P
4. Carmel D
5. Cleeremans A
6. Deouell L
7. Dienes Z
8. Elosegi P
9. Gayet S
10. Goldstein A
11. Halchin AM
12. Hesselmann G
13. Kimchi R
14. Lamy D
15. Loued-Khenissi L
16. Meyen S
17. Micher N
18. Pitts M
19. Salomon R
20. Sandberg K
21. Schnepf IA
22. Schurger A
23. Shanks D
24. Soto D
25. Tal A
26. Trübutschek D
27. Vadillo MA
28. van Gaal S
29. Yaron I
30. Zheng Z
31. Faivre N
32. Mudrik L
(2024) Studying unconscious processing: contention and consensus
PsyArXiv.

https://doi.org/10.31234/osf.io/bkxzh
- Google Scholar
1. Yaron I
2. Melloni L
3. Pitts M
4. Mudrik L
(2022) The ConTraSt database for analysing and comparing empirical studies of consciousness theories
Nature Human Behaviour 6:593–604.

https://doi.org/10.1038/s41562-021-01284-5
- PubMed
- Google Scholar

Article and author information

Author details

Johannes Jacobus Fahrenfort
1. Department of Applied and Experimental Psychology, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
2. Institute for Brain and Behavior Amsterdam (iBBA), Vrije Universiteit Amsterdam, Amsterdam, Netherlands
3. Department of Psychology, University of Amsterdam, Amsterdam, Netherlands
4. Amsterdam Brain and Cognition, University of Amsterdam, Amsterdam, Netherlands
Contribution
Writing – original draft, Writing – review and editing

For correspondence
j.j.fahrenfort@vu.nl

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-9025-3436
Philippa A Johnson

Cognitive Psychology Unit, Institute of Psychology and Leiden Institute for Brain and Cognition, Leiden University, Leiden, Netherlands

Contribution
Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-6125-3138
Niels Kloosterman
1. Department of Psychology, University of Lübeck, Lübeck, Germany
2. Center of Brain, Behavior and Metabolism, University of Lübeck, Lübeck, Germany
Contribution
Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-1134-7996
Timo Stein
1. Department of Psychology, University of Amsterdam, Amsterdam, Netherlands
2. Amsterdam Brain and Cognition, University of Amsterdam, Amsterdam, Netherlands
Contribution
Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-8484-0933
Simon van Gaal
1. Department of Psychology, University of Amsterdam, Amsterdam, Netherlands
2. Amsterdam Brain and Cognition, University of Amsterdam, Amsterdam, Netherlands
Contribution
Funding acquisition, Writing – review and editing

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0001-6628-4534

Funding

HORIZON EUROPE European Research Council (10.3030/715605)

Simon van Gaal

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.