In a recent Nature paper, Marek, Tervo-Clemmens (T-C) et al. evaluated the effects of sample size on univariate and multivariate Brain-Wide Association Studies (BWAS) in three large-scale neuroimaging datasets and came to the general conclusion that

image BWAS reproducibility requires samples with thousands of individuals.

But is this always the case? Let’s have a closer look at the replicability of functional connectivity-based multivariate BWAS in the Human Connectome Project, with N=500:

Replicability and predictive effect size at N=500

Interactive figures, no motion-bias (p>0.05 for all, tested with mlconfound). See analysis notebook.

image Appearantly, BWAS can be replicable with moderate sample sizes in many cases…

If you wonder how to interpret these findings in light of Marek, T-C et al.’s results:


1. Check out our commentary

“Multivariate BWAS can be replicable with moderate sample sizes”

by Tamas Spisak, Ulrike Bingel & Tor Wager, Nature, 615, E4–E7 (2023) doi: 10.1038/s41586-023-05745-x


2. Watch the video abstract


3. Read more below

Summary

  • Brain-Wide Association Studies (BWAS) correlate individual differences in phenotypic traits with measures of brain structure and function.
  • In a recent paper, Marek, Tervo-Clemmens (T-C) et al. evaluated the effects of sample size on univariate and multivariate BWAS in three large-scale neuroimaging datasets and came to the general conclusion that “BWAS reproducibility requires samples with thousands of individuals”.
  • Marek, T-C, et al. find that multivariate BWAS provide “inflated in-sample associations” that often fail to replicate (i.e., are underpowered) unless thousands of participants are included. This implies that effect size estimates from the discovery sample are necessarily inflated.
  • In our ‘Matter Arising’ commentary, we distinguish between the effect size estimation method (in-sample vs. cross-validated) and the sample (discovery vs. replication), and show that with appropriate cross-validation the in-sample “inflation” Marek, T-C, et al. report in the discovery sample can be entirely eliminated.
  • With additional analyses, we demonstrate that multivariate BWAS effects in high quality datasets can be replicable with substantially smaller sample sizes in many cases. Specifically, with a standard multivariate predictive model, functional connectivity-based BWAS is commonly replicable with N=75-500.

Take Home Messages and Recommendations

  • Replicability depends on effect size:
  • Multivariate BWAS (predictive modelling) can provide high effect sizes:
    • Go multivariate
    • Follow methodological recommendations
    • Aim at traits (cognition, personality) rather than states (e.g. emotion)
    • Incorporate within-person variation in symptoms or behavior to improve between person predictions
    • Focus on larger effects, that are usueful for making individual predictions about individuals
  • Validate Twice (internally and externally):
    • Perform internal validation during model discovery (e.g. cross-validation)
    • Finalize your model (including data preprocessing) and pre-register further validation
    • Externally validate on independent (prospective) data.
  • Large samples are still essential:
    • to maximize predictive performance
    • to evaluate
      • generalizability (e.g. out-of-center, out-of-context)
      • confounding bias (e.g. motion; can be tested with mlconfound)
      • fairness (across subpopulations)

Other papers in the topic

Authors Title Where
Nature editorial Cognitive neuroscience at the crossroads Nature
Nat. Neurosci. editorial Revisiting doubt in neuroimaging research Nat. Neurosci.
Monica D. Rosenberg and Emily S. Finn How to establish robust brain–behavior relationships without thousands of individuals Nat. Neurosci.
Bandettini P et al. The challenge of BWAS: Unknown Unknowns in Feature Space and Variance Med
Gratton C. et al. Brain-behavior correlations: Two paths toward reliability Neuron
Cecchetti L. and Handjaras G. Reproducible brain-wide association studies do not necessarily require thousands of individuals psyArXiv
Winkler A. et al. We need better phenotypes brainder.org
DeYoung C. et al. Reproducible between-person brain-behavior associations do not always require thousands of individuals psyArXiv
Gell M et al. The Burden of Reliability: How Measurement Noise Limits Brain-Behaviour Predictions bioRxiv
Tiego J. et al. Precision behavioral phenotyping as a strategy for uncovering the biological correlates of psychopathology OSF
Chakravarty MM. Precision behavioral phenotyping as a strategy for uncovering the biological correlates of psychopathology OHBM Aperture Neuro
White T. Behavioral phenotypes, stochastic processes, entropy, evolution, and individual variability: Toward a unified field theory for neurodevelopment and psychopathology OHBM Aperture Neuro
Bandettini P. Lost in transformation: fMRI power is diminished by unknown variability in methods and people OHBM Aperture Neuro
Thirion B. On the statistics of brain/behavior associations OHBM Aperture Neuro
Tiego J., Fornito A. Putting behaviour back into brain–behaviour correlation analyses OHBM Aperture Neuro
Lucina QU. Brain-behavior associations depend heavily on user-defined criteria OHBM Aperture Neuro
Valk SL., Hettner MD. Commentary on ‘Reproducible brain-wide association studies require thousands of individuals’ OHBM Aperture Neuro
Kong XZ., et al. Scanning reproducible brain-wide associations: sample size is all you need? Psychoradiology
J. Goltermann, et al. Cross-validation for the estimation of effect size generalizability in mass-univariate brain-wide association studies BioRxiv
Kang K., et al. Study design features that improve effect sizes in cross-sectional and longitudinal brain-wide association studies BioRxiv
Makowski C., et al. Reports of the death of brain-behavior associations have been greatly exaggerated BioRxiv
J. Wu et al. The challenges and prospects of brain-based prediction of behaviour Nat. Human Behaviour

image Dartmouth College

TRR289 CTBNS