Inconsistent outcome measurement in depression psychotherapy trials: A systematic historical and meta-analytic review over the past 50 years

ElsevierVolume 397, 15 March 2026, 120873Journal of Affective DisordersAuthor links open overlay panel, , , , , , , , , Highlights•

Over the past 50 years, PHQ-9 and HDRS administration increased in psychotherapy trials.

Similarly, the use of OMIs for depression in these trials grew more heterogeneous.

Particularly the CES-D and GDS predicted increased and decreased treatment effect estimates.

Meta-analyses incorporating various OMIs for depression should be interpreted cautiously.

Core outcome sets for depression could improve robustness of meta-analytic evidence.

AbstractBackground

Psychotherapy studies employ different outcome measurement instruments (OMIs) for depression, which may limit the interpretability of meta-analytic findings. This meta-epidemiological study aimed to examine trends in OMI use in randomised controlled trials (RCTs) for depression, consistency of measurement practices over time, and their influence on meta-analytic effect estimates.

Methods

A systematic literature search was conducted in four databases up to September 01, 2024. We included RCTs comparing psychotherapy to control conditions in adults with depression using OMIs for depression, present in at least ten comparisons. Change over time in OMI usage frequencies and heterogeneity (perplexity) was analysed with Poisson and linear regression models. Confounder-adjusted meta-regression and network meta-analysis examined the dependence of treatment effect estimates on different OMIs.

Results

We included 492 trials (1260 comparisons in ∼51,000 patients), published between 1977 and 2024, using 17 different OMIs for depression. The proportion of trials using the Patient Health Questionnaire (PHQ-9; b = 1.04, SE = 0.30, p < 0.001) and Hamilton Rating Scale for Depression (HDRS; b = 0.29, SE = 0.12, p = 0.021), but also heterogeneity in OMI use (b = 0.12, SE = 0.01, p < 0.001), increased significantly over time. Particularly the Centre for Epidemiological Studies Depression Scale (CES-D) and Geriatric Depression Scale (GDS) predicted increased and decreased treatment effect estimates.

Conclusions

Despite increases in PHQ-9 and HRSD administration, OMI heterogeneity in depression psychotherapy research has grown over the past 50 years. Since different OMIs, especially CES-D and GDS, affect treatment effect estimates, meta-analyses incorporating various OMIs for depression should be interpreted cautiously. Core outcome sets for depression could improve robustness of meta-analytic evidence.

AbbreviationsBDI-1

Beck Depression Inventory

BDI-2

Beck Depression Inventory-II

CES-D

Center for Epidemiologic Studies Depression Scale

DSM

Diagnostic and Statistical Manual of Mental Disorders

EPDS

Edinburgh Postnatal Depression Scale

GDS

Geriatric Depression Scale

HDRS

Hamilton Rating Scale for Depression

MADRS

Montgomery-Åsberg Depression Rating Scale

MDD

Major Depressive Disorder

MINI

Mini-International Neuropsychiatric Interview

OMI

Outcome Measurement Instrument

PHQ-9

Patient Health Questionnaire

PRISMA

Preferred Reporting Items for Systematic Reviews and Meta-Analyses

RCT

Randomised Controlled Trial

SCID

Structured Clinical Interview for DSM

SUCRA

Surface Under the Cumulative Ranking Curve

USA

United States of America

© 2025 The Authors. Published by Elsevier B.V.

Comments (0)

No login
gif