Internal validation methods to detect patient characteristics associated with differential item functioning in patient-reported outcome measures

Teresi, J. A., & Fleishman, J. A. (2007). Differential item functioning and health assessment. Quality of Life Research, 16, 33–42.

Article PubMed Google Scholar

Sprangers, M. A., & Schwartz, C. E. (1999). Integrating response shift into health-related quality of life research: A theoretical model. Social Science & Medicine, 48(11), 1507–1515. https://doi.org/10.1016/S0277-9536(99)00045-3

Article CAS Google Scholar

Sajobi, T. T., Sanusi, R. A., Mayo, N. E., Sawatzky, R., Kongsgaard Nielsen, L., Sebille, V., Liu, J., Bohm, E., Awosoga, O., Norris, C. M., Wilton, S. B., James, M. T., & Lix, L. M. (2024). Unsupervised item response theory models for assessing sample heterogeneity in patient-reported outcomes measures. Quality of Life Research, 33(3), 853–864. https://doi.org/10.1007/s11136-023-03560-5

Article PubMed Google Scholar

Jones, R. N. (2019). Differential item functioning and its relevance to epidemiology. Current Epidemiology Reports, 6, 174–183. https://doi.org/10.1007/s40471-019-00194-5

Article PubMed PubMed Central Google Scholar

Berger, M., & Tutz, G. (2016). Detection of uniform and nonuniform differential item functioning by item-focused trees. Journal of Educational and Behavioral Statistics, 41(6), 559–592. https://doi.org/10.3102/107699861665937

Article Google Scholar

Strobl, C., Malley, J., & Tutz, G. (2009). An introduction to recursive partitioning: Rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychological Methods, 14(4), 323. https://doi.org/10.1037/a0016973

Article PubMed PubMed Central Google Scholar

Wong, T. T. (2015). Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation. Pattern Recognition, 48(9), 2839–2846. https://doi.org/10.1016/j.patcog.2015.03.009

Article Google Scholar

Yadav, S., & Shukla, S. (2016). Analysis of k-fold cross-validation over hold-out validation on colossal datasets for quality classification. In 2016 IEEE 6th International Conference on Advanced Computing (IACC) (pp. 78–83). IEEE. https://ieeexplore.ieee.org/abstract/document/7544814/

Bollmann, S., Berger, M., & Tutz, G. (2018). Item-focused trees for the detection of differential item functioning in partial credit models. Educational and Psychological Measurement, 78(5), 781–804. https://doi.org/10.1177/0013164417722179

Article PubMed Google Scholar

Zhang, H., & Ye, Y. (2008). A tree-based method for modeling a multivariate ordinal response. Statistics and Its Interface, 1(1), 169.

Article PubMed PubMed Central Google Scholar

Geroldinger, A., Lusa, L., Nold, M., & Heinze, G. (2023). Leave-one-out cross-validation, penalization, and differential bias of some prediction model performance measures—A simulation study. Diagnostic and Prognostic Research, 7(1), Article 9. https://doi.org/10.1186/s41512-023-00146-0

Article PubMed PubMed Central Google Scholar

Krstajic, D., Buturovic, L. J., Leahy, D. E., & Thomas, S. (2014). Cross-validation pitfalls when selecting and assessing regression and classification models. Journal of Cheminformatics, 6, 1–15. https://doi.org/10.1186/1758-2946-6-10

Article Google Scholar

Kim, S. B., Huo, X., & Tsui, K. L. (2009). A finite-sample simulation study of cross validation in tree-based models. Information Technology and Management, 10, 223–233. https://doi.org/10.1007/s10799-009-0052-7

Article Google Scholar

Bramer, M. (2007). Avoiding overfitting of decision trees. Principles of Data Mining, 119–134.

Goltermann, J., Winter, N. R., Gruber, M., Fisch, L., Richter, M., Grotegerd, D., Dohm, K., Meinert, S., Leehr, E. J., Böhnlein, J., Kraus, A., Thiel, K., Winter, A., Flinkenflügel, K., Leenings, R., Barkhau, C., Ernsting, J., Berger, K., Minnerup, H., & Dannlowski, U. (2023). Cross-validation for the estimation of effect size generalizability in mass-univariate brain-wide association studies. bioRxiv. https://doi.org/10.1101/2023.03.29.534696

Article PubMed PubMed Central Google Scholar

Arlot, S., & Celisse, A. (2010). A survey of cross-validation procedures for model selection. Statistics Surveys, 4, 40–79. https://doi.org/10.1214/09-SS054

Article Google Scholar

James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning (Vol. 112, p. 18). springer.

Book Google Scholar

Hastie, T., Tibshirani, R., Friedman, J. H., & Friedman, J. H. (2009). The elements of statistical learning: Data mining, inference, and prediction (Vol. 2, pp. 1–758). springer.

Google Scholar

Coley, R. Y., Liao, Q., Simon, N., & Shortreed, S. M. (2023). Empirical evaluation of internal validation methods for prediction in large-scale clinical data with rare-event outcomes: A case study in suicide risk prediction. BMC Medical Research Methodology, 23(1), Article 33. https://doi.org/10.1186/s12874-023-01844-5

Article PubMed PubMed Central Google Scholar

Tutz, G., & Berger, M. (2016). Item-focussed trees for the identification of items in differential item functioning. Psychometrika, 81, 727–750. https://doi.org/10.1007/s11336-015-9488-3

Article PubMed Google Scholar

Hothorn, T., & Lausen, B. (2003). On the exact distribution of maximally selected rank statistics. Computational Statistics & Data Analysis, 43(2), 121–137. https://doi.org/10.1016/S0167-9473(02)00225-6

Article Google Scholar

Shih, Y. S. (2004). A note on split selection bias in classification trees. Computational Statistics & Data Analysis, 45(3), 457-466.19. https://doi.org/10.1016/S0167-9473(03)00064-1

Article Google Scholar

Shih, Y. S., & Tsai, H. W. (2004). Variable selection bias in regression trees with constant fits. Computational Statistics & Data Analysis, 45(3), 595–607. https://doi.org/10.1016/S0167-9473(03)00036-7

Article Google Scholar

Strobl, C., Boulesteix, A. L., & Augustin, T. (2007). Unbiased split selection for classification trees based on the Gini index. Computational Statistics & Data Analysis, 52(1), 483–501. https://doi.org/10.1016/j.csda.2006.12.030

Article Google Scholar

Blanchin, M., Guilleux, A., Hardouin, J. B., & Sébille, V. (2020). Comparison of structural equation modelling, item response theory and Rasch measurement theory-based methods for response shift detection at item level: A simulation study. Statistical Methods in Medical Research, 29(4), 1015–1029. https://doi.org/10.1177/09622802198845

Article PubMed Google Scholar

Halpin, P. F. (2024). Differential item functioning via robust scaling. Psychometrika, 1–26. https://doi.org/10.48550/arXiv.2207.04598

Bodawatte Gedara, M. L., Monchka, B. A., & Lix, L. M. (2025). IFTpredictor: Predictions Using Item-Focused Tree Models (Version 0.1.0) [R package]. Comprehensive R Archive Network (CRAN).

Tennenhouse, L. G., Marrie, R. A., Bernstein, C. N., & Lix, L. M. (2020). Machine-learning models for depression and anxiety in individuals with immune-mediated inflammatory disease. Journal of Psychosomatic Research, 134, Article 110126. https://doi.org/10.1016/j.jpsychores.2020.110126

Article PubMed Google Scholar

Osborne, R. H., Elsworth, G. R., Sprangers, M. A. G., Oort, F. J., & Hopper, J. L. (2004). The value of the Hospital Anxiety and Depression Scale (HADS) for comparing women with early onset breast cancer with population-based reference women. Quality of Life Research, 13, 191–206. https://doi.org/10.1023/B:QURE.0000015292.56268.e7

Article CAS PubMed Google Scholar

Pallant, J. F., & Tennant, A. (2007). An introduction to the Rasch measurement model: An example using the Hospital Anxiety and Depression Scale (HADS). British Journal of Clinical Psychology, 46(1), 1–18. https://doi.org/10.1348/014466506X96931

Article PubMed Google Scholar

Sajobi, T. T., Wang, M., Awosoga, O., Santana, M. J., Southern, D. A., Liang, Z., Galbraith, D., Wilton, S. B., Quan, H., Graham, M. M., James, M. T., Ghali, W. A., Knudtson, M. L., & Norris, C. M. (2018). Trajectories of health-related quality of life in coronary artery disease. Circulation: Cardiovascular Quality and Outcomes, 11(3), Article e003661. https://doi.org/10.1161/CIRCOUTCOMES.117.003661

Article PubMed Google Scholar

Kwan, A., Marzouk, S., Ghanean, H., Kishwar, A., Anderson, N., Bonilla, D., Vitti, M., Su, J., & Touma, Z. (2019). Assessment of the psychometric properties of patient-reported outcomes of depression and anxiety in systemic lupus erythematosus. Seminars in Arthritis and Rheumatism, 49(2), 260–266. https://doi.org/10.1016/j.semarthrit.2019.03.004

Article PubMed Google Scholar

Christensen, A. V., Dixon, J. K., Juel, K., Ekholm, O., Rasmussen, T. B., Borregaard, B., Mols, R. E., Thrysøe, L., Thorup, C. B., & Berg, S. K. (2020). Psychometric properties of the Danish Hospital Anxiety and Depression Scale in patients with cardiac disease: Results from the DenHeart survey. Health and Quality of Life Outcomes, 18, 1–13. https://doi.org/10.1186/s12955-019-1264-0

Article Google Scholar

McCartney, L., Johnstone, B., O’Brien, T., Kwan, P., Kalincik, T., Velakoulis, D., & Malpas, C. (2020). Psychometric properties of the Hospital Anxiety and Depression Scale in an inpatient video-monitoring epilepsy cohort. Epilepsy & Behavior, 103, Article 106631. https://doi.org/10.1016/j.yebeh.2019.106631

Article Google Scholar

Stafford, L., Berk, M., & Jackson, H. J. (2007). Validity of the Hospital Anxiety and Depression Scale and Patient Health Questionnaire-9 to screen for depression in patients with coronary artery disease. General Hospital Psychiatry, 29(5), 417–424. https://doi.org/10.1016/j.genhosppsych.2007.06.005

Article PubMed

View original article

QUALITY OF LIFE RESEARCH

Like

Share Bookmark

0 0 0 0 0 0 0

More from this channel

Internal validation methods to detect patient characteristics associated with differential item functioning in patient-reported outcome measures

Comments (0)