Internal validation methods to detect patient characteristics associated with differential item functioning in patient-reported outcome measures

Teresi, J. A., & Fleishman, J. A. (2007). Differential item functioning and health assessment. Quality of Life Research, 16, 33–42.

Article  PubMed  Google Scholar 

Sprangers, M. A., & Schwartz, C. E. (1999). Integrating response shift into health-related quality of life research: A theoretical model. Social Science & Medicine, 48(11), 1507–1515. https://doi.org/10.1016/S0277-9536(99)00045-3

Article  CAS  Google Scholar 

Sajobi, T. T., Sanusi, R. A., Mayo, N. E., Sawatzky, R., Kongsgaard Nielsen, L., Sebille, V., Liu, J., Bohm, E., Awosoga, O., Norris, C. M., Wilton, S. B., James, M. T., & Lix, L. M. (2024). Unsupervised item response theory models for assessing sample heterogeneity in patient-reported outcomes measures. Quality of Life Research, 33(3), 853–864. https://doi.org/10.1007/s11136-023-03560-5

Article  PubMed  Google Scholar 

Jones, R. N. (2019). Differential item functioning and its relevance to epidemiology. Current Epidemiology Reports, 6, 174–183. https://doi.org/10.1007/s40471-019-00194-5

Article  PubMed  PubMed Central  Google Scholar 

Berger, M., & Tutz, G. (2016). Detection of uniform and nonuniform differential item functioning by item-focused trees. Journal of Educational and Behavioral Statistics, 41(6), 559–592. https://doi.org/10.3102/107699861665937

Article  Google Scholar 

Strobl, C., Malley, J., & Tutz, G. (2009). An introduction to recursive partitioning: Rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychological Methods, 14(4), 323. https://doi.org/10.1037/a0016973

Article  PubMed  PubMed Central  Google Scholar 

Wong, T. T. (2015). Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation. Pattern Recognition, 48(9), 2839–2846. https://doi.org/10.1016/j.patcog.2015.03.009

Article  Google Scholar 

Yadav, S., & Shukla, S. (2016). Analysis of k-fold cross-validation over hold-out validation on colossal datasets for quality classification. In 2016 IEEE 6th International Conference on Advanced Computing (IACC) (pp. 78–83). IEEE. https://ieeexplore.ieee.org/abstract/document/7544814/

Bollmann, S., Berger, M., & Tutz, G. (2018). Item-focused trees for the detection of differential item functioning in partial credit models. Educational and Psychological Measurement, 78(5), 781–804. https://doi.org/10.1177/0013164417722179

Article  PubMed  Google Scholar 

Zhang, H., & Ye, Y. (2008). A tree-based method for modeling a multivariate ordinal response. Statistics and Its Interface, 1(1), 169.

Article  PubMed  PubMed Central  Google Scholar 

Geroldinger, A., Lusa, L., Nold, M., & Heinze, G. (2023). Leave-one-out cross-validation, penalization, and differential bias of some prediction model performance measures—A simulation study. Diagnostic and Prognostic Research, 7(1), Article 9. https://doi.org/10.1186/s41512-023-00146-0

Article  PubMed  PubMed Central  Google Scholar 

Krstajic, D., Buturovic, L. J., Leahy, D. E., & Thomas, S. (2014). Cross-validation pitfalls when selecting and assessing regression and classification models. Journal of Cheminformatics, 6, 1–15. https://doi.org/10.1186/1758-2946-6-10

Article  Google Scholar 

Kim, S. B., Huo, X., & Tsui, K. L. (2009). A finite-sample simulation study of cross validation in tree-based models. Information Technology and Management, 10, 223–233. https://doi.org/10.1007/s10799-009-0052-7

Article  Google Scholar 

Bramer, M. (2007). Avoiding overfitting of decision trees. Principles of Data Mining, 119–134.

Goltermann, J., Winter, N. R., Gruber, M., Fisch, L., Richter, M., Grotegerd, D., Dohm, K., Meinert, S., Leehr, E. J., Böhnlein, J., Kraus, A., Thiel, K., Winter, A., Flinkenflügel, K., Leenings, R., Barkhau, C., Ernsting, J., Berger, K., Minnerup, H., & Dannlowski, U. (2023). Cross-validation for the estimation of effect size generalizability in mass-univariate brain-wide association studies. bioRxiv. https://doi.org/10.1101/2023.03.29.534696

Article  PubMed  PubMed Central  Google Scholar 

Arlot, S., & Celisse, A. (2010). A survey of cross-validation procedures for model selection. Statistics Surveys, 4, 40–79. https://doi.org/10.1214/09-SS054

Article  Google Scholar 

James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning (Vol. 112, p. 18). springer.

Book  Google Scholar 

Hastie, T., Tibshirani, R., Friedman, J. H., & Friedman, J. H. (2009). The elements of statistical learning: Data mining, inference, and prediction (Vol. 2, pp. 1–758). springer.

Google Scholar 

Coley, R. Y., Liao, Q., Simon, N., & Shortreed, S. M. (2023). Empirical evaluation of internal validation methods for prediction in large-scale clinical data with rare-event outcomes: A case study in suicide risk prediction. BMC Medical Research Methodology, 23(1), Article 33. https://doi.org/10.1186/s12874-023-01844-5

Article  PubMed  PubMed Central  Google Scholar 

Tutz, G., & Berger, M. (2016). Item-focussed trees for the identification of items in differential item functioning. Psychometrika, 81, 727–750. https://doi.org/10.1007/s11336-015-9488-3

Article  PubMed  Google Scholar 

Hothorn, T., & Lausen, B. (2003). On the exact distribution of maximally selected rank statistics. Computational Statistics & Data Analysis, 43(2), 121–137. https://doi.org/10.1016/S0167-9473(02)00225-6

Article  Google Scholar 

Shih, Y. S. (2004). A note on split selection bias in classification trees. Computational Statistics & Data Analysis, 45(3), 457-466.19. https://doi.org/10.1016/S0167-9473(03)00064-1

Article  Google Scholar 

Shih, Y. S., & Tsai, H. W. (2004). Variable selection bias in regression trees with constant fits. Computational Statistics & Data Analysis, 45(3), 595–607. https://doi.org/10.1016/S0167-9473(03)00036-7

Article  Google Scholar 

Strobl, C., Boulesteix, A. L., & Augustin, T. (2007). Unbiased split selection for classification trees based on the Gini index. Computational Statistics & Data Analysis, 52(1), 483–501. https://doi.org/10.1016/j.csda.2006.12.030

Article  Google Scholar 

Blanchin, M., Guilleux, A., Hardouin, J. B., & Sébille, V. (2020). Comparison of structural equation modelling, item response theory and Rasch measurement theory-based methods for response shift detection at item level: A simulation study. Statistical Methods in Medical Research, 29(4), 1015–1029. https://doi.org/10.1177/09622802198845

Article  PubMed  Google Scholar 

Halpin, P. F. (2024). Differential item functioning via robust scaling. Psychometrika, 1–26. https://doi.org/10.48550/arXiv.2207.04598

Bodawatte Gedara, M. L., Monchka, B. A., & Lix, L. M. (2025). IFTpredictor: Predictions Using Item-Focused Tree Models (Version 0.1.0) [R package]. Comprehensive R Archive Network (CRAN).

Tennenhouse, L. G., Marrie, R. A., Bernstein, C. N., & Lix, L. M. (2020). Machine-learning models for depression and anxiety in individuals with immune-mediated inflammatory disease. Journal of Psychosomatic Research, 134, Article 110126. https://doi.org/10.1016/j.jpsychores.2020.110126

Article  PubMed  Google Scholar 

Osborne, R. H., Elsworth, G. R., Sprangers, M. A. G., Oort, F. J., & Hopper, J. L. (2004). The value of the Hospital Anxiety and Depression Scale (HADS) for comparing women with early onset breast cancer with population-based reference women. Quality of Life Research, 13, 191–206. https://doi.org/10.1023/B:QURE.0000015292.56268.e7

Article  CAS  PubMed  Google Scholar 

Pallant, J. F., & Tennant, A. (2007). An introduction to the Rasch measurement model: An example using the Hospital Anxiety and Depression Scale (HADS). British Journal of Clinical Psychology, 46(1), 1–18. https://doi.org/10.1348/014466506X96931

Article  PubMed  Google Scholar 

Sajobi, T. T., Wang, M., Awosoga, O., Santana, M. J., Southern, D. A., Liang, Z., Galbraith, D., Wilton, S. B., Quan, H., Graham, M. M., James, M. T., Ghali, W. A., Knudtson, M. L., & Norris, C. M. (2018). Trajectories of health-related quality of life in coronary artery disease. Circulation: Cardiovascular Quality and Outcomes, 11(3), Article e003661. https://doi.org/10.1161/CIRCOUTCOMES.117.003661

Article  PubMed  Google Scholar 

Kwan, A., Marzouk, S., Ghanean, H., Kishwar, A., Anderson, N., Bonilla, D., Vitti, M., Su, J., & Touma, Z. (2019). Assessment of the psychometric properties of patient-reported outcomes of depression and anxiety in systemic lupus erythematosus. Seminars in Arthritis and Rheumatism, 49(2), 260–266. https://doi.org/10.1016/j.semarthrit.2019.03.004

Article  PubMed  Google Scholar 

Christensen, A. V., Dixon, J. K., Juel, K., Ekholm, O., Rasmussen, T. B., Borregaard, B., Mols, R. E., Thrysøe, L., Thorup, C. B., & Berg, S. K. (2020). Psychometric properties of the Danish Hospital Anxiety and Depression Scale in patients with cardiac disease: Results from the DenHeart survey. Health and Quality of Life Outcomes, 18, 1–13. https://doi.org/10.1186/s12955-019-1264-0

Article  Google Scholar 

McCartney, L., Johnstone, B., O’Brien, T., Kwan, P., Kalincik, T., Velakoulis, D., & Malpas, C. (2020). Psychometric properties of the Hospital Anxiety and Depression Scale in an inpatient video-monitoring epilepsy cohort. Epilepsy & Behavior, 103, Article 106631. https://doi.org/10.1016/j.yebeh.2019.106631

Article  Google Scholar 

Stafford, L., Berk, M., & Jackson, H. J. (2007). Validity of the Hospital Anxiety and Depression Scale and Patient Health Questionnaire-9 to screen for depression in patients with coronary artery disease. General Hospital Psychiatry, 29(5), 417–424. https://doi.org/10.1016/j.genhosppsych.2007.06.005

Article  PubMed 

Comments (0)

No login
gif