Empirical assessment of fragility index based on a large database of clinical studies in the Cochrane Library
Affiliation
Department of Epidemiology and Biostatistics, University of ArizonaIssue Date
2022-11-02
Metadata
Show full item recordPublisher
John Wiley and Sons IncCitation
Xing, A., & Lin, L. (2022). Empirical assessment of fragility index based on a large database of clinical studies in the Cochrane Library. Journal of Evaluation in Clinical Practice.Rights
© 2022 John Wiley & Sons Ltd.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Rationale Aims and Objectives: The fragility index (FI) and fragility quotient (FQ) are increasingly used measures for assessing the robustness of clinical studies with binary outcomes in terms of statistical significance. The FI is the minimum number of event status modifications that can alter a study result's statistical significance (or nonsignificance), and the FQ is calculated as the FI divided by the study's total sample size. The literature has no widely recognized criteria for interpreting the fragility measures' magnitudes. This article aims to provide an empirical assessment for the FI and FQ based on a large database of clinical studies in the Cochrane Library. Methods: We explored the overall empirical distributions of the FI and FQ based on five common methods (Fisher's exact test, χ2 test, risk difference, odds ratio, and relative risk) for determining statistical significance of binary outcomes in clinical research. We also considered three different scenarios for the FI calculation and evaluated the relationship between p values and FIs or FQs using Spearman's (Formula presented.). Finally, we summarized empirical thresholds based on the overall distributions of the FI and FQ to facilitate their interpretations in future research. Results: For about 20% of studies with significant results, the statistical significance was changed after modifying the event status of only one participant. Studies with significant results were considered slightly fragile if the significance hinged on the statuses of about five events. Studies were extremely fragile if FI (Formula presented.) 1 or FQ (Formula presented.) 0.01. The FIs were strongly correlated with p values for significant studies, while Spearman's (Formula presented.) varied according to the total sample sizes of studies. Conclusions: The statistical significance of clinical studies could be changed after modifying a few events' statuses. Many studies' findings are fairly fragile. The distributions of the FI and FQ provide insights for appraising the robustness of evidence in clinical decision-making.Note
12 month embargo; first published: 02 November 2022EISSN
1365-2753PubMed ID
36322140Version
Final accepted manuscriptae974a485f413a2113503eed53cd6c53
10.1111/jep.13787

