Asian Spine J Search

CLOSE


Asian Spine J > Online first
Yu, Mohamed, Kurapatti, Song, Huang, Singh, Alasadi, Grewal, Nephade, Namiri, Corvi, Kim, and Cho: How statistically fragile are randomized controlled trial outcomes of balloon kyphoplasty for vertebral compression fractures?

Abstract

Randomized controlled trials (RCTs) are essential for guiding the treatment of vertebral compression fractures with kyphoplasty, yet the statistical robustness of their findings has not been thoroughly evaluated. This systematic review assessed the fragility of dichotomous outcomes in kyphoplasty RCTs using the fragility index (FI), reverse fragility index (rFI), and fragility quotient (FQ). A comprehensive search of PubMed, Embase, and MEDLINE identified eligible RCTs, yielding 282 dichotomous outcomes for analysis. The overall median FI was 4 (interquartile range [IQR], 4–5), corresponding to a median FQ of 0.020 (IQR, 0.013–0.040). Statistically significant outcomes (n=18) demonstrated greater fragility, with a median FI of 2 (IQR, 1–4) and FQ of 0.015 (IQR, 0.011–0.029), compared with nonsignificant outcomes (n=264; FI, 5 [IQR, 4–5]; FQ, 0.020 [IQR, 0.013–0.041]). Subgroup analyses revealed that pain and complication outcomes had median FIs of 4 and 5, respectively, with FQs ranging from 0.016 to 0.025. Cement leakage outcomes were the most fragile (FI, 4; FQ, 0.048). More than half of the analyzed outcomes involved a greater number of patients lost to follow-up than their corresponding FI, underscoring the need for cautious interpretation of kyphoplasty trial results and supporting the inclusion of fragility metrics in future trial reporting to better inform clinical decision-making.

Introduction

Kyphoplasty has become a principal intervention for vertebral compression fractures (VCFs), particularly amid growing concerns over the efficacy and safety of vertebroplasty [14]. This minimally invasive procedure involves balloon inflation within the fractured vertebra to restore height, followed by cement augmentation, thereby achieving the dual benefits of vertebral stabilization and deformity correction [5,6]. Between 2007 and 2014, the use of both vertebroplasty and kyphoplasty declined, although vertebroplasty showed a sharper decrease. This trend reflects evolving clinical guidelines, increasing scrutiny of vertebroplasty outcomes, and the perception of kyphoplasty as a safer and more effective alternative [7]. However, debate persists regarding kyphoplasty’s long-term benefits, cost-effectiveness, and comparative efficacy relative to non-surgical and emerging conservative treatments [6,810].
Central to this debate is the quality and statistical rigor of randomized controlled trials (RCTs), which form the cornerstone of evidence-based clinical practice [11]. Although p-values remain the conventional measure of statistical significance, excessive reliance on them can obscure the true robustness of trial results. Because p-values are highly sensitive to sample size, data variability, and methodological nuances, a statistically significant result may hinge on only a few event reversals, which challenges the stability of clinical conclusions [12]. Fragility metrics such as the fragility index (FI), reverse fragility index (rFI), and fragility quotient (FQ) provide a more nuanced understanding of the stability of study outcomes [13,14]. These metrics quantify the smallest number of event changes required to alter a result from significant to nonsignificant (or vice versa), thereby offering a tangible measure of a study’s statistical robustness. Although fragility analyses have been applied in select areas of spine surgery, the literature concerning kyphoplasty remains limited [1517]. Given kyphoplasty’s widespread use in managing VCFs, it is crucial to critically assess the reliability of the RCT evidence that supports its clinical application.
The objective of this study was to conduct a comprehensive fragility analysis of RCTs evaluating the efficacy, and safety of kyphoplasty. By applying fragility metrics, we aimed to assess the robustness of existing trial data and identify potential vulnerabilities in the evidence base that informs clinical practice. We hypothesized that RCTs investigating kyphoplasty will demonstrate considerable statistical fragility, emphasizing the need for cautious interpretation of findings and the development of more rigorous study designs.

Methods

Search strategy and study selection

A comprehensive literature search was conducted in PubMed, Embase, and MEDLINE to identify RCTs published between January 1, 2010, and July 16, 2024. The search adhered to Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [18] and used combinations of keywords and related terms, including “kyphoplasty,” “spine surgery,” “balloon,” and “percutaneous” to capture relevant studies. The complete search strategy is provided in the Supplement 1.
RCTs reporting dichotomous outcomes and including at least one intervention arm involving endoscopic lumbar decompression were eligible for inclusion. Exclusion criteria included studies not published in English, lacking full-text availability, or those based on cadaveric, biomechanical, in vitro, or animal models.
Two reviewers independently screened all titles and abstracts after duplicate removal using an online systematic review tool. Inter-reviewer agreement was assessed during a pilot screening phase to ensure consistency. Full texts of potentially eligible articles were subsequently reviewed independently by the same two reviewers. Discrepancies at either stage were resolved through consensus, and unresolved cases were adjudicated by a third reviewer. Inter-reviewer agreement was quantified throughout the process; for title and abstract screening, the proportionate agreement was 78.8% with a Cohen’s κ of 0.33, indicating fair agreement. For full-text screening, agreement was 81.1% with a κ of 0.62, indicating substantial agreement. All conflicts were resolved by consensus or adjudication to minimize bias in study selection. Data extraction was performed in duplicate using standardized forms, and disagreements were resolved by consensus. All excluded studies were documented, and the final list of included trials was verified by the senior author (S.K.C.).
The methodological quality of the included trials was assessed using the revised Cochrane Risk of Bias tool for randomized trials (RoB 2) (Table 1) [19]. As the primary objective was to assess the statistical properties of reported outcomes rather than clinical effectiveness, the review did not meet the criteria for PROSPERO registration. Only publicly available studies were analyzed; therefore, ethical approval was not required.

Data collection

From each included study, the following information was extracted: first author, year of publication, journal, treatment and control interventions, key outcomes, number of patients lost to follow-up, and available p-values. A summary of study characteristics is provided in Table 2. Outcomes were organized into thematic categories such as complications, new vertebral fractures, cement leakage, and pain. RCTs directly comparing kyphoplasty with vertebroplasty were analyzed separately. Data extraction was independently performed by two reviewers using standardized forms to ensure accuracy and consistency. The complete dataset used for fragility analysis is available in the Supplement 1.

Statistical fragility assessment

To evaluate the stability of reported findings, all dichotomous outcomes were analyzed using a two-tailed Fisher’s exact test with a significance threshold of p<0.05. For statistically significant results, the FI was calculated as the minimum number of event reversals required to increase the p-value above 0.05. For nonsignificant outcomes, the rFI was calculated by determining the number of event changes needed to achieve statistical significance (Fig. 1).
The FQ was derived by dividing the FI or rFI by the total sample size, representing the proportion of patients whose outcome status would need to change to alter statistical conclusions. Subgroup analyses were conducted according to outcome type and statistical significance. Results of the fragility analysis are reported as medians with interquartile ranges (IQRs).

Results

The initial database searches yielded 550 studies after duplicate removal. Following title and abstract screening, 150 studies were excluded. The remaining 159 full-text articles were assessed, and 36 RCTs published across 22 journals met the inclusion criteria for final analysis. A PRISMA flow diagram summarizing the screening process and literature selection is displayed in Fig. 2.
Across the 36 included RCTs, a total of 282 dichotomous outcomes were identified (Table 3), comprising 18 statistically significant (p<0.05) and 264 nonsignificant results. For all outcomes combined, the median FI was 4 (IQR, 4–5), and the median FQ was 0.020 (IQR, 0.013–0.040), suggesting that a change in outcomes for approximately 2% of participants could alter the statistical conclusions of these trials. Among the 18 significant outcomes, the median FI was 2 (IQR, 1–4), with a corresponding FQ of 0.015 (IQR, 0.011–0.029). In contrast, the 264 nonsignificant outcomes demonstrated a median FI of 5 (IQR, 4–5) and a median FQ of 0.020 (IQR, 0.013–0.041).
Subgroup analyses of the reported outcomes are summarized in Table 4. Trials comparing kyphoplasty with vertebroplasty represented the most fragile subgroup, with a median FI of 5 (IQR, 4–5) and a median FQ of 0.013 (IQR, 0.010–0.029), suggesting that the reversal of just 1.3% of patients would alter the statistical significance of results. The next most fragile subgroup was complications, with a median FI of 5 (IQR, 4–5) and a median FQ of 0.016 (IQR, 0.013–0.020). Fracture-related outcomes had a median FI of 5 (IQR, 4–6) and a median FQ of 0.020 (IQR, 0.013–0.038), while pain outcomes had a median FI of 4 (IQR, 3–5) and a median FQ of 0.025 (IQR, 0.014–0.039). The least fragile subgroup was cement leakage, with a median FI of 4 (IQR, 3–5) and a median FQ of 0.048 (IQR, 0.029–0.061). Complications were the most frequently reported outcomes (n=142), whereas pain-related outcomes were the least reported (n=24) (Table 4). Notably, the number of patients lost to follow-up exceeded the FI in 148 of the 282 outcomes (52.48%) (Table 3), suggesting that the inclusion of these patients could have reversed the statistical significance in more than half of the reported outcomes.

Discussion

Recent evaluations of spine surgery literature have revealed a recurring pattern in which statistically significant findings often lack robustness, calling into question the reliability of reported treatment effects [15,20]. Although VCFs are common among older adults and kyphoplasty represents a substantial healthcare expenditure, few studies have examined the statistical fragility of kyphoplasty-related outcomes [7,21]. The present study addresses this gap by systematically evaluating the fragility of RCTs evaluating kyphoplasty outcomes. Among available fragility metrics, the FQ provides a distinct advantage by accounting for sample size, thereby addressing a key limitation of the FI and the rFI, both of which disregard the proportion of affected patients [22]. Accordingly, median FQ was selected as the principal measure of outcome stability in this analysis. Our results demonstrated that kyphoplasty RCTs, particularly those comparing kyphoplasty with vertebroplasty and those assessing complication rates, exhibited considerable statistical fragility, suggesting that even minimal changes in patient outcomes could reverse the reported significance of many findings.
The median FQ across all kyphoplasty RCT outcomes was 0.02 (IQR, 0.013–0.040), indicating that a reversal in outcomes for only 2% of participants would be sufficient to alter statistical significance. This degree of fragility is comparable to that reported in a review of 32 Food and Drug Administration investigational device exemption trials in spine surgery, which demonstrated an average FQ of 0.027 [23]. Similarly, Muthu and Ramakrishnan [15] examined 70 spine surgery RCTs and observed an even lower median FQ of 0.0148 (IQR, 0–0.033). In that study, 27.1% of outcomes lost statistical significance when reanalyzed using two-sided Fisher’s exact tests, resulting in an FI of 0 for those trials. Comparable trends have been identified in orthopedic literature, where even FQs as high as 9.7% in distal radius fracture RCTs have been characterized as fragile [24]. Collectively, these findings suggest that the fragility observed in kyphoplasty RCTs aligns with the broader pattern of statistical instability reported across spine and orthopedic surgery research.
When examining the significance of outcomes, statistically significant results demonstrated a 25% lower median FQ of 0.015 (IQR, 0.011–0.029) compared with nonsignificant outcomes, which had a median FQ of 0.02 (IQR, 0.013–0.041). Moreover, more than half of all analyzed outcomes originated from trials in which the number of patients lost to follow-up exceeded the number required to reverse the result based on FQ. For instance, a multicenter RCT by Beall et al. [25] involving 285 patients reported fewer cardiovascular-related unplanned readmissions for a PEEK (polyether ether ketone) implant compared with balloon kyphoplasty. In that trial, 32 patients were lost to follow-up, whereas the reversal of outcomes in just 1.1 patients (0.39%) would have nullified the statistical significance. This example illustrates the risk of overinterpreting significant findings in studies with high attrition, particularly when statistical fragility is not considered. Such limitations may contribute to overly optimistic conclusions that may influence clinical recommendations and could promote interventions whose true efficacy or safety remains uncertain.
Our findings also highlight the intrinsic relationship between trial design characteristics and statistical fragility. Smaller sample sizes inherently reduce the number of event reversals needed to alter statistical significance, resulting in artificially low FIs that may exaggerate treatment effects. Similarly, high attrition amplified this vulnerability. In more than half of the analyzed outcomes, the number of patients lost to follow-up exceeded the FI, indicating that complete follow-up alone could have reversed the study’s conclusion. These methodological limitations (limited sample size and incomplete follow-up) collectively contribute to the fragility of kyphoplasty RCTs and may partially explain the variability observed across outcome subgroups, particularly those related to pain and complications. Future trials should therefore ensure adequate sample sizes, minimize loss to follow-up, and incorporate fragility analysis into the reporting of dichotomous outcomes.
We also observed considerable variability in statistical fragility across different outcome types. Pain and cement leakage outcomes demonstrated the highest median FQs but were associated with the lowest median FIs, indicating that even these relatively stable endpoints were susceptible to reversal with only a few event changes. Complication outcomes, by contrast, exhibited particularly low FQs. Even for cement leakage, among the most robust subgroup, the median FQ was only 0.048 (IQR, 0.029–0.061), suggesting persistent vulnerability. Notably, trials comparing kyphoplasty with vertebroplasty showed the lowest FQs overall. This finding is concerning, as many studies advocating for kyphoplasty’s clinical superiority rely heavily on comparative outcomes such as postoperative pain, cement leakage, or fracture incidence [2628].
An illustrative example is the trial by Vogl et al. [29], which compared a cement-directed kyphoplasty system with vertebroplasty in 77 patients with painful osteoporotic VCFs. The authors reported that kyphoplasty resulted in significantly less cement leakage. However, our analysis indicated that a change in outcomes for only 2.9% of patients would have nullified this finding. This observation raises concern regarding the reliability of frequently cited differences between kyphoplasty and vertebroplasty. Replication of these results in larger, methodologically rigorous trials is essential to determine whether the observed outcome differences are clinically meaningful or merely statistically unstable. Accordingly, there is a pressing need for more robust RCTs comparing kyphoplasty and vertebroplasty, given the significant implications these findings hold for treatment selection, patient safety, and postoperative quality of life.
As investigations continue to explore the comparative merits of nonoperative care, kyphoplasty, and vertebroplasty for managing VCFs, the present findings underscore the need to strengthen the methodological foundation of future research [30]. Including fragility measures such as the FQ alongside conventional p-values, coupled with concerted efforts to minimize attrition, can help reduce the overestimation of treatment effects and enhance the interpretability of outcomes. These methodological improvements will improve the reliability of future kyphoplasty RCTs and contribute to more robust and transparent evidence across procedural outcome research in spine surgery.
This study has several important limitations. First, the analysis was confined to dichotomous outcomes, as fragility metrics are best validated for categorical data. Continuous variables such as pain intensity scores or functional assessments may display different patterns of fragility and should be evaluated in future investigations. Second, there is no universally accepted threshold defining what constitutes a “fragile” FI or FQ within spine surgery, which may limit interpretability. Nonetheless, our emphasis on the FQ and its relationship with patient attrition provides a clinically meaningful context for understanding result stability. Third, because the fragility analysis relied exclusively on p-values, it did not account for other critical aspects of trial quality, including randomization procedures, inclusion criteria, blinding methods, or variability in patient populations, all of which are integral to the broader reliability and validity of study findings.
Another concern relates to follow-up reporting. Although the number of patients lost to follow-up was recorded for each study, few trials detailed the reasons for attrition, making it difficult to assess how missing data may have influenced fragility estimates. This issue reflects the limitations of primary study reporting rather than the fragility analysis itself, but highlights the need for greater transparency in clinical trial methodology. Lastly, by focusing exclusively on dichotomous outcomes, this analysis did not incorporate other data types frequently used to assess kyphoplasty efficacy, which may restrict the generalizability of our conclusions. Future work to adapt fragility methods for continuous outcomes would be valuable.

Conclusions

This review evaluated the statistical robustness of outcomes reported in RCTs of kyphoplasty and found that many key endpoints, particularly those comparing kyphoplasty with vertebroplasty, were highly susceptible to reversal with minor changes in patient outcomes, especially in studies with substantial loss to follow-up. The median FQ of 0.02 is consistent with broader concerns regarding the stability of findings in spine surgery research. These results highlight the need for cautious interpretation of existing evidence. Improving methodological rigor through larger sample sizes, better follow-up adherence, and the routine incorporation of fragility metrics like FQ may enhance the reliability of kyphoplasty research and contribute to more informed clinical decision-making for VCF management.

Key Points

  • Many kyphoplasty randomized controlled trial outcomes demonstrate statistical fragility, with a median fragility quotient of 0.02.

  • Statistically significant outcomes were more fragile than nonsignificant outcomes.

  • In over half of outcomes, the number lost to follow-up exceeded the fragility index.

  • Comparative trials between kyphoplasty and vertebroplasty were among the most statistically fragile

Notes

Conflict of Interest

Samuel Kang-Wook Cho, MD, FAAOS, reports board or committee membership with AAOS, the American Orthopaedic Association, AOSpine North America, the Cervical Spine Research Society, the North American Spine Society, and the Scoliosis Research Society; intellectual property royalties from Globus Medical; paid consultancy for Stryker; and fellowship support from Cerapedics and Globus Medical. Otherwise, no potential conflict of interest relevant to this article was reported.

Author Contributions

Conceptualization: AY, KSM, JS. Data curation: AY, KSM, JJH, PS, YA, AG. Data analysis: AY, KSM, JJH, PS, YA, AG. Investigation: AY, KSM, JJH, PS, YA, AG. Methodology: AY. Writing–original draft: AY, KSM, MK, JJH, PS, YA, AG. Writing–review and editing: AY, KSM, MK, JS, JJH, PS, YA, AG, SKWC. Supervision: AY, JS, NN, JC, JSK, SKWC. Final approval of the manuscript: all authors.

Supplementary Materials

Supplementary materials can be available from https://doi.org/10.31616/asj.2025.0282.
Supplement 1. Detailed outcomes of included studies.
asj-2025-0282-Supplement-1.csv

Fig. 1
Demonstration of statistical significance reversal using a 2×2 contingency table with a resulting fragility index=3. p-values were calculated using a two-tailed Fisher exact test.
asj-2025-0282f1.jpg
Fig. 2
PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flow chart of the study selection process.
asj-2025-0282f2.jpg
Table 1
Cochrane risk-of-bias assessment for randomized controlled trials
Study Year Domain 1: Risk of bias arising from randomization process Domain 2: Risk of bias due to deviations from the intended interventions Domain 3: Risk of bias due to missing outcome data Domain 4: Risk of bias in measurement of the outcome Domain 5: Risk of bias in selection of the reported result Overall risk of bias
Chen et al. [S7] 2010 Low Low Low Low Low Low
Liu et al. [S16] 2010 Low Low Low Low Low Low
Chen et al. [S8] 2011 Low Low Low Low Low Low
Boonen et al. [S5] 2011 Low Low Low Low Low Low
Vanni et al. [S27] 2012 Low Low Low Low Low Low
Rebolledo et al. [S21] 2013 Low Low Low Low Low Low
Van et al. [S26] 2013 Low Low Low Low Low Low
Bastian et al. [S3] 2013 Low Low Low Low Low Low
He et al. [S10] 2013 Low Low Low Low Low Low
Korovessis et al. [15] 2013 Low Low Low Low Low Low
Vogl et al. [S28] 2013 Low Low Low Low Low Low
Werner et al. [S31] 2013 Low Low Low Low Low Low
Rebolledo et al. [S21] 2013 Low Low Low Low Low Low
Dohm et al. [S9] (2014) 2014 Low Low Low Low Low Low
Korovessis et al. [S14] 2014 Low Low Low Low Low Low
Dohm et al. [S9] 2014 Low Low Low Low Low Low
Tutton et al. [S25] 2015 Low Low Low Low Low Low
Ruatti et al. [S22] 2016 Low Low Low Low Low Low
Petersen et al. [S19] 2016 Low Low Low Low Low Low
Yan et al. [S32] 2016 Low Low Low Low Low Low
Masoudi et al. [S17] 2017 Low Low Low Low Low Low
Beall et al. [S4] 2017 Low Low Low Low Low Low
Masoudi et al. [S17] 2017 Low Low Low Low Low Low
Jin et al. [S13] 2018 Low Low Low Low Low Low
Wang et al. [S30] 2018 Low Low Low Low Low Low
He et al. [S11] 2019 Low Low Low Low Low Low
Qi et al. [S20] 2019 Low Low Low Low Low Low
Tang et al. [S24] 2019 Low Low Low Low Low Low
Noriega et al. [S18] 2019 Low Low Low Low Low Low
Ban et al. [S2] 2020 Low Low Low Low Low Low
Wang et al. [S29] 2021 Low Low Low Low Low Low
Hong et al. [S12] 2023 Low Low Low Low Low Low
Shi et al. [S23] 2023 Low Low Low Low Low Low
Chai et al. [S6] 2023 Low Low Low Low Low Low
Alkhatatba et al. [S1] 2024 Low Low Low Low Low Low
Zhou et al. [S33] 2024 Low Low Low Low Low Low
Table 2
Characteristics of included studies
Author Year Journal Sample size Lost to follow-up (mo)
Chen et al. [S7] 2010 Injury 58 0
Liu et al. [S16] 2010 Osteoporosis International 100 1
Chen et al. [S16] 2011 Spine 49 0
Boonen et al. [S5] 2011 Journal of Bone and Mineral Research 300 72
Vanni et al. [S27] 2012 Journal of Craniovertebral Junction and Spine 300 0
Rebolledo et al. [S21] 2013 Bone and Joint Journal 44 12
Van et al. [S26] 2013 Spine 300 0
Bastian et al. [S3] 2013 American Journal of Neuroradiology 112 5
He et al. [S10] 2013 European Spine Journal 43 0
Korovessis et al. [S15] 2013 Spine 185 17
Vogl et al. [S28] 2013 Spine 104 0
Werner et al. [S31] 2013 The Journal of Bone and Joint Surgery American Volume 100 0
Rebolledo et al. [S21] 2013 The Bone & Joint Journal 44 8
Dohm et al. [S9] 2014 American Journal of Neuroradiology 404 79
Korovessis et al. [S14] 2014 Spine 47 0
Dohm et al. [S9] 2014 American Journal of Neuroradiology 408 27
Tutton et al. [S25] 2015 Spine 300 47
Ruatti et al. [S22] 2016 European Spine Journal 143 18
Petersen et al. [S19] 2016 European Journal of Orthopaedic Surgery and Traumatology 80 0
Yan et al. [S32] 2016 Osteoporosis International 108 0
Masoudi et al. [S17] 2017 Clinical Spine Surgery 76 6
Beall et al. [S4] 2017 Pain Physician 285 32
Masoudi et al. [S17] 2017 Clinical Spine Surgery 76 6
Jin et al. [S13] 2018 Medical Science Monitor 64 23
Wang et al. [S30] 2018 Oncology Letters 86 0
He et al. [S11] 2019 Journal of Musculoskeletal Neuronal Interactions 66 9
Qi et al. [S20] 2019 World Neurosurgery 76 0
Tang et al. [S24] 2019 Journal of the College of Physicians and Surgeons Pakistan 178 0
Noriega et al. [S18] 2019 The Spine Journal 152 11
Ban et al. [S2] 2020 Medical Science Monitor 60 0
Wang et al. [S29] 2021 BMC Musculoskeletal Disorders 72 0
Hong et al. [S12] 2023 Journal of Orthopaedic Surgery and Research 80 5
Shi et al. [S23] 2023 The Spine Journal 90 5
Chai et al. [S6] 2023 Pain Physician 135 17
Alkhatatba et al. [S1] 2024 Orthopedics 44 3
Zhou et al. [S33] 2024 Clinical Interventions in Aging 94 0
Table 3
Statistical fragility of overall outcomes
No. of outcomes FI, median (IQR) FQ, median (IQR) # of patients lost to follow-up >FI (%)
All RCT outcomes 282 4 (4–5) 0.020 (0.013–0.040) 52.48
Significant outcomes (p<0.05) 18 2 (1–4) 0.015 (0.011–0.029) 44.44
Nonsignificant outcomes (p≥0.05) 264 5 (4–5) 0.02 (0.013–0.041) 50.38

FI, fragility index; IQR, interquartile range; RCT, randomized controlled trial.

Table 4
Statistical fragility of subgroup outcomes
No. of outcomes FI, median (IQR) FQ, median (IQR)
Complications 142 5 (4–5) 0.016 (0.013–0.020)
Pain 24 4 (3–5) 0.025 (0.014–0.039)
Cement leakage 48 4 (3–5) 0.048 (0.029–0.061)
Fracture 40 5 (4–6) 0.020 (0.013–0.038)
Kyphoplasty vs. vertebroplasty 65 5 (4–5) 0.013 (0.010–0.029)

FI, fragility index; IQR, interquartile range.

References

1. Abduljabbar FH, Al-Jurayyan A, Alqahtani S, et al. Does balloon kyphoplasty deliver more cement safely into osteoporotic vertebrae with compression fractures compared with vertebroplasty?: a study in vertebral analogues. Global Spine J 2015;5:300–7. https://doi.org/10.1055/s-0035-1546818
crossref pmid pmc
2. Aboud N, Eckardt N, Kamp MA, Senft C, Schwarz F. Occurrence of adjacent segment fractures after surgical treatment of an osteoporotic vertebral fracture: a retrospective comparison between two different treatment methods. Arch Orthop Trauma Surg 2023;143:2333–9. https://doi.org/10.1007/s00402-022-04434-0
crossref pmid pmc
3. Cavka M, Delimar D, Rezan R, et al. Complications of percutaneous vertebroplasty: a pictorial review. Medicina (Kaunas) 2023;59:1536. https://doi.org/10.3390/medicina59091536
crossref pmid pmc
4. Al-Nakshabandi NA. Percutaneous vertebroplasty complications. Ann Saudi Med 2011;31:294–7. https://doi.org/10.5144/0256-4947.2011.294
crossref pmid pmc
5. Cloft HJ, Jensen ME. Kyphoplasty: an assessment of a new technology. AJNR Am J Neuroradiol 2007;28:200–3.
pmid pmc
6. Pumberger M, Schitz F, Burger J, Schomig F, Putzier M, Palmowski Y. Kyphoplasty restores the global sagittal balance of the spine independently from pain reduction. Sci Rep 2020;10:8894. https://doi.org/10.1038/s41598-020-65798-0
crossref pmid pmc
7. Laratta JL, Shillingford JN, Lombardi JM, et al. Utilization of vertebroplasty and kyphoplasty procedures throughout the United States over a recent decade: an analysis of the Nationwide Inpatient Sample. J Spine Surg 2017;3:364–70. https://doi.org/10.21037/jss.2017.08.02
crossref pmid pmc
8. Jin J, Shen W. Long-term therapeutic effect of percutaneous kyphoplasty combined with & without back muscle rehabilitation exercise in elderly patients: a comparative study. Pak J Med Sci 2022;38:1595–600. https://doi.org/10.12669/pjms.38.6.5873
crossref pmid pmc
9. Hopkins TJ, Eggington S, Quinn M, Nichols-Ricker CI. Cost-effectiveness of balloon kyphoplasty and vertebroplasty versus conservative medical management in the USA. Osteoporos Int 2020;31:2461–71. https://doi.org/10.1007/s00198-020-05513-x
crossref pmid pmc
10. Van Meirhaeghe J, Bastian L, Boonen S, Ranstam J, Tillman JB, Wardlaw D. A randomized trial of balloon kyphoplasty and nonsurgical management for treating acute vertebral compression fractures: vertebral body kyphosis correction and surgical parameters. Spine (Phila Pa 1976) 2013;38:971–83. https://doi.org/10.1097/brs.0b013e31828e8e22
crossref pmid pmc
11. Subbiah V. The next generation of evidence-based medicine. Nat Med 2023;29:49–58. https://doi.org/10.1038/s41591-022-02160-z
crossref pmid
12. Dahiru T. P - value, a true test of statistical significance?: a cautionary note. Ann Ib Postgrad Med 2008;6:21–6. https://doi.org/10.4314/aipm.v6i1.64038
crossref pmid pmc
13. Feinstein AR. The unit fragility index: an additional appraisal of “statistical significance” for a contrast of two proportions. J Clin Epidemiol 1990;43:201–9. https://doi.org/10.1016/0895-4356(90)90186-s
crossref pmid
14. Walsh M, Srinathan SK, McAuley DF, et al. The statistical significance of randomized controlled trial results is frequently fragile: a case for a fragility index. J Clin Epidemiol 2014;67:622–8. https://doi.org/10.1016/j.jclinepi.2013.10.019
crossref pmid pmc
15. Muthu S, Ramakrishnan E. Fragility analysis of statistically significant outcomes of randomized control trials in spine surgery: a systematic review. Spine (Phila Pa 1976) 2021;46:198–208. https://doi.org/10.1097/brs.0000000000003645
crossref pmid
16. Ortiz-Babilonia CD, Gupta A, Cartagena-Reyes MA, et al. The statistical fragility of trials comparing cervical disc arthroplasty and anterior cervical discectomy and fusion: a meta analysis. Spine (Phila Pa 1976) 2024;49:708–14. https://doi.org/10.1097/brs.0000000000004756
crossref pmid
17. Megafu EC, Megafu MN, Nguyen JT, et al. The fragility of statistical findings in cervical disc arthroplasty: a systematic review of randomized controlled trials. Arch Orthop Trauma Surg 2024;144:2609–17. https://doi.org/10.1007/s00402-024-05353-y
crossref pmid pmc
18. Page MJ, McKenzie JE, Bossuyt PM, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 2021;372:n71. https://doi.org/10.1136/bmj.n71
crossref pmid pmc
19. Sterne JA, Savovic J, Page MJ, et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ 2019;366:l4898. https://doi.org/10.1136/bmj.l4898
crossref pmid pmc
20. Evaniew N, Files C, Smith C, et al. The fragility of statistically significant findings from randomized trials in spine surgery: a systematic survey. Spine J 2015;15:2188–97. https://doi.org/10.1016/j.spinee.2015.06.004
crossref pmid
21. Goz V, Errico TJ, Weinreb JH, et al. Vertebroplasty and kyphoplasty: national outcomes and trends in utilization from 2005 through 2010. Spine J 2015;15:959–65. https://doi.org/10.1016/j.spinee.2013.07.245
crossref pmid
22. Condon TM, Sexton RW, Wells AJ, To MS. The weakness of fragility index exposed in an analysis of the traumatic brain injury management guidelines: a meta-epidemiological and simulation study. PLoS One 2020;15:e0237879. https://doi.org/10.1371/journal.pone.0237879
crossref pmid pmc
23. Proal JD, Moon AS, Kwon B. The fragility index and reverse fragility index of FDA investigational device exemption trials in spinal fusion surgery: a systematic review. Eur Spine J 2024;33:2594–603. https://doi.org/10.1007/s00586-024-08317-3
crossref pmid
24. Megafu M, Megafu E. The fragility of statistical findings in distal radius fractures: a systematic review of randomized controlled trials. Injury 2022;53:3352–6. https://doi.org/10.1016/j.injury.2022.07.017
crossref pmid
25. Beall DP, Coe JD, McIlduff M, et al. Serious adverse events associated with readmission through one year after vertebral augmentation with either a polyetheretherketone implant or balloon kyphoplasty. Pain Physician 2017;20:521–8. https://doi.org/10.36076/ppj.20.5.521
crossref pmid
26. Schofer MD, Efe T, Timmesfeld N, Kortmann HR, Quante M. Comparison of kyphoplasty and vertebroplasty in the treatment of fresh vertebral compression fractures. Arch Orthop Trauma Surg 2009;129:1391–9. https://doi.org/10.1007/s00402-009-0901-1
crossref pmid
27. Hiwatashi A, Westesson PL, Yoshiura T, et al. Kyphoplasty and vertebroplasty produce the same degree of height restoration. AJNR Am J Neuroradiol 2009;30:669–73. https://doi.org/10.3174/ajnr.a1442
crossref pmid pmc
28. Dohm M, Black CM, Dacre A, Tillman JB, Fueredi G. A randomized trial comparing balloon kyphoplasty and vertebroplasty for vertebral compression fractures due to osteoporosis. AJNR Am J Neuroradiol 2014;35:2227–36. https://doi.org/10.3174/ajnr.a4127
crossref pmid pmc
29. Vogl TJ, Pflugmacher R, Hierholzer J, et al. Cement directed kyphoplasty reduces cement leakage as compared with vertebroplasty: results of a controlled, randomized trial. Spine (Phila Pa 1976) 2013;38:1730–6. https://doi.org/10.1097/brs.0b013e3182a14d15
crossref pmid
30. Halvachizadeh S, Stalder AL, Bellut D, et al. Systematic review and meta-analysis of 3 treatment arms for vertebral compression fractures: a comparison of improvement in pain, adjacent-level fractures, and quality of life between vertebroplasty, kyphoplasty, and nonoperative management. JBJS Rev 2021;9:e21.00045. https://doi.org/10.2106/jbjs.rvw.21.00045
crossref


ABOUT
ARTICLE CATEGORY

Browse all articles >

BROWSE ARTICLES
EDITORIAL POLICY
FOR CONTRIBUTORS
Editorial Office
Department of Orthopedic Surgery, Asan Medical Center, University of Ulsan College of Medicine
88, Olympic-ro 43-gil, Songpa-gu, Seoul 05505, Korea
Tel: +82-2-3010-3530    Fax: +82-2-3010-8555    E-mail: asianspinejournal@gmail.com                
Korean Society of Spine Surgery
82, Gumi-ro 173beon-gil, Bundang-gu, Seongnam-si, Gyeonggi-do, 13620, Korea
Tel: +82-31-966-3413    Fax: +82-2-831-3414    E-mail: office@spine.or.kr                

Copyright © 2026 by Korean Society of Spine Surgery.

Developed in M2PI

Close layer
prev next