Uncertainty Quantification of Central Canal Stenosis Deep Learning Classifier From Lumbar Sagittal T2-Weighted MRI

Jan 2026·

Brenzikofer A.

These authors contributed equally

Maria Monzon

These authors contributed equally

Galbusera F.

Manjaly Z.M.

Cina A.

Jutzeler C.R.

· 0 min read

Link Code Project DOI

Abstract

Background: Accurate assessment of the severity of central canal stenosis (CCS) on lumbar spine MRI is critical for clinical decision-making. We evaluated deep learning models for automated CCS grading on sagittal T2-weighted MRI, focusing on uncertainty quantification to improve clinical reliability.

Methods: Using a retrospective cohort from the LumbarDISC dataset (1974 patients), we compared multiple deep learning architectures for three-level CCS classification (normal/mild, moderate, severe). To assess model confidence, Monte Carlo (MC) dropout and Test Time Augmentation (TTA) techniques were applied to quantify prediction uncertainty.

Results: The fine-tuned Spine Grading Network (SGN) achieved a balanced accuracy of 79.4% and a macro F1 score of 68.8%, with per-class accuracies of 71.3% for moderate and 78.5% for severe stenosis. MC dropout revealed an increase in uncertainty predominantly in moderate and severe cases, while TTA uncertainty was higher for mild stenosis.

Conclusion: DL-based CCS grading demonstrates potential to assist radiologists by providing rapid, standardized evaluations. Incorporating uncertainty quantification offers a safeguard to flag ambiguous cases, thus supporting clinical trust and facilitating safer integration of AI tools into the interpretation of spine MRI.

Type

Journal article

Publication

JOR Spine

Last updated on Jan 2026

Medical-Imaging Uncertainty Quantification Spine

Authors

Maria Monzon (she/her)

Computer Vision & Medical AI Researcher

PhD candidate at ETH Zurich developing robust and trustworthy deep learning for medical image analysis — spine and cardiac MRI, multimodal biomedical data, and uncertainty quantification. Previously a computer-vision researcher at BASF, where I deployed models to production in regulated, GLP-certified environments. I care about efficient code and reproducible research.

← ORMIR-MIDS: An open standard for curating and sharing musculoskeletal imaging data Jan 2026

A data-driven analysis of lumbar steroid injection satisfaction in patients with chronic low back pain Sep 2025 →

No results found

Uncertainty Quantification of Central Canal Stenosis Deep Learning Classifier From Lumbar Sagittal T2-Weighted MRI