End2Reg: Learning Task-Specific Segmentation for Markerless Registration in Spine Surgery

¹Department of Biomedical Engineering, University of Basel, Allschwil, Switzerland, ²Department of Orthopedics, University Children's Hospital Basel, Switzerland

Registration result on one ex-vivo human specimen from the SpineDepth dataset: interactive view.

If the interactive viewer does not appear, please use Safari or Chrome. Use your mouse to rotate and zoom in. The visualization shows the intraoperative point cloud and the preoperative vertebral mesh registered with the network prediction, with colors indicating the point-to-point distance to the ground truth.

Abstract

Purpose: Intraoperative navigation in spine surgery demands millimeter-level accuracy. Current systems based on intraoperative radiographic imaging and bone-anchored markers, are invasive, radiation-intensive and workflow disruptive. Recent markerless RGB-D registration methods offer a promising alternative, but existing approaches rely on weak segmentation labels to isolate relevant anatomical structures, which can propagate errors throughout registration.

Methods: We present End2Reg an end-to-end deep learning framework that jointly optimizes segmentation and registration, eliminating the need for weak segmentation labels and manual steps. The network learns segmentation masks specifically optimized for registration, guided solely by the registration objective without direct segmentation supervision.

Results: The proposed framework achieves state-of-the-art performance on ex- and in-vivo benchmarks, reducing median Target Registration Error by 32% to 1.83 mm and mean Root Mean Square Error by 45% to 3.95 mm, respectively. An ablation study confirms that end-to-end optimization significantly improves registration accuracy.

Conclusion: The presented end-to-end RGB-D registration pipeline removes dependency on weak labels and manual steps, advancing towards fully automatic, markerless intraoperative navigation.

Model architecture.

The framework consists of a segmentation module and a registration module, taking as input the intraoperative RGB-D point cloud and the preoperative point cloud. The network is jointly optimized: the registration loss is backpropagated through both modules, with a Straight-Through Gumbel–Softmax estimator enabling gradients to pass through the discrete segmentation step. The network outputs the rigid transformation T, aligning the preoperative model to the intraoperative scene.

End2Reg: Learning Task-Specific Segmentation for Markerless Registration in Spine Surgery

Registration result on one ex-vivo human specimen from the SpineDepth dataset: interactive view.

Abstract

Model architecture.

Results SpineDepth dataset (ex-vivo)

Before and after: Original pointcloud and Unsupervised Segmentation output.

Registration results

Results SpineAlign dataset (in-vivo)

Before and after: Original pointcloud and Unsupervised Segmentation output.

Registration results

BibTeX