If the interactive viewer does not appear, please use Safari or Chrome. Use your mouse to rotate and zoom in. The visualization shows the intraoperative point cloud and the preoperative vertebral mesh registered with the network prediction, with colors indicating the point-to-point distance to the ground truth.
Purpose: Intraoperative navigation in spine surgery demands millimeter-level accuracy. Current systems based on intraoperative radiographic imaging and bone-anchored markers, are invasive, radiation-intensive and workflow disruptive. Recent markerless RGB-D registration methods offer a promising alternative, but existing approaches rely on weak segmentation labels to isolate relevant anatomical structures, which can propagate errors throughout registration.
Methods: We present End2Reg an end-to-end deep learning framework that jointly optimizes segmentation and registration, eliminating the need for weak segmentation labels and manual steps. The network learns segmentation masks specifically optimized for registration, guided solely by the registration objective without direct segmentation supervision.
Results: The proposed framework achieves state-of-the-art performance on ex- and in-vivo benchmarks, reducing median Target Registration Error by 32% to 1.83 mm and mean Root Mean Square Error by 45% to 3.95 mm, respectively. An ablation study confirms that end-to-end optimization significantly improves registration accuracy.
Conclusion: The presented end-to-end RGB-D registration pipeline removes dependency on weak labels and manual steps, advancing towards fully automatic, markerless intraoperative navigation.
The framework consists of a segmentation module and a registration module, taking as input the intraoperative RGB-D point cloud and the preoperative point cloud. The network is jointly optimized: the registration loss is backpropagated through both modules, with a Straight-Through Gumbel–Softmax estimator enabling gradients to pass through the discrete segmentation step. The network outputs the rigid transformation T, aligning the preoperative model to the intraoperative scene.
Interactive slider: visualization of three different specimens from the SpineDepth dataset.
Interactive slider: visualization of two different specimens from the SpineAlign dataset.
@article{ my article
}