Abstract
KinectFusion enables a user holding and moving a standard Kinect camera to rapidly create detailed 3D reconstructions of an indoor scene. Only the depth data from Kinect is used to track the 3D pose of the sensor and reconstruct, geometrically precise, 3D models of the physical scene in real-time.
The capabilities of KinectFusion, as well as the novel GPU-based pipeline are described in full. Uses of the core system for low-cost handheld scanning, and geometry-aware augmented reality and physics-based interactions are shown.
Novel extensions to the core GPU pipeline demonstrate object segmentation and user interaction directly in front of the sensor, without degrading camera tracking or reconstruction. These extensions are used to enable real-time multi-touch interactions anywhere, allowing any planar or non-planar reconstructed physical surface to be appropriated for touch.
Accompanying Video
ACM Digital Library
SIGGRAPH '11 ACM SIGGRAPH 2011 Talks, 2011
UIST '11 Proceedings of the 24th annual ACM symposium on User interface software and technology, 2011
ISMAR '11 Proceedings of the 2011 10th IEEE International Symposium on Mixed and Augmented Reality