Estimating depth from a sequence of posed RGB images is a fundamental computer vision task, with applications in       augmented reality, path

DoubleTake: Geometry Guided Depth Estimation

submited by
Style Pass
2024-06-28 10:00:02

Estimating depth from a sequence of posed RGB images is a fundamental computer vision task, with applications in augmented reality, path planning etc. Prior work typically makes use of previous frames in a multi view stereo framework, relying on matching textures in a local neighborhood. In contrast, our model leverages historical predictions by giving the latest 3D geometry data as an extra input to our network. This self-generated geometric hint can encode information from areas of the scene not covered by the keyframes and it is more regularized when compared to individual predicted depth maps for previous frames. We introduce a Hint MLP which combines cost volume features with a hint of the prior geometry, rendered as a depth map from the current camera location, together with a measure of the confidence in the prior geometry. We demonstrate that our method, which can run at interactive speeds, achieves state-of-the-art estimates of depth and 3D scene reconstruction in offline , incremental , and revisit evaluation scenarios.

Our key contribution is the injection of cheaply-available metadata into the feature volume. Each volumetric cell is then reduced in parallel with an MLP into a feature map before input into a 2D cost volume encoder-decoder. We also make use of an image encoder specifically used to enforce a strong image prior when propagating and correcting depth estimates from the cost volume throughout the frame in the cost volume encoder-decoder. This formulation is flexible and allows for three different operating modes: 1) incremental for online depth and reconstruction at 76.6ms per frame, 2) offline for high-quality offline depth and reconstruction at 13.8s per scene, and 3) revisit for depth estimation when revisiting locations after a long absence at 62.8ms per frame.

Leave a Comment