DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation

We present DepthInSpace, a self-supervised deep-learning method for depth estimation using a structured-light camera. The design of this method is motivated by the commercial use case of embedded depth sensors in nowadays smartphones. We first propose to use estimated optical flow from ambient information of multiple video frames as a complementary guide for training a single-frame depth estimation network, helping to preserve edges and reduce over-smoothing issues. Utilizing optical flow, we also propose to fuse the data of multiple video frames to get a more accurate depth map. In particular, fused depth maps are more robust in occluded areas and incur less in flying pixels artifacts. We finally demonstrate that these more precise fused depth maps can be used as self-supervision for fine-tuning a single-frame depth estimation network to improve its performance. Our models' effectiveness is evaluated and compared with state-of-the-art models on both synthetic and our newly introduced real datasets.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation

Graph Chatbot

Chat with Graph Search

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Aggregating Spatial and Photometric Context for Photometric Stereo

Robust machine learning for neuroscientific inference

Aggregating Spatial and Photometric Context for Photometric Stereo

Robust machine learning for neuroscientific inference

Advancing Self-Supervised Deep Learning for 3D Scene Understanding