We combine detection and tracking techniques to achieve robust 3D motion recovery of people seen from arbitrary viewpoints by a single and potentially moving camera. We rely on detecting key postures, which can be done reliably, using a motion model to infer 3D poses between consecutive detections, and finally refining them over the whole sequence using a generative model. We demonstrate our approach in the cases of golf motions filmed using a static camera and walking motions acquired using a potentially moving one. We will show that our approach, although monocular, is both metrically accurate because it integrates information over many frames and robust because it can recover from a few misdetections.
Mohamed Farhat, Davide Bernardo Preso, Armand Baptiste Sieber
Dario Floreano, Fabrizio Schiano, Maxim Pavliv, Giuseppe Loianno
Pascal Fua, Pavan P Ramdya, Adám Gosztolai, Victor Lobato Rios, Helge Jochen Rhodin, Semih Günel, Daniel Eduardo Morales Garza, Marco Pietro Abrate