A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Predicting where a person is looking is a complex task, requiring to understand not only the person's gaze and scene content, but also the 3D scene structure and the person's situation (are they manipulating? interacting or observing others? attentive?) to detect obstructions in the line of sight or apply attention priors that humans typically have when observing others. In this paper, we hypothesize that identifying and leveraging such priors can be better achieved through the exploitation of explicitly derived multimodal cues such as depth and pose. We thus propose a modular multimodal architecture allowing to combine these cues using an attention mechanism. The architecture can naturally be exploited in privacy-sensitive situations such as surveillance and health, where personally identifiable information cannot be released. We perform extensive experiments on the GazeFollow and VideoAttentionTarget public datasets, obtaining state-of-the-art performance and demonstrating very competitive results in the privacy setting case. (1)

A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings

Graph Chatbot

Chattez avec Graph Search

Differentially private multi-agent constraint optimization

Optimization Algorithms for Decentralized, Distributed and Collaborative Machine Learning

PRO-Face C: Privacy-Preserving Recognition of Obfuscated Face via Feature Compensation

Optimization Algorithms for Decentralized, Distributed and Collaborative Machine Learning

PRO-Face C: Privacy-Preserving Recognition of Obfuscated Face via Feature Compensation

Differentially private multi-agent constraint optimization