Publications de Weipeng He | EPFL Graph Search

Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction

Jean-Marc Odobez, Petr Motlicek, Weipeng He

This paper introduces a novel approach for extracting speaker embeddings from audio mixtures of multiple overlapping voices. This approach is based on a multi-task neural network. The network first extracts a latent feature for each direction. This feature ...

ISCA-INT SPEECH COMMUNICATION ASSOC2021

Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training

Jean-Marc Odobez, Petr Motlicek, Weipeng He

Despite the recent success of deep neural network-based approaches in sound source localization, these approaches suffer the limitations that the required annotation process is costly, and the mismatch between the training and test conditions undermines th ...

IEEE2019

Deep Neural Networks for Multiple Speaker Detection and Localization

Jean-Marc Odobez, Petr Motlicek, Weipeng He

We propose to use neural networks for simultaneous detection and localization of multiple sound sources in human-robot interaction. In contrast to conventional signal processing techniques, neural network-based sound source localization methods require few ...

2018

Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network

Jean-Marc Odobez, Petr Motlicek, Weipeng He

We propose a novel multi-task neural network-based approach for joint sound source localization and speech/non-speech classification in noisy environments. The network takes raw short time Fourier transform as input and outputs the likelihood values for th ...

ISCA-INT SPEECH COMMUNICATION ASSOC2018

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Weipeng He

Deep Learning Approaches for Auditory Perception in Robotics

Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction

Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training

Deep Neural Networks for Multiple Speaker Detection and Localization

Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network

Graph Chatbot

Chattez avec Graph Search

Deep Learning Approaches for Auditory Perception in Robotics

Multi-task Neural Network for Robust Multiple Speaker Embedding Extraction

Adaptation of Multiple Sound Source Localization Neural Networks with Weak Supervision and Domain-Adversarial Training

Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network

Deep Neural Networks for Multiple Speaker Detection and Localization