Publication

CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION

Publications associées (61)

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Temporal Spiking Recurrent Neural Network for Action Recognition

Wei Wang, Siyuan Hao

In this paper, we propose a novel temporal spiking recurrent neural network (TSRNN) to perform robust action recognition in videos. The proposed TSRNN employs a novel spiking architecture which utilizes the local discriminative features from high-confidenc ...

2019

Exploring Neural Network Models for the Classification of Students in Highly Interactive Environments

Open-ended learning environments (OELEs) allow students to freely interact with the content and to discover important principles and concepts of the learning domain on their own. However, only some students possess the necessary skills for efficient and ef ...

International Educational Data Mining Society2019

Evaluating and Interpreting Deep Convolutional Neural Networks via Non-negative Matrix Factorization

Edo Collins

With ever greater computational resources and more accessible software, deep neural networks have become ubiquitous across industry and academia. Their remarkable ability to generalize to new samples defies the conventional view, which holds that complex, ...

EPFL2019

LAF-Net: Locally Adaptive Fusion Networks for Stereo Confidence Estimation

Seungryong Kim

We present a novel method that estimates confidence map of an initial disparity by making full use of tri-modal input, including matching cost, disparity, and color image through deep networks. The proposed network, termed as Locally Adaptive Fusion Networ ...

IEEE COMPUTER SOC2019

Processing Megapixel Images with Deep Attention-Sampling Models

François Fleuret, Angelos Katharopoulos

Existing deep architectures cannot operate on very large signals such as megapixel images due to computational and memory constraints. To tackle this limitation, we propose a fully differentiable end-to-end trainable model that samples and processes only a ...

Idiap2019

Implicit discourse relation classification with syntax-aware contextualized word representations

James Henderson

Automatically identifying implicit discourse relations requires an in-depth semantic understanding of the text fragments involved in such relations. While early work investigated the usefulness of different classes of input features, current state-of-the-a ...

2019

Zero-Learning Fast Medical Image Fusion

Sabine Süsstrunk, Fayez Lahoud

Clinical applications, such as image-guided surgery and noninvasive diagnosis, rely heavily on multi-modal images. Medical image fusion plays a central role by integrating information from multiple sources into a single, more understandable output. We prop ...

2019

Fast Language Adaptation Using Phonological Information

Hervé Bourlard, Philip Neil Garner, Sibo Tong

Phoneme-based multilingual connectionist temporal classification (CTC) model is easily extensible to a new language by concatenating parameters of the new phonemes to the output layer. In the present paper, we improve cross-lingual adaptation in the contex ...

ISCA-INT SPEECH COMMUNICATION ASSOC2018

Is lack of attention necessary for task-irrelevant perceptual learning?

Michael Herzog, Lukasz Grzeczkowski, Jessica Galliussi

Perceptual learning can occur for a feature irrelevant to the training task, when it is sub-threshold and outside of the focus of attention (task-irrelevant perceptual learning, TIPL); however, TIPL does not occur when the task-irrelevant feature is supra- ...

PERGAMON-ELSEVIER SCIENCE LTD2018

Generating Steganographic Text with LSTMs

Martin Jaggi, Tina Fang

Motivated by concerns for user privacy, we design a steganographic system ("stegosystem") that enables two users to exchange encrypted messages without an adversary detecting that such an exchange is taking place. We propose a new linguistic stegosystem ba ...

2017