Publications related to Subspace Regularized Dynamic Time Warping for Spoken Query Detection

Deep Learning Theory Through the Lens of Diagonal Linear Networks

In this PhD manuscript, we explore optimisation phenomena which occur in complex neural networks through the lens of

2

-layer diagonal linear networks. This rudimentary architecture, which consists of a two layer feedforward linear network with a diagonal ...

EPFL2024

On the number of regions of piecewise linear neural networks

Michaël Unser, Alexis Marie Frederic Goujon

Many feedforward neural networks (NNs) generate continuous and piecewise-linear (CPWL) mappings. Specifically, they partition the input domain into regions on which the mapping is affine. The number of these so-called linear regions offers a natural metric ...

2024

Fast and Future: Towards Efficient Forecasting in Video Semantic Segmentation

Evann Pierre Guy Courdier

Deep learning has revolutionized the field of computer vision, a success largely attributable to the growing size of models, datasets, and computational power.Simultaneously, a critical pain point arises as several computer vision applications are deployed ...

EPFL2024

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification

Informative sample selection in an active learning (AL) setting helps a machine learning system attain optimum performance with minimum labeled samples, thus reducing annotation costs and boosting performance of computer-aided diagnosis systems in the pres ...

Amsterdam2024

Performing and Detecting Backdoor Attacks on Face Recognition Algorithms

Alexander Carl Unnervik

The field of biometrics, and especially face recognition, has seen a wide-spread adoption the last few years, from access control on personal devices such as phones and laptops, to automated border controls such as in airports. The stakes are increasingly ...

EPFL2024

Network time series forecasting in photovoltaics power production

Jelena Simeunovic

Accurate forecasting of photovoltaic (PV) power production is crucial for the integration of more renewable energy sources into the power grid. PV power production is highly intermittent, due to the stochastic cloud behaviour and cloud dynamics. Previous w ...

EPFL2024

Driving and suppressing the human language network using large language models

Martin Schrimpf

Transformer models such as GPT generate human-like language and are predictive of human brain responses to language. Here, using functional-MRI-measured brain responses to 1,000 diverse sentences, we first show that a GPT-based encoding model can predict t ...

Berlin2024

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Seyed Mohammad Mahdi Johari

Recent advancements in deep learning have revolutionized 3D computer vision, enabling the extraction of intricate 3D information from 2D images and video sequences. This thesis explores the application of deep learning in three crucial challenges of 3D com ...

EPFL2024

Spectral Estimators for High-Dimensional Matrix Inference

Farzad Pourkamali

A key challenge across many disciplines is to extract meaningful information from data which is often obscured by noise. These datasets are typically represented as large matrices. Given the current trend of ever-increasing data volumes, with datasets grow ...

EPFL2024

Match Normalization: Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World

Mathieu Salzmann, Zheng Dang

In this work, we tackle the task of estimating the 6D pose of an object from point cloud data. While recent learning-based approaches have shown remarkable success on synthetic datasets, we have observed them to fail in the presence of real-world data. We ...

Ieee Computer Soc2024

Subspace Regularized Dynamic Time Warping for Spoken Query Detection

Graph Chatbot

Chat with Graph Search

Deep Learning Theory Through the Lens of Diagonal Linear Networks

On the number of regions of piecewise linear neural networks

Fast and Future: Towards Efficient Forecasting in Video Semantic Segmentation

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification

Performing and Detecting Backdoor Attacks on Face Recognition Algorithms

Network time series forecasting in photovoltaics power production

Driving and suppressing the human language network using large language models

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Spectral Estimators for High-Dimensional Matrix Inference

Match Normalization: Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World

Deep Learning Theory Through the Lens of Diagonal Linear Networks

Network time series forecasting in photovoltaics power production

Driving and suppressing the human language network using large language models

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Spectral Estimators for High-Dimensional Matrix Inference

On the number of regions of piecewise linear neural networks

Fast and Future: Towards Efficient Forecasting in Video Semantic Segmentation

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification

Performing and Detecting Backdoor Attacks on Face Recognition Algorithms

Match Normalization: Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World