Publications related to RayTran: 3D Pose Estimation and Shape Reconstruction of Multiple Objects from Videos with Ray-Traced Transformers

Aggregating Spatial and Photometric Context for Photometric Stereo

Photometric stereo, a computer vision technique for estimating the 3D shape of objects through images captured under varying illumination conditions, has been a topic of research for nearly four decades. In its general formulation, photometric stereo is an ...

EPFL2024

Random matrix methods for high-dimensional machine learning models

Antoine Philippe Michel Bodin

In the rapidly evolving landscape of machine learning research, neural networks stand out with their ever-expanding number of parameters and reliance on increasingly large datasets. The financial cost and computational resources required for the training p ...

EPFL2024

Deep Learning Theory Through the Lens of Diagonal Linear Networks

Scott William Pesme

In this PhD manuscript, we explore optimisation phenomena which occur in complex neural networks through the lens of

2

-layer diagonal linear networks. This rudimentary architecture, which consists of a two layer feedforward linear network with a diagonal ...

EPFL2024

Infusing structured knowledge priors in neural models for sample-efficient symbolic reasoning

Mattia Atzeni

The ability to reason, plan and solve highly abstract problems is a hallmark of human intelligence. Recent advancements in artificial intelligence, propelled by deep neural networks, have revolutionized disciplines like computer vision and natural language ...

EPFL2024

Topics in statistical physics of high-dimensional machine learning

Hugo Chao Cui

In the past few years, Machine Learning (ML) techniques have ushered in a paradigm shift, allowing the harnessing of ever more abundant sources of data to automate complex tasks. The technical workhorse behind these important breakthroughs arguably lies in ...

EPFL2024

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Seyed Mohammad Mahdi Johari

Recent advancements in deep learning have revolutionized 3D computer vision, enabling the extraction of intricate 3D information from 2D images and video sequences. This thesis explores the application of deep learning in three crucial challenges of 3D com ...

EPFL2024

Coupling a recurrent neural network to SPAD TCSPC systems for real-time fluorescence lifetime imaging

Edoardo Charbon, Claudio Bruschini, Andrei Ardelean, Paul Mos, Yang Lin

Fluorescence lifetime imaging (FLI) has been receiving increased attention in recent years as a powerful diagnostic technique in biological and medical research. However, existing FLI systems often suffer from a tradeoff between processing speed, accuracy, ...

Berlin2024

Generalization of Scaled Deep ResNets in the Mean-Field Regime

Volkan Cevher, Grigorios Chrysos, Fanghui Liu

Despite the widespread empirical success of ResNet, the generalization properties of deep ResNet are rarely explored beyond the lazy training regime. In this work, we investigate scaled ResNet in the limit of infinitely deep and wide neural networks, of wh ...

2024

Enabling Uncertainty Estimation in Iterative Neural Networks

Pascal Fua, Nikita Durasov, Doruk Oner, Minh Hieu Lê

Turning pass-through network architectures into iterative ones, which use their own output as input, is a well-known approach for boosting performance. In this paper, we argue that such architectures offer an additional benefit: The convergence rate of the ...

2024

Exploring High-Performance and Energy-Efficient Architectures for Edge AI-Enabled Applications

Joshua Alexander Harrison Klein

The desire and ability to place AI-enabled applications on the edge has grown significantly in recent years. However, the compute-, area-, and power-constrained nature of edge devices are stressed by the needs of the AI-enabled applications, due to a gener ...

EPFL2024

RayTran: 3D Pose Estimation and Shape Reconstruction of Multiple Objects from Videos with Ray-Traced Transformers

Graph Chatbot

Chat with Graph Search

Aggregating Spatial and Photometric Context for Photometric Stereo

Random matrix methods for high-dimensional machine learning models

Deep Learning Theory Through the Lens of Diagonal Linear Networks

Infusing structured knowledge priors in neural models for sample-efficient symbolic reasoning

Topics in statistical physics of high-dimensional machine learning

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Coupling a recurrent neural network to SPAD TCSPC systems for real-time fluorescence lifetime imaging

Generalization of Scaled Deep ResNets in the Mean-Field Regime

Enabling Uncertainty Estimation in Iterative Neural Networks

Exploring High-Performance and Energy-Efficient Architectures for Edge AI-Enabled Applications

Infusing structured knowledge priors in neural models for sample-efficient symbolic reasoning

Enabling Uncertainty Estimation in Iterative Neural Networks

Aggregating Spatial and Photometric Context for Photometric Stereo

Random matrix methods for high-dimensional machine learning models

Topics in statistical physics of high-dimensional machine learning

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Coupling a recurrent neural network to SPAD TCSPC systems for real-time fluorescence lifetime imaging

Generalization of Scaled Deep ResNets in the Mean-Field Regime

Exploring High-Performance and Energy-Efficient Architectures for Edge AI-Enabled Applications

Deep Learning Theory Through the Lens of Diagonal Linear Networks