Publications related to Critical Parameters for Scalable Distributed Learning with Large Batches and Asynchronous Updates

Understanding generalization and robustness in modern deep learning

In this thesis, we study two closely related directions: robustness and generalization in modern deep learning. Deep learning models based on empirical risk minimization are known to be often non-robust to small, worst-case perturbations known as adversari ...

EPFL2024

Optimization Algorithms for Decentralized, Distributed and Collaborative Machine Learning

Anastasiia Koloskova

Distributed learning is the key for enabling training of modern large-scale machine learning models, through parallelising the learning process. Collaborative learning is essential for learning from privacy-sensitive data that is distributed across various ...

EPFL2024

On the Generalization of Stochastic Gradient Descent with Momentum

Volkan Cevher, Kimon Antonakopoulos

While momentum-based accelerated variants of stochastic gradient descent (SGD) are widely used when training machine learning models, there is little theoretical understanding on the generalization error of such methods. In this work, we first show that th ...

Brookline2024

Meta-learning to address diverse Earth observation problems across resolutions

Devis Tuia, Benjamin Alexander Kellenberger, Marc Conrad Russwurm

Earth scientists study a variety of problems with remote sensing data, but they most often consider them in isolation from each other, which limits information flows across disciplines. In this work, we present METEOR, a meta-learning methodology for Earth ...

London2024

Topics in statistical physics of high-dimensional machine learning

Hugo Chao Cui

In the past few years, Machine Learning (ML) techniques have ushered in a paradigm shift, allowing the harnessing of ever more abundant sources of data to automate complex tasks. The technical workhorse behind these important breakthroughs arguably lies in ...

EPFL2024

Seeking the new, learning from the unexpected: Computational models of surprise and novelty in the brain

Alireza Modirshanechi

Human babies have a natural desire to interact with new toys and objects, through which they learn how the world around them works, e.g., that glass shatters when dropped, but a rubber ball does not. When their predictions are proven incorrect, such as whe ...

EPFL2024

Predicting the long-term collective behaviour of fish pairs with deep learning

Francesco Mondada, Alexandre Massoud Alahi, Vaios Papaspyros

Modern computing has enhanced our understanding of how social interactions shape collective behaviour in animal societies. Although analytical models dominate in studying collective behaviour, this study introduces a deep learning model to assess social in ...

2024

Social Opinion Formation and Decision Making Under Communication Trends

Ali H. Sayed, Virginia Bordignon, Mert Kayaalp

This work studies the learning process over social networks under partial and random information sharing. In traditional social learning models, agents exchange full belief information with each other while trying to infer the true state of nature. We stud ...

Piscataway2024

Few-shot Learning for Efficient and Effective Machine Learning Model Adaptation

Arnout Jan J Devos

Machine learning (ML) enables artificial intelligent (AI) agents to learn autonomously from data obtained from their environment to perform tasks. Modern ML systems have proven to be extremely effective, reaching or even exceeding human intelligence.Althou ...

EPFL2024

Deep Learning Theory Through the Lens of Diagonal Linear Networks

Scott William Pesme

In this PhD manuscript, we explore optimisation phenomena which occur in complex neural networks through the lens of

2

-layer diagonal linear networks. This rudimentary architecture, which consists of a two layer feedforward linear network with a diagonal ...

EPFL2024

Critical Parameters for Scalable Distributed Learning with Large Batches and Asynchronous Updates

Graph Chatbot

Chat with Graph Search

Understanding generalization and robustness in modern deep learning

Optimization Algorithms for Decentralized, Distributed and Collaborative Machine Learning

On the Generalization of Stochastic Gradient Descent with Momentum

Meta-learning to address diverse Earth observation problems across resolutions

Topics in statistical physics of high-dimensional machine learning

Seeking the new, learning from the unexpected: Computational models of surprise and novelty in the brain

Predicting the long-term collective behaviour of fish pairs with deep learning

Social Opinion Formation and Decision Making Under Communication Trends

Few-shot Learning for Efficient and Effective Machine Learning Model Adaptation

Deep Learning Theory Through the Lens of Diagonal Linear Networks

Optimization Algorithms for Decentralized, Distributed and Collaborative Machine Learning

Deep Learning Theory Through the Lens of Diagonal Linear Networks

Understanding generalization and robustness in modern deep learning

Meta-learning to address diverse Earth observation problems across resolutions

Topics in statistical physics of high-dimensional machine learning

On the Generalization of Stochastic Gradient Descent with Momentum

Predicting the long-term collective behaviour of fish pairs with deep learning

Seeking the new, learning from the unexpected: Computational models of surprise and novelty in the brain

Social Opinion Formation and Decision Making Under Communication Trends

Few-shot Learning for Efficient and Effective Machine Learning Model Adaptation