Publications de Volkan Cevher | EPFL Graph Search

Single-pass Detection of Jailbreaking Input in Large Language Models

Volkan Cevher, Grigorios Chrysos, Yongtao Wu, Elias Abad Rocamora

Defending aligned Large Language Models (LLMs) against jailbreaking attacks is a challenging problem, with existing approaches requiring multiple requests or even queries to auxiliary LLMs, making them computationally heavy. Instead, we focus on detecting ...

2025

How Gradient Descent Balances Features: A Dynamical Analysis For Two-Layer Neural Networks

Volkan Cevher, Fanghui Liu, Zhenyu Zhu

This paper investigates the fundamental regression task of learning k neurons (a.k.a. teachers) from Gaussian input, using two-layer ReLU neural networks with width m (a.k.a. students) and m, k = O(1), trained via gradient descent under proper initializati ...

2025

Accelerating Spectral Clustering under Fairness Constraints

Volkan Cevher

Fairness of decision-making algorithms is an increasingly important issue. In this paper, we focus on spectral clustering with group fairness constraints, where every demographic group is represented in each cluster proportionally as in the general populat ...

2025

Generalization of Noisy SGD in Unbounded Non-convex Settings

Volkan Cevher, Leello Tadesse Dadi

We study the generalization of iterative noisy gradient schemes on smooth non-convex losses. Formally, we establish time-independent information theoretic generalization bounds for Stochastic Gradient Langevin Dynamics (SGLD) that do not diverge as the ite ...

2025

IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic

Volkan Cevher, Luca Viano

This paper introduces the SOAR framework for imitation learning. SOAR is an algorithmic template that learns a policy from expert demonstrations with a primal dual style algorithm that alternates cost and policy updates. Within the policy updates, the SOAR ...

2025

CHAMELEON: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning

Volkan Cevher, Wanyun Xie

Training data mixtures greatly impact the generalization performance of large language models. Existing domain reweighting methods often rely on costly weight computations and require retraining when new data is introduced. To this end, we introduce a flex ...

2025

Faster Inference Of Flow-Based Generative Models Via Improved Data-Noise Coupling

Volkan Cevher, Leello Tadesse Dadi

Conditional Flow Matching (CFM), a simulation-free method for training continuous normalizing flows, provides an efficient alternative to diffusion models for key tasks like image and video generation. The performance of CFM in solving these tasks depends ...

2025

Best of Both Worlds: Regret Minimization versus Minimax Play

Volkan Cevher, Luca Viano

In this paper, we investigate the existence of online learning algorithms with bandit feedback that simultaneously guarantee O(1) regret compared to a given comparator strategy, and Õ(√ T) regret compared to any fixed strategy, where T is the number of rou ...

2025

Adversarial Training For Defense Against Label Poisoning Attacks

Volkan Cevher

As machine learning models grow in complexity and increasingly rely on publicly sourced data, such as the human-annotated labels used in training large language models, they become more vulnerable to label poisoning attacks. These attacks, in which adversa ...

2025

Continuous-Time Analysis of Heavy Ball Momentum in Min-Max Games

Volkan Cevher

Since Polyak's pioneering work, heavy ball (HB) momentum has been widely studied in minimization. However, its role in min-max games remains largely unexplored. As a key component of practical min-max algorithms like Adam, this gap limits their effectivene ...

2025