Category

Data management

Related publications (557)

A Practical Influence Approximation for Privacy-Preserving Data Filtering in Federated Learning

Boi Faltings, Ljubomir Rokvic, Panayiotis Danassis

Federated Learning by nature is susceptible to low-quality, corrupted, or even malicious data that can severely degrade the quality of the learned model. Traditional techniques for data valuation cannot be applied as the data is never revealed. We present ...
2023

Lithium tantalate photonic integrated circuits for volume manufacturing

Tobias Kippenberg

Dataset for the manuscript "Lithium tantalate photonic integrated circuits for volume manufacturing".  DOI: 10.1038/s41586-024-07369-1 Contains all raw data and code used to produce the Figures and Extended Data Figures in the manuscript.  ...
Zenodo2023

Center-aware Adversarial Augmentation for Single Domain Generalization

Mathieu Salzmann, Zhiye Wang

Domain generalization (DG) aims to learn a model from multiple training (i.e., source) domains that can generalize well to the unseen test (i.e., target) data coming from a different distribution. Single domain generalization (SingleDG) has recently emerge ...
IEEE COMPUTER SOC2023

An Annotated Corpus of Tonal Piano Music from the Long 19th Century

Martin Alois Rohrmeier, Fabian Claude Moss, Markus Franz Josef Neuwirth, Johannes Hentschel

We present a dataset of 264 annotated piano pieces of nine composers, composed in the long 19th century (https://doi.org/10.5281/zenodo.7483349). Annotations adhere to the DCML harmony annotation standard and include Roman numerals, phrase boundaries, and ...
Ohio State Univ, Sch Music2023

Pollution mask for the continuous corrected particle number concentration data in 1 min time resolution measured in the Swiss aerosol container using a whole air inlet during MOSAiC 2019/2020

Julia Schmale, Andrea Baccarini, Ivo Fabio Beck, Hélène Paule Angot

This dataset contains particle number concentrations and a pollution flag in 1 min time resolution. It is derived by the pollution detection algorithm (PDA, doi:10.5281/zenodo.5761101) based on the corrected particle number concentration data of the CPC377 ...
EPFL Infoscience2023

Dielectric metasurfaces and their applications for optical biosensing

Yasaman Jahani

Over the past century, our understanding of life has been focused on its most fundamental building blocks: molecules. Biological molecules, such as proteins found in blood, other body fluids, or tissues, are excellent guides to identifying a normal or abno ...
EPFL2023

Reinforcement Learning for Joint Design and Control of Battery-PV Systems

Christophe Ballif, Marine Dominique Cauz, Laure-Emmanuelle Perret Aebi

The decentralisation and unpredictability of new renewable energy sources require rethinking our energy system. Data-driven approaches, such as reinforcement learning (RL), have emerged as new control strategies for operating these systems, but they have n ...
2023

STREAMING TENSOR TRAIN APPROXIMATION

Daniel Kressner

Tensor trains are a versatile tool to compress and work with high-dimensional data and functions. In this work we introduce the streaming tensor train approximation (STTA), a new class of algorithms for approximating a given tensor ' in the tensor train fo ...
Philadelphia2023

impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers

Maud Ehrmann, Matteo Romanello

Text Reuse reveals meaningful reiterations of text in large corpora. Humanities researchers use text reuse to study, e.g., the posterior reception of influential texts or to reveal evolving publication practices of historical media. This research is often ...
2023

Distributed Optimization with Byzantine Robustness Guarantees

Lie He

As modern machine learning continues to achieve unprecedented benchmarks, the resource demands to train these advanced models grow drastically. This has led to a paradigm shift towards distributed training. However, the presence of adversaries—whether ma ...
EPFL2023

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.