Publication

Learning-based Image Coding: Early Solutions Reviewing and Subjective Quality Evaluation

Related publications (79)

Efficient Temporally-Aware DeepFake Detection using H.264 Motion Vectors

Sabine Süsstrunk, Yufan Ren, Peter Arpad Grönquist, Alessio Verardo, Qingyi He

Video DeepFakes are fake media created with Deep Learning (DL) that manipulate a person’s expression or identity. Most current DeepFake detection methods analyze each frame independently, ignoring inconsistencies and unnatural movements between frames. Som ...
2024

Data for Paper "Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning"

Anders Meibom, Devis Tuia, Guilhem Maurice Louis Banc-Prandi, Jonathan Paul Sauder

Example Data for DeepReefMap This dataset contains input videos in MP4 format taken with GoPro Hero 10 Cameras in Reefs in the Red Sea to demonstrate the DeepReefMap tool, which is described in the paper "Scalable Semantic 3D Mapping of Coral Reefs with De ...
EPFL Infoscience2024

Toward Automatic Typography Analysis: Serif Classification and Font Similarities

Mathieu Salzmann, Delphine Ribes Lemay, Nicolas Henchoz, Romain Simon Collaud, Syed Talal Wasim

Whether a document is of historical or contemporary significance, typography plays a crucial role in its composition. From the early days of modern printing, typographic techniques have evolved and transformed, resulting in changes to the features of typog ...
2024

The JPEG AI Standard: Providing Efficient Human and Machine Visual Data Consumption

Touradj Ebrahimi

The Joint Photographic Experts Group (JPEG) AI learning-based image coding system is an ongoing joint standardization effort between International Organization for Standardization (ISO), International Electrotechnical Commission (IEC), and International Te ...
IEEE COMPUTER SOC2023

Communication-efficient distributed training of machine learning models

Thijs Vogels

In this thesis, we explore techniques for addressing the communication bottleneck in data-parallel distributed training of deep learning models. We investigate algorithms that either reduce the size of the messages that are exchanged between workers, or th ...
EPFL2023

Dual-frequency spectral radar retrieval of snowfall microphysics: a physics-driven deep-learning approach

Alexis Berne, Gionata Ghiggi

The use of meteorological radars to study snowfall microphysical properties and processes is well established, in particular via a few distinct techniques: the use of radar polarimetry, of multi-frequency radar measurements, and of the radar Doppler spectr ...
COPERNICUS GESELLSCHAFT MBH2023

Learnable Wavelet Transform and Domain Adversarial Learning for Enhanced Bearing Fault Diagnosis

Olga Fink, Gaëtan Michel Frusque, Qi Li, Baorui Dai

The application of unsupervised domain adaptation (UDA)-based fault diagnosis methods has shown significant efficacy in industrial settings, facilitating the transfer of operational experience and fault signatures between different operating conditions, di ...
Research Publishing2023

Smart filter aided domain adversarial neural network for fault diagnosis in noisy industrial scenarios

Olga Fink, Gaëtan Michel Frusque, Tianfu Li, Qi Li, Baorui Dai

The application of unsupervised domain adaptation (UDA)-based fault diagnosis methods has shown significant efficacy in industrial settings, facilitating the transfer of operational experience and fault signatures between different operating conditions, di ...
2023

SAGTTA: SALIENCY GUIDED TEST TIME AUGMENTATION FOR MEDICAL IMAGE SEGMENTATION ACROSS VENDOR DOMAIN SHIFT

Devavrat Tomar

Test time augmentation has been shown to be an effective approach to combat domain shifts in deep learning. Despite their promising performance levels, the interpretability of the underlying used models is however low. Saliency maps have been widely used i ...
New York2023

VETIM: Expanding the Vocabulary of Text-to-Image Models only with Text

Sabine Süsstrunk, Radhakrishna Achanta, Mahmut Sami Arpa, Martin Nicolas Everaert

Text-to-image models, such as Stable Diffusion, can generate high-quality images from simple textual prompts. With methods such as Textual Inversion, it is possible to expand the vocabulary of these models with additional concepts, by learning the vocabula ...
BMVA2023

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.