Publications associées à Neural machine translation

Advancing Self-Supervised Deep Learning for 3D Scene Understanding

Recent advancements in deep learning have revolutionized 3D computer vision, enabling the extraction of intricate 3D information from 2D images and video sequences. This thesis explores the application of deep learning in three crucial challenges of 3D com ...

EPFL2024

Predicting Visual Stimuli From Cortical Response Recorded With Wide-Field Imaging in a Mouse

Silvestro Micera, Daniela De Luca

Neural decoding of the visual system is a subject of research interest, both to understand how the visual system works and to be able to use this knowledge in areas, such as computer vision or brain-computer interfaces. Spike-based decoding is often used, ...

Ieee-Inst Electrical Electronics Engineers Inc2024

Regularization Techniques for Low-Resource Machine Translation

Alejandro Ramírez Atrio

Neural machine translation (MT) and text generation have recently reached very high levels of quality. However, both areas share a problem: in order to reach these levels, they require massive amounts of data. When this is not present, they lack generaliza ...

EPFL2023

Dense Image-based Predictions for Comics Analysis

Deblina Bhattacharjee

Dense image-based prediction methods have advanced tremendously in recent years. Their remarkable development has been possible due to the ample availability of real-world imagery. While these methods work well on photographs, their abilities do not genera ...

EPFL2023

Linear Complexity Self-Attention With 3rd Order Polynomials

Grigorios Chrysos, Filippos Kokkinos

Self-attention mechanisms and non-local blocks have become crucial building blocks for state-of-the-art neural architectures thanks to their unparalleled ability in capturing long-range dependencies in the input. However their cost is quadratic with the nu ...

Ieee Computer Soc2023

Modeling Structured Data in Attention-based Models

Alireza Mohammadshahi

Natural language processing has experienced significant improvements with the development of Transformer-based models, which employ self-attention mechanism and pre-training strategies. However, these models still present several obstacles. A notable issue ...

EPFL2023

Novel Methods For Detection And Analysis Of Atypical Aspects In Speech

Julian David Fritsch

Atypical aspects in speech concern speech that deviates from what is commonly considered normal or healthy. In this thesis, we propose novel methods for detection and analysis of these aspects, e.g. to monitor the temporary state of a speaker, diseases tha ...

EPFL2023

Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages

Karl Aberer, Rémi Philippe Lebret, Negar Foroutan Eghlidi

Vision-Language Pre-training (VLP) has advanced the performance of many visionlanguage tasks, such as image-text retrieval, visual entailment, and visual reasoning. The pre-training mostly utilizes lexical databases and image queries in English. Previous w ...

Assoc Computational Linguistics-Acl2023

Interpretable Representation Learning and Evaluation for Abstractive Summarization

Andreas Thomas Marfurt

Abstractive summarization has seen big improvements in recent years, mostly due to advances in neural language modeling, language model pretraining, and scaling models and datasets. While large language models generate summaries that are fluent, coherent, ...

EPFL2023

Deep Learning Meets Sparse Regularization

Rahul Parhi

Deep learning (DL) has been wildly successful in practice, and most of the state-of-the-art machine learning methods are based on neural networks (NNs). Lacking, however, is a rigorous mathematical theory that adequately explains the amazing performance of ...

2023

BMAT: An open-source BIDS managing and analysis tool

Meritxell Bach Cuadra, Francesco La Rosa, Maxence Charles F Wynen, Benoît Macq

Magnetic Resonance Imaging (MRI) is an established technique to study in vivo neurological disorders such as Multiple Sclerosis (MS). To avoid errors on MRI data organization and automated processing, a standard called Brain Imaging Data Structure (BIDS) h ...

ELSEVIER SCI LTD2022

Biochemistry of Aminoacyl tRNA Synthetase and tRNAs and Their Engineering for Cell-Free and Synthetic Cell Applications

Sebastian Maerkl, Ragunathan Bava Ganesh

Cell-free biology is increasingly utilized for engineering biological systems, incorporating novel functionality, and circumventing many of the complications associated with cells. The central dogma describes the information flow in biology consisting of t ...

FRONTIERS MEDIA SA2022

V-ATPase/TORC1-mediated ATFS-1 translation directs mitochondrial UPR activation in C. elegans

Johan Auwerx, Kristina Schoonjans, Felix Naef, Xiaoxu Li, Jia Liu, Hao Li, Wen Gao, Nagammal Neelagandan, Yang Li, Amélia Lalou, Yuan Liu

To adapt mitochondrial function to the ever-changing intra- and extracellular environment, multiple mitochondrial stress response (MSR) pathways, including the mitochondrial unfolded protein response (UPRmt), have evolved. However, how the mitochondrial st ...

Rockefeller University Press2022

Improved framework to estimate travel time and derived distributions in hydrological control volumes

Mitra Asadollahi

Crossing properties of soil saturation, defined as the duration and excursion of soil saturation below and above certain thresholds, are key variables to ecosystem functioning and evolution by primarily influencing the plant and soil microbes physiological ...

EPFL2022

Rectilinear translation four-bar flexure mechanism based on four Remote Center Compliance pivots

Simon Nessim Henein, Charles Baur, Loïc Benoît Tissot-Daguette, Hubert Pierre-Marie Benoît Schneegans, Quentin Gubler

This paper presents a novel planar four-bar linkage compliant mechanism (called 4-RCC) based on four flexure-based Remote Center of Compliance (RCC) pivots. With particular configurations and dimensions, the beam shortening of the RCC pivots can compensate ...

euspen2022

Continual Test-Time Domain Adaptation

Olga Fink, Qin Wang

Test-time domain adaptation aims to adapt a source pretrained model to a target domain without using any source data. Existing works mainly consider the case where the target domain is static. However, real-world machine perception systems are running in n ...

IEEE COMPUTER SOC2022

Latent Space Slicing for Enhanced Entropy Modeling in Learning-Based Point Cloud Geometry Compression

Touradj Ebrahimi, Davi Nachtigall Lazzarotto

The growing adoption of point clouds as an imaging modality has stimulated the search for efficient solutions for compression. Learning-based algorithms have been reporting increasingly better performance and are drawing the attention from the research com ...

IEEE2022

Using Animal Motion Capture to Learn Neural Representations

Semih Günel

Understanding behavior from neural activity is a fundamental goal in neuroscience. It has practical applications in building robust brain-machine interfaces, human-computer interaction, and assisting patients with neurological disabilities. Despite the eve ...

EPFL2022

Bridging the gap between model-driven and data-driven methods in the era of Big Data

Gael Lederrey

Data-driven and model-driven methodologies can be regarded as competitive fields since they tackle similar problems such as prediction. However, these two fields can learn from each other to improve themselves. Indeed, data-driven methodologies have been d ...

EPFL2022

Personalized Productive Engagement Recognition in Robot-Mediated Collaborative Learning

Barbara Bruno, Jauwairia Nasir

In this paper, we propose and compare personalized models for Productive Engagement (PE) recognition. PE is defined as the level of engagement that maximizes learning. Previously, in the context of robot-mediated collaborative learning, a framework of prod ...

2022