Êtes-vous un étudiant de l'EPFL à la recherche d'un projet de semestre?
Travaillez avec nous sur des projets en science des données et en visualisation, et déployez votre projet sous forme d'application sur Graph Search.
Future deep HI surveys will be essential for understanding the nature of galaxies and the content of the Universe. However, the large volume of these data will require distributed and automated processing techniques. We introduce LiSA, a set of python modules for the denoising, detection and characterization of HI sources in 3D spectral data. LiSA was developed and tested on the Square Kilometer Array Science Data Challenge 2 dataset, and contains modules and pipelines for easy domain decomposition and parallel execution. LiSA contains algorithms for 2D-1D wavelet denoising using the starlet transform and flexible source finding using null-hypothesis testing. These algorithms are lightweight and portable, needing only a few user-defined parameters reflecting the resolution of the data. LiSA also includes two convolutional neural networks developed to analyze data cubes which separate HI sources from artifacts and predict the HI source properties. All of these components are designed to be as modular as possible, allowing users to mix and match different components to create their ideal pipeline. We demonstrate the performance of the different components of LiSA on the SDC2 dataset, which is able to find 95% of HI sources with SNR > 3 and accurately predict their properties. (C) 2022 The Author(s). Published by Elsevier B.V.
Alexander Mathis, Alberto Silvio Chiappa, Alessandro Marin Vargas, Axel Bisi