Êtes-vous un étudiant de l'EPFL à la recherche d'un projet de semestre?
Travaillez avec nous sur des projets en science des données et en visualisation, et déployez votre projet sous forme d'application sur Graph Search.
Darwin is a genomics co-processor that achieved a 15000x acceleration on long read assembly through innovative hardware and algorithm co-design. Darwins algorithms and hardware implementation were specifically designed for DNA analysis pipelines. This paper analyzes the feasibility of applying Darwins algorithms to the problem of protein sequence alignment. In addition to a behavioral analysis of Darwin when aligning proteins, we propose an algorithmic improvement to Darwins alignment algorithm, GACT, in the form of a multi-pass variant that increases its accuracy on protein sequence alignment. Concretely, our proposed multi-pass variant of GACT achieves on average 14% better alignment scores.
Anne-Florence Raphaëlle Bitbol, Nicola Dietler, Umberto Lupo
Rubén Laplaza Solanas, Anne-Clémence Corminboeuf, Puck Elisabeth van Gerwen, Alexandre Alain Schöpfer, Simone Gallarati
Anne-Florence Raphaëlle Bitbol, Damiano Sgarbossa, Umberto Lupo