Êtes-vous un étudiant de l'EPFL à la recherche d'un projet de semestre?
Travaillez avec nous sur des projets en science des données et en visualisation, et déployez votre projet sous forme d'application sur Graph Search.
With the increasing availability of both molecular and topo-climatic data, the main challenges facing landscape genomics — i.e. the combination of landscape ecology with population genomics — include processing large numbers of models and distinguishing between selection and demographic processes (e.g. population structure). Several methods address the latter, either by estimating a null model of population history or by simultaneously inferring environmental and demographic effects. Here we present Samβada, an approach designed to study signatures of local adaptation, with special emphasis on high performance computing of large-scale genetic and environmental datasets. Samβada identifies candidate loci using genotype-environment associations while also incorporating multivariate analyses to assess the effect of many environmental predictor variables. This enables the inclusion of explanatory variables representing population structure into the models in order to lower the occurrences of spurious genotype-environment associations. In addition, Samβada calculates Local Indicators of Spatial Association (LISA) for candidate loci to provide information on whether similar genotypes tend to cluster in space, which constitutes a useful indication of the possible kinship between individuals. To test the usefulness of this approach, we carried out a simulation study and analysed a dataset from Ugandan cattle to detect signatures of local adaptation with Samβada, BayEnv, LFMM and an FST outlier method (FDIST approach in Arlequin) and compare their results. Samβada — an open source software for Windows, Linux and Mac OS X available at http://lasig.epfl.ch/sambada — outperforms other approaches and better suits whole genome sequence data processing.