Concept

Computational phylogenetics

Computational phylogenetics is the application of computational algorithms, methods, and programs to phylogenetic analyses. The goal is to assemble a phylogenetic tree representing a hypothesis about the evolutionary ancestry of a set of genes, species, or other taxa. For example, these techniques have been used to explore the family tree of hominid species and the relationships between specific genes shared by many types of organisms. Traditional phylogenetics relies on morphological data obtained by measuring and quantifying the phenotypic properties of representative organisms, while the more recent field of molecular phylogenetics uses nucleotide sequences encoding genes or amino acid sequences encoding proteins as the basis for classification. Many forms of molecular phylogenetics are closely related to and make extensive use of sequence alignment in constructing and refining phylogenetic trees, which are used to classify the evolutionary relationships between homologous genes represented in the genomes of divergent species. The phylogenetic trees constructed by computational methods are unlikely to perfectly reproduce the evolutionary tree that represents the historical relationships between the species being analyzed. The historical species tree may also differ from the historical tree of an individual homologous gene shared by those species. Phylogenetic trees generated by computational phylogenetics can be either rooted or unrooted depending on the input data and the algorithm used. A rooted tree is a directed graph that explicitly identifies a most recent common ancestor (MRCA), usually an imputed sequence that is not represented in the input. Genetic distance measures can be used to plot a tree with the input sequences as leaf nodes and their distances from the root proportional to their genetic distance from the hypothesized MRCA. Identification of a root usually requires the inclusion in the input data of at least one "outgroup" known to be only distantly related to the sequences of interest.

Source officielle

https://en.wikipedia.org/wiki/Computational_phylogenetics

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Cours associés (3)

BIO-109: Introduction to life sciences (for IC)

Ce cours présente les principes fondamentaux à l'œuvre dans les organismes vivants. Autant que possible, l'accent est mis sur les contributions de l'Informatique aux progrès des Sciences de la Vie.

BIO-463: Genomics and bioinformatics

This course covers various data analysis approaches associated with applications of DNA sequencing technologies, from genome sequencing to quantifying gene evolution, gene expression, transcription fa

CS-473: System programming for Systems-on-chip

To efficiently program embedded systems an understanding of their architectures is required. After following this course students will be able to take an existing SoC, understand its architecture, and

Séances de cours associées (10)

Exploration de l'évolution: preuves moléculaires et mécanismes

Explore les preuves moléculaires et les mécanismes de l'évolution, y compris les arbres phylogénétiques et les comparaisons de protéines.

Exploration de l'évolution: aperçu moléculaire et comparaison des protéines

Explore l'évolution à travers des enzymes, des modèles d'ADN et des comparaisons de protéines, en discutant des connaissances moléculaires et des mécanismes évolutifs.

Construction d'arbres phylogénétiques

Couvre les étapes de la construction d'un arbre phylogénétique en utilisant des séquences d'ADN et la méthode UPGMA.

Afficher plus

Publications associées (29)

The microbial genomics of glacier-fed streams: adaptations to an extreme ecosystem

Massimo Bourquin

Glacier-fed streams are the cold, ultra-oligotrophic, and unstable streams that are fed by glacial meltwater. Despite these extreme conditions, they harbour a diverse and abundant microbial diversity that develops into biofilms, covering the boulders and s ...

EPFL2024

The time-domain Cartesian multipole expansion of electromagnetic fields

Farhad Rachidi-Haeri, Marcos Rubinstein, Elias Per Joachim Le Boudec, Nicolas Mora Parra, Chaouki Kasmi, Emanuela Radici

Time-domain solutions of Maxwell’s equations in homogeneous and isotropic media are paramount to studying transient or broadband phenomena. However, analytical solutions are generally unavailable for practical applications, while numerical solutions are co ...

Nature Publishing Group2024

Impact of phylogeny on structural contact inference from protein sequence data

Anne-Florence Raphaëlle Bitbol, Umberto Lupo, Nicola Dietler

Local and global inference methods have been developed to infer structural contacts from multiple sequence alignments of homologous proteins. They rely on correlations in amino acid usage at contacting sites. Because homologous proteins share a common ance ...

ROYAL SOC2023

Afficher plus

Source officielle

https://en.wikipedia.org/wiki/Computational_phylogenetics

À propos de ce résultat

Cours associés (3)

BIO-109: Introduction to life sciences (for IC)

BIO-463: Genomics and bioinformatics

CS-473: System programming for Systems-on-chip

Séances de cours associées (10)

Exploration de l'évolution: preuves moléculaires et mécanismes

Explore les preuves moléculaires et les mécanismes de l'évolution, y compris les arbres phylogénétiques et les comparaisons de protéines.

Exploration de l'évolution: aperçu moléculaire et comparaison des protéines

Explore l'évolution à travers des enzymes, des modèles d'ADN et des comparaisons de protéines, en discutant des connaissances moléculaires et des mécanismes évolutifs.

Construction d'arbres phylogénétiques

Couvre les étapes de la construction d'un arbre phylogénétique en utilisant des séquences d'ADN et la méthode UPGMA.

Afficher plus

Publications associées (29)

The microbial genomics of glacier-fed streams: adaptations to an extreme ecosystem

Massimo Bourquin

EPFL2024

The time-domain Cartesian multipole expansion of electromagnetic fields

Farhad Rachidi-Haeri, Marcos Rubinstein, Elias Per Joachim Le Boudec, Nicolas Mora Parra, Chaouki Kasmi, Emanuela Radici

Nature Publishing Group2024

Impact of phylogeny on structural contact inference from protein sequence data

Anne-Florence Raphaëlle Bitbol, Umberto Lupo, Nicola Dietler

ROYAL SOC2023

Afficher plus

Personnes associées (3)

Bernard Moret

Bernard M.E. Moret was born in Vevey, Switzerland, received baccalauréats in Latin-Greek and Latin-Mathematics, then did a Diploma in Electrical Engineering at EPFL. After working for 2 years for Omega and Swiss Timing on the development of real-time OS for sports applications, he left for the US. He received his PhD in Electrical Engineering from the U. of Tennessee in 1980 and joined the Department of Computer Science at the University of New Mexico (UNM) that fall. He served as Chairman of the department from 1991 till 1993 and eventually retired in summer 2006 to join the School of Computer and Communication Sciences at EPFL. (You can read about his work at UNM on his (archived) personal and laboratory web pages at UNM.) He was appointed group leader for phylogenetics at the Swiss Institute for Bioinformatics (SIB). From 2009 until his retirement, he was also in charge of the BS and MS programs in Computer Science and Associate Dean for Education. He founded the ACM Journal of Experimental Algorithmics (JEA) and served as its Editor-in-Chief for 7 years; he also helped found the IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), where he served as Associate Editor until 2008. He founded the annual Workshop on Algorithms in Bioinformatics (WABI) and chairs its steering committee, and he serves on the steering committee of the Workshop on Algorithm Engineering and Experiments (ALENEX). Until summer 2008, he chaired the Biodata Management and Analysis (BDMA) study section of the US National Institutes of Health (NIH); now he is a charter member of the NIH College of Reviewers. He led a team of over 50 biologists, computer scientists, and mathematicians in the CIPRES (Cyber Infrastructure for Phylogenetic Research) project, funded by the US National Science Foundation (NSF) for US$ 12 million over 5 years. He has published nearly 150 papers in computational biology, under funding from the US NSF, the Alfred P. Sloan foundation, the IBM Corporation, the US NIH, the Swiss NSF, and SystemsX.ch. He is a Fellow of the ISCB (International Society for Computational Biology). His Erdös number is 2 and (as of 2020) his h-index is 48.

Afficher plus

Concepts associés (13)

Maximum de parcimonie

Les méthodes de maximum de parcimonie, ou plus simplement méthodes de parcimonie ou encore parcimonie de Wagner, sont une méthode statistique non-paramétrique très utilisée, notamment pour l'inférence phylogénétique. Cette méthode permet de construire des arbres de classification hiérarchique après enracinement, lesquels permettent d'obtenir des informations sur la structure de parenté d'un ensemble de taxons. Sous l'hypothèse du maximum de parcimonie, l'arbre phylogénétique « préféré » est celui qui requiert le plus petit nombre de changements évolutifs.

Multiple sequence alignment

Multiple sequence alignment (MSA) may refer to the process or the result of sequence alignment of three or more biological sequences, generally protein, DNA, or RNA. In many cases, the input set of query sequences are assumed to have an evolutionary relationship by which they share a linkage and are descended from a common ancestor. From the resulting MSA, sequence homology can be inferred and phylogenetic analysis can be conducted to assess the sequences' shared evolutionary origins.

Matrice de similarité

Les matrices de similarité ou matrices de substitution sont des matrices utilisées en bioinformatique pour réaliser des alignements de séquences biologiques reliées évolutivement. Elles permettent de donner un score de similarité ou de ressemblance entre deux acides aminés. Ces matrices, M, sont des matrices 20 x 20 (pour les 20 acides aminés protéinogènes standards) qui recensent l'ensemble des scores M(a,b) obtenus lorsqu'on substitue l'acide aminé a à l'acide b dans un alignement.

Afficher plus

Personnes associées (3)

Bernard Moret

Afficher plus