Concept

Computational phylogenetics

Computational phylogenetics is the application of computational algorithms, methods, and programs to phylogenetic analyses. The goal is to assemble a phylogenetic tree representing a hypothesis about the evolutionary ancestry of a set of genes, species, or other taxa. For example, these techniques have been used to explore the family tree of hominid species and the relationships between specific genes shared by many types of organisms. Traditional phylogenetics relies on morphological data obtained by measuring and quantifying the phenotypic properties of representative organisms, while the more recent field of molecular phylogenetics uses nucleotide sequences encoding genes or amino acid sequences encoding proteins as the basis for classification. Many forms of molecular phylogenetics are closely related to and make extensive use of sequence alignment in constructing and refining phylogenetic trees, which are used to classify the evolutionary relationships between homologous genes represented in the genomes of divergent species. The phylogenetic trees constructed by computational methods are unlikely to perfectly reproduce the evolutionary tree that represents the historical relationships between the species being analyzed. The historical species tree may also differ from the historical tree of an individual homologous gene shared by those species. Phylogenetic trees generated by computational phylogenetics can be either rooted or unrooted depending on the input data and the algorithm used. A rooted tree is a directed graph that explicitly identifies a most recent common ancestor (MRCA), usually an imputed sequence that is not represented in the input. Genetic distance measures can be used to plot a tree with the input sequences as leaf nodes and their distances from the root proportional to their genetic distance from the hypothesized MRCA. Identification of a root usually requires the inclusion in the input data of at least one "outgroup" known to be only distantly related to the sequences of interest.

Official source

https://en.wikipedia.org/wiki/Computational_phylogenetics

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related courses (3)

BIO-109: Introduction to life sciences (for IC)

Ce cours présente les principes fondamentaux à l'œuvre dans les organismes vivants. Autant que possible, l'accent est mis sur les contributions de l'Informatique aux progrès des Sciences de la Vie.

BIO-463: Genomics and bioinformatics

This course covers various data analysis approaches associated with applications of DNA sequencing technologies, from genome sequencing to quantifying gene evolution, gene expression, transcription fa

CS-473: System programming for Systems-on-chip

To efficiently program embedded systems an understanding of their architectures is required. After following this course students will be able to take an existing SoC, understand its architecture, and

Related publications (29)

The microbial genomics of glacier-fed streams: adaptations to an extreme ecosystem

Massimo Bourquin

Glacier-fed streams are the cold, ultra-oligotrophic, and unstable streams that are fed by glacial meltwater. Despite these extreme conditions, they harbour a diverse and abundant microbial diversity that develops into biofilms, covering the boulders and s ...

EPFL2024

The time-domain Cartesian multipole expansion of electromagnetic fields

Farhad Rachidi-Haeri, Marcos Rubinstein, Elias Per Joachim Le Boudec, Nicolas Mora Parra, Chaouki Kasmi, Emanuela Radici

Time-domain solutions of Maxwell’s equations in homogeneous and isotropic media are paramount to studying transient or broadband phenomena. However, analytical solutions are generally unavailable for practical applications, while numerical solutions are co ...

Nature Publishing Group2024

Impact of phylogeny on structural contact inference from protein sequence data

Anne-Florence Raphaëlle Bitbol, Umberto Lupo, Nicola Dietler

Local and global inference methods have been developed to infer structural contacts from multiple sequence alignments of homologous proteins. They rely on correlations in amino acid usage at contacting sites. Because homologous proteins share a common ance ...

ROYAL SOC2023

Related people (3)

Bernard Moret

Bernard M.E. Moret was born in Vevey, Switzerland, received baccalauréats in Latin-Greek and Latin-Mathematics, then did a Diploma in Electrical Engineering at EPFL. After working for 2 years for Omega and Swiss Timing on the development of real-time OS for sports applications, he left for the US. He received his PhD in Electrical Engineering from the U. of Tennessee in 1980 and joined the Department of Computer Science at the University of New Mexico (UNM) that fall. He served as Chairman of the department from 1991 till 1993 and eventually retired in summer 2006 to join the School of Computer and Communication Sciences at EPFL. (You can read about his work at UNM on his (archived) personal and laboratory web pages at UNM.) He was appointed group leader for phylogenetics at the Swiss Institute for Bioinformatics (SIB). From 2009 until his retirement, he was also in charge of the BS and MS programs in Computer Science and Associate Dean for Education. He founded the ACM Journal of Experimental Algorithmics (JEA) and served as its Editor-in-Chief for 7 years; he also helped found the IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), where he served as Associate Editor until 2008. He founded the annual Workshop on Algorithms in Bioinformatics (WABI) and chairs its steering committee, and he serves on the steering committee of the Workshop on Algorithm Engineering and Experiments (ALENEX). Until summer 2008, he chaired the Biodata Management and Analysis (BDMA) study section of the US National Institutes of Health (NIH); now he is a charter member of the NIH College of Reviewers. He led a team of over 50 biologists, computer scientists, and mathematicians in the CIPRES (Cyber Infrastructure for Phylogenetic Research) project, funded by the US National Science Foundation (NSF) for US$ 12 million over 5 years. He has published nearly 150 papers in computational biology, under funding from the US NSF, the Alfred P. Sloan foundation, the IBM Corporation, the US NIH, the Swiss NSF, and SystemsX.ch. He is a Fellow of the ISCB (International Society for Computational Biology). His Erdös number is 2 and (as of 2020) his h-index is 48.

Official source

https://en.wikipedia.org/wiki/Computational_phylogenetics

About this result

Related courses (3)

BIO-109: Introduction to life sciences (for IC)

BIO-463: Genomics and bioinformatics

CS-473: System programming for Systems-on-chip

Related lectures (10)

Exploring Evolution: Molecular Evidence and Mechanisms

Explores molecular evidence and mechanisms of evolution, including phylogenetic trees and protein comparisons.

Exploring Evolution: Molecular Insights and Protein Comparison

Explores evolution through enzymes, DNA models, and protein comparisons, discussing molecular insights and evolutionary mechanisms.

Phylogenetic Tree Construction

Covers the stages of building a phylogenetic tree using DNA sequences and the UPGMA method.

Related publications (29)

The microbial genomics of glacier-fed streams: adaptations to an extreme ecosystem

Massimo Bourquin

EPFL2024

The time-domain Cartesian multipole expansion of electromagnetic fields

Farhad Rachidi-Haeri, Marcos Rubinstein, Elias Per Joachim Le Boudec, Nicolas Mora Parra, Chaouki Kasmi, Emanuela Radici

Nature Publishing Group2024

Impact of phylogeny on structural contact inference from protein sequence data

Anne-Florence Raphaëlle Bitbol, Umberto Lupo, Nicola Dietler

ROYAL SOC2023

Related people (3)

Bernard Moret

Related concepts (13)

Maximum parsimony (phylogenetics)

In phylogenetics, maximum parsimony is an optimality criterion under which the phylogenetic tree that minimizes the total number of character-state changes (or minimizes the cost of differentially weighted character-state changes). Under the maximum-parsimony criterion, the optimal tree will minimize the amount of homoplasy (i.e., convergent evolution, parallel evolution, and evolutionary reversals). In other words, under this criterion, the shortest possible tree that explains the data is considered best.

Multiple sequence alignment

Multiple sequence alignment (MSA) may refer to the process or the result of sequence alignment of three or more biological sequences, generally protein, DNA, or RNA. In many cases, the input set of query sequences are assumed to have an evolutionary relationship by which they share a linkage and are descended from a common ancestor. From the resulting MSA, sequence homology can be inferred and phylogenetic analysis can be conducted to assess the sequences' shared evolutionary origins.

Substitution matrix

In bioinformatics and evolutionary biology, a substitution matrix describes the frequency at which a character in a nucleotide sequence or a protein sequence changes to other character states over evolutionary time. The information is often in the form of log odds of finding two specific character states aligned and depends on the assumed number of evolutionary changes or sequence dissimilarity between compared sequences. It is an application of a stochastic matrix.