Concept

Nucleic acid notation

The nucleic acid notation currently in use was first formalized by the International Union of Pure and Applied Chemistry (IUPAC) in 1970. This universally accepted notation uses the Roman characters G, C, A, and T, to represent the four nucleotides commonly found in deoxyribonucleic acids (DNA). Given the rapidly expanding role for genetic sequencing, synthesis, and analysis in biology, some researchers have developed alternate notations to further support the analysis and manipulation of genetic data. These notations generally exploit size, shape, and symmetry to accomplish these objectives. Degenerate base symbols in biochemistry are an IUPAC representation for a position on a DNA sequence that can have multiple possible alternatives. These should not be confused with non-canonical bases because each particular sequence will have in fact one of the regular bases. These are used to encode the consensus sequence of a population of aligned sequences and are used for example in phylogenetic analysis to summarise into one multiple sequences or for BLAST searches, even though IUPAC degenerate symbols are masked (as they are not coded). Under the commonly used IUPAC system, nucleobases are represented by the first letters of their chemical names: guanine, cytosine, adenine, and thymine. This shorthand also includes eleven "ambiguity" characters associated with every possible combination of the four DNA bases. The ambiguity characters were designed to encode positional variations in order to report DNA sequencing errors, consensus sequences, or single-nucleotide polymorphisms. The IUPAC notation, including ambiguity characters and suggested mnemonics, is shown in Table 1. Despite its broad and nearly universal acceptance, the IUPAC system has a number of limitations, which stem from its reliance on the Roman alphabet. The poor legibility of upper-case Roman characters, which are generally used when displaying genetic data, may be chief among these limitations. The value of external projections in distinguishing letters has been well documented.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Related courses (3)
BIO-203: Integrated labo in Life sciences I
Au cours de deux semestres, vous utilisez la biologie moléculaire, la biologie cellulaire ainsi que la biochimie pour cloner un ADNc dans un plasmide d'expression, afin de produire, purifier et caract
BIO-212: Biological chemistry I
Biochemistry is a key discipline for the Life Sciences. Biological Chemistry I and II are two tightly interconnected courses that aim to describe and understand in molecular terms the processes that m
BIOENG-110: General Biology
Le but du cours est de fournir un aperçu général de la biologie des cellules et des organismes. Nous en discuterons dans le contexte de la vie des cellules et des organismes, en mettant l'accent sur l
Related lectures (22)
DNA: PCR and Sequencing Techniques
Explores DNA engineering through PCR, Sanger sequencing, the Human Genome Project, recombinant DNA, bacterial DNA manipulation, and CRISPR/Cas9.
Biochemistry Basics
Introduces functional groups, compounds, and reactions in biochemistry, focusing on alcohols, aldehydes, ketones, and sugars.
The Molecules of Life: Nucleic Acids
Covers the structure and functions of nucleic acids, including genetic information storage and gene expression.
Show more
Related publications (45)

Landscapes of DNA Mechanics and Genomes

Thomas Antonin Zwahlen

The local physical properties - such as shape and flexibility - of the DNA double-helix is today widely believed to be influenced by nucleic acid sequence in a non-trivial way. Furthermore, there is strong evidence that these properties play a role in many ...
EPFL2023

Optimised set of oligonucleotides for bulk rna barcoding and sequencing

Bart Deplancke, Vincent Roland Julien Gardeux, Riccardo Dainese, Daniel Alpern

The present invention relates generally to the field of nucleic acid sequencing and provides oligonucleotide molecules and barcodes contained therein. These oligonucleotide molecules and barcodes molecules are useful in sequencing to identify and resolve e ...
2023

Interplay of the mechanical and structural properties of DNA nanostructures determines their electrostatic interactions with lipid membranes

Maartje Martina Cornelia Bastings, Cem Tekin, Diana Kornelia Morzy, Vincenzo Caroprese

Nucleic acids and lipids function in close proximity in biological processes, as well as in nanoengineered constructs for therapeutic applications. As both molecules carry a rich charge profile, and frequently coexist in complex ionic solutions, the electr ...
ROYAL SOC CHEMISTRY2023
Show more
Related concepts (8)
Ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally-occurring form, -ribose, is a component of the ribonucleotides from which RNA is built, and so this compound is necessary for coding, decoding, regulation and expression of genes. It has a structural analog, deoxyribose, which is a similarly essential component of DNA. -ribose is an unnatural sugar that was first prepared by Emil Fischer and Oscar Piloty in 1891.
Directionality (molecular biology)
Directionality, in molecular biology and biochemistry, is the end-to-end chemical orientation of a single strand of nucleic acid. In a single strand of DNA or RNA, the chemical convention of naming carbon atoms in the nucleotide pentose-sugar-ring means that there will be a 5′ end (usually pronounced "five-prime end"), which frequently contains a phosphate group attached to the 5′ carbon of the ribose ring, and a 3′ end (usually pronounced "three-prime end"), which typically is unmodified from the ribose -OH substituent.
RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (Non-coding RNA) or by forming a template for production of proteins (messenger RNA). RNA and deoxyribonucleic acid (DNA) are nucleic acids. The nucleic acids constitute one of the four major macromolecules essential for all known forms of life. RNA is assembled as a chain of nucleotides.
Show more

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.