Concept

Nucleic acid notation

The nucleic acid notation currently in use was first formalized by the International Union of Pure and Applied Chemistry (IUPAC) in 1970. This universally accepted notation uses the Roman characters G, C, A, and T, to represent the four nucleotides commonly found in deoxyribonucleic acids (DNA). Given the rapidly expanding role for genetic sequencing, synthesis, and analysis in biology, some researchers have developed alternate notations to further support the analysis and manipulation of genetic data. These notations generally exploit size, shape, and symmetry to accomplish these objectives. Degenerate base symbols in biochemistry are an IUPAC representation for a position on a DNA sequence that can have multiple possible alternatives. These should not be confused with non-canonical bases because each particular sequence will have in fact one of the regular bases. These are used to encode the consensus sequence of a population of aligned sequences and are used for example in phylogenetic analysis to summarise into one multiple sequences or for BLAST searches, even though IUPAC degenerate symbols are masked (as they are not coded). Under the commonly used IUPAC system, nucleobases are represented by the first letters of their chemical names: guanine, cytosine, adenine, and thymine. This shorthand also includes eleven "ambiguity" characters associated with every possible combination of the four DNA bases. The ambiguity characters were designed to encode positional variations in order to report DNA sequencing errors, consensus sequences, or single-nucleotide polymorphisms. The IUPAC notation, including ambiguity characters and suggested mnemonics, is shown in Table 1. Despite its broad and nearly universal acceptance, the IUPAC system has a number of limitations, which stem from its reliance on the Roman alphabet. The poor legibility of upper-case Roman characters, which are generally used when displaying genetic data, may be chief among these limitations. The value of external projections in distinguishing letters has been well documented.

Official source

https://en.wikipedia.org/wiki/Nucleic_acid_notation

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related courses (1)

BIO-212: Biological chemistry I

Biochemistry is a key discipline for the Life Sciences. Biological Chemistry I and II are two tightly interconnected courses that aim to describe and understand in molecular terms the processes that m

Related lectures (10)

DNA: PCR and Sequencing Techniques

Explores DNA engineering through PCR, Sanger sequencing, the Human Genome Project, recombinant DNA, bacterial DNA manipulation, and CRISPR/Cas9.

Molecular Structure of Nucleic Acids

Explains the molecular structure of nucleic acids, covering nucleotides, pentose sugars, base composition, polymerization, and 3D helix formation.

Biochemistry Basics

Introduces functional groups, compounds, and reactions in biochemistry, focusing on alcohols, aldehydes, ketones, and sugars.

Related publications (25)

Stimulus-responsive assembly of nonviral nucleocapsids

Angela Steinauer

Controlled assembly of a protein shell around a viral genome is a key step in the life cycle of many viruses. Here we report a strategy for regulating the co-assembly of nonviral proteins and nucleic acids into highly ordered nucleocapsids in vitro. By fus ...

Nature Portfolio2024

Landscapes of DNA Mechanics and Genomes

Thomas Antonin Zwahlen

The local physical properties - such as shape and flexibility - of the DNA double-helix is today widely believed to be influenced by nucleic acid sequence in a non-trivial way. Furthermore, there is strong evidence that these properties play a role in many ...

EPFL2023

Optimised set of oligonucleotides for bulk rna barcoding and sequencing

Bart Deplancke, Vincent Roland Julien Gardeux, Riccardo Dainese, Daniel Alpern

The present invention relates generally to the field of nucleic acid sequencing and provides oligonucleotide molecules and barcodes contained therein. These oligonucleotide molecules and barcodes molecules are useful in sequencing to identify and resolve e ...

2023

Official source

https://en.wikipedia.org/wiki/Nucleic_acid_notation

About this result

Related courses (1)

BIO-212: Biological chemistry I

Related lectures (10)

DNA: PCR and Sequencing Techniques

Explores DNA engineering through PCR, Sanger sequencing, the Human Genome Project, recombinant DNA, bacterial DNA manipulation, and CRISPR/Cas9.

Molecular Structure of Nucleic Acids

Explains the molecular structure of nucleic acids, covering nucleotides, pentose sugars, base composition, polymerization, and 3D helix formation.

Biochemistry Basics

Introduces functional groups, compounds, and reactions in biochemistry, focusing on alcohols, aldehydes, ketones, and sugars.

Related publications (25)

Stimulus-responsive assembly of nonviral nucleocapsids

Angela Steinauer

Nature Portfolio2024

Landscapes of DNA Mechanics and Genomes

Thomas Antonin Zwahlen

EPFL2023

Optimised set of oligonucleotides for bulk rna barcoding and sequencing

Bart Deplancke, Vincent Roland Julien Gardeux, Riccardo Dainese, Daniel Alpern

2023

Related concepts (2)

Ribose

Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally-occurring form, -ribose, is a component of the ribonucleotides from which RNA is built, and so this compound is necessary for coding, decoding, regulation and expression of genes. It has a structural analog, deoxyribose, which is a similarly essential component of DNA. -ribose is an unnatural sugar that was first prepared by Emil Fischer and Oscar Piloty in 1891.

Directionality (molecular biology)

Directionality, in molecular biology and biochemistry, is the end-to-end chemical orientation of a single strand of nucleic acid. In a single strand of DNA or RNA, the chemical convention of naming carbon atoms in the nucleotide pentose-sugar-ring means that there will be a 5′ end (usually pronounced "five-prime end"), which frequently contains a phosphate group attached to the 5′ carbon of the ribose ring, and a 3′ end (usually pronounced "three-prime end"), which typically is unmodified from the ribose -OH substituent.