The nucleic acid notation currently in use was first formalized by the International Union of Pure and Applied Chemistry (IUPAC) in 1970. This universally accepted notation uses the Roman characters G, C, A, and T, to represent the four nucleotides commonly found in deoxyribonucleic acids (DNA).
Given the rapidly expanding role for genetic sequencing, synthesis, and analysis in biology, some researchers have developed alternate notations to further support the analysis and manipulation of genetic data. These notations generally exploit size, shape, and symmetry to accomplish these objectives.
Degenerate base symbols in biochemistry are an IUPAC representation for a position on a DNA sequence that can have multiple possible alternatives. These should not be confused with non-canonical bases because each particular sequence will have in fact one of the regular bases. These are used to encode the consensus sequence of a population of aligned sequences and are used for example in phylogenetic analysis to summarise into one multiple sequences or for BLAST searches, even though IUPAC degenerate symbols are masked (as they are not coded).
Under the commonly used IUPAC system, nucleobases are represented by the first letters of their chemical names: guanine, cytosine, adenine, and thymine. This shorthand also includes eleven "ambiguity" characters associated with every possible combination of the four DNA bases. The ambiguity characters were designed to encode positional variations in order to report DNA sequencing errors, consensus sequences, or single-nucleotide polymorphisms. The IUPAC notation, including ambiguity characters and suggested mnemonics, is shown in Table 1.
Despite its broad and nearly universal acceptance, the IUPAC system has a number of limitations, which stem from its reliance on the Roman alphabet. The poor legibility of upper-case Roman characters, which are generally used when displaying genetic data, may be chief among these limitations. The value of external projections in distinguishing letters has been well documented.
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Les étudiants appliquent des techniques de base en biologie moléculaire pour cloner un cDNA d'intérêt dans un plasmide d'expression afin de produire la protéine correspondante dans des cellules de mam
Biochemistry is a key discipline for the Life Sciences. Biological Chemistry I and II are two tightly interconnected courses that aim to describe and understand in molecular terms the processes that m
Le but du cours est de fournir un aperçu général de la biologie des cellules et des organismes. Nous en discuterons dans le contexte de la vie des cellules et des organismes, en mettant l'accent sur l
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally-occurring form, -ribose, is a component of the ribonucleotides from which RNA is built, and so this compound is necessary for coding, decoding, regulation and expression of genes. It has a structural analog, deoxyribose, which is a similarly essential component of DNA. -ribose is an unnatural sugar that was first prepared by Emil Fischer and Oscar Piloty in 1891.
Directionality, in molecular biology and biochemistry, is the end-to-end chemical orientation of a single strand of nucleic acid. In a single strand of DNA or RNA, the chemical convention of naming carbon atoms in the nucleotide pentose-sugar-ring means that there will be a 5′ end (usually pronounced "five-prime end"), which frequently contains a phosphate group attached to the 5′ carbon of the ribose ring, and a 3′ end (usually pronounced "three-prime end"), which typically is unmodified from the ribose -OH substituent.
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (Non-coding RNA) or by forming a template for production of proteins (messenger RNA). RNA and deoxyribonucleic acid (DNA) are nucleic acids. The nucleic acids constitute one of the four major macromolecules essential for all known forms of life. RNA is assembled as a chain of nucleotides.
The present invention relates generally to the field of nucleic acid sequencing and provides oligonucleotide molecules and barcodes contained therein. These oligonucleotide molecules and barcodes molecules are useful in sequencing to identify and resolve e ...
2023
, , ,
Nucleic acids and lipids function in close proximity in biological processes, as well as in nanoengineered constructs for therapeutic applications. As both molecules carry a rich charge profile, and frequently coexist in complex ionic solutions, the electr ...
The local physical properties - such as shape and flexibility - of the DNA double-helix is today widely believed to be influenced by nucleic acid sequence in a non-trivial way. Furthermore, there is strong evidence that these properties play a role in many ...