Coding regionThe coding region of a gene, also known as the coding sequence (CDS), is the portion of a gene's DNA or RNA that codes for protein. Studying the length, composition, regulation, splicing, structures, and functions of coding regions compared to non-coding regions over different species and time periods can provide a significant amount of important information regarding gene organization and evolution of prokaryotes and eukaryotes. This can further assist in mapping the human genome and developing gene therapy.
Fumaric acidFumaric acid is an organic compound with the formula HO2CCH=CHCO2H. A white solid, fumaric acid occurs widely in nature. It has a fruit-like taste and has been used as a food additive. Its E number is E297. The salts and esters are known as fumarates. Fumarate can also refer to the C4H2O42− ion (in solution). Fumaric acid is the trans isomer of butenedioic acid, while maleic acid is the cis isomer. It is produced in eukaryotic organisms from succinate in complex 2 of the electron transport chain via the enzyme succinate dehydrogenase.
Coenzyme Q10DISPLAYTITLE:Coenzyme Q10 Coenzyme Q is a coenzyme family that is ubiquitous in animals and most bacteria (hence its other name, ubiquinone). In humans, the most common form is coenzyme Q10 (which is also called CoQ10 (ˌkoʊkjuːˈtɛn) and ubiquinone-10. Coenzyme Q10 is a 1,4-benzoquinone, in which Q refers to the quinone chemical group and 10 refers to the number of isoprenyl chemical subunits (shown enclosed in brackets in the diagram) in its tail. In natural ubiquinones, there are from six to ten subunits in the tail.
Oxidative decarboxylationOxidative decarboxylation is a decarboxylation reaction caused by oxidation. Most are accompanied by α- Ketoglutarate α- Decarboxylation caused by dehydrogenation of hydroxyl carboxylic acids such as carbonyl carboxylic acid, malic acid, isocitric acid, etc. Pyruvate catalytic reaction catalyzed by pyruvate dehydrogenase system is a special decarboxylation method, namely oxidative decarboxylation, which is different from the common decarboxylation reaction, namely common decarboxylation.
Consensus sequenceIn molecular biology and bioinformatics, the consensus sequence (or canonical sequence) is the calculated sequence of most frequent residues, either nucleotide or amino acid, found at each position in a sequence alignment. It represents the results of multiple sequence alignments in which related sequences are compared to each other and similar sequence motifs are calculated. Such information is important when considering sequence-dependent enzymes such as RNA polymerase.
Escherichia coliEscherichia coli (ˌɛʃəˈrɪkiə_ˈkoʊlaɪ ) is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus Escherichia that is commonly found in the lower intestine of warm-blooded organisms. Most E. coli strains are harmless, but some serotypes such as EPEC, and ETEC are pathogenic and can cause serious food poisoning in their hosts, and are occasionally responsible for food contamination incidents that prompt product recalls.
HistidineHistidine (symbol His or H) is an essential amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated –NH3+ form under biological conditions), a carboxylic acid group (which is in the deprotonated –COO− form under biological conditions), and an imidazole side chain (which is partially protonated), classifying it as a positively charged amino acid at physiological pH. Initially thought essential only for infants, it has now been shown in longer-term studies to be essential for adults also.
Sequence alignmentIn bioinformatics, a sequence alignment is a way of arranging the sequences of DNA, RNA, or protein to identify regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequences. Aligned sequences of nucleotide or amino acid residues are typically represented as rows within a matrix. Gaps are inserted between the residues so that identical or similar characters are aligned in successive columns.
SequenceIn mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called elements, or terms). The number of elements (possibly infinite) is called the length of the sequence. Unlike a set, the same elements can appear multiple times at different positions in a sequence, and unlike a set, the order does matter. Formally, a sequence can be defined as a function from natural numbers (the positions of elements in the sequence) to the elements at each position.
Human genomeThe human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs.