Nucleic acid sequenceA nucleic acid sequence is a succession of bases within the nucleotides forming alleles within a DNA (using GACT) or RNA (GACU) molecule. This succession is denoted by a series of a set of five different letters that indicate the order of the nucleotides. By convention, sequences are usually presented from the 5' end to the 3' end. For DNA, with its double helix, there are two possible directions for the notated sequence; of these two, the sense strand is used.
Deep homologyIn evolutionary developmental biology, the concept of deep homology is used to describe cases where growth and differentiation processes are governed by genetic mechanisms that are homologous and deeply conserved across a wide range of species. In 1822, the French zoologist Étienne Geoffroy Saint-Hilaire dissected a crayfish, discovering that its body is organised like a vertebrate's, but inverted belly to back (dorsoventrally): I just found that all the soft organs, that is to say, the principal organs of life are found in crustaceans, and so in insects, in the same order, in the same relationships and with the same arrangement as their analogues in the high vertebrate animals .
Enzyme catalysisEnzyme catalysis is the increase in the rate of a process by a biological molecule, an "enzyme". Most enzymes are proteins, and most such processes are chemical reactions. Within the enzyme, generally catalysis occurs at a localized site, called the active site. Most enzymes are made predominantly of proteins, either a single protein chain or many such chains in a multi-subunit complex. Enzymes often also incorporate non-protein components, such as metal ions or specialized organic molecules known as cofactor (e.
Structural bioinformaticsStructural bioinformatics is the branch of bioinformatics that is related to the analysis and prediction of the three-dimensional structure of biological macromolecules such as proteins, RNA, and DNA. It deals with generalizations about macromolecular 3D structures such as comparisons of overall folds and local motifs, principles of molecular folding, evolution, binding interactions, and structure/function relationships, working both from experimentally solved structures and from computational models.
Protein superfamilyA protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if no sequence similarity is evident. Sequence homology can then be deduced even if not apparent (due to low sequence similarity). Superfamilies typically contain several protein families which show sequence similarity within each family.
Threading (protein sequence)In molecular biology, protein threading, also known as fold recognition, is a method of protein modeling which is used to model those proteins which have the same fold as proteins of known structures, but do not have homologous proteins with known structure. It differs from the homology modeling method of structure prediction as it (protein threading) is used for proteins which do not have their homologous protein structures deposited in the Protein Data Bank (PDB), whereas homology modeling is used for those proteins which do.
Native stateIn biochemistry, the native state of a protein or nucleic acid is its properly folded and/or assembled form, which is operative and functional. The native state of a biomolecule may possess all four levels of biomolecular structure, with the secondary through quaternary structure being formed from weak interactions along the covalently-bonded backbone. This is in contrast to the denatured state, in which these weak interactions are disrupted, leading to the loss of these forms of structure and retaining only the biomolecule's primary structure.
IsozymeIn biochemistry, isozymes (also known as isoenzymes or more generally as multiple forms of enzymes) are enzymes that differ in amino acid sequence but catalyze the same chemical reaction. Isozymes usually have different kinetic parameters (e.g. different KM values), or are regulated differently. They permit the fine-tuning of metabolism to meet the particular needs of a given tissue or developmental stage. In many cases, isozymes are encoded by homologous genes that have diverged over time.
Protein sequencingProtein sequencing is the practical process of determining the amino acid sequence of all or part of a protein or peptide. This may serve to identify the protein or characterize its post-translational modifications. Typically, partial sequencing of a protein provides sufficient information (one or more sequence tags) to identify it with reference to databases of protein sequences derived from the conceptual translation of genes. The two major direct methods of protein sequencing are mass spectrometry and Edman degradation using a protein sequenator (sequencer).
Regulatory sequenceA regulatory sequence is a segment of a nucleic acid molecule which is capable of increasing or decreasing the expression of specific genes within an organism. Regulation of gene expression is an essential feature of all living organisms and viruses. In DNA, regulation of gene expression normally happens at the level of RNA biosynthesis (transcription). It is accomplished through the sequence-specific binding of proteins (transcription factors) that activate or inhibit transcription.