Publication

Landscapes of DNA Mechanics and Genomes

Thomas Antonin Zwahlen
2023
EPFL thesis
Abstract

The local physical properties - such as shape and flexibility - of the DNA double-helix is today widely believed to be influenced by nucleic acid sequence in a non-trivial way. Furthermore, there is strong evidence that these properties play a role in many important processes, such as protein binding or nucleosome positioning. In order to address such biologically pertinent problems, we have developed mathematical and computational tools to be able to identify or predict sequences and sites in genome-sized data based on their mechanical properties. For this, we rely on the cgDNA coarse-grain model, which provides a detailed sequence dependent description of the statistical mechanics of DNA. As an application, we present a method inspired from information theory techniques to scan the genome of S. cerevisiae in search for mechanically exceptional sequences (or outliers). This method reveals a systematic bias for A/T base pair content and AA/TT dimer content in mechanical outlier sequences. Moreover, it shows an even drastically stronger preference for CG dimer content when CpG steps are methylated. Finally, a clustering analysis of exhaustive ensembles of DNA shape predictions of the cgDNA+ model reveals the importance of purine/pyrimidine content.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.
Related concepts (38)
Genome
In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as regulatory sequences (see non-coding DNA), and often a substantial fraction of junk DNA with no evident function. Almost all eukaryotes have mitochondria and a small mitochondrial genome.
Base pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA and RNA. Dictated by specific hydrogen bonding patterns, "Watson–Crick" (or "Watson–Crick–Franklin") base pairs (guanine–cytosine and adenine–thymine) allow the DNA helix to maintain a regular helical structure that is subtly dependent on its nucleotide sequence.
Genome size
Genome size is the total amount of DNA contained within one copy of a single complete genome. It is typically measured in terms of mass in picograms (trillionths (10−12) of a gram, abbreviated pg) or less frequently in daltons, or as the total number of nucleotide base pairs, usually in megabases (millions of base pairs, abbreviated Mb or Mbp). One picogram is equal to 978 megabases. In diploid organisms, genome size is often used interchangeably with the term C-value.
Show more
Related publications (81)

Development of multiplexed single-molecule imaging techniques to study chromatin invasion dynamics of chromatin effector proteins

Kristina Makasheva

DNA-binding proteins physically interact with the DNA and directly affect genomic functions. The eukaryotic genome is compacted into chromatin, limiting the DNA access to nuclear factors. In this Ph.D. thesis, I explored the dynamic mechanisms, that allow ...
EPFL2023

cgNA+: A sequence-dependent coarse-grain model of double-stranded nucleic acids

Rahul Sharma

DNA mechanics plays a crucial role in many biological processes, including nucleosome positioning and protein-DNA interactions. It is believed that nature employs epigenetic modifications in DNA to further regulate gene expression. Moreover, double-strande ...
EPFL2023

cgNA plus web : A Visual Interface to the cgNA plus Sequence-dependent Statistical Mechanics Model of Double-stranded Nucleic Acids

John Maddocks, Rahul Sharma, Alessandro Samuele Patelli

The sequence-dependent statistical mechanics of double-stranded nucleic acid, or dsNA, is believed to be essential in its biological functions. In turn, the equilibrium statistical mechanics behaviour of dsNA depends strongly both on sequence-dependent per ...
2023
Show more
Related MOOCs (6)
Neuroscience Reconstructed: Cell Biology
This course will provide the fundamental knowledge in neuroscience required to understand how the brain is organised and how function at multiple scales is integrated to give rise to cognition and beh
Neuroscience Reconstructed: Cell Biology
This course will provide the fundamental knowledge in neuroscience required to understand how the brain is organised and how function at multiple scales is integrated to give rise to cognition and beh
Neuroscience Reconstructed: Genetics and Brain Development
This course will provide the fundamental knowledge in neuroscience required to understand how the brain is organised and how function at multiple scales is integrated to give rise to cognition and beh
Show more

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.