Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
Sequence dependent mechanics of DNA is believed to play a central role in the functioning of the cell through the expression of genetic information. Nucleosome positioning, gene regulation, DNA looping and packaging within the cell are only some of the processes that are believed to be at least partially governed by mechanical laws. Therefore it is important to understand how the sequence of DNA affects its mechanical properties. For exploring the mechanical properties of DNA, various discrete and continuum models have been, and continue to be, developed. A large family of these models, including the model considered in this work, assume that bases or base pairs of DNA are rigid bodies. The most standard are rigid base pair models, with parameters either obtained directly from experimental data or from Molecular Dynamics (MD) simulations. The drawback of current experimental data, such as crystal structures, is that only small ensembles of configurations are available for a small number of sequences. In contrast, MD simulations allow a much more detailed view of a larger number of DNA sequences. However, the drawback is that the results of these simulations depend on the choice of the simulation protocol and force field parameters. MD simulations also have sequence length limitations and are currently too intensive for (linear) molecules longer than a few tens of base pairs. The only way to simulate longer sequences is to construct a coarse-grain model. The goal of this work is to construct a small parameter set that can model a sequence- dependent equilibrium probability distribution for rigid base configurations of a DNA oligomer with any given sequence of any length. The model parameter sets previously available were for rigid base pair models ignoring all the couplings beyond nearest neighbour interactions. However it was shown in previous work, that this standard model of rigid base pair nearest neighbour interactions is inconsistent with a (then) large scale MD simulation of a single oligomer [36]. In contrast we here show that a rigid base nearest neighbour, dimer sequence dependent model is a quite good fit to many MD simulations of different duration and se- quence. In fact a hierarchy of rigid base models with different interaction range and length of sequence-dependence is discussed, and it is concluded that the nearest neighbour, dimer based model is a good compromise between accuracy and complexity of the model. A full parameter set for this model is estimated. An interesting feature is that despite the dimer dependence of the parameter set, due to the phenomenon of frustration, our model predicts non local changes in the oligomer shape as a function of local changes in the sequence, down to the level of a point mutation.