Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
The ever-increasing availability of transcriptomic and metabolomic data can be used to deeply analyze and make ever-expanding predictions about biological processes, as changes in the reaction fluxes through genome-wide pathways can now be tracked. Currently, constraint-based metabolic modeling approaches, such as flux balance analysis (FBA), can quantify metabolic fluxes and make steady-state flux predictions on a genome-wide scale using optimization principles. However, relating the differential gene expression or differential metabolite abundances in different physiological states to the differential flux profiles remains a challenge. Here we present a novel method, named REMI (Relative Expression and Metabolomic Integrations), that employs genome-scale metabolic models (GEMs) to translate differential gene expression and metabolite abundance data obtained through genetic or environmental perturbations into differential fluxes to analyze the altered physiology for any given pair of conditions. REMI allows for gene-expression, metabolite abundance, and thermodynamic data to be integrated into a single framework, then uses optimization principles to maximize the consistency between the differential gene-expression levels and metabolite abundance data and the estimated differential fluxes and thermodynamic constraints. We applied REMI to integrate into the Escherichia coli GEM publicly available sets of expression and metabolomic data obtained from two independent studies and under wide-ranging conditions. The differential flux distributions obtained from REMI corresponding to the various perturbations better agreed with the measured fluxomic data, and thus better reflected the different physiological states, than a traditional model. Compared to the similar alternative method that provides one solution from the solution space, REMI was able to enumerate several alternative flux profiles using a mixed-integer linear programming approach. Using this important advantage, we performed a high-frequency analysis of common genes and their associated reactions in the obtained alternative solutions and identified the most commonly regulated genes across any two given conditions. We illustrate that this new implementation provides more robust and biologically relevant results for a better understanding of the system physiology.
Didier Trono, Evaristo Jose Planet Letschert, Julien Léonard Duc, Alexandre Coudray, Julien Paul André Pontis, Delphine Yvette L Grun, Cyril David Son-Tuyên Pulver, Shaoline Sheppard
Vassily Hatzimanikatis, Georgios Fengos, Maria Masid Barcon, Daniel Robert Weilandt, Zhaleh Hosseini, Pierre Guy Rémy Salvy