Systems biology approaches to understanding the Arabidopsis hypersensitive response

 

Vikas Agrawal1, Chu Zhang1, Prasad S. Dhurjati2 and Allan D. Shapiro1

 

1Department of Plant and Soil Sciences, 2Department of Chemical Engineering, University of Delaware, Newark, DE

 

Motivation  Cross-talk and feedback regulation in signal transduction networks are often quite complex. The signaling network governing Arabidopsis hypersensitive response to avirulent Pseudomonas syringae was analyzed using a systems biology approach. A systems approach uses quantitative data collection and mathematical modeling to understand higher order relationships governing network behavior. The goal was to create a framework for efficient design of experiments by simulating the time course progression of the most important components of the response in silico. These predictions were then compared to experimental data for validation (Agrawal, et al., Biotechnology Progress 2003).

Strategy  The most important components of the response were taken as model variables. These included death of individual cells (PCD), salicylic acid (SA) and reactive oxygen (ROS) accumulation, and level of apoplastic superoxide dismutase (SOD) (Zhang, et al., Molecular Plant-Microbe Interactions, in press). Initial estimates of kinetic parameters and time delays were made from experimental data and subsequently refined by global fitting of simulated to experimental data. This process resulted in a system of ten delay differential equations governed by expert-system type rules. This system was solved numerically using engineering software (MATLAB). A one-to-one correspondence was maintained between model variables and specific signaling components. The mathematical forms for relationships between model variables also corresponded one-to-one with experimentally observed relationships between signaling components. As such, the assumptions questioned by new data can be readily identified.

Results  1) In silico simulations of the time course of changes in levels of salicylic acid, PCD and hydrogen peroxide match experimental data. 2) We have used the model to prove that direct negative autoregulation of salicylic acid biosynthesis does not exist in this system. Including terms for this extra negative feedback loop made it impossible for simulated data to match experimental results. 3) Simulations also determined that NPR1-dependent negative feedback on PCD cannot affect the fraction of total PCD seen late in HR progression as a direct consequence of high levels of superoxide. 4) The dynamic profiles of apoplastic superoxide dismutase (SOD) activity and two putative gene induction events have been predicted. 5) ÒSensitivityÓ analysis has been used to predict which model components have the most significant influence on overall system dynamics. These predictions will aid in design of further experiments to test our knowledge of control of the HR.

Our current strategy to improve the explanatory power of the model via explicit modeling of cell-to-cell signaling events using population balance modeling will be described.

 

 

Comparison of large scale gene expression measurement technologies

 

Vikas Agrawal, Hassan Ghazal and Blake C. Meyers

 

Department of Plant and Soil Sciences, University of Delaware. Newark, DE

 

We are in the process of comparing four large scale gene expression technologies, based on expression data from more than 10 tissues/conditions in the model plant Arabidopsis thaliana. Data comprising 40 Agilent arrays, 40 Affymetrix arrays and 63 MPSS sequencing runs was obtained using the same mRNA samples for each library. Qiagen/Operon data is forthcoming.

Agilent technology is known to be more reproducible in technical replicates than MPSS than Affymetrix in turn. We have shown that MPSS technology has a dynamic range about 5 fold larger than that of Agilent and 10 times that of Affymetrix. Via global analysis of the data across the libraries we found that MPSS often underestimates the expression levels (relative to the other two) while Agilent overestimates them or vice versa.

We identified 45 genes that were within a 2-fold range in all technologies and all libraries. There were more than 2000 genes that were measured within a factor of 2 across all technologies in each specific library. The overall correlations across technologies for specific libraries were in the range of 0.65-0.75. The measurements of differential expression had a poorer correlation across technologies. There were 6 genes that were expressed above a concentration of 500 transcripts per million (TPM), 592 genes expressed below 10 TPM and 85 genes expressed below 4 TPM in all libraries as measured by all technologies.

Thus we have examined issues of reproducibility, correlation, dynamic range and estimation of the absolute level of transcripts. We are conducting QRT-PCR measurements that will place these comparisons on a stronger foundation. There are issues of the cost to benefit ratios that we have not presented in this study.