Subtractive Comparisons of Gene Expression in Defense-Related EST Libraries

The soybean EST database is a very large one and contains a wide range of expression information (from ca. 80 different libraries).  Several of the soybean EST libraries of particular relevance to the work proposed here include those for hypocotyl response to an incompatible race of P. sojae (Gm-c1084), cotyledon response to the glucan elicitor (Gm-c1076) and leaf response to an incompatible infection with Psg (Gm-c1074).

To complement our cataloging of soybean defense genes and results from preliminary analyses, we undertook a thorough comparative analysis of the defense-related libraries at the TIGR Soybean Gene Index.  Analyses available at TIGR include electronic comparisons of expression in two libraries of choice. The output of such an analysis is analogous to data generated in an experimental expression array. Thus, by comparing expression in the defense-related libraries to suitable control libraries, we were able to essentially do a subtraction electronically and define those genes selectively up-regulated by the treatment. To rule out false positives, we limited our search to those genes that were expressed at least at the 0.1% level and which were up-regulated when compared to the appropriate control library with a confidence of at least 95% or a "Group R Value" of at least 4.0. While hits in these analyses included genes for the phenylpropanoid pathways, due to our extensive characterization of these elsewhere, we have excluded them from this description. Within these various constraints, the analyses led to the definition of over 40 genes that were selectively up-regulated in one or all three of the defense-related libraries (Table 1). Expression is given as a percent of all ESTs in that library. The most significant increases in expression are shown in red, those with an intermediate increase in expression in green and those with no statistical difference from controls in grey. The expression levels marked with an asterisk denote responses that were the highest (ranked either 1 or 2) in this library when compared to all other (ca. 80) EST libraries.

While some of the genes in Table 1 are already on our list of genes of immediate interest, several potentially new candidates were suggested.

Note added December, 2004:  Please note that since this table was developed, the TC designations for many of the genes listed here have changed.  Searching for them at the TIGR database will give you their new designations.

Table 1. Electronic Expression Analysis of Up-regulated Genes in Defense-Related EST Libraries

Tentative

Contig

Type Gm-c1076 Gm-c1084 GM-c1074 Tentative

Contig

Type Gm-c1076 Gm-c1084 Gm-c1074
TC119828 PR-2 0.27 0.19 0.05 TC119617 Redox 0 0.02 0.15
TC135699 PR-2 0 0 0.15* TC132764 PCD 0.13* 0.02 0.15*
TC120222 PR-5 0.04 0 0.75* TC133073 PCD 0.02 0.23* 0.05
TC133074 PR-6 0.47* 0 0 TC119277 SIG 0.16 0.09 0.15
TC119739 PR-7 0.31 0.60* 0.10 TC119300 SIG 0.11 0 0.15
TC132334 PR-9 0.16 0.21* 0 TC132909 SIG 0.07 0.21* 0.15*
TC132128 PR-10 1.90* 0.34 0.61* TC120975 SIG 0 0.15* 0
TC132130 PR-10 0.51* 0.04 0.10* TC121307 STR 0.40* 0.04 0
TC132126 PR-10 0.42* 0.15 0.39 TC132755 TF 0.07 0.17* 0
TC132038 PR-12 0.04 0.58* 0 TC133162 TF 0.04 0.02 0.17
TC132042 PR-L 0.80* 0.02 0.02 TC119441 Wall 1.69* 0.19 0.19
TC119403 PR-L 0.16 1.00 0.02 TC131999 Wall 1.36* 0 0.05
TC131777 PR-L 0.02 0.23 0 TC119549 Wall 1.18* 0 0
TC120737 PR-L 0 0.19* 0 TC119541 Wall 1.07* 0.06 0.19
TC131708 Redox 0.65* 0 0.05 TC119067 Wall 0.31 0.23 0
TC119575 Redox 0.04 0.13 0.17* TC119174 Wall 0.25 0.3 0
TC133189 Redox 0.02 0.15* 0 TC119687 Wall 0.25 0 0
TC132499 Redox 0 0.85* 0 TC119062 Wall 0 0.55* 0.05
TC132488 Redox 0 0.23 0 TC119359 UNK 0.40 0 0.29

Abbreviations: PR, pathogenesis-related protein genes; PR-L, other PR-like protein genes; Redox, redox related genes; PCD, genes related to programmed cell death; SIG, signaling genes; STR, signal transduction genes; TF, transcription factors; Wall, cell wall related genes; UNK, unknown function. Control libraries used for comparison were: Gm-c1032 for Gm-c1076 (glucan elicitor), Gm-c1045 for Gm-c1084 (P. sojae infected), Gm-c1037 for Gm-c1074 (Pseudomonas infected).