Characterization of the Arabidopsis proteome by fluorescent protein tagging

 

Amitabh Mohanty1, Vitaly Citovsky2, Georgia Drakaki4, David Ehrhardt3, David Jackson1, Shijun Li3, Brigitte Paap2, Natasha Raikhel4, Sue Rhee3, Narasimha Chary Samboju4 and Guo-wei Tian2.

 

1Cold Spring Harbor Laboratory, Cold Spring Harbor, NY; 2Department of Biochemistry and Cell Biology, State University of New York, Stony Brook, NY; 3 Department of Plant Biology, Carnegie Institution of Washington, Stanford, CA; 4Department of Botany and Plant Sciences, University of California, Riverside, CA

 

Arabidopsis has 25,498 predicted genes, however, only about 70% of the genes have been assigned to any functional category, and less than 10% of the genes have been verified experimentally. There are only very limited reports of systematic analysis of genes of unknown function. Moreover, for the majority of the annotated proteins, the sub-cellular location and/or expression pattern is unknown. Thus, a large gap remains in our understanding of the function of a very significant portion of Arabidopsis gene products, many of which are plant specific and likely play fundamental roles in plant biology. Our consortium aims to fill this gap by systematically analyzing a large number of the functionally unassigned genes by Fluorescent Tagging of their Full-Length Protein products (FTFLP). We plan to characterize the Arabidopsis proteome by seeking three specific aims: 1. Selection and sub-cloning of Arabidopsis genes of unknown function with their potential native regulatory sequences. 2. Fluorescent tagging (using Yellow or Cyan fluorescent proteins) of the genes and insertion into Arabidopsis plants. 3. Analysis of the expression patterns and sub-cellular localization of the YFP/CFP-tagged proteins in planta. Based on the most recent Arabidopsis genome annotations, more than 8,000 genes are annotated as "unknown" or "putative" protein. We have selected a short list of ~800 for initial characterization.

As a pre-requisite we have constructed a reference localization map with ~20 Arabidopsis proteins with known sub-cellular localization patterns. This map will serve as a reference for co-localization studies. For each full-length gene, including upstream and downstream native regulatory sequences are being amplified by PCR from genomic DNA. We use these PCR fragments together with YFP or CFP fragments as templates for a second round of PCR (Triple template overlap PCR) for the insertion of the tag into the gene. Under the control of native promoters, the expression levels of some proteins could be too weak for visualization, so we are also using vectors with multiple copies of the 35S enhancer, to enhance the native expression of target genes. Our recent data from this project will be presented.