High resolution physical
mapping of the maize genome and sequencing a part thereof
Arvind K. Bharti1, Fusheng Wei2,
Ed Butler2, Yeisoo Yu2, Jose L. Goicoechea2, HyeRan
Kim2, Galina Fuks1, Will Nelson3, Jamie
Hatfield3, Heidrun Gundlach4, Wojciech M. Karlowski4,
Christina Raymond5, Sarah Towey5, David Jaffe5,
Chad Nusbaum5, Bruce Birren5, Klaus Mayer4,
Cari Soderlund3, Rod A. Wing2 and Joachim Messing1
1The Plant Genome
Initiative at Rutgers, Waksman Institute, Rutgers University, Piscataway, NJ; 2Arizona
Genomics Institute, 3Arizona Genomics Computational Laboratory, University
of Arizona, Tucson, AZ; 4Munich Information Center for Protein
Sequences, Institut fr Bioinformatik, Neuherberg,
Germany; 5MIT/Broad Institute Center for Genome Research, Cambridge,
MA
Due to its economic
significance, maize (>2 Gb) is likely to be the next cereal to be sequenced
after rice (0.4 Gb). Sequencing the maize genome will present a new challenge
not only because it is 5-times larger but also because it contains many gene
families, tandemly arrayed and nested repeat sequences. To establish a
framework to sequence maize, we are developing a high information-content
fingerprinting (HICF)/BAC sequence tagged connector (STC)-based physical map.
This map will be fully integrated with the genetic map and is complementary to
previous work by the NSF funded Maize Mapping Project (www.genome.arizona.edu/fpc/maize). MMP generated a genetically anchored BAC-based physical
map by fingerprinting three deep coverage BAC libraries (HindIII, EcoRI, and MboI) using a lower resolution ŇagaroseÓ method. The HICF
method builds upon the MMP physical map by generating HICF fingerprints of the
identical libraries, which allows for a more precise determination of a minimum
tiling path of BAC clones across the entire genome. The current success rate of
HICF is ~86%, which has already yielded 382,696 successful fingerprints.
Preliminary investigation suggests that many HICF mega-contigs are likely to
join two or more agarose-based contigs and thus should result in the reduction
of the total number of contigs. Additional contigs are expected to be anchored
based on alignments of STCs to the rice genome. To evaluate the MTP generated by
the HICF/agarose physical map, 56 BACs are being sequenced from a ~5 Mb
mega-contig, in addition to 100 randomly selected BACs (>10 Mb). All sequence information generated from this project,
from both BAC ends and whole BACs, is being annotated, which will provide
invaluable data on the sequence organization of the maize genome.