Www.genome.clemson.edu HICF-based physical mapping of Mimulus guttatus and M. lewisii Anna Blenda 1, John Willis 2, Todd Vision 3, Eric Fang 1, Barbara.

Slides:



Advertisements
Similar presentations
Sequencing the Maize Genome Maize Genome Sequencing Consortium
Advertisements

Maize Genetics, Genomics, Bioinformatics workshop
Cacao BAC Library Construction and Physical Mapping.
Fingerprinting and Markers for Floral Crop Improvement James W. Moyer Dept. of Plant Pathology North Carolina State University, Raleigh, NC
Recombinant DNA Technology
Lettuce genetic map viewer is written in PHP and uses GD library. The viewer interacts with tables in the relational mySQL database and creates graphical.
DNA polymorphisms Insertion-deletion length polymorphism – INDEL Single nucleotide polymorphism – SNP Simple sequence repeat length polymorphism – mini-
Bio 261 September 30, 2003 Todd Vision Genetic Mapping “Speciation QTL” in Mimulus From QTL to an understanding the evolution of species differences at.
Overview of research program Todd Vision
9 Genomics and Beyond Brief Chapter Outline
Bioinformatics for the Canadian Potato Genome Project David De Koeyer, Martin Lagüe and Rebecca Griffiths Wageningen September 18, 2004.
Recombinant DNA Introduction to Recombinant DNA technology
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Genetic and physical maps around the sex-determining M- locus of the dioecious plant asparagus Telgmann-Rauber et al
8/22/03 CS RA fair Comparative genome mapping Todd Vision Department of Biology University of North Carolina at Chapel Hill.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Genome Analysis Determine locus & sequence of all the organism’s genes More than 100 genomes have been analysed including humans in the Human Genome Project.
BioInformatics (2). Physical Mapping - I Low resolution  Megabase-scale High resolution  Kilobase-scale or better Methods for low resolution mapping.
Mouse Genome Sequencing
Genetic Map to Physical Map This activity is intended to supplement the workshop session entitled “Integrating the Genetic and Physical Maps of Maize”.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
What is SGN? S GN is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such.
Genome Annotation and Databases Genomic DNA sequence Genomic annotation BIO520 BioinformaticsJim Lund Reading Ch 9, Ch10.
CUGI Pilot Sequencing/Assembly Projects Christopher Saski.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
Screening a Library Plate out library on nutrient agar in petri dishes. Up to 50,000 plaques or colonies per plate.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Tomato Overgo Project and Seed BAC Selection Cornell Team Ying Eileen Wang, 2005 PAG.
Information System for Comparative Analysis of Legume Genomes Anita Dalwani Advisors: Dr. Roger Innes, Dr. Haixu Tang.
Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy, Don Jones, Dorrie Main.
Genome Sequencing in the Legumes Le et al Phylogeny Major sequencing efforts Minor sequencing efforts ~14 MY ~45 MY.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Revision – Concept map.
4th Solanaceae Genome Workshop 2007, September 09th- 13th, Jeju Island, Korea THE FRENCH CONTRIBUTION TO THE INTERNATIONAL TOMATO GENOME SEQUENCING PROGRAM.
FINISHING WORKSHOP APRIL 2008 CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Theobroma cacao Integrated Physical and Genetic Map 2 BAC Libraries 250 Genetic Markers.
© 2010 by The Samuel Roberts Noble Foundation, Inc. 1 The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK, 73401, USA 2 National Center.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Chromosome 2 Doil Choi, Sunghwan Jo KOREA. Cytological architecture of chromosome kb/µm DAPI (4’-6-diamidino-2-phenylindole) stained pachytene chromosome.
INTRODUCTION ● Expressed sequence tags offer a low cost approach to gene discovery ● For a range of non-model organisms, ESTs represent the only sequence.
Linkage and Mapping. Figure 4-8 For linked genes, recombinant frequencies are less than 50 percent.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, C. Perla 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino 2, A.
Bioinformatics Scheme of the sequencing project (Martínez & Figueras, 2007) Construction Bookseller Bases determination Fragments assembly Gene search.
Applied Bioinformatics Week 5. Topics Cleaning of Nucleotide Sequences Assembly of Nucleotide Reads.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Human Genome.
Molecular Biology II Lecture 1 OrR. Restriction Endonuclease (sticky end)
Development of a Cotton Marker Database (CMD) for Gossypium genome and genetic research CMD Main Goals Collect and integrate.
Genome Analysis Assaad text book slides only Lectures by F. Assaad can be downlaoded from muenchen.de/~farhah/index.htm.
16 th April 2007 Christine Nicholson, Mapping Core Group Wellcome Trust Sanger Institute Tomato Chromosome 4 Mapping & Use of FPC Copyright Wellcome Trust.
What is BLAST? Basic BLAST search What is BLAST?
Plan A Topics? 1.Making a probiotic strain of E.coli that destroys oxalate to help treat kidney stones in collaboration with Dr. Lucent and Dr. VanWert.
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
Improvement of SSR Redundancy Identification by Machine Learning Approach Using Dataset from Cotton Marker Database Pengfei Xuan 1,2, Feng Luo 2, Albert.
The Bovine Genome Sequence: potential resources and practical uses. Nicola Hastings, Andy Law and John L. Williams * * Department of Genetics and Genomics,
What is BLAST? Basic BLAST search What is BLAST?
Fall HORT6033 Molecular Plant Breeding
Virginia Commonwealth University
GENETIC MARKERS (RFLP, AFLP, RAPD, MICROSATELLITES, MINISATELLITES)
Basics of BLAST Basic BLAST Search - What is BLAST?
Section 3: Gene Technologies in Detail
Introduction to Sequencing
Presentation transcript:

HICF-based physical mapping of Mimulus guttatus and M. lewisii Anna Blenda 1, John Willis 2, Todd Vision 3, Eric Fang 1, Barbara Blackmon 1, Jeanice Troutman 1, David Henry 1, Stephen Ficklin 1, Michael Atkins 1 and Jeff Tomkins 1 1 Clemson University Genomics Institute, Clemson, SC, 29634, USA 2 Dept. Biology, Duke University, Durham, NC, 27708, USA 3 Dept. Biology, University of North Carolina, Chapel Hill, NC, 27599, USA This project is supported by NSF-FIBR grant No and NSF-MRI grant No This project is supported by NSF-FIBR grant No and NSF-MRI grant No Mimulus guttatus (Monkeyflower) Mimulus lewisii (Pink Monkeyflower) Genomic regions conferring reproductive barriers and adaptive differences between species in the genus Mimulus, a well- developed ecological plant model, have been identified by QTL mapping in two different populations. Further progress on understanding the molecular genetic basis of adaptation and speciation in Mimulus requires comparative genomic tools. Comprehensive structural, functional and comparative studies of a genome are increasingly dependent upon the availability of integrated physical/genetic maps. Anchoring molecular markers to specific large-insert DNA clones (BACs) associated with various traits, particularly QTLs, is an important objective in developing a physical framework. The high information content fingerprinting (HICF) technique, based on 5-enzyme digestion, ABI SNaPshot labeling kit and ABI3730 capillary electrophoresis, was used to construct physical maps for two Mimulus species; M. guttatus and M. lewisii. Physical map data for both species is available at To integrate the genetic and physical maps, cDNA-derived markers were anchored to all three Mimulus BAC libraries by overgo hybridization using a high-throughput probe pooling approach. Physical Mapping Status. A total of 56,624 (80%) clones from both M. guttatus BAC libraries were used for contig construction after filtering the data set in the GeneMapper and GenoProfiler programs. Prior to FPC assembly, Mimulus libraries were screened with chloroplast and mitochondrial DNA probes and the non-nuclear organelle BAC clones removed from the data set. The contig assembly produced 2,535 contigs containing 38,337 clones (68% of total clones used for FPC) with an average Sulston score of Similarly, 33,163 (91%) clones were used for construction of the M. lewisii physical map of 1,913 contigs. The integrated genetic/physical map is available at Chromosome 6 Marker info Genetic map display FPC contigs display References Lazo GR, Lui N, Gu YQ, Kong X, Coleman-Derr D, Anderson OD. Hybsweeper: a resource for detecting high-density plate gridding coordinates. Biotechniques (3):320,322,324. Luo M-C, Thomas C, You FM, Hsiao J, Ouyang S, Buell CR, Malandro M, McGuire PE, Anderson OD, Dvorak J. High- throughput fingerprinting of bacterial artificial chromosomes using the SNaPshot labeling kit and sizing of restriction fragments by capillary electrophoresis Genomics 82: Nelson WM, Bharti AK, Butler E, Wei F, Fuks G, Kim H, Wing RA, Messing J, Soderlund C. Whole-genome validation of high- information-content fingerprinting. Plant Physiol (1): Methods and Procedures. Fingerprinting pipeline was used for data processing and analysis, which incorporates the use of the following software applications: ABI Data Collection v2.0, ABI GeneMapper v3.7 (software packages for ABI DNA sequencers), GenoProfiler v1.1, and FPC v CUGI uses the publicly available phred and crossmatch programs to perform base calling and to remove residual vector from sequences. We use our in-house scripts to filter low quality sequences from the final data sets. Sequences with at least 100 base pairs with a phred value of 20 or higher and less than 5% ambiguous bases are considered successful. All others are removed from the dataset. The final BESs submitted to Genbank contain only the longest contiguous non-vector sequence of bases with low quality bases trimmed off both the beginning and end of the sequence. Accession numbers: M. guttatus BAC-end sequences were submitted to the Genbank under accession numbers ED ED806636, M. lewisii BAC-end sequences were submitted to the Genbank under accession numbers ED ED Integration of Genetic and Physical Map Data. Hybridization of genetically mapped ESTs (containing SSRs) onto the fingerprinted Mimulus BAC libraries is being done via overgo probe pooling (5 x 5 x 5 design). The HybSweeper program (Lazo et al., 2005) is used for high-throughput hybridization data scoring and the data is then de-convoluted using a PERL script. At present, 523 and 376 ESTs have been anchored to the M. guttatus and M. lewisii physical maps, respectively. Using the FPC contig assemblies, ~5,000 clones for each species were selected and re-arrayed into minimum tile libraries for comparative studies and BAC-end sequenced (BES). BES data is being used to develop additional genetic markers (SSRs). Goal = 1,000 STS markers anchored to both M. guttatus and M. lewisii physical maps. M. guttatus HICF raw data after GeneMapper processing Overgo design scheme Overgo hybridization scheme Pool 1 Pool 2 Pool 3 Pool 4 Pool 5 Overgo probe pooling design (5 x 5 x 5) used to hybridize STS markers onto Mimulus BAC libraies CUGI website Mimulus project pages M. guttatus WebFPC v. 2.1 FPC contig with hybridization data HybSweeper HybSweeper program for high-throughput hybridization data scoring Mimulus BES Homology Blast and SSR Analysis Results. To gain a glimpse into the Mimulus genome and to provide a resource for other researchers, the successful BESs were queried against the Swiss-Prot protein database, NCBI's non-redundant protein database, and MIPS Arabidopsis database. The results of the December, 2006 homology searches are provided in the form of downloadable Excel spreadsheets. Alternatively, researchers are able to perform BLAST or FASTA homology searches of their own sequences against the Mimulus publicly available BES library through CUGI's online FASTA and BLAST servers. In-house scripts were used to mine SSRs in the Mimulus BESs (Table 1), and the Primer3 program for locating primers. The resulting SSRs are defined as dinucleotides (motifs with 2 bp), trinucleotides (motifs with 3 bp), tetranucleotides (motifs with 4 bp), pentanucleotides (motifs with 5 bp) and hexanucleotides (motifs with 6 bp). Only dinucleotides with at least five repeats, trinucleotides with at least four repeats, and tetra-, penta- and hexanucleotides with at least three repeats are included in the result set. Forward and reverse primers for SSRs were also generated. Primers are mostly generated for SSRs from sequences that have a GC content between 40% and 60% with at least 20 base pairs of sequence on either side of the SSR. The resulting Excel spreadsheet also indicates if the SSRs are located within a putative coding region.