Information System for Comparative Analysis of Legume Genomes Anita Dalwani Advisors: Dr. Roger Innes, Dr. Haixu Tang.

Slides:



Advertisements
Similar presentations
1.1.3 MI.
Advertisements

Lettuce genetic map viewer is written in PHP and uses GD library. The viewer interacts with tables in the relational mySQL database and creates graphical.
The IWGSC: Building the sequence-based foundation for accelerated wheat breeding Kellye A. Eversole IWGSC Executive Director & The IWGSC Cereals for Food,
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
9 Genomics and Beyond Brief Chapter Outline
Bioinformatics for the Canadian Potato Genome Project David De Koeyer, Martin Lagüe and Rebecca Griffiths Wageningen September 18, 2004.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
The Human Genome Race. Collins vs. Venter Collins Venter.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Genome Analysis Determine locus & sequence of all the organism’s genes More than 100 genomes have been analysed including humans in the Human Genome Project.
Comparative Genomics of Viruses: VirGen as a case study Dr. Urmila Kulkarni-Kale Bioinformatics Centre University of Pune Pune
BioInformatics (2). Physical Mapping - I Low resolution  Megabase-scale High resolution  Kilobase-scale or better Methods for low resolution mapping.
Genetic technology Unit 4 Chapter 13.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
From Haystacks to Needles AP Biology Fall Isolating Genes  Gene library: a collection of bacteria that house different cloned DNA fragments, one.
Mouse Genome Sequencing
AP Biology Ch. 20 Biotechnology.
Trends in Biotechnology
Applied Genetics: DNA Technology & Genomics
Title: GeneWiz browser: An Interactive Tool for Visualizing Sequenced Chromosomes By Peter F. Hallin, Hans-Henrik Stærfeldt, Eva Rotenberg, Tim T. Binnewies,
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Introducing Mouse Model Archive. What is Mouse Model Archive (MMA)? A web-based platform for design and analysis, management, recording, and sharing of.
What is SGN? S GN is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
Probes can be designed in an evolutionary hierarchy.
Genome sequencing Haixu Tang School of Informatics.
Genome Sequencing in the Legumes Le et al Phylogeny Major sequencing efforts Minor sequencing efforts ~14 MY ~45 MY.
Chapter 13 Table of Contents Section 1 DNA Technology
DNA Technology. Overview DNA technology makes it possible to clone genes for basic research and commercial applications DNA technology is a powerful set.
Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library.
4th Solanaceae Genome Workshop 2007, September 09th- 13th, Jeju Island, Korea THE FRENCH CONTRIBUTION TO THE INTERNATIONAL TOMATO GENOME SEQUENCING PROGRAM.
3/24/2005 TIGP 1 Bioinformatics for Microarray Studies at IBS Pei-Ing Hwang, Ph.D. Mar. 24, 2005.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
Chromosome 2 Doil Choi, Sunghwan Jo KOREA. Cytological architecture of chromosome kb/µm DAPI (4’-6-diamidino-2-phenylindole) stained pachytene chromosome.
INDIAN INITIATIVE FOR TOMATO GENOME SEQUENCING Nagendra Singh National Research Centre on Plant Biotechnology Indian Agricultural Research Institute New.
Linkage and Mapping. Figure 4-8 For linked genes, recombinant frequencies are less than 50 percent.
Chapter 7 Analyzing DNA and gene structure, variation and expression 1.Sequencing and genotyping DNA Standard/manual DNA sequencing using dideoxynucleotide.
Applied Bioinformatics Week 5. Topics Cleaning of Nucleotide Sequences Assembly of Nucleotide Reads.
Human Genome.
GENETIC ENGINEERING CHAPTER 20
Central Arizona Phoenix LTER Center for Environmental Studies Arizona State University Data Query Peter McCartney RDIFS Training Workshop Sevilleta LTER.
Center for Integrated Fungal Research
ARGOS (A Replicable Genome InfOrmation System) for FlyBase and wFleaBase Don Gilbert, Hardik Sheth, Vasanth Singan { gilbertd, hsheth, vsingan
Molecular Biology II Lecture 1 OrR. Restriction Endonuclease (sticky end)
Effects of Visualization and Interface Design on User Comprehensibility of Composite Data Asheem Chhetri, Apoorv Wairagade, Mahesh Gorantla, Hanye Xu,
IMDB: A Generic Insertional Mutagenesis Database Xiaokang Pan and Lincoln Stein Cold Spring Harbor Laboratory.
BIOL 433 Plant Genetics Term 2, Instructors: Dr. George Haughn Dr. Ljerka Kunst BioSciences 2239BioSciences Tel
Genome Analysis Assaad text book slides only Lectures by F. Assaad can be downlaoded from muenchen.de/~farhah/index.htm.
16 th April 2007 Christine Nicholson, Mapping Core Group Wellcome Trust Sanger Institute Tomato Chromosome 4 Mapping & Use of FPC Copyright Wellcome Trust.
Genome Analysis. This involves finding out the: order of the bases in the DNA location of genes parts of the DNA that controls the activity of the genes.
Culturable Bacterial Communities Analyzer DIANA VANESSA SARRIA-ZUNIGA ELIANA TORRES-ZELADA April 29, 2016.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
HICF-based physical mapping of Mimulus guttatus and M. lewisii Anna Blenda 1, John Willis 2, Todd Vision 3, Eric Fang 1, Barbara.
The Bovine Genome Sequence: potential resources and practical uses. Nicola Hastings, Andy Law and John L. Williams * * Department of Genetics and Genomics,
Virginia Commonwealth University
Microbial genomics.
BIOL 433 Plant Genetics Term 2,
Pre-genomic era: finding your own clones
Evolving Resistance: Research Goals and Goods
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Sequencing update of tomato chromosome 3 Chinese Academy of Sciences
BARLEX – the Barley Draft Genome Explorer
BIOL 433 Plant Genetics Term 2,
Introduction to Sequencing
Genetic Engineering Chapter 13.
Presentation transcript:

Information System for Comparative Analysis of Legume Genomes Anita Dalwani Advisors: Dr. Roger Innes, Dr. Haixu Tang

LAYOUT Motivation Participants Background Design Results/Demo Future Work

Motivation?

Motivation Goal of legume genome project -Investigate the process of genome restructuring following polyploidization in plants (soybean and its relatives in the Glycine genus) -Try answering questions like : - Genome evolution on both short( 50 million yrs) time scale - Evolution of disease resistance (R) genes.

Motivation To answer these questions: -1 Mbp syntenic genomic regions from six taxa as well as their duplicated regions in the polyploidy members (12 such regions in total) will be sequenced and analyzed. -These regions contain several important disease resistance (R) genes.

Motivation Plant species and accessionNo. of regions to be analysed Whole Genome size (megabases) G. max cultivar Williams G. max PI G. tomentella G1188 (2n=80)42083 G. tomentella race D3 (2n-40)21103 Teramnus labialus1< 700 Medicago truncatula1466

Motivation Information System -central repository for the data -stores and retrieves updated information -bioinformatics and visualization tools

Participants ParticipantsUniversity Roles Roger Innes Tom Ashfield Anita Dalwani Murali Mohan Innes Lab Indiana University, Bloomington. Principal Investigator R gene evolution Database development, Web application. Database development. Nevin Young Steve Cannon Roxanne Denny Young Lab, University of Minnesota Co-PI phylogenetic; R genes; comparative genomics. Lab Manager Jeff Doyle Bernard Pfeil Doyle Lab Cornell University Co-PI phylogenetic and polyploidy Bruce Roe Majesta Siegfried Roe Lab, Oklahoma University Co-PI Bac sequencing Saghai Maroof Milind Ratnaparkhe Jafar Mammado Maroof Lab, Virginia Tech Co-PI R genes; comparative genomics

Background Procedure 1.Create and make available Bacterial Artificial Chromosome (BAC) libraries of each species. Indexing available BAC, BAC end sequences, library, probes, vector, gel images

Background 2. Assemble syntenic BAC contigs from each library i. Strategically chosen soybean clones are used as probes

Probe 53 - ACCCGT Probe 21 - AATTC Probe 9 - GTACTT Probe 26 - AAACT Probe 1 - CCCC Probe 3 - AATC ACCCGT AATTC GTACTT AAACT CCCC AATC

ii. Individual probes are hybridized to high-density BAC filters representing all the target genomes

Background

Background iii. Integrity of contigs is confirmed by fingerprinting iv. Set of clones that hybridize to two or more probes are selected v. BACs representing the tentative minimum tiling path will be end sequenced

Probe53Probe21Probe9Probe26Probe1Probe3 Bac1 Bac2 Bac3 Bac4 Bac5 Bac6 Bac7 Bac8

Probe53Probe21Probe9Probe26Probe1Probe3 Bac2 Bac3 Bac4 Bac8

ACCCGTAATTCGTACTTAAACTCCCCAATC ACCCGTAAATC CTTCTT CCGCAATCT AATCCCCC

Background 3. DNA sequencing, Assembly, Annotation 4. Compare the content, order and sequence of gene 5. Results available for public

Importance Information System -Centrally available data -User-friendly interface for retrieving the information -Updated progress information -Tools for interpreting the results. Works as an Laboratory Management Information System

Design Steps for designing the Information System. 1.Design the Database - Data: BAC, BES, Probes, Libraries, vector, library screen hits etc.

Design - Visualize the relationship between these large amount of data. For example, Library table stores detailed information about each library used rather than having each BAC storing the library information

Design -Created tables based on these relationship Main tables used in the database are: BAC BES GEL IMAGES GENOMIC SOUTHERNS GENOTYPE LIBRARY LIBRARY SCREENS LIBRARY SCREEN HITS PRIMER PROBE PROBE WITHIN BACS VECTOR

Design

Design

Design 2. Populate the database with initial set of data -Initial set of data was stored in form of MS- Excel. -Perl script for parsing information.

Design Web Database Application -understanding the needs for the project -Web database interface - displays information about the project - add and update interface - tools for analyses

Design For determining the tiling path - Designing a Visualization tool - displays the locations of the clones with respect to probes - Probes are strategically chosen from soybean genomes

Design - Input : library name - subset of probes with at least one hit with the library are selected -BAC clones for the library are generated which have hits with probes -Probes are arranged in order of their position -BACs are mapped to these probes.

Design System Specifications -Database: Oracle 9i -Languages: PHP, Perl, HTML, JavaScript -Web Server: Apache Platform: Unix (SunOS 5.9)

Results

Future Work Comparative physical Mapping Bioinformatics tools Public interface

Acknowledgements Dr. Roger Innes Dr. Haixu Tang Dr. Sun Kim Legume genome project team

References Innes, Roger W. Comparative Analysis of Legume Genome Evolution, Proposal submitted to National Science Foundation. Tang, Haixu. Comparative physical mapping: ordering clones by cross species hybridization Dec