Expansion of the USDA-ARS Germplasm Resources Information Network (GRIN) Database to Accommodate Molecular Data Gayle Volk, Christopher Richards USDA-ARS-National.

Slides:



Advertisements
Similar presentations
Applications of genome sequencing projects 1) Molecular Medicine 2) Energy sources and environmental applications 3) Risk assessment 4) Bioarchaeology,
Advertisements

applications of genome sequencing projects
V Alyssa Rosemartin 1, Lee Marsh 1, Ellen Denny 1, Bruce Wilson USA National Phenology Network, Tucson, AZ; 2 - Oak Ridge National Laboratory, Oak.
the global plant genebank information management system
The National Plant Germplasm System: Current Status and Future Prospects Peter Bretting USDA/ARS Office of National Programs.
Anopheles gambiae PopGenBase Groundwork for a vector population genetics database UC Davis - UCLA.
Molecular Tools for Aiding and Breeding of Aquaculture Species Masters in Aquaculture and Fisheries Genetic and Selection 2014 Tamara Moedas nº 47853;
Peer Assessment of 5-year Performance ARS National Program 301: Plant, Microbial and Insect Genetic Resources, Genomics and Genetic Improvement Summary.
Creating An Allele Index For NPGS: Bioinformatic Issues Edward Buckler USDA-ARS at Cornell University, Ithaca, NY.
Structural and Functional Genomics of Tomato Barone et al Tomato (Solanum Lycopersicon) – economically important crop worldwide, – intensively investigated.
The National Plant Germplasm System: Status and Prospects Peter Bretting USDA/ARS Office of National Programs.
C ONVERTING P UBLISHED D ATA TO D ESCRIPTORS ( PERMISSION / CITATION ) Clare Coyne and Dave Stout USDA Agricultural Research Service Western Regional Plant.
Dan Masiga Molecular Biology and Biotechnology Department International Centre of Insect Physiology and Ecology, Nairobi, Kenya BARCODE Data Standard The.
Reading the Blueprint of Life
Dorrie Main, Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy and Don Jones.
GRIN-Global Project the global plant genebank information management system.
GRIN-Global Project the global plant genebank information management system.
Creating a Kinship Matrix using Microsatellite Analyzer (MSA) Zhifen Zhang The Ohio State University.
HAPLOID GENOME SIZES (DNA PER HAPLOID CELL) Size rangeExample speciesEx. Size BACTERIA1-10 Mb E. coli: Mb FUNGI10-40 Mb S. cerevisiae 13 Mb INSECTS.
Centre for Genetic Resources, the Netherlands Information Standards for Plant Genebanks Theo van Hintum Centre for Genetic Resources, The Netherlands.
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
Molecular Marker Evaluation Data Laura Fredrick Marek ISU/NCRPIS, Ames, IA WRPIS, Pullman, WA supporting presentations by: grape SSR informationpea SNP.
GRIN-Global a global plant genebank management system Update for ARS Administrator’s Council December 5, 2012.
Prepared for the 3rd SBB telecon 20 Mar 2012 Michele Walters, BI-01 task coordinator.
Phenotyping Clare Coyne & Melanie Harrison-Dunn, curators.
The National Plant Germplasm System: 2015 Status, Prospects, and Challenges Peter Bretting USDA/ARS Office of National Programs
Molecular marker data and their impact on gene bank management Chris Richards NCGRP, Fort Collins, CO Curator Workshop, Atlanta Georgia.
Supplementary Data for GERMINATE Tutorial: Retrieving Genetic Data from GERMINATE using the Perl-CG Interface.
GLOBAL BIODIVERSITY INFORMATION FACILITY TDWG 2009, Montpelier, November 12, 2009 Dag Endresen (NordGen)Samy Gaiji (GBIF) Dag Endresen (NordGen) & Samy.
Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy, Don Jones, Dorrie Main.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
An Efficient Method of Generating Whole Genome Sequence for Thousands of Bulls Chuanyu Sun 1 and Paul M. VanRaden 2 1 National Association of Animal Breeders,
APPLICATION OF MOLECULAR MARKERS FOR CHARACTERIZATION OF LATVIAN CROP PLANTS Nils Rostoks University of Latvia Vienošanās Nr. 2009/0218/1DP/ /09/APIA/VIAA/099.
TDWG 2006, Missouri, U.S.A. Exchange of germplasm datasets with PyWrapper/BioCASE October 16, 2006 TDWG annual Meeting 2006 Missouri Botanical Garden St.
Diversity Bioinformatics Terry Casstevens Institute for Genomic Diversity, Cornell University GMOD Meeting at NESCent Durham, NC – June 29-30, 2006.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
American Oat Workers Conference, Fargo 2006 Towards a Global Strategy for the Conservation of Oat Genetic Resources Federal Centre of Breeding Research.
Announcements: Proposal resubmission deadline 4/23 (Thursday).
Biodiversity research and informatics in Bioversity International TDWG 2009 meeting ‘e-knowledge about Biodiversity and Agriculture’ Montpellier, 9-13.
Managing the Impacts of Change on Archiving Research Data A Presentation for “International Workshop on Strategies for Preservation of and Open Access.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Oat Molecular Markers: Status and Opportunities Howard W. Rines USDA-ARS and Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul,
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Meredith A. Lane CODATA/ERPANET Workshop: Scientific Data Selection &
Gramene: Interactions with NSF Project on Molecular and Functional Diversity in the Maize Genome Maize PIs (Doebley, Buckler, Fulton, Gaut, Goodman, Holland,
François Ancien Sascha Kretzschmann Olivier Suplis Genotyping Errors Causes, Consequences and Solutions Genotyping Errors.
Core 2: Bioinformatics NCBO-Berkeley. Core 2 Specific Aims 1.Apply ontologies  Software toolkit for describing and classifying data 2.Capture, manage,
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Data Management for Integrated Breeding
This tutorial will describe how to navigate the section of Gramene that provides descriptions of alleles associated with morphological, developmental,
Gramene V. 211 Gramene Diversity Gramene Genetic Diversity database contains SSR and SNP allelic data and passport descriptions for rice, maize and wheat.
GRIN-Global Project the global plant genebank information management system.
P.M. VanRaden and D.M. Bickhart Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD, USA
Welcome to the NPGS Crop Germplasm Committee Chairs Virtual Meeting November 15, 2012 The meeting will begin at 1:30 pm EST.
Simple-Sequence Length Polymorphisms SSLPs Short tandemly repeated DNA sequences that are present in variable copy numbers at a given locus. Scattered.
ISI Web of Knowledge update: October What’s New? Conference Proceedings Citation Indexes now in Web of Science –Two editions – Science and Social.
Development of a Cotton Marker Database (CMD) for Gossypium genome and genetic research CMD Main Goals Collect and integrate.
Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng, Jodi L. Humann, Deah McGaughey, Heidi Hough, Stephen P. Ficklin, B. Todd Campbell,
Dan Born, Peter Bradbury, Ed Buckler Payan Canaran, Terry Casstevens, Dallas Kroon, Dave Matthews, Susan McCouch, Junjian Ni, Isaak Yosief Tecle, Doreen.
Progress on TripalBIMS Breeding Information Management System in Tripal Sook Jung, Taein Lee, Chun-Huai Chen, Jing Yu, Ksenija Gasic, Todd Campbell, Kate.
Introduction to Data Management Arllet M. Portugal Integrated Breeding Platform Breeding Management System Intensive Workshop on Data Management Jan. 26,
Simple-Sequence Length Polymorphisms
Fall HORT6033 Molecular Plant Breeding
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
CottonGen: An Up-to-Date Resource Enabling Genetics, Genomics and Breeding Research for Crop Improvement Plant and Animal Genome Conference XXV Jing Yu1,
the global plant genebank information management system
Relationship between Genotype and Phenotype
Access to Sequence Data and Related Information
Welcome to the Gene and Allele Database Tutorial
What is GDPC GDPC is the Genomic Diversity and Phenotype Connection.
Relationship between Genotype and Phenotype
Presentation transcript:

Expansion of the USDA-ARS Germplasm Resources Information Network (GRIN) Database to Accommodate Molecular Data Gayle Volk, Christopher Richards USDA-ARS-National Center for Genetic Resources Preservation, 1111 S. Mason St., Ft. Collins, CO Mark Bohning, Quinn Sinnott USDA-ARS-National Germplasm Resources Laboratory, Baltimore Avenue, Bldg. 003 BARC-West, Beltsville, MD MARKER CROP GENETIC OBSERVATION GENOTYPIC ASSAY LIT MARKER CITATION EVALUATION INVENTORY ACCESSION Relationships among molecular tables (not shaded) and connected pre-existing tables within GRIN (shaded). Area structure arrangement within GRIN. Genetic observation table (gob): raw data matrix (allele calls or sequence alignment) for each individual within an inventory Genetic observation number (gobno) Genotypic assay number (gano) Inventory identifier (ivid) Individual (indiv) Genetic observation (gob) Genbank link (genbank_link) Image link (image_link) Genotypic assay table (ga): Laboratory methods to collect genotypic data Genotypic assay number (gano) Marker number (mrkno) Evaluation number (eno) Method (method) Scoring method (scoring_method) Control values (control_values) Number of observed alleles (no_obs_alleles) Maximum observed alleles (max_gob_alleles) Size alleles (size_alleles) Unusual alleles (unusual_alleles) Comment (cmt) Marker table (mrk): Information about the markers used in the assay Marker nmber (mrkno) Crop number (cropno) Site code (site) Marker (marker) Synonyms (synonyms) Repeat motif (repeat_motif) Primers (primers) Standard assay conditions (assay_conditions) Range products (range_products) Genbank number (genbank_no) Map location (map_location) Position (position) Comment (cmt) Polymorphic type (poly_type) Marker Citation Table (mcit): Citation number (citno) Marker number (mrkno) Reference abbreviation (abbr) Citation title (cittitle) Author of publication (author) Citation year (cityr) Citation reference (citref) Comment (cmt) Columns within Genetic Marker/observation area of GRIN. REFERENCES Casstevens T.M. and E.S. Buckler GDPC: connecting researchers with multiple integrated data sources. Bioinformatics 20: Mowder J.D. and A.K. Stoner Information systems. Plant Breed. Rev. 7: Perry M., A.K. Stoner and J.D. Mowder Plant germplasm information management system: Germplasm Resources Information Network. HortScience 23: USDA, ARS, National Genetic Resources Laboratory Germplasm Resources Information Network - (GRIN). [Online Database] National Germplasm Resources Laboratory, Beltsville, Maryland. 15 Feb Volk G.M. and C.M. Richards Availability of genotypic data for USDA-ARS National Plant Germplasm System accessions using the Genetic Resources Information Network (GRIN) database. HortScience (in press). RATIONALE AND SCOPE USDA’s National Plant Germplasm System (NPGS) maintains the world’s largest living collection of plant genetic resources. The NPGS is tasked with acquiring, preserving, characterizing, and distributing over 450,000 accessions. Major components of the system include more than 20 active field evaluation sites, a base collection for long term storage, quarantine, taxonomy, and plant exploration units. Critical information, including passport data and phenotypic characteristics about these plant materials, is available through the Genetic Resources Information Network (GRIN) database (Mowder and Stoner 1988; Perry et al. 1988; USDA, ARS, National Genetic Resources Laboratory 2008). The new molecular tables in GRIN add the capacity to associate genotypic data with the existing phenotypic data (Volk and Richards 2008). The revised tables accommodate multiple marker types, provide raw data for individuals, accept polyploid data, provide a record of methods, standards, and control values. Presentation of data at the individual level is a new feature for GRIN; now specific seeds (or individuals) within an inventory can be genotyped and documented. The revised tables also are structured so that interoperability with other databases will be possible. The genomics community continues to generate large data sets comprised of sequence data from expressed sequence tag (EST) studies, mapping projects and fragment analysis for diverse species. The development of GRIN molecular tables serves as a resource for a great number of taxa for which there currently exists no alternative database model organism databases for sequence data. The four new tables added to GRIN accommodate specific molecular data types: amplified fragment length polymorphism, allozyme, sequence, microsatellite, restriction fragment length polymorphism and their variants. In addition, sequence-based markers will be accommodated including genomic and single nucleotide polymorphisms (SNPs). The improved capacity for holding genotypic data in GRIN has made apple, pear, blueberry, hops, hazelnut, and cacao allelic data publicly available. Additional data will be uploaded as datasets are published. It is currently possible to download tables of allelic data from GRIN. Databases must evolve to become interoperable with other genomic, genebank, and environmental (or geographical information system) databases. Interoperability allows for the combination and synthesis of data from disparate sources using middleware services (Casstevens and Buckler 2004). An example of interoperability in the biodiversity discipline is the Global Biodiversity Information Facility ( of which the NPGS is a data provider. Additionally, direct links have already been established between sequence data in GRIN and NCBI. It is also recognized that model organism and clade organism databases provide users with valuable map, marker, and genomic data. Complementary features of these databases and the passport and phenotypic data available in GRIN enhance the prospects of future interoperability.