Creating An Allele Index For NPGS: Bioinformatic Issues Edward Buckler USDA-ARS at Cornell University, Ithaca, NY.

Slides:



Advertisements
Similar presentations
Diversity Data at MaizeGDB Ethalinda KS Cannon 1, Carson M. Andorf 2, Bremen L. Braun 2, Darwin A. Campbell 2, Mary L. Schaeffer 3,4, Cheng-Ting Yeh 5,
Advertisements

Potato Mapping / QTLs Amir Moarefi VCR
Sook Jung, Taein Lee, Stephen Ficklin, Kate Evans, Cameron Peace and Dorrie Main.
How to use GDR, the Genome Database for Rosaceae Sook Jung, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Anna Blenda, Jing Yu, Sushan Ru, Kate Evans, Cameron.
GDR, the Genome Database for Rosaceae, in Chado and Tripal Sook Jung, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Anna Blenda, Sushan Ru, Ping Zheng,
Gramene Comparative & Phylogenomics Resources for Plants Joshua C. Stein 1, William Spooner 1, Sharon Wei 1, Liya Ren 1, Doreen Ware 1,2 1 Cold Spring.
PAG 2011 TASSEL Terry Casstevens 1, Peter Bradbury 2,3, Zhiwu Zhang 1, Yang Zhang 1, Edward Buckler 1,2,4 1 Institute.
1 United States Department of Agriculture-Agriculture Research Service (USDA-ARS), U.S. Arid-Land Agricultural Research Center, North Cardon Lane,
Chapter 7: Molecular markers in breeding
Lab 13: Association Genetics. Goals Use a Mixed Model to determine genetic associations. Understand the effect of population structure and kinship on.
Intro to Genetics. We can predict what traits are possible among the offspring from certain parents by using a Punnett square. A: True B: False.
Abstract The goal of the Conifer Translational Genomics Network (CTGN) project is to provide tree breeders across the United States with new tools to enhance.
Panzea Allele Frequency Display Tutorial With the Panzea Allele Frequency Geographic Display tool, you can display pie graphs that show.
The National Plant Germplasm System: Status and Prospects Peter Bretting USDA/ARS Office of National Programs.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Chapter 17: Evolution of Populations
New Data and Functionality of GDR, the Genome Database for Rosaceae Sook Jung, Taein Lee, Stephen Ficklin, Chun-Huai Cheng, Ping Zheng, Anna Blenda, Sushan.
Update in GDR, The Genome Database for Rosaceae S Jung, T Lee, S Ficklin, CH Cheng, I Cho, P Zheng, K Evans, C Peace, N Oraguzie, A Abbott, D Layne, M.
Development of the Generation Challenge Program Ontology for Crops Elizabeth Arnaud (Bioversity International) and Rosemary Shrestha (CRIL-CIMMYT), Richard.
Dorrie Main, Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy and Don Jones.
Introduction to NRSP databases and other breeding databases.
Building Database Resources For Translational Research in Rosaceae Sook Jung, Taein Lee, Stephen Ficklin, Chun-Huai Cheng, Anna Blenda, Sushan Ru, Ping.
Expansion of the USDA-ARS Germplasm Resources Information Network (GRIN) Database to Accommodate Molecular Data Gayle Volk, Christopher Richards USDA-ARS-National.
A Comparative Mapping Resource for Grains Gramene Outreach Summary 2005 Claire Hebbard Presented to the Gramene SAB January 13, 2006.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
A Comparative mapping resource GRAMENE Doreen Ware USDA ARS Cold Spring Harbor Laboratory
Gramene’s Outreach Program. Outreach Components Workshops Website Improvements / Additions Public Announcements High School Outreach Collaborators and.
Phenotyping Clare Coyne & Melanie Harrison-Dunn, curators.
Molecular marker data and their impact on gene bank management Chris Richards NCGRP, Fort Collins, CO Curator Workshop, Atlanta Georgia.
Using Gramene: A genomics and genetics resource for rice and other grasses. PAG XIV Jan 15, 2006 Immanuel (Noel) Yap Claire Hebbard.
1 Building Communities Around Ontology Development Pankaj Jaiswal Dept. of Plant Breeding and Genetics Cornell University Ithaca, NY FAO,
Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy, Don Jones, Dorrie Main.
A Comparative Genomics Resource for Grains. Tutorial Tips If you are viewing this tutorial with Adobe Acrobat Reader, click the "bookmarks" on the left.
A comparative mapping resource for the grasses Gramene Plant Biology July 25, 2004 Molly Fogleman Doreen Ware Pankaj Jaiswal.
APPLICATION OF MOLECULAR MARKERS FOR CHARACTERIZATION OF LATVIAN CROP PLANTS Nils Rostoks University of Latvia Vienošanās Nr. 2009/0218/1DP/ /09/APIA/VIAA/099.
讲 座 提 纲讲 座 提 纲 1 什么是分子育种 2 历史回顾 3 全基因组策略 4 基因型鉴定 5 表现型鉴定 6 环境型鉴定 (etyping) 7 标记 - 性状关联分析 8 标记辅助选择 9 决策支撑系统 10 展望.
Diversity Bioinformatics Terry Casstevens Institute for Genomic Diversity, Cornell University GMOD Meeting at NESCent Durham, NC – June 29-30, 2006.
GDR in Drupal facilitating community building and efficient maintenance.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Introduction to the Gramene Genetic Diversity module 5/2010 Build #31.
A Comparative Genomic Mapping Resource for Grains.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
NextGen Pipeline: Enabling the Plant Science Community Tom Brutnell (lead), Steve Rounsley (co-lead), Matt Vaughn (Engagement Lead) Ed Buckler, Justin.
Gramene: Interactions with NSF Project on Molecular and Functional Diversity in the Maize Genome Maize PIs (Doebley, Buckler, Fulton, Gaut, Goodman, Holland,
TRPGR Gramene: A Platform for Comparative Plant Genomics Doreen Ware USDA ARS Cold Spring Harbor Laboratory Pankaj Jaiswal Oregon State University Joshua.
Lab 13: Association Genetics December 5, Goals Use Mixed Models and General Linear Models to determine genetic associations. Understand the effect.
A Comparative Genomics Resource for Grains V26. Tutorial Tips If you are viewing this tutorial with Adobe Acrobat Reader, click the "bookmarks" on the.
C R O P S T O I N D U S T R Y WP 2 Tasks: 2-1 to 2-3 Task leader: Dimitra Milioni Crops2Industry “Non-food Crops-to-Industry schemes in EU27”
Exploring Rice Phenotypes Using Gramene A Comparative Mapping Resource GRAMENE.
Phenotype Curation Susan R. McCouch Department of Plant Breeding Cornell University.
Gramene V. 211 Gramene Diversity Gramene Genetic Diversity database contains SSR and SNP allelic data and passport descriptions for rice, maize and wheat.
Gramene V. 221 Gramene Diversity Gramene Genetic Diversity database contains SSR and SNP allelic data and passport descriptions for rice, maize and wheat.
Dan Born, Peter Bradbury, Ed Buckler Payan Canaran, Terry Casstevens, Dallas Kroon, Dave Matthews, Susan McCouch, Junjian Ni, Isaak Yosief Tecle, Doreen.
Progress on TripalBIMS Breeding Information Management System in Tripal Sook Jung, Taein Lee, Chun-Huai Chen, Jing Yu, Ksenija Gasic, Todd Campbell, Kate.
A Comparative Genomic Mapping Resource for Grains.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
5/12/2018 Genome Database for Rosaceae New Data and New Functionality
Gramene Technical Improvements
Breeding Information Management System
11-5 Linkage and Gene Maps.
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
the Genome Database for Rosaceae: New Data and Functionality
What is GDPC GDPC is the Genomic Diversity and Phenotype Connection.
for the Cotton Community
Linkage analysis and genetic mapping
Membership Login/sign in
Membership Login/sign in
Membership Login/sign in
Tutorial for: Gramene Website Navigation
Presentation transcript:

Creating An Allele Index For NPGS: Bioinformatic Issues Edward Buckler USDA-ARS at Cornell University, Ithaca, NY

AIM: Make more useful plants by conserving, finding and combining better alleles. NEED: The National Germplasm conserves 464,000 accessions and may contain 100,000,000 distinct alleles, but there is no index.

Population structure Familial relatedness Genetic mapping is the basis of the index, and QTL mapping approaches now exist for virtually all types of populations. Near gene level resolution achieved in multiple species Identification of genes controlling flowering, starch, nutrients, wood quality Positive Results in: Maize Rice Arabidopsis Conifers

What needs to happen? Genotyping (0.5Mdp per accession) Phenotyping (500dp per accession) Bioinformatics (GRIN) Mapping Tools Breeder Decision Tools

What data is currently available outside NPGS? Several large NSF Plant Genome projects on diversity with NPGS germplasm at the heart of these projects Numerous smaller projects (however, most data gets lost over time from these) Millions of genotypic and phenotypic data points in just maize, wheat, and rice projects. Database aware analysis tools (eg. TASSEL)

GDPDM Gramene Panzea (Maize) Rice Evol. TASSEL Other Analysis Tools Alignment & SNP Display DBs Middleware Analysis Panzea Web Data Access Display Upload Tools GDPC Germinate GDPC Data Browser GRIN GRIN?

GDPDM Germplasm Genotype Phenotype Environment Used by maize, wheat, and rice diversity projects.

GDPDM Gramene Panzea (Maize) Rice Evol. TASSEL Other Analysis Tools Alignment & SNP Display DBs Middleware Analysis Panzea Web Data Access Display Upload Tools GDPC Germinate GDPC Data Browser GRIN

Purpose The purpose of GDPC is to simplify access to the large genomic and phenotypic datasets that are becoming available in plant biology.

GDPC Data Flow Diagram

GDPC Data Flow Diagram

Databases Where has GDPC been mapped? Panzea (GDPDM schema) Gramene (GDPDM) Germinate (generic schema) GRIN (passport data)

GDPC: Select Data Service

GDPC: Select Taxa

GDPC: Nucleotide Data

GDPC: Trait Data

GDPC Browser Demo GDPC: Marker Data

Current GDPC Limitations XML is not efficient for large datasets –Several avenues are possible for improving efficiency More visualization and analysis tools need to be developed –Linkage Mapping –Breeder Decision Tools –Geographic interfaces –Pedigree Interfaces (in progress)

What should GRIN consider? Becoming the lead repository for genotypic and phenotypic diversity data Lead efforts for the consolidation of community diversity data Implement several middleware or web services standards (eg. GDPC and perhaps others IRRI) Collaborate on the development of data visualization tools

All of the software can be accessed through