3 rd UK Cereal Genetics and Genomics Workshop. Next generation Genomics challenges 3 rd UK Cereal Genetics and Genomics Workshop. Next generation Genomics.

Slides:



Advertisements
Similar presentations
A quantitative trait locus not associated with cognitive ability in children: a failure to replicate Hill, L. et al.
Advertisements

Planning breeding programs for impact
Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Association Mapping as a Breeding Strategy
Qualitative and Quantitative traits
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Genome Structure/Mapping Lisa Malm 05/April/2006 VCR 221 Lisa Malm 05/April/2006 VCR 221.
Genetic Basis of Agronomic Traits Connecting Phenotype to Genotype Yu and Buckler (2006); Zhu et al. (2008) Traditional F2 QTL MappingAssociation Mapping.
Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
QTL Mapping R. M. Sundaram.
Ingredients for a successful genome-wide association studies: A statistical view Scott Weiss and Christoph Lange Channing Laboratory Pulmonary and Critical.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Signatures of Selection
Genomics An introduction. Aims of genomics I Establishing integrated databases – being far from merely a storage Linking genomic and expressed gene sequences.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Quantitative Genetics
Introduction to Computational Biology Topics. Molecular Data Definition of data  DNA/RNA  Protein  Expression Basics of programming in Matlab  Vectors.
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
Genomics tools to identify the molecular basis of complex traits Justin Borevitz Salk Institute naturalvariation.org.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Fibre properties that affect paper quality Strength –Microfibril length/thickness –Hydrogen bonding between microfibrils and other cell wall constituents.
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Quantitative Genetics
Review Session Monday, November 8 Shantz 242 E (the usual place) 5:00-7:00 PM I’ll answer questions on my material, then Chad will answer questions on.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Geuvadis RNAseq analysis at UNIGE Analysis plans
Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
IAP workshop, Ghent, Sept. 18 th, 2008 Mixed model analysis to discover cis- regulatory haplotypes in A. Thaliana Fanghong Zhang*, Stijn Vansteelandt*,
Non-Mendelian Genetics
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Agropolis Resource Center for Crop Conservation, Adaptation and Diversity Jean-Louis Pham Jean-Pierre Labouisse
Fig. S1 The non-metric multi-dimensional scaling of 24 double haploid (DH) lines (colored in grey) in the background of 225 DH lines (colored in blue)
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Type 1 Error and Power Calculation for Association Analysis Pak Sham & Shaun Purcell Advanced Workshop Boulder, CO, 2005.
Quantitative Genetics
INTRODUCTION TO ASSOCIATION MAPPING
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
PT Sampoerna Agro Tbk Sampoerna Strategic Square North Tower, 28th Floor Jl. Jend. Sudirman Kav. 45 Jakarta, Indonesia,12930 Development of Marker Assisted.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
1 Before considering selection, it’s important to characterize how gene expression varies within and between species. What evolutionary forces act on gene.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
Use of breeding populations to detect and use QTL Jean-Luc Jannink Iowa State University 2006 American Oat Workers Conference Fargo, ND24 July 2006.
Pedagogical Objectives Bioinformatics/Neuroinformatics Unit Review of genetics Review/introduction of statistical analyses and concepts Introduce QTL.
1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Genetic mapping and QTL analysis - JoinMap and QTLNetwork -
Association Mapping in Families Gonçalo Abecasis University of Oxford.
EQTLs.
Moukoumbi, Y. D1. , R. Yunus2, N. Yao3, M. Gedil1, L. Omoigui1 and O
Genetical Genomics in the Mouse
upstream vs. ORF binding and gene expression?
Statistical Applications in Biology and Genetics
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Gene Hunting: Design and statistics
Power to detect QTL Association
Mapping Quantitative Trait Loci
Linkage analysis and genetic mapping
Fig. 2 Genotype-induced differential gene expression is different in MDMi cells compared to monocytes. Genotype-induced differential gene expression is.
GWAS-eQTL signal colocalisation methods
Presentation transcript:

3 rd UK Cereal Genetics and Genomics Workshop. Next generation Genomics challenges 3 rd UK Cereal Genetics and Genomics Workshop. Next generation Genomics challenges The challenge of connecting traits to genes through genomics Daryl J. Somers and Mark Jordan Agriculture and Agri-Food Canada – Cereal Research Centre Winnipeg, MB, Canada The challenge of connecting traits to genes through genomics Daryl J. Somers and Mark Jordan Agriculture and Agri-Food Canada – Cereal Research Centre Winnipeg, MB, Canada John Innes Centre, Norwich, 6–7 April 2006

Cereal improvement through breeding, and molecular genetics has always benefited by knowing the precise location and function of genes. “next generation challenges” 25 years ago? understand structure of the cereal genomes (JIC groups!) Sequences of genes with key biological relevance. “next generation challenges” 25 years ago? understand structure of the cereal genomes (JIC groups!) Sequences of genes with key biological relevance. Today – 25 years later: Good understanding of the wheat/barley genomes. 750K – 1M gene sequences, most of unknown biological relevance. Today – 25 years later: Good understanding of the wheat/barley genomes. 750K – 1M gene sequences, most of unknown biological relevance.

“Next generation genomics challenges” ….still, knowing the precise location, sequence, function of genes toward applied cereal improvement. Applied genomics: 3 simple elements… We need to identify TARGETS for producers, processors, consumers. Design/invent/improve GENOMIC TECHNOLOGY to characterize the targets. Perform the research with APPLICATION and VALIDATION of the result/discovery.

Target – improved cereal quality for bread making and nutritional value. largely a consumer target, but also processor benefits. Genomic technology – Fusion of genetic mapping, association genetics and microarray-based gene expression analysis. Application/validation – Genetic experiments and seed quality analysis to validate the discoveries. All elements require a multidisciplinary team: genomics, breeding, chemistry

Independent approaches to identifying targets for research. 1. Expression level polymorphisms (eQTL) (M. Jordan, T. Banks) RL4452 x AC Domain (HRS, 40 DH lines) segregates for dough, milling, bread quality popln is mapped and QTL analysis (49 traits) 2. Association genetics (T. Banks, AAFC, U of SK) Analysis of192 HRS370 loci 96 durum245 loci Examine popln structure, LD analysis, association analysis. Independent approaches to identifying targets for research. 1. Expression level polymorphisms (eQTL) (M. Jordan, T. Banks) RL4452 x AC Domain (HRS, 40 DH lines) segregates for dough, milling, bread quality popln is mapped and QTL analysis (49 traits) 2. Association genetics (T. Banks, AAFC, U of SK) Analysis of192 HRS370 loci 96 durum245 loci Examine popln structure, LD analysis, association analysis.

Phenotypic data is the expression level of a single gene which is a “quantitative assessment of gene activity” (Doerge 2002). The change in activity of a single gene is an expression level polymorphism (ELP) (St. Clair, Michelmore, Doerge).

BB0AAABABBBAABBBBBAAABBB AAAAAAABBBABBAAABBBBAAAA AA BBABBBABABBABABBAB Gene 1,2,3,4 Gene 5,6,7,8 QTL analysis Identification of Regulatory Regions Adapted from: er_for_Bioinformatics/combining_qtl_analysis_with_microarray_da ta.html

2004/2005 Gene Expression Experiment RL4452 x AC Domain 3 Locations, 3 reps/location Total of 43 entries (including parents) Grand total of 387 rows RNA samples at 5 dpa for all lines, 3 and 10 dpa for some

2004/2005 collection of developing seeds, 5 dpa.

5 genotypes, 1 location, 3 reps Rep effect was non-significant ELP Mapping Data: 39 genotypes plus parents 1 location - 2 replicate RNA samples Affymetrix wheat gene chip data collection

RMA pre-processing of all chips, normalize to median 1 site 2 reps Determine the genes significantly different among genotypes by ANOVA (Benjamini Hochberg False Discovery Rate, error level) using CGEM (GeneSpring) as only 2 reps. Significant genes (1,327) ranked by kurtosis to identify bi-modal (qualitative) data, quantitative data and data skewed by off samples. 558 negative kurtosis (lowest -1.9) 577 greater than 1 (max 32). Procedure for analysis of Affymetrix data

RL4452 Domain RL4452 Domain RL4452 Highly Negative Kurtosis RL4452 Domain RL4452 Domain RL4452 Domain Transcriptional Frequency Classes (Gibson and Weir 2005) Qualitative ELP Quantitative ELP (eQTL)

ELP Mapping Summary Top 800 genes ranked by kurtosis were considered 101 were binarized based on qualitative distribution. 77 were mapped (40 individuals using JoinMap V3.0) 24 unassigned 699 were subjected to CIM (QTL Cartographer). 402 were assigned to an interval (1 major LOD peak). 297 had more than 1 peak and were not assigned. Top 800 genes ranked by kurtosis were considered 101 were binarized based on qualitative distribution. 77 were mapped (40 individuals using JoinMap V3.0) 24 unassigned 699 were subjected to CIM (QTL Cartographer). 402 were assigned to an interval (1 major LOD peak). 297 had more than 1 peak and were not assigned.

Ta S1_at 3B LOD chromosome QTL scan 3B QTL Cartograher CIM analysis

The ELP genomics challenge… We have the very low hanging fruit. How do you get at the rest? If back off stringency introduce more errors (more reps). Lack of annotations- gene expression to phenotype Cis eQTLs have larger effects on transcription.As stringency increases proportion of cis eQTLs increases. We have ~60% cis when the reported average is closer to 33% (Gibson and Weir 2005). Still, even at this level there is a high trans effect. Cis eQTLs have larger effects on transcription. As stringency increases proportion of cis eQTLs increases. We have ~60% cis when the reported average is closer to 33% (Gibson and Weir 2005). Still, even at this level there is a high trans effect. Need to develop an automated statistically based method coupled with assignment to “transcriptional frequency classes”. Identification of cis vs trans requires more physical transcript mapping (SNPs?) – and an easier way to move between Affy gene chip annotation and the bin-mapping results.

Linkage Disequilibrium and Association Genetics Wheat microsatellite allele database: 192 HRS x 370 loci 96 durum x 245 loci Whole genome LD calculations Syntenic pairs of loci across genome LD within 8 subpopulations B genome -T. aestivum B genome - T. durum Marker order is based on consensus map (Somers et al. 2004)

Population Structure NTSYS – UPGMA Structure (Pritchard et al.) 192 bread wheat – 42 SSRs

Total Popln Alleles and Haplotypes in LD barc98wmc fail232 Alleles and Haplotypes in LD barc98wmc fail232 Common Haplotypes barc98wmc457 expobs Disequlibrium!

LD analysis on chromosome 4D Total Popln Northern US introductions

TARGETS and GENOMICS TECHNOLOGY are described….. APPLICATION and VALIDATION: An example: Interval on 3B could be examined through genetics. wmc418-barc164 Common haplotypes: null wmc418-barc164 Common haplotypes: null Transfer haplotype specific segments. Examine ELP and seed quality.

Acknowledgements: Western Grains Research Foundation AAF-Matching Investment Initiative AAFC-Canadian Crop Genomics Initiative Brenda Terwisscha – Affy hybs Kerry Ward- Affy data analysis Travis Banks- assorted bioinformatics tasks Zlatko Popovic – SSR allele database Monika Eng – SSR allele database Daryl’s and Mark’s labs plus many summer students- days spent in the field, rain or shine. Breeders: J. Clarke, C. Pozniak, S. Fox, R. Depauw, G. Humphreys