Markers, mapping, and expression using arrays Justin Borevitz Salk Institute naturalvariation.org.

Slides:



Advertisements
Similar presentations
Planning breeding programs for impact
Advertisements

Identification of markers linked to Selenium tolerance genes
Qualitative and Quantitative traits
Genetics of Adaptation: Arabidopsis thaliana as an ecological model Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
1.Generate mutants by mutagenesis of seeds Use a genetic background with lots of known polymorphisms compared to other genotypes. Availability of polymorphic.
Genomic Approaches to the Genetics of Adaptation Justin Borevitz Ecology & Evolution University of Chicago
Toward the genomics of Adaptation to seasonal environments in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Genomics of Natural Variation in Arabidopsis thaliana Justin Borevitz Salk Institute naturalvariation.org.
Natural Variation in Arabidopsis thaliana Light Response: Genomic Approaches Justin Borevitz Salk Institute naturalvariation.org.
MicroArray Evolution: expression to mapping and back again Justin Borevitz Salk Institute naturalvariation.org.
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz University of Chicago naturalvariation.org.
Light response QTL in Arabidopsis thaliana: LIGHT1 cloning Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
Microarrays for mapping and expression analysis: Toward the genetic determinants of light response adaptation in Arabidopsis and Aquilegia Justin Borevitz.
Genomic Methods for Cloning QTL Justin Borevitz University of Chicago naturalvariation.org.
Arrays as tools for Natural Variation studies: Mapping, Haplotyping, and gene expression Justin Borevitz University of Chicago naturalvariation.org`
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Identification and Genotyping of Single Feature Polymorphisms in Complex Genomes Justin Borevitz University of Chicago naturalvariation.org.
Natural Variation in Light Response using Whole Genome Tiling Arrays Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Genomics tools to identify the molecular basis of complex traits Justin Borevitz Salk Institute naturalvariation.org.
Genetics and Genomics of Light Response adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Global dissection of cis and trans regulatory variations in Arabidopsis thaliana Xu Zhang Borevitz Lab.
Toward the genetic basis of adaptation: Arrays/Association Mapping Justin Borevitz Ecology & Evolution University of Chicago
High Density Oligo Arrays for Single Feature Polymorphism Genotyping and Mapping Justin Borevitz Ecology & Evolution University of Chicago
QTL mapping using Single Feature Polymorphisms Justin Borevitz Salk Institute naturalvariation.org.
Haplotype mapping with Single Feature Polymorphisms in Arabidopsis Justin Borevitz Ecology & Evolution University of Chicago
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz Ecology & Evolution University of Chicago naturalvariation.org.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
EXtreme Array Mapping and Haplotype analysis Using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Genomic Systems underlying the genetics of adaptation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
SNP/Tiling arrays for very high density marker based breeding and QTL candidate gene identification Justin Borevitz Ecology & Evolution University of Chicago.
Mechanisms of Sustainable re Development: Lessons from Plants Justin Borevitz Ecology & Evolution University of Chicago
Towards the Arabidopsis Haplotype Map using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Studies of Genome Wide Molecular Variation in Arabidopsis thaliana using Arrays Justin Borevitz Salk Institute naturalvariation.org.
Toward the genetic basis of adaptation: Arrays/Association Mapping Justin Borevitz Ecology & Evolution University of Chicago
Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang Borevitz Lab Whole genome transcriptome variation in Arabidopsis thaliana Xu Zhang.
What is a QTL? What are QTL?. Current methods for QTL  Single Marker Methods ( Student, 17?? )  t-tests  Interval Mapping Method (Lander and Botstein,
Tiling arrays for genetic, epigentic, and environmental variation in Arabidopsis thaliana Justin Borevitz Ecology & Evolution University of Chicago
Array Genotyping to Dissect Quantitative Trait Loci in Arabidopsis thaliana Justin Borevitz Ecology and Evolution University of Chicago naturalvariation.org.
Special Topics in Genomics Lecture 1: Introduction Instructor: Hongkai Ji Department of Biostatistics
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Using mutants to clone genes Objectives 1. What is positional cloning? 2.What is insertional tagging? 3.How can one confirm that the gene cloned is the.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
High Resolution Patterns of Variation in the Arabidopsis Genome Justin Borevitz University of Chicago naturalvariation.org.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Quantitative Genetics
Ecological and Evolutionary Systems biology: Conceptual and molecular tools for analysis Justin Borevitz Ecology & Evolution University of Chicago
Toward the genetic basis of adaptation using arrays Justin Borevitz Ecology & Evolution University of Chicago
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Chapter 2 From Genes to Genomes. 2.1 Introduction We can think about mapping genes and genomes at several levels of resolution: A genetic (or linkage)
1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
The trait defines the two major germplasm groups in barley
ChipViewer is coded to visualize and analyze the tiling chip data.
School of Pharmacy, University of Nizwa
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Map-based cloning of interesting genes
Some mutations affect a single gene, while others affect an entire chromosome.
Volume 17, Issue 8, Pages (April 2007)
Volume 24, Issue 16, Pages (August 2014)
Volume 6, Issue 6, Pages (November 2013)
Using mutants to clone genes
Presentation transcript:

Markers, mapping, and expression using arrays Justin Borevitz Salk Institute naturalvariation.org

Talk Outline Intro Natural Variation/ QTL Transcription profiling to identify QTL candidate genes Identification of Single Feature Polymorphisms Future Prospects

PHYB

differences may be due to expression or hybridization

Deletions

Spatial Correction Spatial Artifacts Improved reproducibility

False Discovery and Sensitivity PM only SAM threshold 5% FDR GeneChip SFPs nonSFPs Cereon marker accuracy % Sequence Sensitivity Polymorphic % Non-polymorphic False Discovery rate: 3% Test for independence of all factors: Chisq = , df = 1, p- value = 1.845e -40 SAM threshold 18% FDR GeneChip SFPs nonSFPs Cereon marker accuracy % Sequence Sensitivity Polymorphic % Non-polymorphic False Discovery rate: 13% Test for independence of all factors: Chisq = , df = 1, p-value = 1.309e-59 90%80%70% 41%53%85% 90%80%70% 67%85%100% Cereon may be a sequencing Error TIGR match is a match

Chip genotyping of a Recombinant Inbred Line 29kb interval

bibb mutant phenotypes Colbib-1det days bib-3 three independent recessive alleles medial sepals remain attached “cabbage-like” rosette leaves flowers open prematurely fruit appear more slender bib-3 Ler bib-3

bibb mapping ChipMap AS1 Bulk segregant Mapping using Chip hybridization bibb maps to Chromosome2 near ASYMETRIC LEAVES1

BIBB=AS1 Sequenced AS1 coding region from bib-1 …found g -> a change that would introduce a stop codon in the MYB domain bibbas1-101 MYB bib-1 W49* as-101 Q107* as1 bibb AS1 (ASYMMETRIC LEAVES1) = MYB closely related to PHANTASTICA located at 64cM

Future Bulk segregant Mapping Extend to Quantitative Traits Map multiple genes (mutation modifiers) Fine mapping after PCR identification of recombinants Multiple models (epistasis) Changes in allele F2, F3 frequency (selection) 20% variance 20 simulations

Potential Deletions 111 potential deletions 45 confirmed by Ler sequence 23 (of 114) transposons Disease Resistance (R) gene clusters Single R gene deletions Genes involved in Secondary metabolism Unknown genes

Gene expression revisited Now that we know what features are polymorphic we can determine which gene expression differences are real and which are due to polymorphism. New RNA analysis algorithm— account for spatial correction, polymorphisms, and feature differences

Genes Expression for a candidate gene Features (probes) for At1g22360 Spatially corrected log PM intensity Gene Expression index that accounts for feature differences and polymorphisms

Gene expression revisited SAM True False Difference FDR threshold Positive Positive

Gene expression revisited FLC controls flowering time Difference detected at 3 day old seedling stage

Gene expression revisited PAG1 down regulated in Cvi pag1 Knock Out is pale

pag1 KO is light insensitive pag1 KO has long hypocotyl in red light

pag1 KO is early flowering

25 bp 16 bp 25 bp 8 bp 1st Set 2nd Set 3rd Set Complete Genome Tiling Chip Polymorphisms (re-sequencing) Global methylation (Methylome) Comparative Genomics (Brassica) New Gene Discovery Improve Annotation Alternative Splicing Micro RNAs 9 Whole-Genome Expression Chips 2 Splicing Chips 2 5’ Mapping Chips Validate features Extra Chips: ChIP – Chip (DNA binding sites)

ChipViewer: Mapping of transcriptional units of ORFeome From 2000v At1g09750 (MIPS) to the latest AGI At1g v Annotation (MIPS) The latest AGI Annotation

Syngenta Hur-Song Chang Tong Zhu NaturalVariation.org Salk Jon Werner Todd Mockler Sarah Liljegren Joanne Chory Detlef Weigel Joseph Ecker UC Davis Julin Maloof UC San Diego Charles Berry Scripps Elizabeth Winzeler Salk Jon Werner Todd Mockler Sarah Liljegren Joanne Chory Detlef Weigel Joseph Ecker UC Davis Julin Maloof UC San Diego Charles Berry Scripps Elizabeth Winzeler

Future Projects Design 2 nd generation expression array $ ,000 25mer features, expect 12,000 SFPs (2accessions) Validated gene models (exon intron) 2 features per exon, alternative splicing 5’ 3’ Untranslated region for gene family/ polymorphism Micro RNAs Validated “good” hybridization intensities

Haplotype Map – 20 Accessions 3 replicates, SFP discovery estimated 85,000 SFPs 1.4kb resolution Association Studies – 120 Accessions 1 replicate, Genotyping Bulk Segregant Mapping – Confirm Associations in specific crosses Future Projects DNA

Future work with Natural Variation VanC advanced intercross RIL population Backcross collections

True natural variation in gene expression 20 Accessions 3 replicates, (polymorphism accounted for) Cis regulatory variation/ Imprinting reciprocal F1s 3 replicates Transcriptome QTL Map – 100 best VanC Advanced Intercross Lines How many loci control the variation in gene transcription? Candidate TF and binding sites? Future Projects RNA