Sequencing type and molecular markers

Slides:



Advertisements
Similar presentations
Applications of genome sequencing projects 1) Molecular Medicine 2) Energy sources and environmental applications 3) Risk assessment 4) Bioarchaeology,
Advertisements

Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Identification of markers linked to Selenium tolerance genes
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Molecular Markers.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Molecular Tools for Aiding and Breeding of Aquaculture Species Masters in Aquaculture and Fisheries Genetic and Selection 2014 Tamara Moedas nº 47853;
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Chapter 3 -- Genetics Diversity Importance of Genetic Diversity Importance of Genetic Diversity -- Maintenance of genetic diversity is a major focus of.
Biodiversity IV: genetics and conservation
PLANT GENETIC MARKERS Plant Biotechnology Dr.Ir. Sukendah, MSc.
A single-nucleotide polymorphism tagging set for human drug metabolism and transport Kourosh R Ahmadi, Mike E Weale, Zhengyu Y Xue, Nicole Soranzo, David.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
CS177 Lecture 10 SNPs and Human Genetic Variation
Used for detection of genetic diseases, forensics, paternity, evolutionary links Based on the characteristics of mammalian DNA Eukaryotic genome 1000x.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Phylogenomics “The intersection of phylogenetics and genomics”
François Ancien Sascha Kretzschmann Olivier Suplis Genotyping Errors Causes, Consequences and Solutions Genotyping Errors.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
MSFD monitoring guidance WG GES proposed amendments and way forward Nikolaos ZAMPOUKAS Andreas PALIALEXIS Georg HANKE.
An quick overview of human genetic linkage analysis

Simple-Sequence Length Polymorphisms SSLPs Short tandemly repeated DNA sequences that are present in variable copy numbers at a given locus. Scattered.
Synteny - many distantly related species have co- linear maps for portions of their genomes; co-linearity between maize and sorghum, between maize and.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Restriction Fragment Length Polymorphism. Definition The variation in the length of DNA fragments produced by a restriction endonuclease that cuts at.
Arun Kumar. B M.Sc 1st Year Biotechnology SSBS
Gene flow and speciation. Mechanism for speciation Allopatric speciation Sympatric speciation.
Announcement NSERC Undergraduate Student Research Awards (USRA) in Universities 16 weeks $5,625
Molecular Tools for Detection of Plant Pathogenic Fungi ByMAZIN.S.SELMAN.
Simple-Sequence Length Polymorphisms
Molecular Markers Using DNA polymorphisms in plant breeding
Interpreting exomes and genomes: a beginner’s guide
Single Nucleotide Polymorphisms (SNPs
GENETIC MARKERS (RFLP, AFLP, RAPD, MICROSATELLITES, MINISATELLITES)
Genomic Analysis: GWAS
Ø Novel approaches for linkage mapping in dairy cattle
Molecular Marker Characterization of plant genotypes
MOLECULAR MARKERS.
Crystiana Tsujiura (’14) and Judy L. Stone
MULTIPLE GENES AND QUANTITATIVE TRAITS
Lucas D. Baker1 Vikram E. Chhatre2 Hayley C. Lanier1
Genetic Variation Genetic Variation in Populations
Global and local spatial autocorrelation
Spotted bat (Euderma maculatum) microsatellite marker discovery
Landscape genetics Landscape genomics
Environmental data Landscape genomics
DNA Marker Lecture 10 BY Ms. Shumaila Azam
Human Cells Human genomics
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Multivariate environmental characterization of samples
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Patterns of Linkage Disequilibrium in the Human Genome
Conclusions of Hardy-Weinberg Law
Applied Molecular Genetics Molecular Marker and Technique
Land cover data: a short theoretical introduction
MULTIPLE GENES AND QUANTITATIVE TRAITS
With Genetic Analyzers from Applied Biosystems
DNA Polymorphisms: DNA markers a useful tool in biotechnology
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
genetic variation is meaningful only in the context of a population
There is a Great Diversity of Organisms
Statistical Analysis and Design of Experiments for Large Data Sets
Medical genomics BI420 Department of Biology, Boston College
Medical genomics BI420 Department of Biology, Boston College
Restriction Fragment Length Polymorphism (RFLP)
9-3 DNA Typing with Tandem Repeats
Evaluation of power for linkage disequilibrium mapping
Presentation transcript:

Sequencing type and molecular markers Landscape genomics Physalia courses , November 6-10, 2017, Berlin Sequencing type and molecular markers Dr Stéphane Joost – Oliver Selmoni (Msc) Laboratory of Geographic Information Systems (LASIG) Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland ESF CONGENOMICS RNP Spring School - March 2014 – Stéphane Joost, Kevin Leempoel, Sylvie Stucki

Introduction Recent and upcoming advances in high throughput DNA sequencing leads to ever increasing availability of genomic sequences The limiting factor in future studies will no longer be the molecular laboratory work, but the development of statistical, bioinformatics and modelling tools for identifying both genes under selection, and the environmental factors acting as selective pressures The genomic tools available have characteristics we must take into account to design landscape genomic studies Rellstab, C., Gugerli, F., Eckert, A. J., Hancock, A. M., & Holderegger, R. (2015). A practical guide to environmental association analysis in landscape genomics. Molecular Ecology, 24(17), 4348–4370. doi:10.1111/mec.13322 Manel, S., Poncet, B. N., Legendre, P., Gugerli, F., & Holderegger, R. (2010). Common factors drive adaptive genetic variation at different spatial scales in Arabis alpina. Molecular Ecology, 19, 3824–3835. doi:DOI 10.1111/j.1365-294X.2010.04716.x

Overview of advantages and drawbacks of the main genomic resources available for landscape genomics studies

AFLPs Amplified fragment length polymorphisms Until recently AFLP was the method of choice to obtain large numbers of molecular markers for non-model organism genomic studies (up to ~2’000) It does not require prior sequenced-based information AFLP markers are bi-allelic, dominant and they usually cover the entire genome although they sometimes tend to cluster around centromeres A recurring issue associated with AFLP is fragment size homoplasy → nonhomologous AFLP fragments co-migrate A variant of the AFLP protocol is the Diversity Array Technology (DArT) Up to several thousands of DNA polymorphisms can be detected in a single hybridization assay on a microarray slide The major advantage of DArTs over AFLPs: their sequences are easily accessible

Microsatellites Microsatellites are codominant and generally multiallelic This makes them useful to monitor decreases in intrapopulation genetic variability observed in the vicinity of adaptive genes or to identify particular alleles specifically associated with environmental variables However, microsatellites have a high mutation rate and a complex mutation pattern, characteristics which can be difficult to accommodate when searching for selection of signatures using traditional population genomics models Moreover, microsatellites can be sparse in the genome of some species and thus difficult to find Up to now, the development of hundreds of microsatellites was time-consuming and expensive, and these markers were also not particularly amenable to massively parallel genotyping As a result, microsatellite resources were sufficient to be exploited in a population genomics context with model species only The increased availability of high-throughput sequencing data will facilitate microsatellite discovery and typing in non-model species Based on microsatellite data, Sork et al. (2010) detected climatically associated genetic variation in populations of valley oak in California, suggesting that the potential for future adaptation in the face of climate change is limited in this long-lived species

SNPs SNPs are the most abundant type of polymorphism in genomes For example, on average there is one SNP every Kb in the 3-billion-base human genome They are usually biallelic and evolve according to a simple infinite sites mutation model One of the major drawbacks of SNPs is their susceptibility to ascertainment bias, i.e. the bias introduced by using a subset of the studied individuals or populations for marker discovery purposes and which can lead to a skew in the distribution of allelic frequencies Detecting SNPs also requires a priori information on the studied genome sequence, but once this task is completed, SNPs present a high potential for an automated high-throughput analysis at a moderate cost Fortunately, next-generation sequencing technologies boosted the use of SNPs for both model and non-model organims Turner et al. (2010) investigated the genetic basis of adaptation to serpentine soils in Arabidopsis lyrata using about 8 millions SNPs

High density SNP chips E.g. Illumina 50k, 600k Specific to species Often used in domestic animals Made of selected SNPs to monitor loci related to traits involved in productive aspects (milk production, hair growth, temperature regulation) Not a ~random set of loci

Sequencing techniques

RAD-Seq In restriction-site associated DNA sequencing (RAD-Seq), the complexity of the genome is reduced using restriction enzymes The flanking regions of restriction sites are sequenced by NGS This approach has successfully been applied to pooled population samples RAD-Seq identifies fewer polymorphisms, from a few thousand to tens of thousands, as compared to millions of SNPs when using whole-genome Pool-Seq Controversy about the use of RAD-Seq without proper understanding of the biology and genome structure of the species to be studied See … Lowry DB, S Hoban, JL Kelley, KE Lotterhos, LK Reed, MF Antolin, and A Storfer. 2016. Breaking RAD: An evaluation of the utility of restriction site associated DNA sequencing for genome scans of adaptation. Mol. Ecol. Resources. 17:142–52. … and answers published in MER

Pool-Seq Pool-Seq is a cost-effective method of NGS because the DNAs of several individuals are pooled before sequencing E.g. in plants, 10 individuals per plot can be sampled and their DNA then pooled before sequencing This approach can lead to accurate SNP allele frequency estimates As a drawback, individual multi-locus genotypes and information on heterozygosity are inaccessible As many landscape genomics tools can handle population allele frequencies, the use of whole-genome Pool-Seq is an attractive option, but only BAYENV yet accounts for the variance introduced by variation in sequencing coverage in Pool-Seq Whole-genome Pool-Seq data have only rarely been used in landscape genomics so far Does not permit to fully exploit local environmental variation

Conclusion: main distinction Study adaptation for conservation purposes AFLP or Microsatellites are key (cheaper) Reinforce the feasability of using genetic data in conservation (absolutely not the case yet, academia excepted) Fundamental research to understand the mechanisms of adaptation SNPs WGS, HD (modulation according to funding) RADSeq

Exercise Data filtering: MAF, missingness Spatial genetic variation (F-stats, Ht, Fis, etc.) Population structure in a spatial context