AFLP and microsatellite analysis. Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information.

Slides:



Advertisements
Similar presentations
RAPD markers Larisa Gustavsson (Garkava)
Advertisements

Lecture 2 Strachan and Read Chapter 13
Lab 3 : Exact tests and Measuring of Genetic Variation.
Lab 3 : Exact tests and Measuring Genetic Variation.
DNA polymorphisms Insertion-deletion length polymorphism – INDEL Single nucleotide polymorphism – SNP Simple sequence repeat length polymorphism – mini-
Biotech Continued… How do forensic scientists determine who’s blood has been left at a crime scene? How do forensic scientists determine who’s blood.
Using DNA sequences to identify target organisms Obtain sequence Align sequences, number of parsimony informative sites Gap handling Picking sequences.
Generation and Analysis of AFLP Data
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
SSR.
PCR based Requiring sequence knowledge
What Can You Do With qPCR?
Chapter 6 Biology of STRs: Stutter Products, Non-template Addition, Microvariants, Null Alleles, and Mutation Rates ©2002 Academic Press.
DNA Forensics. DNA Fingerprinting - What is It? Use of molecular genetic methods that determine the exact genotype of a DNA sample in a such a way that.
GENETIC FINGERPRINT ESTABLISHED FOR THE SELECTED ALFALFA GENOTYPES USING MOLECULAR MARKERS.
Manipulating DNA Genetic Engineering uses the understanding of the properties of DNA to study and change DNA sequences in living organisms – Invitro… in.
Genomic walking (1) To start, you need: -the DNA sequence of a small region of the chromosome -An adaptor: a small piece of DNA, nucleotides long.
PLANT GENETIC MARKERS Plant Biotechnology Dr.Ir. Sukendah, MSc.
DNA Technology Chapter 20.
Work by Antonio Izzo Based on 36 soil cores from a total of 9 plots contained within a 2.5 hectare region.
Module 1 Section 1.3 DNA Technology
Chapter : DQA1/PM Chapter 18: Autosomal STR Profiling.
Targeted next generation sequencing for population genomics and phylogenomics in Ambystomatid salamanders Eric M. O’Neill David W. Weisrock Photograph.
Molecular identification of living things. Molecular Markers Single locus marker Multi-locus marker RFLP Microsatellite DNA Fingerprinting AFLP RAPD.
What is a microsatellite?
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Quantitative Genetics
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
François Ancien Sascha Kretzschmann Olivier Suplis Genotyping Errors Causes, Consequences and Solutions Genotyping Errors.
1. 2 VARIANTS OF PCR APPLICATIONS OF PCR MECHANICS OF PCR WHAT IS PCR? PRIMER DESIGN.
Taqman Technology and Its Application to Epidemiology Yuko You, M.S., Ph.D. EPI 243, May 15 th, 2008.
USDA-ARS, Stoneville, Mississippi
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Advantages of STR Analysis
Molecular Markers CRITFC Genetics Workshop December 8, 2015.
Genotyping and Genetic Maps Bas Heijmans Leiden University Medical Centre The Netherlands.
Molecular Genetic Technologies Gel Electrophoresis PCR Restriction & ligation Enzymes Recombinant plasmids and transformation DNA microarrays DNA profiling.
Population genetics of Liothyrella neozelanica in Breaksea Sound Erik Suring University of Otago, Dunedin, New Zealand Marine Science 480 Research Project.
Simple-Sequence Length Polymorphisms SSLPs Short tandemly repeated DNA sequences that are present in variable copy numbers at a given locus. Scattered.
Polymerase Chain Reaction What is PCR History of PCR How PCR works Optimizing PCR Fidelity, errors & cloning PCR primer design Application of PCR.
Introduction to PCR Polymerase Chain Reaction
The genetic engineers toolkit A brief overview of some of the techniques commonly used.
The Case of the Crown Jewels: Investigate a Crime Scene Using DNA Restriction Analysis (DNA Fingerprinting) Module developed at Boston University School.
1 Chapter 8: Fingerprints, diversity analysis, specific markers Cultivar identification (fingerprint) Specific markers Distance analysis (genetic relatedness)
PCR Polymerase chain reaction. PCR is a method of amplifying (=copy) a target sequence of DNA.
Arun Kumar. B M.Sc 1st Year Biotechnology SSBS
Simple-Sequence Length Polymorphisms
Introduction to PCR Polymerase Chain Reaction
Polymerase Chain Reaction
GENETIC MARKERS (RFLP, AFLP, RAPD, MICROSATELLITES, MINISATELLITES)
Molecular Marker Characterization of plant genotypes
Genotyping module.
Accurate size calling, consistent band intensities, and low run-to-run migration variations by electrophoresis on ABI 3730xl DNA Analyzer Sizing to within.
Genemapper.
DNA profiling DNA profiling is a technique by which individuals can be identified and compared via their respective DNA profiles. Definitions you will.
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
How are areas of DNA that don’t code for proteins (genes) used by our cells? How can we make use of these areas?
Lab 8: PTC Polymerase Chain Reaction Lab
Applied Molecular Genetics Molecular Marker and Technique
Recombinant DNA Unit 12 Lesson 2.
Forensic Biology by Richard Li
ChIP DNA Sample Preparation
Telomere-End Processing
Sequential Steps in Genome Mapping
9-3 DNA Typing with Tandem Repeats
Relationship between Genotype and Phenotype
SBI4U0 Biotechnology.
Presentation transcript:

AFLP and microsatellite analysis

Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information about genome needed Genome wide overage Small amount of DNA needed Cons: Markers are dominant (i.e. heterozygotes are scores as homozygotes) Can be tedious to score Size homoplasy Reproducibility?

STEP 1: Restriction-Ligation

EcoRI PRE-SELECTIVE PRIMER MseI PRE-SELECTIVE PRIMER GTAGACTGCGTACC AATT CA CA AT GAGTCCTGAGTA STEP 2: Pre-selective PCR

SELECTIVE PRIMER GTAGACTGCGTACC AATT CACT GACA AT GAGTCCTGAGTA GTAGACTGCGTACC AATT CA CA AT GAGTCCTGAGTA EcoRI SELECTIVE PRIMER (labeled) MseI SELECTIVE PRIMER STEP 3: Selective PCR FAM

MseI EcoRI MseI EcoRI MseI EcoRI MseI EcoRI MseI EcoRI: 6bp cutter -->one cut every 4096 bp MseI: 4bp cutter --> one cut every 256 bp Selective PCR product contains many unlabeled fragments that will not be visible on ABI

Number of bands in AFLP profile is determined by 1 Genome size:larger genome ---> more bands 2 Number of selective nucleotides in selective primers 3 Dilution of PCR product Low (noise) peaks get magnified Why optimize number of bands? 1 Size homoplasy !!!!! 2 Difficult to score

EcoR1-AGTMseI-CGT EcoR1-AGC MseI-CGA MseI-CGC MseI-CGG etc. MseI-CGTG MseI-CG Choosing selective primer combinations An additional nucleotide reduces number of peaks 4-fold One less nucleotide increases number of peaks 4-fold Use few of these (expensive), but allows use of multiple colors (multiplex run on ABI) Use many of these to get enough markers (cheap) And use these to optimize number of bands

Reproducibility High reproducibility has generally been reported However, DNA quality is crucial component (use same DNA extraction protocol for all samples!) Assess quality of data by repeating several samples from scratch i.e. starting with DNA extraction

Note: Genome size is correlated with noise level Around 20% of primer combinations provide profiles that are suitable for high throughput genotyping. 1 Well separated peaks 2 Right number of peaks 2 Little noise 3 Peaks are distributed across size range 4 High level of Polymorphism Ideal AFLP profile

A very fine example

Too many peaks

Optimizing AFLP reactions 1 DNA quality 2 DNA qualityA successful AFLP analyses depends crucially on this 3 DNA quality 4 Increase restriction time to 2 hours 5 Increase ligation time to 16 hours 6 Use fresh T4 ligase 7 Increase amount of DNA (rest-lig) added to pre-selective PCR (15 ul DNA’ in 50ul reaction) 8 Reduce amount of DNA in Selective PCR 9 Increase amount of cycles in Selective PCR 10 Increase amount of TAQ in Selective PCR 11 Several people have reported better results with TaqI vs MseI (but this requires different adaptors)

Scoring AFLP profiles Normalize samples: Arbitrary cut-off peak height has to be used and this needs to be relative since different samples have different intensity. Set high cut-off for inclusion as marker (that is, at least one individual has to have this cut-off peak height), then reduce peak height for scoring the presence/absence for remainder of individuals. In Genemapper do not use auto-bin option. Make your own bins Analyze all samples for the same primer set in the same project. This allows you to assess the reliability of the marker by scrolling across samples. Also prevents you from including non-polymorphic markers. Also, normalization performed on all samples at the same time. Do not include peaks that do not show clear presence or absence in most cases. Score blindly to avoid bias. Check for overflow from different dye

Normalization

Genemapper Freeware for scoring AFLP from ABI runs: Genographer v 1.6 GenoProfiler 2.0

A few population genetic programs for AFLP analyses RAPDFst: Fst (Lynch and Milligam, 1994) MVSP, NTSYS: Jaccard coeficient, Nei and Li (1979) Arlequin, TFPGA: Amova Genalex:  st, analog of F st, Amova Structure, BAPS: inference of population structure. Hickory: Bayesian estimation of F statistics for dominant markers

A few population genetic programs for AFLP analyses RAPDFst: Fst (Lynch and Milligam, 1994) MVSP, NTSYS: Jaccard coeficient, Nei and Li (1979) Arlequin, TFPGA: Amova Genalex:  st, analog of F st, Amova Structure, BAPS: inference of population structure. Hickory: Bayesian estimation of F statistics for dominant markers Assumes H-W equilibrium

A few population genetic programs for AFLP analyses RAPDFst: Fst (Lynch and Milligam, 1994) MVSP, NTSYS: Jaccard coeficient, Nei and Li (1979) Arlequin, TFPGA: Amova Genalex:  st, analog of F st, Amova Structure, BAPS: inference of population structure. Hickory: Bayesian estimation of F statistics for dominant markers Treats multilocus data as single haplotype Assumes H-W equilibrium

A few population genetic programs for AFLP analyses RAPDFst: Fst (Lynch and Milligam, 1994) MVSP, NTSYS: Jaccard coeficient, Nei and Li (1979) Arlequin, TFPGA: Amova Genalex:  st, analog of F st, Amova Structure, BAPS: inference of population structure. Hickory: Bayesian estimation of F statistics for dominant markers Assumes H-W equilibrium Treats multilocus data as single haplotype No assumption of H-W equilibrium Low information content

Microsatellites * Di- or tri-nuleotide repeats * Ubiquitous * High mutation rate ( ) High level of variability

Mutational mechanism Slippage during replication (also happens during PCR) ACCGAGTCGATCGTGTGTGTGTGTGTGTGTACGCTA TGGCTCAGCTAGCACACA C A C A C A C ACCGAGTCGATCGTGTGTG TGTGTGTGTGTACGCTA TGGCTCAGCTAGCACACAC ACACACACACATGCGAT CA Slippage increases with number of repeats Reduces or decreases number of repeats

Obtaining Microsatellites Screening sequenced genomes Screening enriched genomic library Glenn and Schable (2005) Methods in Enzymology 395: This paper is particularly useful. It comes from a Lab that has isolated microsatellites from 125+ species

SELECTING LOCI Too few repeats Low variability Too many repeats Difficult to score, Homoplasy Choosing loci: repeats uninterrupted repeats Screening of loci: Number of allelesCloning pool of PCR amplicons, followed by labeled PCR Heterozygosity, allelic richness M13 labeled primers

M13 tailed primer Forward primer Reverse primer M13-tail Forward primerReverse primer M13 primer Forward primer FAM (Low concentration) Boutin-Ganache et al (2001) Biotechniques 31, 26-28

Some scoring issues Great looking heterozygote

Some scoring issues Extra peak because of partial A overhang addition of Taq Stutter bands of the two high peaks due to slippage

Some scoring issues Heterozygote

Some scoring issues A single large allele with many repeats Lots of slippage

35 repeats Some scoring issues Increase in slippage with increase in repeat number

Some scoring issues How many alleles?

Some scoring issues Find a heterozygote that clearly shows the shape of a single allele

Some scoring issues The alleles

Some scoring issues Electrophoresis artifacts (Fernando et al (2001) Mol. Ecol. Notes 1, ) The figures shows the difference in peak shape of the same PCR products loaded at different concentration

Some scoring issues Electrophoresis artifacts (Fernando et al (2001) Mol. Ecol. Notes 1, ) Do not overload your gel ! Also keep in mind that in different PCR’s the left peak or the right peak may be dominant

Optimizing PCR Avoid Null Alleles (or try to) Minimize annealing temp lowest temp that produces clean bands MgCl 2 concentrationincrease reduces specificity Different speciesdesign new primers (if possible) ( In my limited experience with cross species amplification null alleles can be big problem) Reduce stutter: Reduce number of cycles Reduce amount of MgCl 2 Touchdown PCR 2/2/8 PCR (2 sec denat, 2 sec anneal, 8 sec extens.) BSA, DMSO Addition of A Increase final extension time Add Pigtail (GTTTCTT) on 5’end of reverse primer to facilitate addition of A overhang Seems to be most successfull

Analysis Issues Null allelesAre loci in HW equilibrium? Linkage disequilibrium? Possible solutions: Remove loci from analysis (if enough loci are available) Check if HW disequilibrium influences results by temporarily removing affected loci. Adjust allele and genotype frequencies (Microchecker) Microsats biggest problem Population subdivision causes both. Null alleles only cause HW disequilibrium.

Some population genetics software Microsatellite toolkit: Excel plug-in for creating Arlequin, FSTAT and Genepop files. Microchecker: Estimate null allele frequency. Adjust allele frequencies. Arlequin: HW equilibrium, Linkage Disequilibrium, Fst, exact test of differentiation, Amova, Mantel test FSTAT: Allelic richness, Fst per locus (to check contribution of each locus to observed pattern of differentiation) Structure, BAPS: Population structuring, population assignment. Migrate: Estimates of effective population size and migration rates Bottleneck: Check for very recent population bottlenecks