Mapping analysis software Dr Ian Carr PhD. MCSD. Leeds Institute of Molecular Medicine St Jamess University Hospital.

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

Lecture 39 Prof Duncan Shaw. Meiosis and Recombination Chromosomes pair upDNA replication Chiasmata form Recombination 1st cell division 2nd cell divisionGametes.
Lecture 2 Strachan and Read Chapter 13
Illumin8er: Software for the Illumina GAII Ian Carr, Joanne Morgan, Phil Chambers, Alex Markham, David Bonthron& Graham Taylor Leeds Institute of Molecular.
Note that the genetic map is different for men and women Recombination frequency is higher in meiosis in women.
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Tutorial #1 by Ma’ayan Fishelson
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Genetic Approaches to Rare Diseases: What has worked and what may work for AHC Erin L. Heinzen, Pharm.D, Ph.D Center for Human Genome Variation Duke University.
Genetics notes For makeup. A gene is a piece of DNA that directs a cell to make a certain protein. –Homozygous describes two alleles that are the same.
Chapter 5 Heredity.
Basics of Linkage Analysis
High resolution detection of IBD Sharon R Browning and Brian L Browning Supported by the Marsden Fund.
Abstract In a large family with a specific disease, patients usually share the identity-by-descent (IBD) haplotype linked to the disease susceptibility.
Office hours Wednesday 3-4pm 304A Stanley Hall. Fig Association mapping (qualitative)
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Restriction Fragment Length Polymorphisms (RFLPs) By Amr S. Moustafa, M.D.; Ph.D. Assistant Prof. & Consultant, Medical Biochemistry Dept. College of.
High Throughput Sequencing
Supplementary slides. Mock-ups Exome overview Genomic coverage: lower quartile 1, median 23, upper quartile 35 Protocols: Aligner used: BWA v2.3 Reference.
Standardization of Pedigree Collection. Genetics of Alzheimer’s Disease Alzheimer’s Disease Gene 1 Gene 2 Environmental Factor 1 Environmental Factor.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
1 Father of genetics. Studied traits in pea plants.
Loss-of-co-Homozygosity mapping and exome sequencing of a Syrian pedigree identified the candidate causal mutation associated with rheumatoid arthritis.
Jeopardy Genes and Chromosomes Basics
Genomics Collaboration Senior Scientist
Lesson Overview 14.1 Human Chromosomes.
1 Balanced Translocation detected by FISH. 2 Red- Chrom. 5 probe Green- Chrom. 8 probe.
The International Consortium. The International HapMap Project.
Lecture 3 Pedigrees and Human Conditions Genes and BioTechnology.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
1 Human Genetics: Pedigrees. Pedigree Looks at family history and how a trait is inherited over several generations and can help predict inheritance patterns.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
Genetic disorders can be due to any of the following factors: A. Monogenetic Disorders: Caused by a mutation in a single gene 1. Autosomal recessive alleles:
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
14.1 Human Chromosomes Key Questions: 1)What is a karyotype? 2)What patterns of inheritance do human traits follow? 3)How can pedigrees be used to analyze.
Lesson Overview Lesson Overview Human Chromosomes Lesson Overview 14.1 Human Chromosomes.
Lesson Overview 14.1 Human Chromosomes. THINK ABOUT IT If you had to pick an ideal organism for the study of genetics, would you choose one that produced.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Short description on how to use dChip SNP Please watch in slide show mode Updated
Genetics Vocabulary. trait A physical characteristic passed from parents to offspring.
1 Bioinformatics Tools for Genotyping Frances Tong Dr. Garry Larson, Ph.D City of Hope Department of Molecular Medicine Southern California Bioinformatics.
Genomic Analysis: GWAS
Frances Bond West Midlands Regional Genetics Laboratory 12/04/10
Extending Mendelian Genetics
Genetics Definitions Definition Key Word
DNA Part 2 Vocabulary Practice Quiz
Jeopardy Genes and Chromosomes
The same gene can have many versions.
Week 10 Vocab Definitions
The same gene can have many versions.
The same gene can have many versions.
The same gene can have many versions.
The same gene can have many versions.
Mendelian Inheritance
The same gene can have many versions.
The same gene can have many versions.
The same gene can have many versions.
Balanced Translocation detected by FISH
The same gene can have many versions.
The same gene can have many versions.
Pedigrees A Pedigree allows you to trace an inherited (genetic) disease through a family. The pattern of a pedigree helps determine: If the disease is.
Following Patterns of Inheritance in Humans
Carrier = an organism that has inherited a genetic trait or mutation, but displays no symptoms X-linked traits = traits that are passed on from parents.
The same gene can have many versions.
The same gene can have many versions.
Inheritance & Variance Traits Vocabulary
The same gene can have many versions.
Presentation transcript:

Mapping analysis software Dr Ian Carr PhD. MCSD. Leeds Institute of Molecular Medicine St Jamess University Hospital

Autozygosity But! LA = local (common) ancestor LI = local inheritance

Autozygosity You only know part of the picture And What you dont know can be more important than what you do know DA = distant (common) ancestor DI = distant inheritance

Analysis New way New way Send DNA off with £300 per sample Send DNA off with £300 per sample Wait three weeks Wait three weeks Stare at a million uninformative SNPs worth of data and wonder what to do with it! Stare at a million uninformative SNPs worth of data and wonder what to do with it! Old way Old way Spend 1.5 years mapping a family with highly informative microsatellites Spend 1.5 years mapping a family with highly informative microsatellites Analyse data as you go Analyse data as you go Hope you find something! Hope you find something!

AutoSNPa What is it: What is it: Its one big database which draws pretty pictures Its one big database which draws pretty pictures There is no maths, because there is no complete knowledge of the system There is no maths, because there is no complete knowledge of the system Assumptions Assumptions All affecteds are consanguineous and have the same mutation and hence a common haplotype All affecteds are consanguineous and have the same mutation and hence a common haplotype

AutoSNPA: Pedigree one First family First family Results Results 135Mb region on chromosome 4 135Mb region on chromosome 4 Out come Out come To many genes: Move on. To many genes: Move on. 135Mb

AutoSNPA: Pedigree two Two families Two families New Results New Results 45Mb Region on chromosome 4 45Mb Region on chromosome 4 Out come Out come Still to many genes: Move on Still to many genes: Move on 45Mb

AutoSNPA: Pedigree three Three families Three families New new Results New new Results 4.5Mb region on chromosome 4 4.5Mb region on chromosome 4 Out come Out come 8 genes, one good candidate: Sequenced it and published. 8 genes, one good candidate: Sequenced it and published. 4.5Mb

The problem with AutoSNPa It requires a large family with multiple affected people who will give a DNA sample or a number of families with the same founder mutation. It requires a large family with multiple affected people who will give a DNA sample or a number of families with the same founder mutation. In reality large families are rare as hens teeth and a each family tends to have its own mutation. In reality large families are rare as hens teeth and a each family tends to have its own mutation.

IBDFinder What is it: What is it: Its another big database which draws pretty pictures Its another big database which draws pretty pictures Again no maths Again no maths Assumptions Assumptions The affecteds are consanguineous and most have mutations in the same gene. The affecteds are consanguineous and most have mutations in the same gene.

Disease has social stigma, so no pedigree data Disease has social stigma, so no pedigree data Most unrelated to each other. Most unrelated to each other. 2 have mutations in a different gene. 2 have mutations in a different gene. 2 have an IBD region of one SNP in the data set 2 have an IBD region of one SNP in the data set Molar pregnancies and IBDFinder Number of patients homozygous for the region 19p-tel19q-tel

Milk drinkers and IBDfinder The ability for adults to drink milk is relatively new and there are only a few genotypes that have the phenotype. Therefore most of us are homozygous for the LCT gene on chromosome 2 The ability for adults to drink milk is relatively new and there are only a few genotypes that have the phenotype. Therefore most of us are homozygous for the LCT gene on chromosome 2

Problems with IBDfinder DNA from affecteds is not always easy to come by. DNA from affecteds is not always easy to come by.

SAMPLE Shadow Autozygosity MaPping by Linkage Exclusion What is it: What is it: A program that finds disease genes without the DNA of an affected patient, only DNA from the parents and siblings of affecteds. A program that finds disease genes without the DNA of an affected patient, only DNA from the parents and siblings of affecteds. Assumptions: Assumptions: An inbreed family is 3 times more likely to have an unaffected kid than an affected one, none of whom will be homozygous for the disease causing allele. An inbreed family is 3 times more likely to have an unaffected kid than an affected one, none of whom will be homozygous for the disease causing allele.

Meckel-Gruber Syndrome (MKS3) DNA available from individuals with yellow symbols. No data from affected individuals DNA available from individuals with yellow symbols. No data from affected individuals

SAMPLE test data SAMPLE excludes most of the genome (~98%) and the remaining regions can be checked using microsatellites.

Problems with SAMPLE All the pedigree have to have a mutation in the same gene. All the pedigree have to have a mutation in the same gene. It works at the level of individual SNPs and does not consider extended haplotypes. It works at the level of individual SNPs and does not consider extended haplotypes.

Phaser What is it What is it A program that uses logic to determine the phase of the genotypes of the SNPs on each chromosome. A program that uses logic to determine the phase of the genotypes of the SNPs on each chromosome. It can then calculate how autozygous each person is, how related a pedigree is to another and to find common haplotypes in affecteds. It can then calculate how autozygous each person is, how related a pedigree is to another and to find common haplotypes in affecteds. Requirements Requirements It needs SNP data for parents and at less two children and ideally a number of pedigrees. It needs SNP data for parents and at less two children and ideally a number of pedigrees.

Meckel-Gruber Syndrome (MKS3) Phaser identifies segments of chromosomes present individuals allowing the user to analysis dominant and recessive diseases. Phaser identifies segments of chromosomes present individuals allowing the user to analysis dominant and recessive diseases.

Degree of relatedness

By knowing how related two pedigrees are, it is possible to judge how likely they are to have a common haplotype By knowing how related two pedigrees are, it is possible to judge how likely they are to have a common haplotype

The problem with Phaser It has not been tested exhaustively and so may not work! It has not been tested exhaustively and so may not work!

Sequence analysis Sanger sequencing mutation detection Sanger sequencing mutation detection Next generation clonal sequencing mutation detection Next generation clonal sequencing mutation detection

Genescreen Rapid detection and annotation of sequence variants Rapid detection and annotation of sequence variants

Annotation of simple mutations Single base mutations are automatically annotated with genomic, cDNA and protein information. Single base mutations are automatically annotated with genomic, cDNA and protein information.

Annotation of complex mutations Heterozygous indels are deconvoluted and annotated. This window also annotates indels and homozygous insertions and deletions Heterozygous indels are deconvoluted and annotated. This window also annotates indels and homozygous insertions and deletions

Exporting data Plain text, LOVD import file or a web page. Plain text, LOVD import file or a web page. The webpage is updatable and so acts a data display and data base. The webpage is updatable and so acts a data display and data base.

Clonal sequencing Nothing lasts for ever so the current sequencing project is to create a program that analysers Illumina sequence data. Nothing lasts for ever so the current sequencing project is to create a program that analysers Illumina sequence data. At the moment the base program analyses data at a rate of 3.6 billion bases an hour or 320Mb of data a minute. At the moment the base program analyses data at a rate of 3.6 billion bases an hour or 320Mb of data a minute.

Underlying data for a heterozygous base change

Underlying data for a heterozygous base pair insertion

All released programs can be obtained from: