Inheritance Model testing Andrew Stubbs Dept. Bioinformatics.

Slides:



Advertisements
Similar presentations
Pedigrees Who do we inherit our traits from? DO YOU LOOK LIKE YOUR AUNT OR UNCLE? DO YOU AND YOUR COUSIN SHARE TRAITS?
Advertisements

Rollercoaster of Genes by Dr. Annette M. Parrott
Analysis of imputed rare variants
A. Dereeper, G. Sarah, F. Sabot, Y. Hueber Exploiting SNP polymorphism data Formation Bio-informatique, 9 au 13 février 2015.
Huntington’s Disease! Hamza Khan Jeremy Tague Period 2 January 29, 2010.
SMART/FHIR Genomic Resources An overview... For latest see
FOR FRESHERS Mendelian Inheritance. Mendelian inheritance There are two alleles of a gene on different sister chromosomes. Dominant alleles trump recessive.
:NEUROPSYCHIATRIC GENETICS [BIOSTATISTICS|BIOINFORMATICS] CORE BIOSTATISTIC/BIOINFORMATIC TOOLS FOR GENETICS DATA: DATA MANAGEMENT AND ANALYSIS RICHARD.
Mendelian pedigree patterns
High Throughput Sequencing
Supplementary slides. Mock-ups Exome overview Genomic coverage: lower quartile 1, median 23, upper quartile 35 Protocols: Aligner used: BWA v2.3 Reference.
Before we start: Align sequence reads to the reference genome
NGS Analysis Using Galaxy
Whole Exome Sequencing for Variant Discovery and Prioritisation
Guideline for ClinLabGeneticist tool Jinlian Wang
An Introduction to Designing, Executing and Sharing Workflows with Taverna Nowgen, Next Gen Workshop 17/01/2012.
MES Genome Informatics I - Lecture VIII. Interpreting variants Sangwoo Kim, Ph.D. Assistant Professor, Severance Biomedical Research Institute,
1 Father of genetics. Studied traits in pea plants.
Do Now : Think-Pair-Share For a height characteristic when tall is dominant What would be the phenotypic ratio for offspring of heterozygous and homozygous.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
NGS data analysis CCM Seminar series Michael Liang:
Alexis DereeperCIBA courses – Brasil 2011 Detection and analysis of SNP polymorphisms.
ChrGeneticist introduction for reviewer Jinlian Wang 10/8/2014.
SCRIPPS GENOME ADVISER Galina Erikson Senior Bioinformatics Programmer The Scripps Translational Science Institute Scripps Translational Science Institute.
Pedigrees. What is a pedigree? A family tree that shows the passage of a trait.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Variant Calling Workshop.
A collaborative tool for sequence annotation. Contact:
Bioinformatics trainings, Vietnam Hanoi, November, 2015
Trinity College Dublin, The University of Dublin GE3M25: Data Analysis, Class 4 Karsten Hokamp, PhD Genetics TCD, 07/12/2015
Trinity College Dublin, The University of Dublin Data download: bioinf.gen.tcd.ie/GE3M25/project Get.fastq.gz file associated with your student ID
Lecture 3 Pedigrees and Human Conditions Genes and BioTechnology.
1 Human Genetics: Pedigrees. Pedigree Looks at family history and how a trait is inherited over several generations and can help predict inheritance patterns.
Punnett Squares. Traits P - Parental Generation Genotype: TT or tt Gametes: T or t F1 – First Filial Generation Genotype: Tt Gametes: T or t F2 – Second.
Genome STRiP ASHG Workshop demo materials
PEDIGREE CHARTS (and Test Crosses) Biology 12. Review: Each Pair of Gene Alleles Is Separated and Two Alleles End Up in Different Gametes.
© 2012 Genomatix GeneGrid finding disease causing variants in NGS data Claudia Gugenmus Genomatix Software GmbH Bayerstrasse 85a
Genetic disorders can be due to any of the following factors: A. Monogenetic Disorders: Caused by a mutation in a single gene 1. Autosomal recessive alleles:
Personalized genomics
Traits and Inheritance Have you wondered why you resemble the other people in your family?
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 2.
CHISOM AMAEFUNA ADELINE LAURENTE 1/29/10 PERIOD 2 Sickle Cell Anemia.
Introduction to Exome Analysis in Galaxy Carol Bult, Ph.D. Professor Deputy Director, JAX Cancer Center Short Course Bioinformatics Workshops 2014 Disclaimer…I.
Identifying disease causal variants Mendelian disorders A. Mesut Erzurumluoglu 1.
IGV Demo Slides:/g/funcgen/trainings/visualization/Demos/IGV_demo.ppt Galaxy Dev: 0.
Understanding PEDIGREEs.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Cancer Institute Frederick National Laboratory is a federally funded research.
Patterns of single gene inheritance Mahmoud A. Alfaqih BDS PhD Jordan University of Science and Technology School of Medicine Department of Biochemistry.
From Reads to Results Exome-seq analysis at CCBR
Canadian Bioinformatics Workshops
NGS Analysis Using Galaxy
CHAPTER 9 Human Genetics
The Human-Mouse: Disease Connection in MGI (BETA)
Yonglan Zheng Galaxy Hands-on Demo Step-by-step Yonglan Zheng
Introduction to Data Formats and tools
ID Mapping tools: Converting Accessions between Databases
Week 9 Vocab Definitions
Human Genetics Pp
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
PEDIGREE CHARTS (and Test Crosses)
What do you notices about the x-link recessive mode of inheritance?
Linkage Analysis Problems
BF528 - Genomic Variation and SNP Analysis
Tracking Family History
Test Cross.
Pedigrees A Pedigree allows you to trace an inherited (genetic) disease through a family. The pattern of a pedigree helps determine: If the disease is.
Quiz #3, Problem #1 Consider the pedigree below, the allele responsible for the trait (a) is recessive to the wild-type allele (A). a. (2 pts) What is.
The Variant Call Format
Presentation transcript:

Inheritance Model testing Andrew Stubbs Dept. Bioinformatics

NGS Protocol Overview

GATK NGS Protocol Overview

Overview of Inheritance Models

Phasing by Transmission

Phasing a VCF file by Transmission

[Clinical]-Annotation

Clinical Significance

GEMINI (GEnome MINIng)

What is GEMINI

GEMINI Framework

Pedigree format

VCF representation of genotypes ZygosityVCF representation Heterozygous0/1; 1/2; 0/2; etc. Homozygous Reference Alternate 0/0; 1/1; 2/2; etc. 0/0 1/1; 2/2; 3/3; etc. Missing./0;./1;./.; etc.

Pre-GEMINI processing: Normalise VCF

Pre-GEMINI processing: Erasmus Galaxy VCF VEP Annotation

Pre-GEMINI processing: VCF Annotation

GEMINI Phasing by Transmission

GEMINI workflow

GEMINI Automated tools for disease inheritance models

GEMINI in our Galaxy

Data Sets VCF: trio.trim.vep.vcf.gz : column names: trio.trim.names.txt Ped: Maternal: mat.dominant.ped BAM: Maternal:1805.BAM.sorted.bam : Paternal:1847.BAM.sorted.bam : Proband:4805.BAM.sorted.bam Notes: 1.The VCF has been normalized and decomposed with VT 2.The VCF has been annotated with VEP.

Practical #1: Autosomal Dominant

1.Upload:trio.trim.vep.vcf.gz - 17,237 lines, 123 comments, format vcf, database hg19 mat.dominant.ped 2.Search for gemini autosomal. – Select dominant model – Use gemini database – Use default paramaters

Practical #1: Autosomal Dominant 1.Create new history mat.dom.ped 2.Upload: trio.trim.vep.vcf.gz & mat.dominant.ped

Practical #1: Autosomal Dominant Load database: VCF (VEP annotated format) & ped file Rename database : gemini_mat.dom.db

Practical #1: Autosomal Dominant Search for autosomal dominant and run model on database HOW MANY VARIANTS ARE DETECTED?

Practical #1: Autosomal Dominant 1.Filtering and sort: Filter – High severity : c71 == ‘HIGH’ – aaf_esp_ea : c95 <= 0.01 – aaf_exac_all : c127 <= 0.01

Practical #1: Autosomal Dominant

High severity : c71 == ‘HIGH’ aaf_esp_ea : c95 <= 0.01 aaf_exac_all : c127 <= 0.01

Practical #1: Autosomal Dominant 1.Generate output Column Select – Chrom, Start, End – Ref, Alt – gene, impact – gts, gt_depths 2.output

Practical #1: Autosomal Dominant 1.Filtering and sort: Filter – Maternal Dominant inheritance: 1,514 variants – High severity: c71 == ‘HIGH’ : 15 variants – aaf_esp_ea : c95 <= 0.01: 7 variants – aaf_exac_all : c127 <= 0.01: 4 variants 2.Column Select

Practical #1: Autosomal Dominant 1.Variant visualisation – Create new history – Get Data: Load files from disk: 3 BAM files

Practical #1: Autosomal Dominant 1.Variant visualisation – Create trackster visual  add other bam files

Practical #1: Autosomal Dominant 1.Variant visualisation – Check variants 1847 paternal 1805 maternal 4805 proband

Practical #2: Autosomal Dominant 1.Create ped file for PATERNAL DOMINANT 2.Re-run Analysis for PATERNAL DOMINANT 3.What is the outcome? 4.Are any of these variants known to underlie a clinical phenotype in either Maternal of Paternal dominant inheritance model? We will review the results together ….

Practical #3: Autosomal Dominant

Command -q SELECT chrom, start, end, ref, alt, gene, impact, (gts).(*), (gt_depths).(*) FROM variants WHERE filter is NULL and impact_severity == 'HIGH‘ and (aaf_esp_ea <= 0.01 or aaf_esp_ea is NULL) and (aaf_exac_all <= 0.01 or aaf_exac_all is NULL) --header --gt-filter (gt_types).(phenotype==2).(==HET).(all) and (gt_types).(phenotype==1).(==HOM_REF).(all) and (gt_depths).(*).(>=20).(all)

Practical #3: Autosomal Dominant Use the gemini query tool to select maternal dominant inheritance. Is there a difference with your results in Practical # 1