Sequencing shRNA libraries with DNA Sudoku Yaniv Erlich Hannon Lab Yaniv Erlich Hannon Lab Compressed Genotyping Cold Spring Harbor.

Slides:



Advertisements
Similar presentations
Biology Day 1.
Advertisements

“Emery’s Elements of Medical genetics”
Applications of HGP Genetic testing Forensics. Testing for a pathogenic mutation in a certain gene in an individual that indicate a persons risk of developing.
Huong Le Department of Molecular & Clinical Genetics, Royal Prince Alfred Hospital Click mouse to move to the next slide.
Lab 10: Mutation, Selection and Drift. Goals 1.Effect of mutation on allele frequency. 2.Effect of mutation and selection on allele frequency. 3.Effect.
Discover Biology FIFTH EDITION
shRNA libraries with DNA Sudoku Yaniv Erlich Hannon Lab Yaniv Erlich Hannon Lab shRNA libraries sequencing using DNA Sudoku.
PRIORITIZING REGIONS OF CANDIDATE GENES FOR EFFICIENT MUTATION SCREENING.
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
Hardy-Weinberg Equilibrium
AP Biology Lab Review.
C HAPTER 14- H UMAN H EREDITY BIG IDEA: How can we use genetics to study human inheritance?
Population Genetics: An introduction Change in Populations & Communities: Population Genetics.
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Population Genetics “The study of genetic variation and its causes in population”
One-way migration. Migration There are two populations (x and y), each with a different frequency of A alleles (px and py). Assume migrants are from population.
Population Genetics direct extension of Mendel’s laws, molecular genetics, and the ideas of Darwin Instead of genetic transmission between individuals,
Hybrid Dense/Sparse Matrices in Compressed Sensing Reconstruction
Introducing the Hardy-Weinberg principle The Hardy-Weinberg principle is a mathematical model used to calculate the allele frequencies of traits with dominant.
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Modern View of Evolution: Genetic Change. Genes and Variation.
Title: Population Genetics 12th February 2014
Unit 6 Lesson 5 Punnett Squares and Pedigrees
Ensuring the Quality of Genetic Testing ICORD Meeting September 14, 2007 Lisa Kalman, PhD Coordinator, GeT-RM CDC
Detection of Rare-Alleles and Their Carriers Using Compressed Se(que)nsing Or Zuk Broad Institute of MIT and Harvard In collaboration.
Chapter 14: Genetic Screening and Counselling. Higher Human Biology Unit 1: Cell Function and Inheritance 07/09/20151Mrs Smith: Ch14: Genetic Screening.
Population Genetics Learning Objectives
Linear Reduction for Haplotype Inference Alex Zelikovsky joint work with Jingwu He WABI 2004.
Learning Goal 1 Natural Selection is a major mechanism of evolution
14 Population Genetics and Evolution. Population Genetics Population genetics involves the application of genetic principles to entire populations of.
Chapter 7 Population Genetics. Introduction Genes act on individuals and flow through families. The forces that determine gene frequencies act at the.
Evolution Chapters Evolution is both Factual and the basis of broader theory What does this mean? What are some factual examples of evolution?
Zorica Stanimirović Faculty of Mathematics, University of Belgrade
Human Inheritance- Important Facts 1) Sex chromosomes carry genes that determine whether a person is male or female. Girls have XX Boys have XY 2) Sex.
Computational Molecular Biology Non-unique Probe Selection via Group Testing.
Fossil Record & Homologies  Scientists have used the fossil record to construct a history of life on Earth. This is only a theory Fossil record is not.
Watson School of Biological Sciences Cold Spring Harbor Laboratory Watson School of Biological Sciences Cold Spring Harbor Laboratory.
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
Mechanisms of Population Evolution
CHAPTER 9 Patterns of Inheritance Part 3. Human Genetic Analysis  Since humans live under variable conditions, in different places, and have long life.
Advances in Genetics SPI 0707.T/E.3 Distinguish between the intended benefits and the unintended consequences of a new technology.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
BIOLOGY 12 Introduction to Genetics. “If our strands of DNA were stretched out in a line, the 46 chromosomes making up the human genome would extend more.
SNPs, Haplotypes, Disease Associations Algorithmic Foundations of Computational Biology II Course 1 Prof. Sorin Istrail.
Computational Molecular Biology Non-unique Probe Selection via Group Testing.
Genetic Disorders Ch. 5 section 2.
Heredity. In the past, traits were thought to be carried by the blood. We now know what genes are responsible for certain traits and that the genes are.
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
The process of evolution n Macroevolution –The creation of new species and the extinction of old ones. The accumulation of genotypic and phenotypic changes.
Unit 1 – Living Cells.  The study of the human genome  - involves sequencing DNA nucleotides  - and relating this to gene functions  In 2003, the.
Computational Molecular Biology
EVOLUTION …via Natural Selection. Organisms produce more offspring than can survive.
National Taiwan University Department of Computer Science and Information Engineering An Approximation Algorithm for Haplotype Inference by Maximum Parsimony.
Personal DNA Testing Melanie Wark Mackenzie Steen.
© 2007 McGraw-Hill Higher Education. All rights reserved. Chapter 2 Genetics: You and Your Family Health History.
POINT > Define Hardy-Weinberg Equilibrium POINT > Use Hardy-Weinberg to determine allele frequencies POINT > Define “heterozygous advantage” POINT > Describe.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Psychology Unit 1 Vocabulary. Unit 1 - Psychology 1. Applied research 2. Basic research 3. Biological perspective 4. Cognitive perspective 5. Functionalism.
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
Chapter 5: Genetics and genomics perspectives in nursing
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
Genetic Engineering.
Genetic Variation Within Populations
Inheriting a Genetic Disorder
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
KEY CONCEPT Hardy-Weinberg equilibrium provides a framework for understanding how populations evolve.
Presentation transcript:

Sequencing shRNA libraries with DNA Sudoku Yaniv Erlich Hannon Lab Yaniv Erlich Hannon Lab Compressed Genotyping Cold Spring Harbor Laboratory

Poster in a nutshell Genotyping is the process of determining the genetic variation for a certain trait in an individual. It is one of the main diagnostic tools in medical genetics - Finding carriers for rare genetic diseases such as Cystic Fibrosis - Tissue matching in organ donation - Forensic DNA analysis Until now - only serial genotyping is possible. This is expensive and tedious. Taking advantage on the ‘signal sparsity’, we developed and tested a compressed genotyping framework.

Sequencing shRNA libraries with DNA Sudoku Significant volumes of knowledge have been accumulated in recent years linking subtle genetic variations to a wide variety of medical disorders from cystic fibrosis to mental retardation. Nevertheless, there are still great challenges in applying this knowledge routinely in the clinic, largely due to the relatively tedious and expensive process of DNA sequencing. Since the genetic polymorphisms that underlie these disorders are relatively rare in the human population, the presence or absence of a disease-linked polymorphism can be thought of as a sparse signal. Using methods and ideas from compressed sensing and group testing, we have developed a cost-effective reconstruction protocol, called "DNA Sudoku", to retrieve useful data. In particular, we have adapted our scheme to a recently developed class of high throughput DNA sequencing technologies, and assembled a mathematical framework that has some important distinctions from 'traditional' compressed sensing ideas in order to address different biological and technical constraints. Abstract

The genotyping problem Input: Thousands of specimens Output: Genotype of each specimen Genotype

Genotyping as a sparse graph reconstruction Samples Alleles An example of carrier screen for Cystic Fibrosis. There are two allele nodes, the Wild Type (WT) and the and the Cystic Fibrosis mutation. Samples 1, 2, 3, 5 are WT, while specimen 4 is a carrier. The specimen labeled with ’X’ is affected and does not enter to the screen. Genotyping is equivalent of finding the edges in the graph. THE GRAPH IS SPARSE 1.Number of carriers is very low 2.No affected individuals 3.The degree of every sample node is always two (human genome is diploid) Genotyping is equivalent to reveal the edges of the bipartite graph

Sequencing shRNA libraries with DNA Sudoku The main idea – pooled processing One could reveal the graph edges by DNA sequence each sample - expensive, tedious, and slow Better: Pool the samples and then sequence the pools

Sequencing shRNA libraries with DNA Sudoku Allele Pool What the observer sees The biadjacency matrix of the graph What the observer wants The pooling design A binary matrix (‘1’ – in the pool, ‘0’ – otherwise) Mathematically speaking Pool Specimen

What is a good pooling design WhyAttribute Decodability Less genotyping assays Small number of pools The robot can pull several specimens every step Constant column weight Less robotics efforts Low column weight Reducing the chance for biological noise Low row weight Trivial compressed sensing demands Biological oriented requirements We need a light-weight d-disjunct matrix

Inputs: N (number of specimens) Column Weight (robotics efforts) Algorithm: 1. Find W numbers {x 1,x 2,…,x w } such that: (a)Bigger than (b)Pairwise coprime 2. Generate W modular equations: 3. Construct the pooling matrix upon the modular equations Output: Pooling matrix Light Chinese Design The algorithm reaches the bound derived by Kautz & Singleton (1964)

Example of a pooling matrix

Decoding the genotyping results by Belief Propagation The pooled results can be decoded as using Belief Propagation Specimens Pools Genotyping results A-priori biological information

03/06/09 Example of Belief Propagation Specimens Pools Specimen is in a pool #1 #2 #3 #4 #5 #6 #7 CBAD CBAD CBAD CBAD CBAD CBAD CBAD DCA ACB CBA CDB 1.You can be either A, C, or D Possible genotypes: 2. I can’t be B 3.Specimen #3, #6 and #7: One of you guys should be B CBAD CBAD CBAD

Simulation results 1000 specimens W = 5 Total pools = 180 Number of carriers

Real results – biotechnology application 40,000 specimens W = 5 Total pools = 1900

Work in progress

References & Acknowledgments Compressed Genotyping. Yaniv Erlich, Assaf Gordon, Michael Brand, Gregory J. Hannon & Partha P. Mitra. Submitted to IEEE Trans. Info. Theory DNA Sudoku - harnessing high-throughput sequencing for multiplexed specimen analysis. Yaniv Erlich, Kenneth Chang, Assaf Gordon, Roy Ronen, Oron Navon, Michelle Rooks & Gregory J. Hannon. Genome Research Lindsay-Goldberg Fellowship