Cell Lineage Analysis of a Mouse Tumor

Slides:



Advertisements
Similar presentations
Algorithms (and Datastructures) Lecture 3 MAS 714 part 2 Hartmut Klauck.
Advertisements

Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
METHODS FOR HAPLOTYPE RECONSTRUCTION
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
Phylogenetic reconstruction
DNA fingerprinting Every human carries a unique set of genes (except twins!) The order of the base pairs in the sequence of every human varies In a single.
Molecular Evolution Revised 29/12/06
DNA marker analysis Mrs. Stewart Medical Interventions Central Magnet School.
Tutorial #6 by Ma’ayan Fishelson Based on notes by Terry Speed.
Methods of identification and localization of the DNA coding sequences Jacek Leluk Interdisciplinary Centre for Mathematical and Computational Modelling,
Heuristic alignment algorithms and cost matrices
4. Ad-hoc I: Hierarchical clustering
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
CSE182-L17 Clustering Population Genetics: Basics.
Incorporating Mutations
Microsatellite Instability Detection by Next Generation Sequencing S.J. Salipante, S.M. Scroggins, H.L. Hampel, E.H. Turner, and C.C. Pritchard September.
Phylogenetic Tree Construction and Related Problems Bioinformatics.
Phylogenetic trees Sushmita Roy BMI/CS 576
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Habil Zare Department of Genome Sciences University of Washington
Molecular phylogenetics
Additive genetic variance and heritability One of the most important questions we can ask to understand evolutionary change is how the phenotypes of parents.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Calculating branch lengths from distances. ABC A B C----- a b c.
We obtained breast cancer tissues from the Breast Cancer Biospecimen Repository of Fred Hutchinson Cancer Research Center. We performed two rounds of next-gen.
By the end of this lecture, students will learn: 1.What is cancer. 2.Genetics of cancer. 3.Oncogenes 4.Tumor suppressor genes. 5.DNA Repair genes 6.Genes.
Introduction to History of Life. Biological evolution consists of change in the hereditary characteristics of groups of organisms over the course of generations.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Simple-Sequence Length Polymorphisms SSLPs Short tandemly repeated DNA sequences that are present in variable copy numbers at a given locus. Scattered.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
1 Microarray Clustering. 2 Outline Microarrays Hierarchical Clustering K-Means Clustering Corrupted Cliques Problem CAST Clustering Algorithm.
DNA marker analysis Mrs. Stewart Medical Interventions Central Magnet School.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Finding Motifs Vasileios Hatzivassiloglou University of Texas at Dallas.
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
CSE280Stefano/Hossein Project: Primer design for cancer genomics.
Clustering [Idea only, Chapter 10.1, 10.2, 10.4].
Dan Frumkin*, Adam Wasserstrom*, Shai Kaplan, Uriel Feige & Ehud Shapiro * equal contributors Genomic variability within an organism exposes its cell lineage.
Simple-Sequence Length Polymorphisms
A Multi-stage Approach to Detect Gene-gene Interactions Associated with Multiple Correlated Phenotypes Zhou Xiangdong,Keith Chan, Danhong Zhu Department.
Colorectal Pretumor Progression Before and After Loss of DNA Mismatch Repair  Peter Calabrese, Jen-Lan Tsao, Yasushi Yatabe, Reijo Salovaara, Jukka-Pekka.
Evolutionary genomics can now be applied beyond ‘model’ organisms
Distance based phylogenetics
Linkage and Linkage Disequilibrium
DNA Marker Lecture 10 BY Ms. Shumaila Azam
Multiple Alignment and Phylogenetic Trees
Genetics of qualitative and quantitative phenotypes
Biological Classification: The science of taxonomy
Ranking Tumor Phylogeny Trees by Likelihood
Patterns in Evolution I. Phylogenetic
Molecular Clocks Rose Hoberman.
Distinguishing de Novo Second Cancer Formation from Tumor Recurrence
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Single-Cell Transcriptomics Meets Lineage Tracing
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Outline Cancer Progression Models
Phylogeny.
Unit Genomic sequencing
Winter 2019 Lecture 11 Minimum Spanning Trees (Part II)
Tutorial #6 by Ma’ayan Fishelson
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
INTRODUCTION A graph G=(V,E) consists of a finite non empty set of vertices V , and a finite set of edges E which connect pairs of vertices .
Presentation transcript:

Cell Lineage Analysis of a Mouse Tumor Elena Helman November 6, 2008

Classic model of cancer cell lineage Tumor is derived from a single founder cell that has acquired a growth advantage over normal cells by genetic modification. The progeny of the founder cell expands in an evolutionary pattern.

Microsatellites (MS) Microsatellite (or Simple Sequence Repeat) : loci of DNA consisting of repeating units of 1-6 bp in length. Mutations in MS loci act as “molecular tumor clocks” that record past tumor histories Why MS mutation as a marker in cell lineage reconstruction? MS slippage mutations are coupled to cell division. high and broad mutation rates occur independently at different loci, usually without affecting phenotype as most are found in noncoding genomic sequences; highly abundant in human, mouse, and many other organisms;

Procedure (biology) 37 single MLH1 –/– mouse lymphoma cells and adjacent normal cells For each single cell : DNA extracted Amplified by WGA Genotyped at 120 MS loci by PCRs and capillary electrophoresis DNA extracted from a tail clipping of the same animal, which represented the DNA of the zygote

Procedure (computational) Input: 120 MS for 37 cells Each cell is represented by a vector of length 120 (“digital identifier”), where each element is the repeat count of the MS at that locus. Vi = count of MSi Cell v = {V1,V2,…,V120} Cell w = {W1,W2,…,W120} Distance function: Dist(v,w) = i=1Σ120 (Vi – Wi) / 120 Apply NJ using this pairwise distance matrix Output: reconstructed cell lineage tree, rooted from the zygote sample Incorporate depths to translate vertical branch distances into absolute numbers of cell division

Depth assignment Depth = number of mitotic divisions since the zygote A calibration CCT (cultured cell tree) is created and reconstructed Multiplier – number representing the ratio between the reconstructed and actual depths (256 in this case). The multiplier of the CCTs is the slope of the linear regression between actual and reconstructed depths of CCT nodes Estimated depth of each cell was its reconstructed depth multiplied by the multiplier obtained from the mouse CCT

Depth calibration

Cancer cell lineage reconstruction All cells from tumor foci clustered on a single subtree. Supports common clonal origin Statistically significant – hypergeometric test – score observed compared to random permutations 3 foci interspersed – local invasion not metastasis

Physical Distance Lineage distance among cancer cells is correlated to their physical distance  coherent growth pattern Divided cells into three categories corresponding to increasing physical distances. Lineage distance measure – sum of branch lengths that connects both cells to their most recent (i.e., deepest) common ancestor) Mean lineage distance was smallest between cells obtained from the same tissue section. Student’s t tests

Cell Depths Significant difference in depths between cancer cells and normal lung epithelial cells (236 vs 121) Kolmogorov-Smirnov test Normal T lymphocytes were not extracted so depth of tumor cells could not be directly compared. Age: Founder of tumor subtree is at depth 102 cell divisions, 134 cell divisions younger than the average cancer cell. Tumor is ~ 5 months old

Presence of TP53 mutation Amplified/sequenced a 240 bp fragment, spanning the mutational hotspot in codon 270 of exon 8 of commonly mutated TP53 gene. Identical mutation found in some, but not all, tumor cells. Why? Multiple independent events or single early event Present but not detectable (allele dropout) Mutation only in tumor subclone

Problems, future directions Is MS mutation the best indicator? Assume no selection (MS neutral) What if MS mutation confers advantage? Assume tumor and normal cells divide at same rate for age/depth. Definition of “cancer” “reversion” of (R-4) / mistaken identity MLH-/- tumors good cancer model?? Maybe that’s why monoclonal… Never reconstruct common ancestors

Graph construction: ‘triplet algorithm’ View every cell as a vertex in an auxiliary graph G. In an execution of a triplet subroutine that outputs cells A and B (say, on input A, B, and C), put an edge between A and B. As long as there are more than two connected components in the graph G, pick three vertices from three different components and execute a triplet subroutine on them, thereby adding an edge to the graph and decreasing the number of connected components by one. After m steps, two connected components remain. Each one of them necessarily corresponds to a subtree of depth at most d.The condensed version of each of the subtrees can be inferred separately by repeating the above procedure. The basic primitive of the triplet algorithm is a ‘‘triplet subroutine.’’ Given identifiers for three cells (say, A, B, and C), the triplet subroutine counts for every pair of cells the number of common mutations, namely, the number of loci in which the two cells have the same label, and moreover, this label is different from the corresponding label of the root. The pair of cells that maximize this count (say, A and B) are output by the triplet subroutine. We say that the triplet subroutine is ‘‘successful’’ if the pair of cells that it outputs is the one that has the longer common branch (or equivalently, the deeper common ancestor).