Download presentation
Presentation is loading. Please wait.
Published byTodd Roberts Modified over 9 years ago
1
Estimating Mutation Rates from Clonal Tree Data (using modeling to understand the immune system) Steven H. Kleinstein Department of Computer Science Princeton University
2
The Immune System Protects the body from dangerous pathogens –viruses, bacteria, parasites Provides basis for vaccines (e.g., flu) Implicated in disease: –Autoimmune (Lupus, MS, Rheumatoid Arthritis) –Sepsis, Cancer Relatively new science, began with Jenner in 1796 Understanding will lead to better diagnostics and therapies
3
Why Model the Immune System? Immune response involves the collective and coordinated response of 10 12 cells and molecules Distributed throughout body –blood, lymph nodes, spleen, thymus, bone marrow, etc. Interactions involve feedback loops and non-linear dynamics Experiments often require artificial constructs High variability observed in experimental results Somatic Hypermutation: important component of response Experiments provide only a static window onto the real dynamics of immunity
4
Talk Outline Background Immunology 1.What are clonal trees? 2.What can we learn from them? 3.Method for estimating the hypermutation rate Simulation Method Analytical Method 4.Results on simulation and experimental data Summary, Conclusions and Open Problems
5
Germinal Center Effector Cells Overview of Humoral Immune Response Ag B B B B B Germinal Centers are the Site of Affinity Maturation (selection for cells with affinity-increasing mutations) Germinal Centers are the Site of Affinity Maturation (selection for cells with affinity-increasing mutations) B B Foreign Pathogen (Antigen) B B B
6
More about Germinal Centers Hundreds of germinal centers in spleen Composed of many cell types (B, T, FDC) Spatial structure that changes over time Driven by stochastic mutation process Observation involves sacrifice of animal While experiments have elucidated basic mechanisms, how these fit together is not well understood While experiments have elucidated basic mechanisms, how these fit together is not well understood
7
How do we learn about somatic hypermutation? 1.Inject mouse with antigen to provoke an immune response 2.Harvest spleen, cross-section and stain to identify proliferating cells B cells T cells 3.Microdissect groups of cells 4.Sequence DNA coding for B cell receptor
8
Experimental Microdissection Data How can we estimate the rate of mutation?
9
Estimating the B Cell Mutation Rate Counting mutations is not sufficient, number of cell divisions at time of measurement is unknown Also, problems of mutation schedule and positive/negative selection Number of Cell Divisions Number of Mutations High Mutation Rate Low Mutation Rate Observed Number of Mutations
10
Clonal Tree Data from Micro-Dissection Clonal trees from sequence data based on pattern of shared mutations GermlineGGGATTCTC 1-C-----G- 2-------G- 3A------GA 4A---C--GA Clonal tree ‘shapes’ reflect underlying dynamics 1(G A) 9(C A) 3 13 4 8(T G) 2(G C) 5(T C) Germline
11
Clonal Tree ‘Shape’ Reflects Underlying Dynamics BACD gctc BAD Initial Sequence a t a D t gct BA Investigate with computer simulation of B cell clonal expansion Simulate clonal trees with known mutation rate and number of divisions Compare:Rate of 0.2 division -1 for 14 divisions Rate of 0.4 division -1 for 7 divisions Use simulation to identify relevant shape measures
12
Intermediate Vertices is Useful Measure Compare:Rate of 0.2 division -1 for 14 divisions Rate of 0.4 division -1 for 7 divisions Shape measures can supplement information from mutation counting (Simulation Data)
13
Method for Estimating Mutation Rate Find mutation rate that produces distribution of tree ‘shapes’ most equivalent to observed set of trees Assumes equivalent mutation rate in all trees, although number divisions may differ Also developed analytical method based on same underlying idea (Kleinstein, Louzoun and Shlomchik, The Journal of Immunology, In Press) Also developed analytical method based on same underlying idea (Kleinstein, Louzoun and Shlomchik, The Journal of Immunology, In Press) Experimental Observations Set of Observed Tree Shapes Mutation Rate Distribution of Tree Shapes Simulation of B cell expansion Number of mutations Intermediate vertices Sequences at root =
14
Simulation Method 12…d Tree 1 Tree 2 … Tree n # divisions (d) 12…d Tree 1 Tree 2 … Tree n # divisions (d) Equivalent, E(t,d) Observable, O(t,d) Used version of Golden Section Search to optimize mutation rate ( ) 2.Likelihood of experimentally observed tree t: 3.Likelihood of experimental dataset: 1.Run simulation many times to fill in equivalent and observable matrices Reasonable clone size? Same tree shape? For each mutation rate ( ) and lethal fraction ( )… t t
15
Validating the Simulation Method Use simulation to construct artificial data sets with limited number of trees/sequences reflecting currently available experimental data Method works even with limited number of clonal trees and sequences Results for simulation method Estimate of method precision (SD = 0.035 division -1 )
16
Analytical Method Formulas to approximate tree shapes… Minimize error X( ) over all experimentally observed trees (t) For each observed tree, choose number of divisions to minimize error Observed shapeCalculated shape The average number of mutations per sequence (M) The average number of sequences present at the root of the tree (R) Total number of sequences in nodes with repeated sequences (P)
17
Validating the Analytical Method Use simulation to construct artificial data sets with limited number of trees/sequences reflecting currently available experimental data Method works even with limited number of clonal trees and sequences
18
Experimental Data from Autoimmune Response Data set consists of 31 trees from 7 mice, average 6 sequences / tree Data set particularly well-suited for estimating mutation rate Defined pick sizes provide upper-bound on clone size Small picks (< 50 cells) minimize positive selection Micro-dissection from extra-follicular areas in MRL/lpr AM14 heavy chain transgenic mice (William, Euler, Christensen, and Shlomchik. Science. 2002 ) B cells (Idiotype + RF) T cells (CD3 + ) Single Micro-dissections
19
Stems Pruned to Reflect Local Expansion Closely related cells remain in close spatial proximity Mutation information retained in local branching structure Long stems may be artifact of local micro-dissection Initial Sequence Time Stem is removed New tree root Pick
20
Mutation Rate in an Autoimmune Response Estimated mutation rate is 0.7 x 10 -3 – 0.9 x 10 -3 base-pair -1 division -1 Consider range of values for lethal mutation frequency ( ) Likely value based on FWR R:S Ratios
21
Mutation Rate in the Primary NP Response Data set consists of 23 trees, average 7 sequences / tree (Jacob et al., 1991), (Jacob and Kelsoe, 1992), (Jacob et al., 1993) and (Radmacher et al., 1998) Estimated mutation rate is 0.9 x 10 -3 – 1.1 x 10 -3 base-pair -1 division -1 Data based on many large picks: Clone sizes may be very large In estimation, limited to: 10 3 in simulation estimate 10 4 in analytical estimate Positive selection clearly factor Still evaluating validity of methods for this NP data set
22
Summary for Estimating Mutation Rate Mutation rate reflected in clonal tree ‘shapes’ Developed simulation and analytical methods Synthetic datasets to estimate precision Applied to autoimmune response & NP response Most accurate mutation rate estimate to date Future improvements with additional data
23
Noisy Optimization Can we adapt the number of runs at each parameter setting to the minimum required to determine which direction to look? How many runs should be used for each likelihood evaluation?
24
Bias in Simulation Method Can we remove the bias from the simulation method? No bias is present in the analytical method
25
Acknowledgements For more information: stevenk@cs.princeton.edu, www.cs.princeton.edu/~stevenk Yoram Louzoun (Bar-Ilan University) Mark Shlomchik (Yale University) (Kleinstein, Louzoun and Shlomchik, The Journal of Immunology, In Press)
26
PICASso PICASso Program in Integrative Information, Computer and Application Sciences Interdisciplinary Computational Seminars –Graduate student-oriented –Wednesdays at noon PICSciE Computational Colloquium –Tuesdays at 4pm Information and mailing list (see website) www.cs.princeton.edu/picasso
27
Summary of Experimental Data A n Total number of cells used to generate the sequences U n Number of unique sequences used to create the tree T n The average number of mutations per sequence M n The number of unique mutations in the tree I N The number of intermediate nodes (with sampled sequences) R N The number of sampled sequences at the root of the tree Autoimmune Response Primary NP Response
28
Acknowledgements For more information: stevenk@cs.princeton.edu, www.cs.princeton.edu/~stevenk Yoram Louzoun (Bar-Ilan University) Mark Shlomchik (Yale University) (Kleinstein, Louzoun and Shlomchik, The Journal of Immunology, In Press)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.