Pejman Mohammadi, Niko Beerenwinkel, Yaakov Benenson  Cell Systems 

Slides:



Advertisements
Similar presentations
Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Advertisements

Xiaoshu Chen, Jianzhi Zhang  Cell Systems 
Volume 44, Issue 1, Pages (January 2016)
Volume 2, Issue 2, Pages (February 2016)
Synthetic Gene Circuits Learn to Classify
Comparative Analysis of Single-Cell RNA Sequencing Methods
Volume 10, Issue 6, Pages (June 2018)
High-Resolution Laser Scanning Reveals Plant Architectures that Reflect Universal Network Design Principles  Adam Conn, Ullas V. Pedmale, Joanne Chory,
Influence of RNA Labeling on Expression Profiling of MicroRNAs
Signal, Noise, and Variation in Neural and Sensory-Motor Latency
Araceli Ramirez-Cardenas, Maria Moskaleva, Andreas Nieder 
Volume 6, Issue 5, Pages e5 (May 2018)
Volume 3, Issue 1, Pages (July 2016)
Volume 95, Issue 3, Pages e4 (August 2017)
Volume 1, Issue 6, Pages (December 2015)
Robust Detection of DNA Hypermethylation of ZNF154 as a Pan-Cancer Locus with in Silico Modeling for Blood-Based Diagnostic Development  Gennady Margolin,
Volume 84, Issue 1, Pages (October 2014)
Volume 94, Issue 2, Pages e6 (April 2017)
Volume 94, Issue 4, Pages e7 (May 2017)
Translation of Genotype to Phenotype by a Hierarchy of Cell Subsystems
Metabolic Network Prediction of Drug Side Effects
Feature- and Order-Based Timing Representations in the Frontal Cortex
Divisive Normalization in Olfactory Population Codes
Confidence as Bayesian Probability: From Neural Origins to Behavior
Hedging Your Bets by Learning Reward Correlations in the Human Brain
A Switching Observer for Human Perceptual Estimation
A Role for the Superior Colliculus in Decision Criteria
Dustin E. Stansbury, Thomas Naselaris, Jack L. Gallant  Neuron 
Cascade Models of Synaptically Stored Memories
Volume 66, Issue 4, Pages (May 2010)
Volume 3, Issue 1, Pages (July 2016)
Volume 94, Issue 2, Pages e6 (April 2017)
Andrew D. Bosson, Jesse R. Zamudio, Phillip A. Sharp  Molecular Cell 
Continuum of Gene-Expression Profiles Provides Spatial Division of Labor within a Differentiated Cell Type  Miri Adler, Yael Korem Kohanim, Avichai Tendler,
Contrast Adaptation and Representation in Human Early Visual Cortex
Volume 85, Issue 4, Pages (February 2015)
Volume 5, Issue 4, Pages e4 (October 2017)
Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways  Nathan Archer, Mark D. Walsh, Vahid Shahrezaei,
The Spatiotemporal Organization of the Striatum Encodes Action Space
Franco Pestilli, Marisa Carrasco, David J. Heeger, Justin L. Gardner 
Volume 4, Issue 5, Pages e5 (May 2017)
A Switching Observer for Human Perceptual Estimation
Volume 3, Issue 5, Pages e13 (November 2016)
Neuronal Selectivity and Local Map Structure in Visual Cortex
Medial Axis Shape Coding in Macaque Inferotemporal Cortex
Volume 91, Issue 5, Pages (September 2016)
A Scalable Population Code for Time in the Striatum
Volume 95, Issue 5, Pages e5 (August 2017)
NF-κB Dynamics Discriminate between TNF Doses in Single Cells
Volume 89, Issue 6, Pages (March 2016)
Volume 6, Issue 2, Pages e5 (February 2018)
James M. Jeanne, Tatyana O. Sharpee, Timothy Q. Gentner  Neuron 
Patrick Kaifosh, Attila Losonczy  Neuron 
Guilhem Ibos, David J. Freedman  Neuron 
Volume 26, Issue 5, Pages (March 2016)
Franco Pestilli, Marisa Carrasco, David J. Heeger, Justin L. Gardner 
Daniel E. Winkowski, Eric I. Knudsen  Neuron 
Volume 6, Issue 2, Pages e5 (February 2018)
Volume 88, Issue 4, Pages (November 2015)
Volume 82, Issue 3, Pages (May 2014)
Supervised Calibration Relies on the Multisensory Percept
Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways  Nathan Archer, Mark D. Walsh, Vahid Shahrezaei,
Brandon Ho, Anastasia Baryshnikova, Grant W. Brown  Cell Systems 
Volume 26, Issue 12, Pages e5 (March 2019)
Volume 66, Issue 4, Pages (May 2010)
Supratim Ray, John H.R. Maunsell  Neuron 
Xiaoshu Chen, Jianzhi Zhang  Cell Systems 
Comparing 3D Genome Organization in Multiple Species Using Phylo-HMRF
Patrick Kaifosh, Attila Losonczy  Neuron 
Speed-Accuracy Tradeoff in Olfaction
Presentation transcript:

Automated Design of Synthetic Cell Classifier Circuits Using a Two-Step Optimization Strategy  Pejman Mohammadi, Niko Beerenwinkel, Yaakov Benenson  Cell Systems  Volume 4, Issue 2, Pages 207-218.e14 (February 2017) DOI: 10.1016/j.cels.2017.01.003 Copyright © 2017 Elsevier Inc. Terms and Conditions

Cell Systems 2017 4, 207-218.e14DOI: (10.1016/j.cels.2017.01.003) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 1 Cell Type Classification with Synthetic Gene Circuits (A) Schematic representation of classifier circuit operation. The circuit senses three endogenous miRNA species designated by blue, red, and green colors. When delivered into a mixed cell population, the circuit triggers apoptosis (dotted red line) in cells with high concentration of red or blue miRNA and low concentration of green miRNA. (B) A classifier gene circuit (left) and the biochemical parameters used in the model. The corresponding logic circuit is on the right. (C) Schematic presentation of logic of a classifier circuit. General circuit is shown using American National Standards Institute (ANSI) symbols for logic gates. A circuit may include disjunctions (OR gates) with up to three miRNA inputs and a number of standalone NOT gates that feed into a conjunction (AND) gate with the total number of inputs not exceeding ten. Cell Systems 2017 4, 207-218.e14DOI: (10.1016/j.cels.2017.01.003) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 2 Evaluating Classifier Circuit Performance Using Expected On-Off Ratio (A) Schematics of calculating the expected On-Off ratio (YES gate, top, AND gate, bottom). The plots on the left show the probability distributions for input gene expression (G1 and G2). The middle plots show the logic output I next to the continuous model output, C(θ), calculated with parameters θ. The violin plots on the right show the output probability distributions for the samples that fall into on and off input domains, respectively. (B) Distribution of parameter values in circuits with the top 10% of the On-Off ratios shown for NOT (top) and a YES (bottom) gates. Parameter correlations are displayed on the right. Cell Systems 2017 4, 207-218.e14DOI: (10.1016/j.cels.2017.01.003) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 3 Classifier Gene Circuits before and after Parameter Optimization (A) Left to right, input-output relation for a logical YES gate, continuous YES gate with best-guess parameters (Table S1), and those optimized for the highest On-Off ratio. (B) Plots similar to (A) for a two-input AND circuit. (C and D) Input-output relation for YES (C) and two-input AND (D) circuits calculated using three parameter sets optimized for binarization thresholds (t) of 0.2% (θ1), 1% (θ2), and 5% (θ3) of the total cellular miRNA pool (Table S1). Cell Systems 2017 4, 207-218.e14DOI: (10.1016/j.cels.2017.01.003) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 4 Evaluating Circuit Performance on Annotated Data (A) Performance evaluation of two example classifier circuits over a given input dataset. The dataset (left) includes three positive and seven negative samples. Circuit outputs and the annotation are used to calculate the Area Under the receiver operator curve (AUC). True positive rate (TPR) and false positive rate (FPR) refer to the false and the true positive rate, respectively. Average margin (m¯) and worst margin (w) are shown. (B) Propagation of the binary margins along the circuit. A set of inputs, the output, and their respective margins are shown for a four-input example circuit. The output value, on/off, is calculated using Boolean logic. The margin at each gate is the smallest fold change that can switch its output value. The critical margin values propagated to the next part of the circuit are highlighted in bold red font. Cell Systems 2017 4, 207-218.e14DOI: (10.1016/j.cels.2017.01.003) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 5 Synthetic Data Experiment (A) The simulation workflow. (B–D) Upper and lower panels correspond respectively to the specific and general search methods. (B) Overall fraction of perfect classifiers identified using the search methods. (C) Probability of missing a perfect solution as a function of the employed convergence threshold. (D) Correlation between classification margins of circuits used to produce data (x axis) and the margin of learned circuits (y axis). The red line is the linear regression fit. Cell Systems 2017 4, 207-218.e14DOI: (10.1016/j.cels.2017.01.003) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 6 Summary of Classifier Circuits Found Using the Specific Search Method on 18 Example Problems from Three Public Datasets For the breast cancer dataset and the human miRNA atlas data, all healthy samples in the data are used as negative samples. For the mouse lymphopoiesis data, the designated cell types are contrasted with all the remaining cell types in the dataset. (A) Circuit output levels. Each triangle corresponds to one sample, with red upward triangles corresponding to the positive and blue downward triangles to the negative samples. (B–D) Other panels show the area under the receiver operating characteristic (ROC) curve (AUC) (B), the average (C), and the worst (D) classification margin, respectively, for each of the classification tasks. The black bars show the values from the best learnt classifier (training performance), while the white boxes are the estimated values from 3-fold cross-validation (generalization performance). The error bars show the SEM estimate. Each classification was repeated for three different parameter sets θ1, θ2, and θ3 (Table S1), and the parameter set giving the best performance is reported in parentheses. (E) Performance of the identified classifier circuits as a function of implementation accuracy. Performance was estimated using Monte Carlo simulation. Each point is the average over 1,000 simulations of each circuit with different sets of biochemical parameters. All parameters are perturbed simultaneously and the x axis is the expected deviation of the implemented circuit parameters from the intended optimized parameter values. (F and G) Classification accuracy as a function of cell-to-cell variation in the input miRNA expressions (F) and in both miRNAs and the circuit parameters (G). Each point is estimated from 10,000 simulated single cells. The columns “Learning” and “Cross val.” correspond to the black and white bars in (B), and σ/μ is the coefficient of variation (see also Figures S5 and S6). Colors in the ROC AUC panels represent the average margin using the original parameters (i.e., the black bars in C). Cell Systems 2017 4, 207-218.e14DOI: (10.1016/j.cels.2017.01.003) Copyright © 2017 Elsevier Inc. Terms and Conditions

Figure 7 Reducing the Size of Learnt miRNA Classifier Circuits with Bootstrap-Based Pruning (A) Examples of learnt networks for all breast cancer samples (upper panel) and breast cancer cell lines (lower panel). (B) The same examples after pruning. (C) The left panel shows the number of genes in the best learnt circuits, and the corresponding pruned circuits for 18 real data experiments. The next two panels show the area under the ROC curve and average classification margin for the best and pruned classifiers estimated from the learning data (L) and cross-validation (CV). (D) Comparison of circuit performance learnt using specific and the general method in cross-validation samples. Cell Systems 2017 4, 207-218.e14DOI: (10.1016/j.cels.2017.01.003) Copyright © 2017 Elsevier Inc. Terms and Conditions