Design Flow Enhancements for DNA Arrays Andrew B. Kahng 1 Ion I. Mandoiu 2 Sherief Reda 1 Xu Xu 1 Alex Zelikovsky 3 (1) CSE Department, University of California.

Slides:



Advertisements
Similar presentations
Next-Generation Sequencing: Methodology and Application
Advertisements

Design Rule Generation for Interconnect Matching Andrew B. Kahng and Rasit Onur Topaloglu {abk | rtopalog University of California, San Diego.
Marius Nicolae Computer Science and Engineering Department
Quality and Error Control Coding for DNA Microarrays Olgica Milenkovic ECE Department University of Colorado, Boulder IEEE Denver ComSoc.
Minimum Implant Area-Aware Gate Sizing and Placement
Optimal Testing of Digital Microfluidic Biochips: A Multiple Traveling Salesman Problem R. Garfinkel 1, I.I. Măndoiu 2, B. Paşaniuc 2 and A. Zelikovsky.
Multi-Project Reticle Floorplanning and Wafer Dicing Andrew B. Kahng 1 Ion I. Mandoiu 2 Qinke Wang 1 Xu Xu 1 Alex Zelikovsky 3 (1) CSE Department, University.
1 Genetics The Study of Biological Information. 2 Chapter Outline DNA molecules encode the biological information fundamental to all life forms DNA molecules.
Border Length Minimization in DNA Array Design A.B. Kahng, I.I. Mandoiu, P. Pevzner, S. Reda (all UCSD), A. Zelikovsky (GSU)
Design and Optimization of Universal DNA Arrays Ion Mandoiu CSE Department & BME Program University of Connecticut.
Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.
Background: Scan-Based Delay Fault Testing Sequentially apply initialization, launch test vector pairs that differ by 1-bit shift A vector pair induces.
Evaluation of Placement Techniques for DNA Probe Array Layout Andrew B. Kahng 1 Ion I. Mandoiu 2 Sherief Reda 1 Xu Xu 1 Alex Zelikovsky 3 (1) CSE Department,
Estimation of alternative splicing isoform frequencies from RNA-Seq data Ion Mandoiu Computer Science and Engineering Department University of Connecticut.
: Grid graph :Draw two rays from each concave point Rays are divided into non-intersected ray-segments Conflict pair: two ray segments from the same point.
Power-Aware Placement
WaferReticle Project Yield-Driven Multi-Project Reticle Design and Wafer Dicing Andrew B. Kahng 1, Ion Mandoiu 2, Xu Xu 1, and Alex Z. Zelikovsky 3 1.
Reticle Floorplanning With Guaranteed Yield for Multi-Project Wafers Andrew B. Kahng ECE and CSE Dept. University of California San Diego Sherief Reda.
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
On Legalization of Row-Based Placements Andrew B. KahngSherief Reda CSE & ECE Departments University of CA, San Diego La Jolla, CA 92093
Yield- and Cost-Driven Fracturing for Variable Shaped-Beam Mask Writing Andrew B. Kahng CSE and ECE Departments, UCSD Xu Xu CSE Department, UCSD Alex Zelikovsky.
Fast and Area-Efficient Phase Conflict Detection and Correction in Standard-Cell Layouts Charles Chiang, Synopsys Andrew B. Kahng, UC San Diego Subarna.
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Implementation of Planted Motif Search Algorithms PMS1 and PMS2 Clifford Locke BioGrid REU, Summer 2008 Department of Computer Science and Engineering.
Layout-aware Scan-based Delay Fault Testing Puneet Gupta 1 Andrew B. Kahng 1 Ion Mandoiu 2 Puneet Sharma 1 1 ECE Department, University of California –
Border Length Minimization in DNA Array Design A.B. Kahng, I.I. Mandoiu, P.A. Pevzner, S. Reda (all UCSD), A. Zelikovsky (GSU)
Estimation of alternative splicing isoform frequencies from RNA-Seq data Ion Mandoiu Computer Science and Engineering Department University of Connecticut.
Triple Patterning Aware Detailed Placement With Constrained Pattern Assignment Haitong Tian, Yuelin Du, Hongbo Zhang, Zigang Xiao, Martin D.F. Wong.
May 25, GSU Biotech Symposium1 Minimum PCR Primer Set Selection with Amplification Length and Uniqueness Constraints Ion Mandoiu University of.
Engineering a Scalable Placement Heuristic for DNA Probe Arrays A.B. Kahng, I.I. Mandoiu, P. Pevzner, S. Reda (all UCSD), A. Zelikovsky (GSU)
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Engineering a Scalable Placement Heuristic for DNA Probe Arrays A.B. Kahng, I.I. Mandoiu, P. Pevzner, S. Reda (all UCSD), A. Zelikovsky (GSU)
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Microarrays: Basic Principle AGCCTAGCCT ACCGAACCGA GCGGAGCGGA CCGGACCGGA TCGGATCGGA Probe Targets Highly parallel molecular search and sort process based.
with an emphasis on DNA microarrays
Molecular Biology Dr. Chaim Wachtel April 4, 2013.
1 Outline Last time: –Molecular biology primer (sections ) –PCR Today: –More basic techniques for manipulating DNA (Sec. 3.8) Cutting into shorter.
Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.
Development and Evaluation of a Comprehensive Functional Gene array for Environmental Studies Zhili He 1,2, C. W. Schadt 2, T. Gentry 2, J. Liebich 3,
Ultra Scale High Density Hybrid DNA Memory Mohamad Al-Sheikhly, William Bentley, Aris Christou, Joseph Silverman Department of Materials Science and Department.
Low-Power Gated Bus Synthesis for 3D IC via Rectilinear Shortest-Path Steiner Graph Chung-Kuan Cheng, Peng Du, Andrew B. Kahng, and Shih-Hung Weng UC San.
UC San Diego / VLSI CAD Laboratory Incremental Multiple-Scan Chain Ordering for ECO Flip-Flop Insertion Andrew B. Kahng, Ilgweon Kang and Siddhartha Nath.
Graph Theory And Bioinformatics Jason Wengert. Outline Introduction to Graphs Eulerian Paths & Hamiltonian Cycles Interval Graph & Shape of Genes Sequencing.
Microarray - Leukemia vs. normal GeneChip System.
VarDetect: a nucleotide sequence variation exploratory tool VarDetect Chumpol Ngamphiw 1, Supasak Kulawonganunchai 2, Anunchai Assawamakin 3, Ekachai Jenwitheesuk.
JM - 1 Introduction to Bioinformatics: Lecture III Genome Assembly and String Matching Jarek Meller Jarek Meller Division of Biomedical.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Combinatorial Optimization Problems in Computational Biology Ion Mandoiu CSE Department.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
1 Efficient Obstacle-Avoiding Rectilinear Steiner Tree Construction Chung-Wei Lin, Szu-Yu Chen, Chi-Feng Li, Yao-Wen Chang, Chia-Lin Yang National Taiwan.
Labeling with Genisphere Kit Mary Lee S. Ledbetter College of the Holy Cross.
Genomics I: The Transcriptome
GeneChip® Probe Arrays
Genotype Calling Jackson Pang Digvijay Singh Electrical Engineering, UCLA.
Consolidating Software Tools for DNA Microarray Design and Manufacturing Mourad Atlas Nisar Hundewale Ludmila Perelygina Alex Zelikovsky.
AFFYMETRIX GENECHIP ARRAY STATION Pierce Chuang, Vishu Rajamani, Jeremy Rasiah By.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Lecture 23 – Functional Genomics I Based on chapter 8 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Outline Motivation and Contributions Related Works ILP Formulation
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Introduction to Oligonucleotide Microarray Technology
1 Double-Patterning Aware DSA Template Guided Cut Redistribution for Advanced 1-D Gridded Designs Zhi-Wen Lin and Yao-Wen Chang National Taiwan University.
Research Techniques Made Simple: Next-Generation Sequencing:
Ultra Scale High Density Hybrid DNA Memory Mohamad Al-Sheikhly, William Bentley, Aris Christou, Joseph Silverman Department of Materials Science.
James D. Z. Ma Department of Electrical and Computer Engineering
L. Perelygina (BIO-GSU)
Presentation transcript:

Design Flow Enhancements for DNA Arrays Andrew B. Kahng 1 Ion I. Mandoiu 2 Sherief Reda 1 Xu Xu 1 Alex Zelikovsky 3 (1) CSE Department, University of California at San Diego (2) CSE Department, University of Connecticut (3) CS Department, Georgia State University

Introduction to DNA microarrays and manufacturing challenges Outline DNA microarray design flow DNA microarray design flow enhancements: Integration of Probe Placement and Embedding Integration of Probe Selection and Physical Design Conclusions and future research directions

Uses of DNA arrays Introduction to DNA microarrays Practical experiment using DNA arrays DNA manufacturing process Problems and challenges in DNA manufacturing process

Introduction to DNA Probe Arrays DNA Arrays (Gene Chips) used in wide range of genomic analyses gene expression detection drug discovery mutation detection Diverse fields from health care to environmental sciences DNA Arrays are composed of probes where each probe is a sequence of 25 nucleotides

Images courtesy of Affymetrix. Tagged RNA fragments flushed over array Laser activation of fluorescent tags Optical scanning of hybridization intensities DNA Array Hybridization Experiment

DNA Array Manufacturing Process Very Large-Scale Immobilized Polymer Synthesis (VLSIPS) Treat substrate with chemically protected linker molecules Selectively expose array sites to light Flush chip’s surface with solution of protected A, C, G, T Repeat last two steps until desired probes are synthesized

Probe Synthesis array probes A 3×3 array CGACACG ACAC ACGAGAG CG AGAGC Nucleotide Deposition Sequence ACG A  Mask 1 A A A A A

Probe Synthesis array probes A 3×3 array CGCGACACG ACAC ACGACGAGAG CGCG AGAGC Nucleotide Deposition Sequence ACG C  Mask 2 C C C C C C A A A A A

Probe Synthesis array probes A 3×3 array CGCGACG ACGAGAG CGCG AGAGC Nucleotide Deposition Sequence ACG G  Mask 3 C C C C C C A A A A A G GG G G G A Nucleotide Deposition Sequence defines the order of nucleotide deposition A Probe Embedding specifies the steps it uses in the nucleotide sequence to get synthesized

VLSIPS Manufacturing Challenges Lamp Mask Array Problem: Diffraction, internal reflection, scattering, internal illumination Occurs at sites near to intentionally exposed sites Reduce interference  Increase yield  Reduce cost Design objective: Minimize the border length

Unwanted Illumination and Border Cost array probes A 3×3 array CGACG ACGAG CG AGC Nucleotide Deposition Sequence ACG A  Mask 1 A A A A A Border = 8 Border Reduction  Unwanted illumination  Chip’s yield

Introduction to DNA arrays manufacturing challenges Outline DNA array design flow DNA array design flow enhancements: Integration of Probe Placement and Embedding Integration of Probe Selection and Physical Design Conclusions

Previous Work Border minimization was first introduced by Feldman and Pevzner. “Gray Code masks for sequencing by hybridization,” Genomics, 1994, pp Work by Hannenhalli et al. gave heuristics for the placement problem by using a TSP formulation. Kahng et al. “Border length minimization in DNA Array Design,” WABI02, suggested constructive methods for placement and embedding Kahng et al. “Engineering a Scalable Placement Heuristic for DNA Probe Arrays,” RECOMB03, suggested scalable placement improvement and embedding techniques

Basic DNA Array Design Flow Probe Selection Design of Test Probes Probe Placement Probe Embedding DNA Array Logic Synthesis BIST and DFT Placement Routing VLSI Chip Physical Design Probe Placement Probe Embedding Probe Selection Design of Test Probes Logic Synthesis BIST and DFT Physic al Design Routing Placement Analogy

Design Flow Outline Physical Design  Degrees of freedom (DOF) in probe embedding  DOF exploitation for border conflict reduction Probe Embedding Probe Placement  Similar probes should be placed close together  Constructive placement  Placement improvement operators

Key DOF: Probe Embedding (Alignment) A A A C C C G G G T T T Deposition Sequence C T G Hypothetical Probe Group C G T Synchronous Embedding C T G As Soon As Possible (ASAP) Embedding C G T Another Embedding

Embedding Determines Border Conflicts A A A C C C T T T G G G A C T G A G T G T G A A Synchronous Embedding A G T A G G T A Deposition Sequence Probes G A A G T A G T ASAP Embedding G

Optimal Probe Embedding T A A G A G T A C A T G Before optimal re-embedding A T A A G G T A C A T G After optimal re-embedding A Using Dynamic Programming to optimally re- embed a probe Problem: Optimally embedding a probe with respect to its neighbors Kahng et al. “Border Length Minimization in DNA Array Design,” WABI02 A A A A A A

Placement Polishing Using Re-Embedding Use optimal re-embedding algorithm to re-embed each probe with respect to its neighbors

Placement Objective: Minimize Border Radix-sort the probes in lexicographical order Probe 1 Probe 2 Probe 3 Probe 4 Probe 5 TATTATAAA A CA GGCC CGGG TATT ATAA A A CA GGCC CGGG 123 Problem: How to place the 1-D ordering of probes onto the 2-D chip? Radix-sorting the probes order reduces discrepancies between adjacent probes

Placement By Threading TATTATAAA A CA GGCC CGGG Probe 1 Probe 2 Probe 3 Probe 4 Probe 5 Thread on the chip

Row-Epitaxial Placement Improvement Array of size 4 × 4 For each site position (i, j): From within the next k rows, find the best probe to place in (i, j) Move the best probe to (i, j) and lock it in this position Row placement = sort + thread + row epitaxial

Introduction to DNA arrays manufacturing challenges Outline DNA array design flow DNA array design flow enhancements: Integration of Probe Placement and Embedding Integration of Probe Selection and Physical Design Conclusions

DNA array design flow enhancements Integration of Probe Placement and Embedding Integration of Probe Selection and Physical Design  Initial embeddings influence the placement results  Propose and implement two flows  Probe pools add additional degrees of freedom  Integrate probe selection into physical design  Propose and implement two flows incorporating probe pools Physical Design Probe Selection Design of Test Probes Probe Placement Probe Embedding DNA Array

Integration of Probe Placement and Embedding Probe Selection Design of Test Probes Probe Placement Probe Embedding DNA Array Integrating placement and probe embedding gives a further reduction in border conflicts. Probe Placement Probe Embedding Analogous to tighter integration between placement and routing in VLSI physical design

Integration of Probe Placement and Embedding 1. Synchronous initial embedding 3. Re-embedding using DP 2. Row placement Flow A Row Epitaxial Re-embedding ASAP initial embedding 1. As Soon As Possible (ASAP) initial embedding 3. Re-embedding using DP 2. Row placement Flow B Chip size Conflicts 6%

Placement + Embedding Runtimes Row Epitaxial Re-embedding Chip size CPU (s) 1. Synchronous initial embedding 3. Re-embedding using DP 2. Row placement Flow A Row Epitaxial Re-embedding ASAP initial embedding 1. As Soon As Possible (ASAP) initial embedding 3. Re-embedding using DP 2. Row placement Flow B

Second Enhancement: Probe Pools Probe Selection Design of Test Probes Probe Placement Probe Embedding DNA Array Physical Design Problem: Given a probe pool for every target sequence, select a probe for every target sequence such that the total conflict after placement and alignment is minimum. Probe 1Probe 2Probe 3Probe 4 Gene Target Sequence Probe Pool – Pool Size = 4

Integrating Probe Selection and Physical Design 1. Perform ASAP embedding of all probe candidates 3. Re-embedding 2. Run row placement selecting the probe from the pool that gives the minimum conflict Flow A ASAP initial embedding 1. Perform ASAP embedding of all probe candidates 3. Run row placement using the selected candidates 2. From each probe pool select the probe that fits in the least number of steps using ASAP Flow B 4. Re-embedding

Results (Conflicts) of Probe Pools Chip size = 100 Pool Size Conflicts Chip size = 300 Pool Size Conflicts Chip size = 500 Pool Size Conflicts Chip size = 200 Pool Size Conflicts

Comparison of Probe Pools Flows Chip size = 100 Pool Size Conflicts Chip size = 300 Pool Size Conflicts Chip size = 500 Pool Size Conflicts Chip size = 200 Pool Size Conflicts

Results (runtime) of Probe Pools Chip size = 100 Pool Size CPU (1000s) Chip size = 300 Pool Size CPU (1000s) Chip size = 500 Pool Size CPU (1000s) Chip size = 200 Pool Size CPU (1000s)

Interpretation and Summary of Experimental Data Initial ASAP embeddings produce a decent reduction in border conflicts. Probe pools offer an extra degree of freedom exploited to further reduce border conflicts Integration of placement and embedding yield up to 6% improvement Probe pools add an extra 12-13% improvement Total improvement up to 18% compared to results published in the literature

Open Research Directions Probe selection should incorporate ability to uniquely detect target sequences present in sample. This should be done with no ambiguity. Methods similar to Boolean covering and test diagnosis can be used. P1P1 P2P2 P3P3 P4P4 P5P5 T1T1 T2T2 T3T3 T4T4 Each target sequence should have a unique signature Probes Target Sequences

Open Research Directions Stronger placement operators leading to further reduction in the border cost. Insertion of probe test can benefit from test and diagnosis topics for VLSI circuits. Future work also covers next generation chips 10k × 10k

Conclusions We presented a DNA design flow benefiting from experiences of the VLSI design flow We introduced feedback loops and integrated a number of steps for further reduction in the border cost and hence unwanted illumination We examined the effects of probe selection on both placement and embedding We examined the embedding options and placement on the total border cost

Thanks for your attention