L. Perelygina (BIO-GSU)

Slides:



Advertisements
Similar presentations
Design Flow Enhancements for DNA Arrays Andrew B. Kahng 1 Ion I. Mandoiu 2 Sherief Reda 1 Xu Xu 1 Alex Zelikovsky 3 (1) CSE Department, University of California.
Advertisements

Molecular Biology Primer
DNA microarray and array data analysis
Border Length Minimization in DNA Array Design A.B. Kahng, I.I. Mandoiu, P. Pevzner, S. Reda (all UCSD), A. Zelikovsky (GSU)
Evaluation of Placement Techniques for DNA Probe Array Layout Andrew B. Kahng 1 Ion I. Mandoiu 2 Sherief Reda 1 Xu Xu 1 Alex Zelikovsky 3 (1) CSE Department,
Class 6 DNA Arrays BIOMEMS, Fall Content u Polymerase Chain Reaction or PCR u DNA Detection Process u DNA Micro Arrays u Electronic DNA Arrays u.
Probe design for microarrays using OligoWiz. Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical.
The Human Genome Project and ~ 100 other genome projects:
Bacterial Physiology (Micr430)
Class 7 DNA and Protein Arrays BIOMEMS, Winter 2009.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Arrays: Narrower terms include bead arrays, bead based arrays, bioarrays, bioelectronic arrays, cDNA arrays, cell arrays, DNA arrays, gene arrays, gene.
Accurate Method for Fast Design of Diagnostic Oligonucleotide Probe Sets for DNA Microarrays Nazif Cihan Tas CMSC 838 Presentation.
Border Length Minimization in DNA Array Design A.B. Kahng, I.I. Mandoiu, P.A. Pevzner, S. Reda (all UCSD), A. Zelikovsky (GSU)
Introduce to Microarray
Engineering a Scalable Placement Heuristic for DNA Probe Arrays A.B. Kahng, I.I. Mandoiu, P. Pevzner, S. Reda (all UCSD), A. Zelikovsky (GSU)
Engineering a Scalable Placement Heuristic for DNA Probe Arrays A.B. Kahng, I.I. Mandoiu, P. Pevzner, S. Reda (all UCSD), A. Zelikovsky (GSU)
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
with an emphasis on DNA microarrays
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Affymetrix vs. glass slide based arrays
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
Data Type 1: Microarrays
Microarray Technology
Introduction to DNA microarrays DTU - May Hanne Jarmer.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Genomics I: The Transcriptome
GeneChip® Probe Arrays
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
Consolidating Software Tools for DNA Microarray Design and Manufacturing Mourad Atlas Nisar Hundewale Ludmila Perelygina Alex Zelikovsky.
Design of Micro-arrays Lecture Topic 6. Experimental design Proper experimental design is needed to ensure that questions of interest can be answered.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Proteome and Gene Expression Analysis Chapter 15 & 16.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Lecture 23 – Functional Genomics I Based on chapter 8 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Example of a DNA array used to study gene expression (note green, yellow red colors; also note.
Solution of Satisfiability Problem on a Gel-Based DNA computer Ji Yoon Park Dept. of Biochem Hanyang University.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Introduction to Oligonucleotide Microarray Technology
Microarray: An Introduction
A Little More Advanced Biotechnology Tools
Part 3 Gene Technology & Medicine
PCR Optimization To optimize PCR the following must be evaluated and fine tuned Target DNA Too small or too large Good or poor quality Complete or incomplete.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Gene Expression Analysis
CHAPTER 20 PART 3: A LITTLE MORE ADVANCED BIOTECHNOLOGY TOOLS
Microarray - Leukemia vs. normal GeneChip System.
The Basics of cDNA Microarray Technology
Primer design.
Microarray Technology and Applications
Example of a DNA Array (note green, yellow red colors; also note that only part of the total array is depicted)
Chapter 20 – DNA Technology and Genomics
Solution of Satisfiability Problem on a Gel-Based DNA computer
Introduction to cDNA Microarray Technology
DNA Technology.
Overview Expression data basics Introduction Biological network data
A Little More Advanced Biotechnology Tools
The Basics of Microarray Image Processing
Gene Chips.
Introduction to Microarrays.
A Little More Advanced Biotechnology Tools
A Little More Advanced Biotechnology Tools
A Little More Advanced Biotechnology Tools
Microarrays: biotechnology's discovery platform for functional genomics  Mark Schena, Renu A Heller, Thomas P Theriault, Ken Konrad, Eric Lachenmeier,
Data Type 1: Microarrays
Affymetrix GeneChip oligonucleotide microarray.
Design Issues Lecture Topic 6.
Presentation transcript:

L. Perelygina (BIO-GSU) Consolidating Software Tools for DNA Microarray Design and Manufacturing M. Atlas (CS-GSU) N. Hundewale (CS-GSU) L. Perelygina (BIO-GSU) A. Zelikovsky (CS-GSU)

Microarray Technology It is a device that allows for DNA to be bound to it for analysis with homologous cDNA or RNA. Refer to by other names: microchip, biochip, DNA chip, DNA microarray, gene array, GeneChip®, and genome chip Microarrays provide a tool for answering a wide variety of questions about the dynamics of cells: In which cells is each gene active? Under what environmental conditions is each gene active? How does the activity level of a gene change under different conditions? -Stage of a cell cycle? -Environmental conditions? -Diseases? What genes seem to be regulated together?

Microarray Technology Cont Two general types that are popular: Spotted Arrays (Pat Brown, Stanford). Oligonucleotide Arrays (Affymetrix). Both based on the same basic principles. anchoring pieces of DNA to glass/nylon slides. complementary hybridization. Spotted Arrays Control Cells (left) and Target Cells (right). Harvesting mRNA from both cell group Tagging the mRNA with green and red dye. Applying the mRNA to the cDNA microarray. Reading the result using a laser A false-color composite representing the results. Oligonucleotide Arrays Gene Chips Instead of putting entire genes on array, put sets of DNA 25-mers (synthesized oligonucleotides). Produced using a photolithography process similar to the ones used to make semiconductor chips. mRNA samples are processed separately instead of in pairs.

Hybridization experiment Analysis of hybridization intensities Genome ID DNA Array Flow Downloading genome sequence and producing ORF in FASTA format. For each gene G, find probes that hybridize to G at a given TM. do not hybridize to any other gene at that TM. In probe placement, determine for each probe a site on the array 2-D surface for it to be placed or synthesized. And in probe embeddings, the step where each nucleotide is synthesized is determined. DNA array goes through a combination of photolithography and combinatorial chemistry process. Each probe is quantified, and probes are diluted so that all are at an equal concentration. Hybridization intensities can be measured by a laser scanner and converted to a quantitative readout. Reading genomic data Probe selection Physical design Mask and array manufacturing Hybridization experiment Analysis of hybridization intensities

Or: ORF Finder Extracting Reading Genomic Data Genome ID Downloading genome sequence from genbank Bioperl ORF Extraction GeneMark (Bordovsky GaTech) Or: ORF Finder Extracting Extra ORFs: ( ) ORF parser ORF in FASTA format Probe selection

Probe Selection Probe selection ORF preprocessing Promide Choosing the best melting temperature Ocand :find all candidate for given temperature Temp Checker’s Pool of probes Physical design Max/ Min Temp

Probe Design Constraints Sequence Related Length of probes Deviation of melting temperature of probe-target hybrids must be low (for physical reasons) No self complementary regions longer than four nucleotides (not descriptive enough) Melting temperatures of target and non-target seq. must be larger than a predefined (too close, too hard to identify) Ensuring a minimum number of mismatches is enough (homologous sequences) System Related Execution Time Usability

Deposition sequence design Physical Design Probe selection Deposition sequence design Test control 2D-probe placement 3D-probe embedding Mask and array manufacturing

Mask and Array manufacturing

Array Manufacturing Very Large-Scale Immobilized Polymer Synthesis: Treat substrate with chemically protected “linker” molecules, creating rectangular array Site size = approx. 10x10 microns Selectively expose array sites to light Light deprotects exposed molecules, activating further synthesis Flush chip surface with solution of protected A,C,G,T Binding occurs at previously deprotected sites Repeat steps 2&3 until desired probes are synthesized

Benchmarks: Herpes B virus B virus is a member of the subfamily Alphaherpesvirinae from the genus Simplex virus. Herpes B (HB) virus is mild localized or asymptotic infection in its internal hosts . In contrast HB virus infection in foreign host, humans or monkeys species other than macaques often result in encephalitis, encephalomyelitis, and death. Herpes B virus genome is 156,798 bps long and includes 74 genes[Perelygina03]. In this study, we design a chip for Perelygina to carry on her experiments on HB virus.

Experimental Study Melting Temperature: Number of Candidates: In our experiment we have considered the following parameters and we measured the results for different values of these parameters. Melting Temperature: We choose the temperatures 60C and 65C as best melting temperatures for our DNA probe array. Number of Candidates: We experimented with different values of K (number of candidates) for each pools of probes: 1 and 2. Chip Size: We ran our Experiments with 2 different chip sizes. We experimented with 50x50 and 60x60. We give the number of conflict and runtime for each algorithm for the Herpes B virus and simulated data

  Herpes B Virus Simulated Data  K=1 # Conflicts CPU Time(sec) Initial 107577 265992 Tsort 98830 0.17 231526 0.08 Tsp 95640 0.22 227960 0.09 Lalign 79254 0.25 189272 0.1 Reptx 2 64830 4.45 154766 1.58 Chessboard 63594 15.58 150812 7.1   Herpes B Virus Simulated Data K=2 # Conflicts CPU Time(sec) Initial 54205 265328 Tsort 49746 0.3 232954 0.14 Tsp 48541 0.34 227762 0.15 LAlign 42858 0.42 182972 0.16 Reptx 2 32098 7.84 149332 3.16 Chessboard 31498 20.93 146708 10.89

Conclusion and Future work Our experiments show: The genomic data follow the pattern predicted by simulated data. In case of Herpes B virus, like simulated data, increasing number of candidates per probe (k) decreases number of border conflicts during the probe placement algorithms. The number of border conflicts is several times smaller than for simulated data. The trade-off between number of border conflicts and the CPU time taken for the various algorithms that are defined in the physical design. We give a concatenate software solution for the entire DNA array flow. We explore all steps in a single automated software suite of tools. Future work The entire software suite be made available through web services. Users can enter name of organism or ID and with an option of choosing to set the required parameters the suite will produce the DNA probe micro-array chip layout.