Probe Selection Problems in Gene Sequences. (C) 2003, SNU Biointelligence Lab, DNA Microarrays cDNA: PCR from.

Slides:



Advertisements
Similar presentations
Lecture 2 Strachan and Read Chapter 13
Advertisements

Modeling sequence dependence of microarray probe signals Li Zhang Department of Biostatistics and Applied Mathematics MD Anderson Cancer Center.
9 Genomics and Beyond Brief Chapter Outline
Probe design for microarrays using OligoWiz Rasmus Wernersson, Assistant Professor Center for Biological Sequence Analysis Technical University of Denmark.
Genetic algorithms applied to multi-class prediction for the analysis of gene expressions data C.H. Ooi & Patrick Tan Presentation by Tim Hamilton.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
Finding approximate palindromes in genomic sequences.
Probe design for microarrays using OligoWiz. Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical.
Selection of Optimal DNA Oligos for Gene Expression Arrays Reporter : Wei-Ting Liu Date : Nov
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
Using a Genetic Algorithm for Approximate String Matching on Genetic Code Carrie Mantsch December 5, 2003.
Introduction to computational genomics – hands on course Gene expression (Gasch et al) Unit 1: Mapper Unit 2: Aggregator and peak finder Solexa MNase Reads.
©2003/04 Alessandro Bogliolo Primer design. ©2003/04 Alessandro Bogliolo Outline 1.Polymerase Chain Reaction 2.Primer design.
Interdisciplinary Center for Biotechnology Research
PCR Primer Design Guidelines
PCR Primer Design
Technology and Methods Seminar
Biostatistics-Lecture 15 High-throughput sequencing and sequence alignment Ruibin Xi Peking University School of Mathematical Sciences.
IN THE NAME OF GOD. PCR Primer Design Lecturer: Dr. Farkhondeh Poursina.
PCR- Polymerase chain reaction
The methods used by molecular biologists to study DNA have been developed through adaptation of the chemical reactions and biological processes that occur.
Affymetrix vs. glass slide based arrays
Ch5 Sec3 Advances in Genetics. Key Concepts What are three ways of producing organisms with desired traits? What is the goal of the Human Genome Project?
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Screening a Library Plate out library on nutrient agar in petri dishes. Up to 50,000 plaques or colonies per plate.
Intro. ANN & Fuzzy Systems Lecture 36 GENETIC ALGORITHM (1)
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Dave Palmer Primer Design Dave Palmer
A mathematical model and a computerized simulation of PCR using complex templates Nucleic Acid Research, 1996, Vol. 24, No. 18, pp E. Rubin.
The Fidelity of the Tag-Antitag System J. A. Rose, R. J. Deaton, M. Hagiya, And A. Suyama DNA7 poster Summarized by Shin, Soo-Yong.
Zorica Stanimirović Faculty of Mathematics, University of Belgrade
Remember the limitations? –You must know the sequence of the primer sites to use PCR –How do you go about sequencing regions of a genome about which you.
More on Heuristics Genetic Algorithms (GA) Terminology Chromosome –candidate solution - {x 1, x 2,...., x n } Gene –variable - x j Allele –numerical.
5.3 – Advances in Genetics Trashketball!. Selecting organisms with desired traits to be parents of the next generation is… A. Inbreeding A. Inbreeding.
Chapter 4.1 Beyond “Classic” Search. What were the pieces necessary for “classic” search.
Design of oligonucleotides for microarrays and perspectives for design of multi-transcriptome arrays Henrik Bjorn Nielsen, Rasmus Wernersson and Steen.
GeneChip® Probe Arrays
CHAPTER SIX Nucleic acid hybridization: principles and applications 생물정보학협동과정 강민호.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Computational and experimental analysis of DNA shuffling : Supporting text N. Maheshri and D. V. Schaffer PNAS, vol. 100, no. 6, Summarized by.
Solving Function Optimization Problems with Genetic Algorithms September 26, 2001 Cho, Dong-Yeon , Tel:
Brückner et al., Fig. 1b Brückner et al., Fig. 1B a c b 6 Fig. 1. Circular representation of Streptococcus pneumoniae genome comparisons.
D Nagesh Kumar, IIScOptimization Methods: M8L5 1 Advanced Topics in Optimization Evolutionary Algorithms for Optimization and Search.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Introduction to Oligonucleotide Microarray Technology
Artificial Intelligence By Mr. Ejaz CIIT Sahiwal Evolutionary Computation.
Components Of A Typical PCR Reaction Mix PCR Reaction Buffer (usually supplied at 10X) 100mM Tris-HCl - Essentially a pH Buffer. Maintains appropriate.
D. Darban, Ph.D Department of Microbiology School of Medicine Alborz University of Medical Sciences 1 Probe and Primer Design.
Detecting DNA with DNA probes arrays. DNA sequences can be detected by DNA probes and arrays (= collection of microscopic DNA spots attached to a solid.
Genetic Algorithm(GA)
Evolutionary Design of the Closed Loop Control on the Basis of NN-ANARX Model Using Genetic Algoritm.
Polymerase Chain Reaction
Using GA’s to Solve Problems
Author :Shigeomi HARA Hiroshi DOUZONO Yoshio NOGUCHI
From: Optimized design and assessment of whole genome tiling arrays
PCR TECHNIQUE
 The human genome contains approximately genes.  At any given moment, each of our cells has some combination of these genes turned on & others.
Artificial Intelligence Project 2 Genetic Algorithms
Selection of Oligonucleotide Probes for Protein Coding Sequences
Lecture 4: Probe & primer design
Chapter 14 Bioinformatics—the study of a genome
DNA-based technology New and old technologies that are utilized in biotechnology DNA cloning DNA libraries Polymerase chain reaction (PCR) Genome sequencing.
Nucleic Acid Interactions Practicalities
Fitness measures for DNA Computing
Information Theoretical Probe Selection for Hybridisation Experiments
Genetic algorithms: case study
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001)
Nearest-neighbor model for Tm
Presentation transcript:

Probe Selection Problems in Gene Sequences

(C) 2003, SNU Biointelligence Lab, DNA Microarrays cDNA: PCR from clones Oligonucleotide: design specific probes

(C) 2003, SNU Biointelligence Lab, Gene Detection Using Microarrays ….CCCATGGACTCAAG…. ….CCCTGGCGACAGTT…. ….AATCCTACGACGGC…. ….AGCTCTAGGCCCAT…. Each probe  Selected from gene sequences  Same length (20mer~)  Detect one gene

(C) 2003, SNU Biointelligence Lab, Probe Selection Uniqueness  Comparing potential probe sequences with the full-length sequences of other genes being monitored Hybridization characteristics  Tm among probes  GC content  Secondary structure  Position of core region  Number of mismatch

(C) 2003, SNU Biointelligence Lab, Affymetrix Probe Selection Criteria Single base(As, Ts, Cs, Gs)  Not exceeds 50% of the probe size Length of contiguous As and Ts or Cs and Gs region  Less than 25% of the probe size (G+C)%  Between 40 and 60% of the probe sequence  (G+C)% can be adjusted based on G+C contents of a genome sequence Contiguous repeat  No 15-long repeat anywhere in the entire coding sequence of the whole genome No self-complementary within probe sequence

(C) 2003, SNU Biointelligence Lab, Tm Calculation Approximate target DNA concentration  : enthalpy for helix formation  : entropy for helix formation  R : molar gas constant ( )  c : total molar concentration of the annealing oligonucleotides when oligonucleotides are not self-complementary

(C) 2003, SNU Biointelligence Lab, General Approach Probe sequence 생성  임의의 서열을 만들어 후보 probe 를 생성한다.  Target gene 서열로부터 후보 probe 를 생성한다.  일반적으로 유전자의 처음부터 sliding window 방식으로 생성 한다. ACGCGTCGCGAGGCCTAGGCC… 후보 probe 시퀀스

(C) 2003, SNU Biointelligence Lab, General Approach 후보 probe 서열 중 target sequences 이외의 유전자에 흔히 존재하는 것 들을 삭제한다. (blast 등과 같은 sequence alignment 프로그램을 이용한다 )  cross hybridization 발생 가능성 억제 Target 과 probe 간의 hybridization 이 잘되게 하기 위해 Tm 값을 이용해 후보 probe 서열들을 필터링한다. Sequence 의 2 차원 구조에 따른 필터링을 한다. Intramolecular 구조가 형성될 수 있는 것들을 제거.

(C) 2003, SNU Biointelligence Lab, GA Representation Chromosome represents possible probe set. Each position of a chromosome is the starting point of probe sequence in gene sequence. one probe (parameter: probe length c) probe set (size n: number of target genes) start position of selected region

(C) 2003, SNU Biointelligence Lab, GA Operators Fitness function  Linear combination of sequence match and probe characteristics  Combination rate can be varied with generation  Sequence match between probe and target gene  Perfect match: 1  Mismatch: 0  Considered probe characteristics  Tm  (G+C)%  Self-complementary (by sequence comparison) Population  Population  Parents are selected from roulette-wheel  One-point crossover with Pc, mutation with Pm

(C) 2003, SNU Biointelligence Lab, Optimizing Probe Set One probe(spot) is designed to detect one gene. Assumption  Optimal probe set will contain as few oligonucleotide as possible Goal  M probes detect N genes (M is less than or equal to N)  Find M probes which detect N genes

(C) 2003, SNU Biointelligence Lab, 3 target genes ….CCCATAGGCTCAAG…. ….CCTAGGCGCGCTCA…. ….AGCTCTAGGCCCAT….

(C) 2003, SNU Biointelligence Lab, GA Representation Probe sequences are selected from target gene sequences one probe (parameter: probe length c) probe set (size m: number of probes) start position of selected region gene id

(C) 2003, SNU Biointelligence Lab, Sequence Matching Target, probe Matching result between target c i and probe set  Can be represented as list or vector   fingerprint  Two clones are distinguishable  Find probe set which make all fingerprints different each other.

(C) 2003, SNU Biointelligence Lab, Consider number of mismatch Consider Blast search Design of probe sequences For fitness conditions, probe characteristics can be learned?