A Design Method of DNA chips for SNP Analysis Using Self Organizing Maps Author : Honjoy Saga Graduate : Chien-Ming Hsiao.

Slides:



Advertisements
Similar presentations
Microarray Technique, Analysis, and Applications in Dermatology Jennifer Villaseñor-Park 1 and Alex G Ortega-Loayza 2 1 Department of Dermatology, University.
Advertisements

Lecture 2 Strachan and Read Chapter 13
Applications of genome sequencing projects 1) Molecular Medicine 2) Energy sources and environmental applications 3) Risk assessment 4) Bioarchaeology,
applications of genome sequencing projects
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Cloning lab results Cloning the human genome Physical map of the chromosomes Genome sequencing Integrating physical and recombination maps Polymorphic.
Single nucleotide polymorphisms and applications Usman Roshan BNFO 601.
Border Length Minimization in DNA Array Design A.B. Kahng, I.I. Mandoiu, P. Pevzner, S. Reda (all UCSD), A. Zelikovsky (GSU)
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
DNA Barcode Data Analysis: Boosting Assignment Accuracy by Combining Distance- and Character-Based Classifiers Bogdan Paşaniuc, Sotirios Kentros and Ion.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
The Extraction of Single Nucleotide Polymorphisms and the Use of Current Sequencing Tools Stephen Tetreault Department of Mathematics and Computer Science.
Dimension reduction : PCA and Clustering by Agnieszka S. Juncker
SNP Discovery in the Human Genome C244/144 November 21, 2005.
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
Genotyping of James Watson’s genome from Low-coverage Sequencing Data Sanjiv Dinakar and Yözen Hernández.
Tracking Video Objects in Cluttered Background
Information Aspects of Nucleic Acids Measurement Technologies Description of nucleic acid measurement technologies Algorithmic, optimization, data analysis.
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
Phylogeny - based on whole genome data
©2003/04 Alessandro Bogliolo Primer design. ©2003/04 Alessandro Bogliolo Outline 1.Polymerase Chain Reaction 2.Primer design.
1 Unity of Invention: Biotech Examples TC1600 Special Program Examiner Julie Burke (571)
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
Todd J. Treangen, Steven L. Salzberg
Data Type 1: Microarrays
U.S. Patent and Trademark Office Technology Center 1600 Michael P. Woodward Unity of Invention: Biotech Examples.
Applying statistical tests to microarray data. Introduction to filtering Recall- Filtering is the process of deciding which genes in a microarray experiment.
1 Velvet: Algorithms for De Novo Short Assembly Using De Bruijn Graphs March 12, 2008 Daniel R. Zerbino and Ewan Birney Presenter: Seunghak Lee.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Identification of Regulatory Binding Sites Using Minimum Spanning Trees Pacific Symposium on Biocomputing, pp , 2003 Reporter: Chu-Ting Tseng Advisor:
Microarrays.
An algorithm to guide selection of specific biomolecules to be studied by wet-lab experiments Jessica Wehner and Madhavi Ganapathiraju Department of Biomedical.
Microarray - Leukemia vs. normal GeneChip System.
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
MAPPING GENOMES – genetic, physical & cytological maps Genetic distance (in cM) 1 centimorgan = 1 map unit, corresponding to recombination frequency of.
Paper: Large-Scale Clustering of cDNA-Fingerprinting Data Presented by: Srilatha Bhuvanapalli INFS 795 – Special Topics in Data Mining.
1 Gene Therapy Gene therapy: the attempt to cure an underlying genetic problem by insertion of a correct copy of a gene. –Tantalizingly simple and profound.
Combinatorial Optimization Problems in Computational Biology Ion Mandoiu CSE Department.
A genetic approach to the automatic clustering problem Author : Lin Yu Tseng Shiueng Bien Yang Graduate : Chien-Ming Hsiao.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman modified by Hanne Jarmer.
Deconstructing Storage Arrays Timothy E. Denehy, John Bent, Florentina I. Popovici, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau University of Wisconsin,
Hierarchical Clustering of Gene Expression Data Author : Feng Luo, Kun Tang Latifur Khan Graduate : Chien-Ming Hsiao.
Genotype Calling Jackson Pang Digvijay Singh Electrical Engineering, UCLA.
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
Identification of Copy Number Variants using Genome Graphs
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Channel Independent Viterbi Algorithm (CIVA) for Blind Sequence Detection with Near MLSE Performance Xiaohua(Edward) Li State Univ. of New York at Binghamton.
Mining A Stream of Transactions for Customer Patterns Author: Diane Lambert Advisor: Dr. Hsu Graduate: Yan-cheng Lin.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Rival-Model Penalized Self-Organizing Map Yiu-ming Cheung.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Molecular Classification of Cancer Class Discovery and Class Prediction by Gene Expression Monitoring.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
Disease Diagnosis by DNAC MEC seminar 25 May 04. DNA chip Blood Biopsy Sample rRNA/mRNA/ tRNA RNA RNA with cDNA Hybridization Mixture of cell-lines Reference.
EE150a – Genomic Signal and Information Processing On DNA Microarrays Technology October 12, 2004.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Jessica K. Ting Michael K. Ng Hongqiang Rong Joshua Z. Huang 國立雲林科技大學.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
ViSOM - A Novel Method for Multivariate Data Projection and Structure Visualization Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Hujun Yin.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Community self-Organizing Map and its Application to Data Extraction Presenter: Chun-Ping Wu Authors:
Introduction to Oligonucleotide Microarray Technology
Using category-Based Adherence to Cluster Market-Basket Data Author : Ching-Huang Yun, Kun-Ta Chuang, Ming-Syan Chen Graduate : Chien-Ming Hsiao.
Affymetrix User’s Group Meeting Boston, MA May 2005 Keynote Topics: 1. Human genome annotations: emergence of non-coding transcripts -tiling arrays: study.
Synteny - many distantly related species have co- linear maps for portions of their genomes; co-linearity between maize and sorghum, between maize and.
Author :Shigeomi HARA Hiroshi DOUZONO Yoshio NOGUCHI
Microarray - Leukemia vs. normal GeneChip System.
Controlling the Cost of Reliability in Peer-to-Peer Overlays
DNA Marker Lecture 10 BY Ms. Shumaila Azam
Molecular Classification of Cancer
Self-organizing map numeric vectors and sequence motifs
Reseeding-based Test Set Embedding with Reduced Test Sequences
Presentation transcript:

A Design Method of DNA chips for SNP Analysis Using Self Organizing Maps Author : Honjoy Saga Graduate : Chien-Ming Hsiao

Outline  Motivation  Objective  Introduction  DNA Chip and Sequence Analyses  Feature Mapping of DNA sequence by SOM  Experimental results  Conclusions  Opinion

Motivation  Conventional DNA chips are showing tendency to be comprised of longer probes and larger in size to achieve a higher resolution.  To shrink the size of DNA chips.

Objective  Applied SOM to obtain common features of DNA sequences with small number of probes which efficiently cover the target sequence with sufficient resolution for finding the correct position of SNPs.

Introduction  DNA chip are powerful tools for sequencings SNP analyses of DNA sequences.  SNPs can be detected by cDNA micro array as the lacks of cDNA sequences in target sequences.  Exact positions of SNPs can known with conventional sequencings.

DNA Chip and Sequence Analyses  DNA chip is a biochemical chip on which probes are printed as an array of short sub-sequences.  DNA is represented by symbols (A,G,T,C) and each DNA hybridizes with its complement DNA. Such as A with T and G with C  SNPs are variations of DNAs which are found in every nucleotides, and they correspond to the effectiveness and side-effect of the medicines.

DNA Chip and Sequence Analyses

Feature Mapping of DNA sequence by SOM  algorithms Step-1 : Initialize the map of probes using random sequences of specified length. Step-2 : Select a position of reference sequence (RS) randomly and find the sub-sequence which starts from that position. Step-3 : Update the closest probe Pr found in Step-2 as follows. Step-4 : Update the probe Pr’ whose distance is closer than M-Dist from Pr geometrically using the same procedure in Step-3. Repeat Step-2 to Step-4, changing the value Th-U and M- Dist.

Experimental results

Conclusions  This algorithm can select a set of small amount of probes that represent the feature of DNA sequence for SNP analyses.  This algorithm shows better results according to the covering rates, but shows worse results for detection rates of SNPs.  The organize probes of 9bp can not sufficiently cover the reference sequence.

Opinion  can consider to use another cluster algorithms to obtain common features of DNA sequences with small number of probes which efficiently cover the target sequence with sufficient resolution for finding the correct position of SNPs.