Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly 2010-2011 Lecture 1 Introduction Aleppo University Faculty of technical engineering.

Slides:



Advertisements
Similar presentations
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Advertisements

Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Master’s course Bioinformatics Data Analysis and Tools Lecture 1: Introduction Centre for Integrative Bioinformatics FEW/FALW C E N T.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 3 Finding Motifs Aleppo University Faculty of technical engineering.
Microarrays Dr Peter Smooker,
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
Bioinformatics and Phylogenetic Analysis
Introduction to BioInformatics GCB/CIS535
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Algorithm Animation for Bioinformatics Algorithms.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Computational Genomics Lecture 1, Tuesday April 1, 2003.
Review of important points from the NCBI lectures. –Example slides Review the two types of microarray platforms. –Spotted arrays –Affymetrix Specific examples.
Epistasis Analysis Using Microarrays Chris Workman.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Ayesha Masrur Khan Spring Course Outline Introduction to Bioinformatics Definition of Bioinformatics and Related Fields Earliest Bioinformatics.
Presented by Liu Qi An introduction to Bioinformatics Algorithms Qi Liu
B IOINFORMATICS Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 8 Analyzing Microarray Data Aleppo University Faculty of technical.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
with an emphasis on DNA microarrays
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
A number of slides taken/modified from:
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
Master’s course Bioinformatics Data Analysis and Tools Lecture 1: Introduction Centre for Integrative Bioinformatics FEW/FALW
High-throughput Biological Data The data deluge and bioinformatics algorithms Introduction to bioinformatics 2005 Lecture 3.
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Microarray Technology
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Finish up array applications Move on to proteomics Protein microarrays.
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
Intelligent systems in bioinformatics Introduction to the course.
CSCI 6900/4900 Special Topics in Computer Science Automata and Formal Grammars for Bioinformatics Bioinformatics problems sequence comparison pattern/structure.
Integrating the Bioinformatic Technology Group into your research programme Introduction People and Skills Examples Integrating the BTG Contacts BHRC Away.
ARE THESE ALL BEARS? WHICH ONES ARE MORE CLOSELY RELATED?
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Overview of Bioinformatics 1 Module Denis Manley..
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
MICROARRAY TECHNOLOGY
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
Gene Expression and Networks. 2 Microarray Analysis Supervised Methods -Analysis of variance -Discriminate analysis -Support Vector Machine (SVM) Unsupervised.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
Algorithms for Biological Sequence Analysis Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University,
Bioinformatics and Computational Biology
1 Bioinformatics at Norwegian University of Science and Technology Professor Finn Drabløs Department of Cancer Research and Molecular Medicine Finn Drabløs.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Bioinformatics Research Overview Li Liao Develop new algorithms and (statistical) learning methods > Capable of incorporating domain knowledge > Effective,
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Microarray: An Introduction
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
BME435 BIOINFORMATICS.
Bioinformatics Overview
High-throughput Biological Data The data deluge
Genomes and Their Evolution
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatic
Presentation transcript:

Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 1 Introduction Aleppo University Faculty of technical engineering Department of Biotechnology

Main Lines Definition Definition Bioinformatics areas Bioinformatics areas Bioinformatics data Bioinformatics data – Data types – Applications for these data Next generation sequencing Next generation sequencing Bioinformatics algorithms Bioinformatics algorithms Joint international programming initiatives Joint international programming initiatives

Definition Bioinformatics is the field of science in which biology, computer science, and information technology merge into a single discipline. Bioinformatics is the field of science in which biology, computer science, and information technology merge into a single discipline. Bioinformatics is the science of managing and analyzing biological data using advanced computing techniques Bioinformatics is the science of managing and analyzing biological data using advanced computing techniques Bioinformatics applies principles of information science to make the vast, diverse, and complex life sciences data more understandable and useful. Bioinformatics applies principles of information science to make the vast, diverse, and complex life sciences data more understandable and useful.

Definition There are two extremes in bioinformatics work There are two extremes in bioinformatics work – Tool users (biologists): know how to press the buttons and the biology but have no clue what happens inside the program – Tool shapers (informaticians): know the algorithms and how the tool works but have no clue about the biology

Bioinformatics areas Molecular sequence analysis Molecular sequence analysis 1.Sequence alignment 2.Sequence database searching 3.Motif discovery 4.Gene and promoter finding 5.Reconstruction of evolutionary relationships 6.Genome assembly and comparison

Bioinformatics areas Molecular structural analysis Molecular structural analysis 1.Protein structure analysis 2.Nucleic acid structure analysis 3.Comparison 4.Classification 5.prediction

Bioinformatics areas Molecular functional analysis Molecular functional analysis 1.gene expression profiling 2.Protein–protein interaction prediction 3.protein sub-cellular localization prediction 4.Metabolic pathway reconstruction 5.simulation

Bioinformatics data There is different data types usually used in bioinformatics There is different data types usually used in bioinformatics The same data may be used in different areas

Data types DNA sequencesDNA sequences RNA sequencesRNA sequences Expression (microarray) profileExpression (microarray) profile Proteome (x-ray, NMR) profileProteome (x-ray, NMR) profile Metabolome profileMetabolome profile Haplotype profileHaplotype profile Phenotype profilePhenotype profile

1- DNA Sequences Simple sequence analysis Simple sequence analysis – Database searching – Pairwise and multiple analysis Regulatory regions Regulatory regions Gene finding Gene finding Whole genome annotation Whole genome annotation Comparative genomics Comparative genomics

2- RNAs Splice variants Splice variants Tissue specific expression Tissue specific expression 2D structure 2D structure 3D structure 3D structure Single gene analysis Single gene analysis Microarray Microarray

2D and 3D structure of tRNA

2D and 3D structure of rRNA

Microarray 20,000 to 60,000 short DNA probes of specified sequences are orderly tethered on a small slide. Each probe corresponds to a particular short section of a gene. 20,000 to 60,000 short DNA probes of specified sequences are orderly tethered on a small slide. Each probe corresponds to a particular short section of a gene.

DNA microarrays measure the RNA abundance with either 1 channel (one color) or 2 channels (two colors). DNA microarrays measure the RNA abundance with either 1 channel (one color) or 2 channels (two colors). Stanford microarrays measure by competitive hybridization the relative expression under a given condition (fluorescent red dye Cy5 compared to its control (labeled with a green fluorescent dye, Cy3) (Two channels) Stanford microarrays measure by competitive hybridization the relative expression under a given condition (fluorescent red dye Cy5 ) compared to its control (labeled with a green fluorescent dye, Cy3) (Two channels) Affymetrix GeneChip has 1 channel and use eitherfluorescent red dye Cy5 or green fluorescent dye, Cy3 Affymetrix GeneChip has 1 channel and use either fluorescent red dye Cy5 or green fluorescent dye, Cy3 Microarray

3- Proteins Protein sequences analysis Protein sequences analysis – Database searching – Pairwise and multiple analysis 2D structure 2D structure 3D structure 3D structure Classification of proteins families Classification of proteins families Protein arrays Protein arrays

3D structure

Animation

4- Metabolome and molecular biology Metabolic pathways Metabolic pathways Regulatory networks Regulatory networks Helps to understand systems biology

5- Haplotype Molecular Markers Molecular Markers – RFLP – RAPD – SSR – ISSR – AFLP – DArT – SNP – ….

SNP

6- Phenotype Morphological data Morphological data Physiological data Physiological data Stresses tolerance Stresses tolerance Pathogenic infections Pathogenic infections Diseases resistance Diseases resistance Cancers types Cancers types ….. …..

Haplotype & Phenotype

Next Generation Sequencing SMRTHelicosAB SOLiD Illumina Solexa Roche GSFLX ABI 3730Sequencing Machine Target release Launched Read length NA85M170M120M400K96Reads/run NA2 GB6 GB 100 MB0.1 MBThroughput per run NA $5.81 k$5.97 k$84.39High costCost/Mb

Short reads assembly problems

String algorithms String algorithms Dynamic programming Dynamic programming Machine learning (NN, k-NN, SVM, GA,..) Machine learning (NN, k-NN, SVM, GA,..) Markov chain models Markov chain models Hidden Markov models Hidden Markov models Markov Chain Monte Carlo (MCMC) algorithms Markov Chain Monte Carlo (MCMC) algorithms Stochastic context free grammars Stochastic context free grammars EM algorithms EM algorithms Gibbs sampling Gibbs sampling Clustering Clustering Tree algorithms (suffix trees) Tree algorithms (suffix trees) Graph algorithms Graph algorithms Text analysis Text analysis Hybrid/combinatorial techniques Hybrid/combinatorial techniques …. …. Algorithms in bioinformatics

Joint international programming initiatives Bioperl Bioperlhttp:// Biopython Biopythonhttp:// BioTcl BioTclhttp://wiki.tcl.tk/12367 BioJava BioJavawww.biojava.org/wiki/Main_Page

Thank You