1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview WWW.IBP.IR پرتال WWW.IBP.IR پرتال بيوانفورماتيك ايرانيان.

Slides:



Advertisements
Similar presentations
Integrating Genomes D. R. Zerbino, B. Paten, D. Haussler Science 336, 179 (2012) Teacher: Professor Chao, Kun-Mao Speaker: Ho, Bin-Shenq June 4, 2012.
Advertisements

Introduction to Bioinformatics. What is Bioinformatics Easy Answer Using computers to solve molecular biology problems; Intersection of molecular biology.
BIOINFORMATICS Ency Lee.
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Bioinformatics What is bioinformatics? Why bioinformatics? The major molecular biology facts Brief history of bioinformatics Typical problems of bioinformatics:
Introduction to Bioinformatics Yana Kortsarts Bob Morris.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Integrative Bioinformatics Institute VU (IBIVU) Tel ,
1 Genetics The Study of Biological Information. 2 Chapter Outline DNA molecules encode the biological information fundamental to all life forms DNA molecules.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Using Bioinformatics to Make the Bio- Math Connection The Confessions of a Biology Teacher.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Recap Sometimes it is necessary to conduct Bad Science – often the product of having too much information Human Genome Project changed natural scientists.
BI420 – Course information Web site: Instructor: Gabor Marth Teaching.
Evaluation of the Haplotype Motif Model using the Principle of Minimum Description Srinath Sridhar, Kedar Dhamdhere, Guy E. Blelloch, R. Ravi and Russell.
Incorporating Bioinformatics in an Algorithms Course Lawrence D’Antonio Ramapo College of New Jersey.
CISC667, F05, Lec27, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Review Session.
Introduction to Biological Sequences. Background: What is DNA? Deoxyribonucleic acid Blueprint that carries genetic information from one generation to.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
C OMPUTATIONAL BIOLOGY. O UTLINE Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity of the Algorithms.
CSE 6406: Bioinformatics Algorithms. Course Outline
A brief Introduction to Bioinformatics Y. SINGH NELSON R. MANDELA SCHOOL OF MEDICINE DEPARTMENT OF TELEHEALTH Content licensed under.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
Introduction to Bioinformatics Yana Kortsarts References: An Introduction to Bioinformatics Algorithms bioalgorithms.info.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
CSCI 6900/4900 Special Topics in Computer Science Automata and Formal Grammars for Bioinformatics Bioinformatics problems sequence comparison pattern/structure.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Centre for Integrative Bioinformatics VU (IBIVU) Tel ,
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Overview of Bioinformatics 1 Module Denis Manley..
November 18, 2000ICTCM 2000 Introductory Biological Sequence Analysis Through Spreadsheets Stephen J. Merrill Sandra E. Merrill Marquette University Milwaukee,
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
An overview of Bioinformatics. Cell and Central Dogma.
Bioinformatics and Computational Biology
COMPUTATIONAL BIOLOGIST DR. MARTIN TOMPA Place of Employment: University of Washington Type of Work: Develops computer programs and algorithms to identify.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
Motif Search and RNA Structure Prediction Lesson 9.
Bioinformatics Dipl. Ing. (FH) Patrick Grossmann
Dna & Genetics: the building blocks of life Ariel white period 2 biology.
Chapter 1 Principles of Life
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Chapter 1 Principles of Life. All organisms Are composed of a common set of chemical components. Genetic information that uses a nearly universal code.
Introduction to molecular biology Data Mining Techniques.
Prepared By: Syed Khaleelulla Hussaini. Outline Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity.
CISC667, S07, Lec25, Liao1 CISC 467/667 Intro to Bioinformatics (Spring 2007) Review Session.
BME435 BIOINFORMATICS.
Bioinformatics Overview
High-throughput Biological Data The data deluge
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Bioinformatics Vicki & Joe.
Bioinformatics For MNW 2nd Year
Genetics: From Genes to Genomes
The Study of Biological Information
Introduction to Bioinformatic
From Mendel to Genomics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان

2 Outline Introduction DNA Definitions Problems in bioinformatics Conclusion AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

3 Sciences reach a point where they become mathematized! “Leonard Adleman”

4 Computing Devices Computers → electronic components (transistors,…) Brains → biological components (neurons, …) Cells → biomolecular components (DNA,…) AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

5 DNA Deoxyribonucleic acid: DNA Four nucleotides (bases), or building blocks: A, T, G, C Zips itself up into helixes using base pairs: → A with T → G with C AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC DNA is essentially digital

6 Bioinformatics Biomolecular computation → idea: use biomolecules and biochemical processes for solving computational problems Computational molecular biology → goal: understand/explain biomolecular systems and mechanisms AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

7 After going through an age of specialization, the sciences are now reuniting into a common mode of inquiry. “The next generation could produce a scientist in the old sense, a real generalist.” “Leonard Adleman”

8 Biomolecular Computation Idea: use biomolecules and biochemical processes for solving computational problems Start point: Leonard Adleman, 1994 → solving the Hamilton Path Problem using liquid- phase DNA chemistry Advantages: → fast → efficient in energy consumption → great storage capabilities AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

9 Computational Molecular Biology Goal: understand/explain biomolecular systems and mechanisms Application of computer technology to the management of biological information. Using Computers to gather, store, analyze and integrate biological and genetic information. AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC Bioinformatics

10 Problems in Bioinformatics

11 Sequencing Genomes AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC GAGGGAACACAGTCTGCACACTCCTTCCGATAT GAGGGAACACA GTCTGCACACT CCTTCCGATAT

12 Sequencing Genomes AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC GAGGGAACACAGTCTGCACACTCCTTCCGATAT GAGGGAACACAGT AGTCTGCACACTC CTCCTTCCGATAT

13 Sequencing Genomes Concrete problem: Sequence assembly problem → given: fragments of large DNA sequence with overlaps (multiple coverage) → want: entire sequence Complicating factors → computational complexity: can be seen as a variation of shortest common superstring problem which is known to be NP-hard → incorrect/missing nucleotides in fragment data AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

14 Relation btw Organisms Concrete problem: Phylogenetic tree inference → given: homologous DNA sequence from multiple species → want: evolutionary tree relating these sequences Complicating factors → errors in sequence → complexity/quality of multiple sequence alignment → limited knowledge of evolutionary processes AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

15 Sequence Alignment AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

16 DNA-Genes-Proteins Basic molecule of life: directly controls the fundamental biology of life Proteins determines the biological makeup of humans or any living organisms Variations and errors in the genomic DNA may lead to different diseases or disorders AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC DNA → Genes → Proteins

17 DNA → Proteins AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC DNA (gene) ↓ mRNA ↓ Protein

18 Computational Gene Finding Given: raw sequence data Predict: → coding and non-coding regions → exons/introns → splicing patterns → transcription factors AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC Exon1Exon2Exon3Intron1Intron2Exon1Exon2Exon3 Pre mRNA mRNA

19 Structure Prediction RNA & Protein Minimum free energy RNA structure: → primary structure: Single stranded sequence of A, U, G, C → secondary structure: Intra-molecular base pairs among its bases AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

20 5’- GAGGGAACACAGUCUGCACACUCCUUC -3’ Secondary Structure

21 Arc Diagram Representation

22 Loops AAACUGCUGACCGGUAACUGAGGCCUGCCUGCAAUUGCUUAACUUGGC Hairpin loopInterior loop Multi loopExternal loop Bulge loop Stacked pair

23 Pseudoknotted Structure

24 Str. Pred. Algorithms Dynamic programming algorithms → restricted class of pseudoknotted structures → Rivas and Eddy (R&E): O(N^6) Heuristic algorithms → search over the solution space AAACUGCUGACCGGUAACUGAGGCCUGCCUGCAAUUGCUUAACUUGGC

25 Motif Discovery AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

26 Genes and Diseases Proteins perform all of life’s essential functions Changes in DNA sequence genome can have disastrous consequences AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

27 Real World Applications AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

28 Related Aspects Computation models of organisms or biological systems Nature-inspired algorithms → genetic algorithms → neural networks → ant colony optimization Artificial life → life-like behavior of artificial systems → (re)-design or biological organisms AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

29 Conclusion Bioinformatics: Using computers for gathering, storing and analyzing biological data Analyzing AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

30 Thank you! AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC Baharak Rastegari, Bio Informatics

31 Genetic Process AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

32 Math and other sciences → Physics: time of Renaissance → Chemistry: after John Dalton developed atomic theory Introduction Sciences reach a point where they become mathematized! AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

33

34 DNA Gene expression? Two genes AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC DNA → Genes → Proteins

35 Genomic Sequence Data Interpretation Gene finding Structure prediction Pattern discovery Classification Clustering AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC

36 Understanding the Cell Concrete problem: Gene regulatory relationship inference → given: expression profiles of two genes A, B → want: decide if there is a (direct) regulatory relationship between A and B, and whether its activating or inhibiting one Complicating factors → imprecision/limitation in measuring expression profiles → indirect/complex regulatory relationship AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC