BIOINFORMATICS Summary

Slides:



Advertisements
Similar presentations
Web Resources for Bioinformatics Vadim Alexandrov and Mark Gerstein.
Advertisements

Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Interoperation of Molecular Biology Databases Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International Menlo Park, CA
1 (c) Mark Gerstein, 1999, Yale, bioinfo.mbb.yale.edu BIOINFORMATICS Introduction Mark Gerstein, Yale University bioinfo.mbb.yale.edu/mbb452a.
Definition (one of many possible): (Molecular) bio – informatics: bioinformatics is conceptualizing biology in terms of molecules (in the sense of physical.
Using Bioinformatics to Make the Bio- Math Connection The Confessions of a Biology Teacher.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
Structural Bioinformatics Dr. Avraham Samson Course no.: Credit points: 1.5 Final grade is based on 10 assignments Course homepage:
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
Protein Structures.
Ayesha Masrur Khan Spring Course Outline Introduction to Bioinformatics Definition of Bioinformatics and Related Fields Earliest Bioinformatics.
Presented by Liu Qi An introduction to Bioinformatics Algorithms Qi Liu
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Bioinformatics Sean Langford, Larry Hale. What is it?  Bioinformatics is a scientific field involving many disciplines that focuses on the development.
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Intelligent systems in bioinformatics Introduction to the course.
CSCI 6900/4900 Special Topics in Computer Science Automata and Formal Grammars for Bioinformatics Bioinformatics problems sequence comparison pattern/structure.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Bioinformatics: Theory and Practice – Striking a Balance (a plea for teaching, as well as doing, Bioinformatics) Practice (Molecular Biology) Theory: Central.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Introduction to Bioinformatics (Lecture for CS397-CXZ Algorithms in Bioinformatics) Jan. 21, 2004 ChengXiang Zhai Department of Computer Science University.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
1 (c) Mark Gerstein, 1999, Yale, bioinfo.mbb.yale.edu BIOINFORMATICS Structures Mark Gerstein, Yale University bioinfo.mbb.yale.edu/mbb452a (last edit.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
Information Technology in the Natural Sciences Biology – Chemistry – Physics.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
An overview of Bioinformatics. Cell and Central Dogma.
Biocomputation: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
داده های عظیم در دوران پساژنوم Big Data in Post Genome Era مهدی صادقی پژوهشگاه ملی مهندسی ژنتیک و زیست فناوری پژوهشکده علوم زیستی، پژوهشگاه دانش های بنیادی.
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
High throughput biology data management and data intensive computing drivers George Michaels.
BME435 BIOINFORMATICS.
Bioinformatics Overview
International Centre for Genetic Engineering and Biotechnology
Data-intensive Computing: Case Study Area 1: Bioinformatics
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
Basics of Comparative Genomics
Computational Chemistry
Techniques for Finding Patterns in Large Amounts of Data: Applications in Biology Vipin Kumar William Norris Professor and Head, Department of Computer.
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
Introduction to bioinformatics
1 Department of Engineering, 2 Department of Mathematics,
Bioinformatics Biological Data Computer Calculations +
1 Department of Engineering, 2 Department of Mathematics,
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
1 Department of Engineering, 2 Department of Mathematics,
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Nancy Baker SILS Bioinformatics Seminar January 21, 2004
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Sequence Based Analysis Tutorial
Protein Structures.
BIOINFORMATICS Introduction
Basics of Comparative Genomics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Computational Biology
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Hyunghoon Cho, Bonnie Berger, Jian Peng  Cell Systems 
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

BIOINFORMATICS Summary Mark Gerstein, Yale University gersteinlab.org/courses/452 (last edit in fall '06, handout version, including in-class changes)

Used in class M11 [2006,12.06]

You'll Forget… [From S Harris's Science Cartoons, http://www.sciencecartoonsplus.com]

… So I'll distill

What is Bioinformatics? (Molecular) Bio - informatics One idea for a definition? Bioinformatics is conceptualizing biology in terms of molecules (in the sense of physical-chemistry) and then applying “informatics” techniques (derived from disciplines such as applied math, CS, and statistics) to understand and organize the information associated with these molecules, on a large-scale. Bioinformatics is “MIS” for Molecular Biology Information. It is a practical discipline with many applications.

Data Types Sequences Structures Functional Genomics

"Core" Bioinformatics Core Stuff Computing with sequences and structures protein structure prediction biological databases and mining them New Stuff: Networks and Expression Analysis `Will teach these in CS 545 (Data Mining) next semester Fairly Speculative: simulating cells

Hierarchical Structure of Course Information Memorize the previous summary Good familarity with main points in lectures (quizzes) Rest of overheads and readings for reference on projects and …

Cross-cutting Themes Algorithms for Comparison Predictions Dynamic programming Different measures of similarity (RMS vs. Structural similarity; PAM & Blossum vs %ID) Generalized similarity matrix in threading Statistical scoring schemes (with P-values) For sequences, structures, sequence to structure, and even expression data Time complexity of the comparisons Predictions LOD scores (# with features / expectation ) Progressive more complex features Amount of features information IN vs. prediction OUT Testing against benchmarks with cross-validation (sec. struc. prediction, seq. comparison scoring, datamining) Other methods, need for heuristics

Cross-cutting Themes Increasing the chemically reality and complexity of genes Character strings, fold (just CAs), volumes and surfaces from all atom representation, energy and minimization, dynamics (time and velocity) Simulation Vector configuration boiled down a scalar E through potential Compute intensive exploration of configurations (MC, MD) Averages over correctly weighted configurations Importance of simplification The Survey Mode Collecting information in DB tables Importance of integration and interoperation Organizing it around "part" classifications Surveying it for useful statistics (taking into account biases) Doing datamining to find more tenuous relationships

Anti-Themes

Depth v Breadth

Historical Perspective 1980 2005 2000 1990 1985 1995 Single Structures Modeling & Geometry Forces & Simulation Docking Sequences, Sequence-Structure Relationships Alignment Structure Prediction Fold recognition Genomics Dealing with many sequences Gene finding & Genome Annotation Databases Integrative Analysis Expression & Proteomics Data Datamining Simulation again….

(from CooperToons, http://members.aol.com/ChipCooper/cartoon26.html)