CS 7010: Computational Methods in Bioinformatics (course introduction)

Slides:



Advertisements
Similar presentations
CP411 Computer Graphics, Wilfrid Laurier University Introduction # 1 Welcome to CP411 Computer Graphics 2012 Instructor: Dr. Hongbing Fan Introduction.
Advertisements

Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
CS 598SS Probabilistic Methods in Biological Sequence Analysis Saurabh Sinha.
Bioinformatics David Brodin BEA core facility MOLEKYLÄRBIOLOGI MED GENETIK – BIOINFORMATIK HT -07 Course web page:
AP Biology Teaching Biology Through Bioinformatics Real world genomics research in your classroom Kim B. Foglia Division Ave. High School Levittown.
CS 7010: Computational Methods in Bioinformatics (course review) Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University.
Prof. Jesús A. Izaguirre Department of Computer Science and Engineering Computational Biology and Bioinformatics at Notre Dame.
Introduction to Computer Science CS 21a: Introduction to Computing I Department of Information Systems and Computer Science Ateneo de Manila University.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
CS 466 Introduction to Bioinformatics Saurabh Sinha.
Jianlin Cheng, PhD Informatics Institute, Computer Science Department University of Missouri, Columbia Fall, 2011.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
The Education of a Software Engineer Mehdi Jazayeri Presented by Matthias Hauswirth.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Workshop in Bioinformatics 2010 What is it ? The goals of the class… How we do it… What’s in the class Why should I take the class..
Computational Genomics Lecture 1, Tuesday April 1, 2003.
Bio/CS 251 Introduction to Bioinformatics. Class Web Site This site will contain all important documents.
Sanguinetti, 2012Bio2 lecture 1 Bioinformatics 2 “My main problem these days is that I don’t understand how we go from an experiment in the lab to a number.
Algorithms in Computational Biology Tanya Berger-Wolf Compbio.cs.uic.edu/~tanya/teaching/CompBio January 13, 2006.
Bioinformatics Curriculum Guidelines: Toward a Definition of Core Competencies Lonnie Welch School of Electrical Engineering & Computer Science Biomedical.
EECS 395/495 Algorithmic Techniques for Bioinformatics General Introduction 9/27/2012 Ming-Yang Kao 19/27/2012.
Data and Text Mining for Computational Biology Introduction.
From T. MADHAVAN, & K.Chandrasekaran Lecturers in Zoology.. EXIT.
Medical Informatics Basics
BIO337 Systems Biology/Bioinformatics (course # 50524) Spring 2014 Tues/Thurs 11 – 12:30 PM BUR 212 Edward Marcotte/Univ. of Texas/BIO337/Spring 2014.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Cpt S 471/571: Computational Genomics Spring 2015, 3 cr. Where: Sloan 9 When: M WF 11:10-12:00 Instructor weekly office hour for Spring 2015: Tuesdays.
SYSC System Analysis and Design 1 Part I – Introduction.
By the Ahmadiyya Muslim Women's’ Student Association.
Bioinformatics Sean Langford, Larry Hale. What is it?  Bioinformatics is a scientific field involving many disciplines that focuses on the development.
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
Computational Biology BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Medical Informatics Basics
Medical Informatics Basics Lection 1 Associated professor Andriy Semenets Department of Medical Informatics.
Biologically Inspired computing Info rm atics luis rocha 2007 biologically-inspired computing.
Master’s Degrees in Bioinformatics in Switzerland: Past, present and near future Patricia M. Palagi Swiss Institute of Bioinformatics.
Bachelor’s Program in CIVIL ENGINEERING Duration of Studies: 8 terms Academic Degree to be Awarded Bachelor of Civil Engineering.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
CS397-CXZ Algorithms in Bioinformatics ChengXiang (“Cheng”) Zhai, Robert Skeel (Department of Computer Science) Nick Sahinidis (Department of Chemical.
Intelligent systems in bioinformatics Introduction to the course.
Molecular Diagnostics Certificate Program Information Session.
Introduction to Science Informatics Lecture 1. What Is Science? a dependence on external verification; an expectation of reproducible results; a focus.
Introduction to Bioinformatics Biostatistics & Medical Informatics 576 Computer Sciences 576 Fall 2008 Colin Dewey Dept. of Biostatistics & Medical Informatics.
CS 461b/661b: Bioinformatics Tools and Applications Software Algorithm Mathematical Models Biology Experiments and Data.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Introduction to Bioinformatics (Lecture for CS397-CXZ Algorithms in Bioinformatics) Jan. 21, 2004 ChengXiang Zhai Department of Computer Science University.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Overview of Bioinformatics 1 Module Denis Manley..
Design of a Typical Course s c h o o l s o f e n g I n e e r I n g S. D. Rajan Professor of Civil Engineering Professor of Aerospace and Mechanical Engineering.
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
COMPUTERS IN BIOLOGY Elizabeth Muros INTRO TO PERSONAL COMPUTING.
EB3233 Bioinformatics Introduction to Bioinformatics.
Bioinformatics lectures at Rice University Li Zhang Lecture 11: Networks and integrative genomic analysis-3 Genomic data
Application of Bioinformatics in Genetic Research Instructors: Dr. Henry Baker Dr. Luciano Brocchieri Dr. Michele Tennant Dr. Lei Zhou
9/03 Data Mining – Introduction G Dong (WSU)1 CS499/ Data Mining Fall 2003 Professor Guozhu Dong Computer Science & Engineering WSU.
Molecular Diagnostics Certificate Program January 23, 2008 Information Session.
B. Prabhakaran1 Multimedia Systems Reference Text “Multimedia Database Management Systems” by B. Prabhakaran, Kluwer Academic Publishers. – Kluwer bought.
High throughput biology data management and data intensive computing drivers George Michaels.
BCH339N Systems Biology/Bioinformatics (course # 54040) Spring 2016 Tues/Thurs 11 – 12:30 PM BUR 212.
Course Overview Stephen M. Thebaut, Ph.D. University of Florida Software Engineering.
Sub-fields of computer science. Sub-fields of computer science.
Introduction to Bioinformatics and Functional Genomics
Syllabus Introduction to Computer Science
Lecture 1 - Introduction
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
Introduction to Bioinformatic
Introduction to Bioinformatics
Presentation transcript:

CS 7010: Computational Methods in Bioinformatics (course introduction) Dong Xu Computer Science Department 109 Engineering Building West E-mail: xudong@missouri.edu 573-882-7064 http://digbio.missouri.edu

Challenges of Our Civilization -1 top 125 unsolved problems in science over the next quarter-century (http://www.sciencemag.org/sciext/125th/) The Top 25 What Is the Universe Made Of? What is the Biological Basis of Consciousness? Why Do Humans Have So Few Genes? To What Extent Are Genetic Variation and Personal Health Linked? Can the Laws of Physics Be Unified? How Much Can Human Life Span Be Extended? What Controls Organ Regeneration? How Can a Skin Cell Become a Nerve Cell?

Challenges of Our Civilization-2 How Does a Single Somatic Cell Become a Whole Plant? How Does Earth's Interior Work? Are We Alone in the Universe? How and Where Did Life on Earth Arise? What Determines Species Diversity? What Genetic Changes Made Us Uniquely Human? How Are Memories Stored and Retrieved? How Did Cooperative Behavior Evolve? How Will Big Pictures Emerge from a Sea of Biological Data?

Challenges of Our Civilization-3 How Far Can We Push Chemical Self-Assembly? What Are the Limits of Conventional Computing? Can We Selectively Shut Off Immune Responses? Do Deeper Principles Underlie Quantum Uncertainty and Nonlocality? Is an Effective HIV Vaccine Feasible? How Hot Will the Greenhouse World Be? What Can Replace Cheap Oil -- and When? Will Malthus Continue to Be Wrong?

Lecture Outline What does bioinformatics do? Course topics Course Organization Workload/grades

Technical Definitions NIH (http://www.bisti.nih.gov/) Bioinformatics: “research, development, or application of computational tools and approaches for expanding the use of biological, medical, behavioral or health data, including those to acquire, represent, describe, store, analyze, or visualize such data”. Computational Biology: “the development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological, behavioral, and social systems”.

Scope of Bioinformatics: Studying Biology on Computer data management; data mining; modeling; prediction; theory formulation bioinformatics genes, proteins, protein complexes, pathways, cells, organisms, ecosystem an indispensable part of biological science with its own methodology engineering aspect scientific aspect computer science, biology, statistics physics, mathematics, chemistry, engineering,…

Why Bioinformatics is So Hot? (I) More than 80 universities offer graduate degrees in bioinformatics At cross-section of two most active fields: computer science and molecular biology Exponential growths in computer technologies (hardware, Internet) pave the way for bioinformatics development

Why Bioinformatics is So Hot? (II) Analytical technology High-throughput data Biological knowledge Medicine & bioengineering

What Can Computing Do for Biology? Data interpretation in analytical technologies Data management and computational infrastructure Discovery from data mining Modeling, prediction and design Theoretical / in silico biology Almost cover every area of computer science

Data Interpretation in Analytical Technologies (I) Analytical technologies are the driving force of new (large-scale) biology: DNA sequencing (genomics) X-ray / NMR structure determination (structural genomics) Protein identification using mass spectrometry (proteomics) Microarray chips (functional genomics)

Data Interpretation in Analytical Technologies (II) peak assignment structural restraint extraction NMR spectra NMR protein structure determination i+4 i+3 i+2 i i-1 i+1 structure calculation protein structure

Data Interpretation in Analytical Technologies (III) From image to data (imaging processing) Large-scale data cannot be handled without computer Noisy data (optimization with under-constraint / over-constraint) Computer algorithms/programs can mimic human interpretation process and do it much faster Automation of experimental data interpretation

Data Management and Computational Infrastructure Track instruments, experiment conditions and results at each step of a complicated biological experiment (LIMS at modern wet labs) Data storage and retrieval (database) Data visualization Data query and analysis pipeline

Discovery from Data Mining (I)

Discovery from Data Mining (II) Pattern/knowledge discovery from data many biological data are generated by biological processes which are not well understood interpretation of such data requires discovery of convoluted relationships hidden in the data which segment of a DNA sequence represents a gene, a regulatory region which genes are possibly responsible for a particular disease Complicated data Large-scale, high-dimension Noisy (false positives and false negatives)

Modeling, Prediction and Design (I) Modeling and prediction of biological objects/processes modeling of biochemistry enzyme reaction rates modeling of biophysics dynamics of biomolecules modeling of evolution prediction of phylogeny

Modeling, Prediction and Design (II) Prediction of outcomes of biological processes computing will become an integral part of modern biology through an iterative process of From prediction to engineering design Protein structure prediction to protein engineering Design genetically modified species model formulation experimental validation computational prediction

Theoretical / In Silico Biology Generate new hypothesis, formulate and test fundamental theories of biology new hypothesis about detailed evolutionary history, through mining genomic sequence data? new hypothesis about a particular signaling network, through data mining? new hypothesis about protein folding pathways, through simulations?

Bioinformatics Application to Biological Systems bacteria (Synechococcus) yeast (Saccharomyces cerevisia) plants (Arabidopsis) neural systems (neurons) viruses (SARS)

Can Biology Help Computing? Computational techniques inspired by biology: Neural network (artificial intelligence) Genetic algorithm, automata A new driver of computer science: Better hardware (supercomputers) New data representation Develop new theoretical framework: DNA computing Network communication (communication between ants, see http://news-service.stanford.edu/news/2003/may7/antchat-57.html)

Computing versus Biology what computer science is to molecular biology is like what mathematics has been to physics ...... -- Larry Hunter, ISMB’94 molecular biology is (becoming) an information science ....... -- Leroy Hood, RECOMB’00 Bioinformatics is still in its infancy!

Lecture Outline What does bioinformatics do? Course topics Course Organization Workload/grades

Course Topics Data interpretation in analytical technologies Data management and computational infrastructure Discovery from data mining Modeling, prediction and design Theoretical / in silico biology Cover classical/mainstream bioinformatics problems from computer science prospective

Course Schedule See http://digbio.missouri.edu/cs7010/   Course Schedule See http://digbio.missouri.edu/cs7010/ First take home exam: --given on 9/29; due on 10/6 Second take home exam: --given on 11/17; due on 11/29 Three phases of project: --9/22, 10/20, 11/17, final report due 12/8

What I Will Teach A general introduction to a few major problems in the field of bioinformatics problems definitions: from biological problem to computable problem some key computational techniques A way of thinking: tackling “biological problem” computationally how to look at a biological problem from a computational point of view how to formulate a computational problem to address a biological issue how to collect statistics from biological data how to build a computational model how to design algorithms for the model how to test and evaluate a computational algorithm how to access confidence of a prediction result

New Ways of Thinking Critical thinking Analytical thinking Quantitative thinking Algorithmic thinking

Lecture Outline What does bioinformatics do? Course topics Course Organization Workload/grades

A Brief Survey Register for the course? Academic department? Computer background? Biology background? Statistical background? Taken another bioinformatics course?

Prerequisites CS 2050 (Algorithm Design and Programming II) or equivalent training Statistics 2500 (Introduction to Probability and Statistics I) or equivalent training Programming skills in any programming language are required No biology background is necessary

Course Info Co Instructor: Trupti Joshi (joshitr@missouri.edu) Course Web Site: http://digbio.missouri.edu/cs7010/

Reference Books - 1 • Neil C.Jones and Pavel A. Pevzner: An Introduction to Bioinformatics Algorithms (Computational Molecular Biology). MIT Press, 2004. • Pavel Pevzner: Computational Molecular Biology - An Algorithmic Approach. MIT Press, 2000. • Current Topics in Computational Molecular Biology, edited by Tao Jiang, Ying Xu, and Michael Zhang. MIT Press. 2002.

Reference Books - 2 • Pierre Baldi and Soren Brunak: Bioinformatics – The Machine Learning Approach (second edition). MIT Press, 2001. • Dan Gusfield: Algorithms on Strings, Trees, and Sequences. Cambridge University Press. 1997. • Warren J. Ewens and Gregory R. Grant: Statistical Methods in Bioinformatics – An Introduction. Springer. 2001. • Terry Speed: Statistical analysis of gene expression of gene expression microarray data. Chapman&Hall/CRC. 2003.

Lectures 3:30pm – 4:45pm, Tuesday and Thursday Powerpoint sides for each lecture (posted before the lecture) Questions/answers in the beginning and end of lecture Discussions are encouraged during the lecture (A topic discussion may be at the end of a lecture)

Office Hours 4:45pm-5:35pm, Tuesdays and Thursdays The instructor who deliver the lecture will give the office hour Dong Xu: Room 109, Engineering Building West (882-7064) Trupti Joshi: Room 317, Engineering Building North (884-3528) Special office hours will be arranged close to the final Appointments at other time

Lecture Outline What does bioinformatics do? Course topics Course Organization Workload/grades

Minimum Requirement Attend class regularly Read suggested class handout after class Deliver the two take-home exams Deliver final project (for graduate students) Expected workload: 5-6 hours / week in addition to class attendance

How to Get Maximum out of the Course Study suggested reading/slide before class Study optional reading Ask questions on class Frequent visits at office hours Perform homework assignments (not graded) Not required (not counted in the final grade) but encouraged.

Grading A final grade of A, B, C, etc. will be assigned, Final project 2 take-home exams (20% each) Project : 3 Phase Reports (5% each), Final Report (15%), Software Demo (15%), Presentation (15%) Final project A working bioinformatics program that can be used by biologists or comprehensive computational analysis on bioinformatics tool outputs One student one project (independent development) with consultation from instructors Potential for publication

Three Phases of Project Phase 1 (due 9/22): Define your project subject. A brief literature survey and illustration of its importance. Phase 2 (due 10/20): Describe key methods. Phase 3 (due 11/17): Present key results. Final report: due 12/8

Discussion What do you expect from this course? - content? - ways of teaching? - how the instructors can help? -…

Assignments Suggested reading: Optional reading: http://bioinfo.mbb.yale.edu/e-print/whatis-mim/text.pdf Bioboxes in “Neil C.Jones and Pavel A. Pevzner: An Introduction to Bioinformatics Algorithms (Computational Molecular Biology). MIT Press, 2004.” Optional reading: Chapter 1 in “Current Topics in Computational Molecular Biology, edited by Tao Jiang, Ying Xu, and Michael Zhang. MIT Press. 2002.” http://www.ncbi.nih.gov/About/primer/bioinformatics.html