Presentation is loading. Please wait.

Presentation is loading. Please wait.

Master’s course Bioinformatics Data Analysis and Tools Lecture 1: Introduction Centre for Integrative Bioinformatics FEW/FALW

Similar presentations


Presentation on theme: "Master’s course Bioinformatics Data Analysis and Tools Lecture 1: Introduction Centre for Integrative Bioinformatics FEW/FALW"— Presentation transcript:

1 Master’s course Bioinformatics Data Analysis and Tools Lecture 1: Introduction Centre for Integrative Bioinformatics FEW/FALW heringa@cs.vu.nl

2 Course objectives There are two extremes in bioinformatics work –Tool users (biologists): know how to press the buttons and the biology but have no clue what happens inside the program –Tool shapers (informaticians): know the algorithms and how the tool works but have no clue about the biology Both extremes are dangerous, need a breed that can do both

3 At the end of this course… You will have seen a couple of algorithmic examples You will have got an idea about methods used in the field You will have a firm basis of the physics and thermodynamics behind a lot of processes and methods You will have an idea of and some experience as to what it takes to shape a bioinformatics tool

4 Bioinformatics “Studying informatic processes in biological systems” (Hogeweg) Applying algorithms and mathematical formalisms to biology (genomics) “Information technology applied to the management and analysis of biological data” (Attwood and Parry-Smith)

5 This course General theory of crucial algorithms (GA, NN, HMM, SVM, etc..) Method examples Research projects within own group –Repeats –Contact alignment –Domain boundary prediction Physical basis of biological processes and tools

6 Bioinformatics Large - external (integrative)ScienceHuman Planetary ScienceCultural Anthropology Population Biology Sociology SociobiologyPsychology Systems Biology Biology Medicine Molecular Biology Chemistry Physics Small – internal (individual) Bioinformatics

7 Genomic Data Sources DNA/protein sequence Expression (microarray) Proteome (xray, NMR, mass spectrometry, PPI) Metabolome Physiome (spatial, temporal) Integrative bioinformatics

8 Protein structural data explosion Protein Data Bank (PDB): 14500 Structures (6 March 2001) 10900 x-ray crystallography, 1810 NMR, 278 theoretical models, others...

9 Mathematics Statistics Computer Science Informatics Biology Molecular biology Medicine Chemistry Physics Bioinformatics Bioinformatics inspiration and cross-fertilisation

10 Algorithms in bioinformatics string algorithms dynamic programming machine learning (NN, k-NN, SVM, GA,..) Markov chain models hidden Markov models Markov Chain Monte Carlo (MCMC) algorithms stochastic context free grammars EM algorithms Gibbs sampling clustering tree algorithms (suffix trees) graph algorithms text analysis hybrid/combinatorial techniques and more…

11 Joint international programming initiatives Bioperl http://www.bioperl.org/wiki/Main_Page http://bioperl.org/wiki/How_Perl_saved_human_genome Biopython http://www.biopython.org/ BioTcl http://wiki.tcl.tk/12367 BioJava www.biojava.org/wiki/Main_Page

12 Integrative bioinformatics @ VU Studying informational processes at biological system level From gene sequence to intercellular processes Computers necessary We have biology, statistics, computational intelligence (AI), HTC,.. VUMC: microarray facility, cancer centre, translational medicine Enabling technology: new glue to integrate New integrative algorithms Goals: understanding cellular networks in terms of genomes; fighting disease (VUMC)

13 Bioinformatics @ VU Progression: DNA: gene prediction, predicting regulatory elements, alternative splicing mRNA expression Proteins: (multiple) sequence alignment, docking, domain prediction, PPI Metabolic pathways: metabolic control Cell-cell communication

14 Fold recognition by threading: THREADER and GenTHREADER Query sequence Compatibility scores Fold 1 Fold 2 Fold 3 Fold N

15 Polutant recognition by microarray mapping: Compatibility scores Cond. 1 Cond. 2 Cond. 3 Cond. N Contaminant 1 Contaminant 2 Contaminant 3 Contaminant N Query array

16 ENFIN WP4 Functional threading From sequence to function –Multiple alignment –Secondary structure prediction, Solvation prediction, Conservation patterns, Loop enumeration

17 ENFIN WP4 Functional threading From sequence to function –Multiple alignment –Secondary structure prediction, Solvation prediction, Conservation patterns, Loop enumeration D H S StructFunc DHS DB of active site descrip tors

18 ENFIN WP5 - BioRange (Anton Feenstra) Protein-protein interaction prediction Mesoscopic modelling Soft-core MD –Fuzzy residues –Fuzzy (surface) locations

19 ENFIN WP6 Silicon Cell –Database of fully parametrized pathway model (differential equations) solver Jacky Snoep (Stellenbosch, VU/IBIVU) Hans Westerhoff (VU, Manchester)

20 New neighbouring disciplines Computational Systems Biology Computational systems biology aims to develop and use efficient algorithms, data structures and communication tools to orchestrate the integration of large quantities of biological data with the goal of modeling dynamic characteristics of a biological system. Modeled quantities may include steady-state metabolic flux or the time-dependent response of signaling networks. Algorithmic methods used include related topics such as optimization, network analysis, graph theory, linear programming, grid computing, flux balance analysis, sensitivity analysis, dynamic modeling, and others. Translational Medicine A branch of medical research that attempts to more directly connect basic research to patient care. Translational medicine is growing in importance in the healthcare industry, and is a term whose precise definition is in flux. In particular, in drug discovery and development, translational medicine typically refers to the "translation" of basic research into real therapies for real patients. The emphasis is on the linkage between the laboratory and the patient's bedside, without a real disconnect. This is often called the "bench to bedside" definition. Neuro-informatics Neuroinformatics combines neuroscience and informatics research to develop and apply the advanced tools and approaches that are essential for major advances in understanding the structure and function of the brain

21 Natural progression

22 Bioinformatics @ VU Qualitative challenges: High quality alignments (alternative splicing) In-silico structural genomics In-silico functional genomics: reliable annotation Protein-protein interactions. Metabolic pathways: assign the edges in the networks Cell-cell communication: find membrane associated components New algorithms

23 Bioinformatics @ VU Quantitative challenges: Understanding mRNA expression levels Understanding resulting protein activity Time dependencies Spatial constraints, compartmentalisation Are classical differential equation models adequate or do we need more individual modeling (e.g macromolecular crowding and activity at oligomolecular level)? Metabolic pathways: calculate fluxes through time Cell-cell communication: tissues, hormones, innervations Need ‘complete’ experimental data for good biological model system to learn to integrate

24 Bioinformatics @ VU VUMC Neuropeptide – addiction Oncogenes – disease patterns Reumatic diseases

25 Bioinformatics @ VU Quantitative challenges: How much protein produced from single gene? What time dependencies? What spatial constraints (compartmentalisation)? Metabolic pathways: assign the edges in the networks Cell-cell communication: find membrane associated components

26 Integrate data sources Integrate methods Integrate data through method integration (biological model) Integrative bioinformatics

27 Data Algorithm Biological Interpretation (model) tool Integrative bioinformatics Data integration

28 Data 1Data 2Data 3

29 Integrative bioinformatics Data integration Data 1 Algorithm 1 Biological Interpretation (model) 1 tool Algorithm 2 Biological Interpretation (model) 2 Algorithm 3 Biological Interpretation (model) 3 Data 2Data 3

30 “Nothing in Biology makes sense except in the light of evolution” (Theodosius Dobzhansky (1900-1975)) “Nothing in Bioinformatics makes sense except in the light of Biology” Bioinformatics


Download ppt "Master’s course Bioinformatics Data Analysis and Tools Lecture 1: Introduction Centre for Integrative Bioinformatics FEW/FALW"

Similar presentations


Ads by Google