Presentation is loading. Please wait.

Presentation is loading. Please wait.

BIOC3010: Bioinformatics - Revision lecture Dr. Andrew C.R. Martin

Similar presentations


Presentation on theme: "BIOC3010: Bioinformatics - Revision lecture Dr. Andrew C.R. Martin"— Presentation transcript:

1 BIOC3010: Bioinformatics - Revision lecture Dr. Andrew C.R. Martin martin@biochem.ucl.ac.uk http://www.bioinf.org.uk/

2 Data Creation Analysis Prediction Presentation Searching Organizing Sequences DNA Protein Computers Structures

3 Introductionary Lecture

4 Introduction helps you create data example of fragment assembly Bioinformatics…

5 Introduction provides tools to store and search data databases and databanks primary/secondary/composite/gateways Bioinformatics…

6 Introduction allows you to make predictions prediction techniques –moving windows, –computer learning Bioinformatics…

7 Introduction allows you to create 3D models separate lecture Bioinformatics…

8 Introduction allows transfer of annotations homologous proteins likely to perform similar functions Bioinformatics…

9 Introduction Annotations… Pre-genome world Post-genome world Annotations will change

10 Genomes and Gene Prediction Lectures

11 Genome structure C-paradox Compare prokaryotes and eukaryotes Complexity of eukaryotes: –Introns/exons, –Repeated sequences, –Transposable elements, –Pseudogenes Problems introduced by these...

12 ORF Scanning in Eukaryotes exonintron exon 5’ 3’ Intron/exon splice sites

13 Finding Genes in Genomic DNA Ab initio methods Similarity based methods Integrated approaches 30 40 TRY4

14 Prediction accuracy Nucleotide level Exon level Measures for assessment

15 Computing Lecture

16 Computers

17 Operating systems What is an operating system? Examples of operating systems Choice of operating systems for different areas of research

18 Computers and computer science Data structures and information retrieval –Relational databases –Design of databases to reduce errors in data Simple examples of SQL and structuring data into tables Must handle:

19 Computers and computer science Algorithms: how to solve a problem –Defined an algorithm –Looked at an example Must handle:

20 Computers and computer science Data mining and machine learning –Extract patterns, etc from data –Computer software which learns from examples and is then able to make predictions Must handle:

21 Comparative Modelling Lecture

22 What is comparative modelling? Build a three-dimensional (3D) model of a protein... …based on known structure of a (generally) homologous protein sequence "Homology Modelling" is misleading: –fold recognition and threading allow recognition of non-homologous sequences which adopt the same fold

23 Stages in CM 1. Identify templates (or ‘parents’)‏ 2. Align the target sequence with the parent(s), 3. Find: structurally conserved regions structurally variable regions 4. Inherit the SCRs from the parent(s)‏ 5. Build the SVRs 6. Build the sidechains 7. Refine the model 8. Evaluate errors in the model

24 Correct alignment is the structural alignment. Align target with parent(s)‏ Structure of Target Optimal alignment based on Structural Equivalents Structure of Parent We don’t have this! Guess structural alignment from sequence alignment

25 An example MLSA

26 Sequence alignment quality

27 Assessing the model Ideal is to compare the model with the true target structure - 4-6Å; 2Å; 0.5Å

28 Model quality The main factors are:  The sequence identity with the primary parent  The number and size of indels  The quality of the alignment  The amount of change which has been necessary to the parent(s) to create the model.

29 Summary of CASP2 results CASP8 ran summer 2008 http://predictioncenter.gc.ucdavis.edu

30 Medical Applications Lecture

31 Mutations, Alleles & Polymorphisms Mutation: –any change in DNA sequence Allele: –alternative form of a genetic locus; one inherited from each parent –e.g. eye colour locus - brown and blue alleles Polymorphism: –genetic variation present in >= 1% of a normal population

32 How are SNPs useful? Understanding evolution –Some alleles may be advantageous in one environment, but disadvantageous in another DNA fingerprinting Markers to map traits –diseases, characteristics Pharmacogenomics –genotype-specific medications

33 Drug responses Drug efficacy may be affected by: transporters metabolism receptors signalling pathways, etc.

34 Potentially lethal SNPs First described ~2000 years ago “What is food to some men may be fierce poison to others” Lucretius Caro

35 Protein Sequence DNA Sequence Protein StructureProtein Function Mutation Altered Sequence Altered Structure Altered Function Understand Structure & Function Restore Structure Restored Function Design Drugs

36 Looked at p53... Local level - effects of mutations General classes –Functional –Fold Preventing –Destabilizing Types of mutations

37

38 How human? Chimeric: 67% human Humanized: 90% human Mouse: 0% human

39 Antibody Humanization

40 Summary –Diagnosis of disease –Prediction of disease risk –Prognosis –Customized response to disease –Identifying drug targets - treatments –Engineering of proteins for therapy

41 Docking and Drug Design Lecture

42 Van der Waals forces Electrostatic (Salt bridge) Interaction Hydrogen bonds Hydrophobic bonding ++ - + + Surface complementarity - + + + + +

43 Six degrees of freedom - protein and ligand both treated as rigid - 3 rotations / 3 translations Docking methods - rigid body Just like docking the space shuttle with a satellite Image from NASA

44 Treat receptor as static / ligand as flexible Dock ligand into binding pocket - generate large number of possible orientations Evaluate and select by energy function Docking methods - flexible ligand

45 Ligand Matching Match sphere centres against ligand atoms Find possible ligand orientations Often >10,000 orientations possible Find the transformation (rotation + translation) to maximize sphere matching DOCK

46 Virtual Screening Docking can be used for virtual screening Scan a library of potential drug molecules Identify leads

47 LUDI (InsightII) - find fragments that can bind GRID - uses molecular mechanics potential to find interaction sites for probe groups X-site - uses an empirical potential to find interaction sites for probe groups De Novo Drug Design

48 Stupid mistakes... Don't confuse secondary databases with secondary structure! Ensure you know the difference between SCOP/PFam functional domains and CATH structural domains

49 Summary Find pockets Principles for docking - complementarity Docking –rigid body / ligand flexibility Virtual screening Identifying probe interaction sites –build ligands de novo


Download ppt "BIOC3010: Bioinformatics - Revision lecture Dr. Andrew C.R. Martin"

Similar presentations


Ads by Google