Ulf Schmitz, Introduction to genomics and proteomics II1 Bioinformatics Introduction to genomics and proteomics II

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Genome organization Lesk, Ch 2 (Lesk, 2008). Genomes and proteomes Genome of a typical bacterium comes as a single DNA molecule of about 5 million characters.
9 Genomics and Beyond Brief Chapter Outline
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 18 LECTURE SLIDES.
Ch2. Genome Organization and Evolution 阮雪芬 Nov14, 2002 NTUST.
Ch2. Genome Organization and Evolution 阮雪芬 Nov21, 2002 NTUST.
16 and 20 February, 2004 Chapter 9 Genomics Mapping and characterizing whole genomes.
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
Bioinformatics page 12, part of ch. 21 Cell and Mol Biol Lab.
Cloning, genomes, and proteomes
Today’s Lecture Genetic mapping studies: two approaches
DNA Technology and Genomics
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
HAPLOID GENOME SIZES (DNA PER HAPLOID CELL) Size rangeExample speciesEx. Size BACTERIA1-10 Mb E. coli: Mb FUNGI10-40 Mb S. cerevisiae 13 Mb INSECTS.
explain how crime scene evidence is
AP Biology Ch. 20 Biotechnology.
Biotechnology SB2.f – Examine the use of DNA technology in forensics, medicine and agriculture.
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
Unit 4 Vocabulary Review. Nucleic Acids Organic molecules that serve as the blueprint for proteins and, through the action of proteins, for all cellular.
DNA Technology Chapter 20.
Genomics BIT 220 Chapter 21.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Module 1 Section 1.3 DNA Technology
Gene Expression and Gene Regulation. The Link between Genes and Proteins At the beginning of the 20 th century, Garrod proposed: – Genetic disorders such.
RNA and Protein Synthesis
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
11.1 Genes are made of DNA. Griffith Experiment Viral DNA Background Virus – a package of nucleic DNA wrapped in a protein shell that must use a host.
Genome Organization & Evolution. Chromosomes Genes are always in genomic structures (chromosomes) – never ‘free floating’ Bacterial genomes are circular.
Chapter 13 Table of Contents Section 1 DNA Technology
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Used for detection of genetic diseases, forensics, paternity, evolutionary links Based on the characteristics of mammalian DNA Eukaryotic genome 1000x.
Cellular Metabolism Chapter 4. Protein Synthesis How DNA works.
DNA TECHNOLOGY AND GENOMICS CHAPTER 20 P
Studijní obor Bioinformatika. LAST LECTURE SUMMARY.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
By Melissa Rivera.  GENE CLONING: production of multiple identical copies of DNA  It was developed so scientists could work directly with specific genes.
Genomics and Forensics
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Forensic Science: Fundamentals & Investigations, Chapter 7 1 Introduction and History of Biological Evidence in Forensics DNA fingerprinting or DNA profiling,
ABC for the AEA Basic biological concepts for genetic epidemiology Martin Kennedy Department of Pathology Christchurch School of Medicine.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
GENETIC ENGINEERING CHAPTER 20
Introduction to Bioinformatics II Lecture 5 By Ms. Shumaila Azam.
Chapter 2 From Genes to Genomes. 2.1 Introduction We can think about mapping genes and genomes at several levels of resolution: A genetic (or linkage)
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Proteome and Gene Expression Analysis Chapter 15 & 16.
Announcements: Note that there will be presentations and associated paper summaries for both Thursday and Tuesday classes. The Exam II mean is 81.6 and.
Johnson - The Living World: 3rd Ed. - All Rights Reserved - McGraw Hill Companies Genomics Chapter 10 Copyright © McGraw-Hill Companies Permission required.
Genetic Engineering/ Recombinant DNA Technology
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
Notes: Human Genome (Right side page)
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
Chapter 14 GENETIC TECHNOLOGY. A. Manipulation and Modification of DNA 1. Restriction Enzymes Recognize specific sequences of DNA (usually palindromes)
9.4 Genetic Engineering Updates: Mutations practice due Homework: –Read 9.5 –Restriction enzymes cut sites/gel due tomorrow Unit 5 quiz 2 Thursday Keystone.
Biotechnology.
Human Genome Project.
Lesson Four Structure of a Gene.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Lesson Four Structure of a Gene.
Relationship between Genotype and Phenotype
Scientists use several techniques to manipulate DNA.
Relationship between Genotype and Phenotype
Vermont Genetics Network Outreach Proteomics Module
Vermont Genetics Network Outreach Proteomics Module
Presentation transcript:

Ulf Schmitz, Introduction to genomics and proteomics II1 Bioinformatics Introduction to genomics and proteomics II Bioinformatics and Systems Biology Group

Ulf Schmitz, Introduction to genomics and proteomics II2 Outline 1.Proteomics Motivation Post -Translational Modifications Key technologies Data explosion 2.Maps of hereditary information 3.Single nucleotide polymorphisms

Ulf Schmitz, Introduction to genomics and proteomics II3 Protomics  Proteomics: is the large-scale study of proteins, particularly their structures and functions This term was coined to make an analogy with genomics, and is often viewed as the "next step", but proteomics is much more complicated than genomics. Most importantly, while the genome is a rather constant entity, the proteome is constantly changing through its biochemical interactions with the genome. One organism will have radically different protein expression in different parts of its body and in different stages of its life cycle.  Proteome: The entirety of proteins in existence in an organism are referred to as the proteome.

Ulf Schmitz, Introduction to genomics and proteomics II4 Proteomics If the genome is a list of the instruments in an orchestra, the proteome is the orchestra playing a symphony. R.Simpson

Ulf Schmitz, Introduction to genomics and proteomics II5 Proteomics Describing all 3D structures of proteins in the cell is called Structural Genomics Finding out what these proteins do is called Functional Genomics GENOME PROTEOME DNA MicroarrayGenetic Screens Protein – Ligand Interactions Protein – Protein Interactions Structure

Ulf Schmitz, Introduction to genomics and proteomics II6 Proteomics What kind of data would we like to measure? What mature experimental techniques exist to determine them? The basic goal is a spatio-temporal description of the deployment of proteins in the organism. Motivation:

Ulf Schmitz, Introduction to genomics and proteomics II7 Proteomics the rates of synthesis of different proteins vary among different tissues and different cell types and states of activity methods are available for efficient analysis of transcription patterns of multiple genes because proteins ‘turn over’ at different rates, it is also necessary to measure proteins directly the distribution of expressed protein levels is a kinetic balance between rates of protein synthesis and degradation Things to consider:

Ulf Schmitz, Introduction to genomics and proteomics II8

Ulf Schmitz, Introduction to genomics and proteomics II9 Why do Proteomics? are there differences between amino acid sequences determined directly from proteins and those determined by translation from DNA? –pattern recognition programs addressing this questions have following errors: a genuine protein sequence may be missed entirely an incomplete protein may be reported a gene may be incorrectly spliced genes for different proteins may overlap genes may be assembled from exons in different ways in different tissues –often, molecules must be modified to make a mature protein that differs significantly from the one suggested by translation in many cases the missing post-translational- modifications are quite important and have functional significance post-transitional modifications include addition of ligands, glycosylation, methylation, excision of peptides, etc. –in some cases mRNA is edited before translation, creating changes in the amino acid sequence that are not inferrable from the genes a protein inferred from a genome sequence is a hypothetical object until an experiment verifies its existence

Ulf Schmitz, Introduction to genomics and proteomics II10 Post-translational modification a protein is a polypeptide chain composed of 20 possible amino acids there are far fewer genes that code for proteins in the human genome than there are proteins in the human proteome (~33,000 genes vs ~200,000 proteins). each gene encodes as many as six to eight different proteins –due to post-translational modifications such as phosphorylation, glycosylation or cleavage (Spaltung) posttranslational modification extends the range of possible functions a protein can have –changes may alter the hydrophobicity of a protein and thus determine if the modified protein is cytosolic or membrane-bound –modifications like phosphorylation are part of common mechanisms for controlling the behavior of a protein, for instance, activating or inactivating an enzyme.

Ulf Schmitz, Introduction to genomics and proteomics II11 Post-translational modification phosphorylation is the addition of a phosphate (PO 4 ) group to a protein or a small molecule (usual to serine, tyrosine, threonine or histidine) In eukaryotes, protein phosphorylation is probably the most important regulatory event Many enzymes and receptors are switched "on" or "off" by phosphorylation and dephosphorylation Phosphorylation is catalyzed by various specific protein kinases, whereas phosphatases dephosphorylate. Phosphorylation Acetylation Is the addition of an acetyl group, usually at the N-terminus of the protein Farnesylation farnesylation, the addition of a farnesyl group Glycosylation the addition of a glycosyl group to either asparagine, hydroxylysine, serine, or threonine, resulting in a glycoprotein

Ulf Schmitz, Introduction to genomics and proteomics II12 Proteomics

Ulf Schmitz, Introduction to genomics and proteomics II13 Key technologies for proteomics 1.1-D electrophoresis and 2-D electrophoresis are for the separation and visualization of proteins. 2.mass spectrometry, x-ray crystallography, and NMR (Nuclear magnetic resonance ) are used to identify and characterize proteins 3.chromatography techniques especially affinity chromatography are used to characterize protein-protein interactions. 4.Protein expression systems like the yeast two- hybrid and FRET (fluorescence resonance energy transfer) can also be used to characterize protein-protein interactions.

Ulf Schmitz, Introduction to genomics and proteomics II14 Key technologies for proteomics Reference map of lympphoblastoid cell linePRI, soluble proteins. 110 µg of proteins loaded Strip 17cm pH gradient 4-7, SDS PAGE gels 20 x 25 cm, % T. Staining by silver nitrate method (Rabilloud et al.,) Identification by mass spectrometry. The pinks labels on the spots indicate the ID in Swiss-prot database browse the SWISS-2DPAGE database for more 2d PAGE images High-resolution two-dimensional polyacrylamide gel electrophoresis (2D PAGE) shows the pattern of protein content in a sample.

Ulf Schmitz, Introduction to genomics and proteomics II15 Proteomics Typically, a sample is purified to homogeneity, crystallized, subjected to an X- ray beam and diffraction data are collected. X-ray crystallography is a means to determine the detailed molecular structure of a protein, nucleic acid or small molecule. With a crystal structure we can explain the mechanism of an enzyme, the binding of an inhibitor, the packing of protein domains, the tertiary structure of a nucleic acid molecule etc..

Ulf Schmitz, Introduction to genomics and proteomics II16 High-throughput Biological Data Enormous amounts of biological data are being generated by high-throughput capabilities; even more are coming –genomic sequences –gene expression data (microarrays) –mass spec. data –protein-protein interaction (chromatography) –protein structures (x-ray christallography) –......

Ulf Schmitz, Introduction to genomics and proteomics II17 Protein structural data explosion Protein Data Bank (PDB): Structures (1 November 2005) x-ray crystallography, NMR

Ulf Schmitz, Introduction to genomics and proteomics II18 Maps of hereditary information 1.Linkage maps of genes mini- / microsatellites 2.Banding patterns of chromosomes physical objects with visible landmarks called banding patterns 3.DNA sequences Contig maps (contigous clone maps) Sequence tagged site (STS) SNPs (Single nucloetide polymorphisms) Following maps are used to find out how hereditary information is stored, passed on, and implemented.

Ulf Schmitz, Introduction to genomics and proteomics II19 Linkage map

Ulf Schmitz, Introduction to genomics and proteomics II20 Maps of hereditary information regions, 8-80bp long, repeated a variable number of times the distribution and the size of repeats is the marker inheritance of VNTRs can be followed in a family and mapped to a pathological phenotype first genetic data used for personal identification –Genetic fingerprints; in paternity and in criminal cases Variable number tandem repeats (VNTRs, also minisatellites) Short tandem repeat polymorphism (STRPs, also microsatellites) Regions of 2-7bp, repeated many times –Usually consecutive copies

Ulf Schmitz, Introduction to genomics and proteomics II21 centromere CGTCGTCGTCGTCGTCGTCGTCGT... GCAGCAGCAGCAGCAGCAGCAGCA... 3bp

Ulf Schmitz, Introduction to genomics and proteomics II22 Maps of hereditary information Banding patterns of chromosomes

Ulf Schmitz, Introduction to genomics and proteomics II23 Maps of hereditary information Banding patterns of chromosomes petite – arm centromere queue - arm

Ulf Schmitz, Introduction to genomics and proteomics II24 Maps of hereditary information Series of overlapping DNA clones of known order along a chromosome from an organism of interest, stored in yeast or bacterial cells as YACs (Yeast Artificial Chromosomes) or BACs (Bacterial Artificial Chromosomes) A contig map produces a fine mapping (high resolution) of a genome YAC can contain up to 10 6 bp, a BAC about bp Contig map (also contiguous clone map) Sequence tagged site (STS) Short, sequenced region of DNA, bp long, that appears in a unique location in the genome One type arises from an EST (expressed sequence tag), a piece of cDNA

Ulf Schmitz, Introduction to genomics and proteomics II25 Maps of hereditary information 1.if we know the protein involved, we can pursue rational approaches to therapy 2.if we know the gene involved, we can devise tests to identify sufferers or carriers 3.wereas the knowledge of the chromosomal location of the gene is unnecessary in many cases for either therapy or detection; it is required only for identifying the gene, providing a bridge between the patterns of inheritance and the DNA sequence Imagine we know that a disease results from a specific defective protein:

Ulf Schmitz, Introduction to genomics and proteomics II26 Single nucleotide polymorphisms (SNPs) SNP (pronounced ‘snip’) is a genetic variation between individuals single base pairs that can be substituted, deleted or inserted SNPs are distributed throughout the genome –average every 2000bp provide markers for mapping genes not all SNPs are linked to diseases

Ulf Schmitz, Introduction to genomics and proteomics II27 Single nucleotide polymorphisms (SNPs) nonsense mutations: –codes for a stop, which can truncate the protein missense mutations: –codes for a different amino acid silent mutations: –codes for the same amino acid, so has no effect

Ulf Schmitz, Introduction to genomics and proteomics II28 Outlook – coming lecture Bioinformatics Information Resources And Networks –EMBnet – European Molecular Biology Network DBs and Tools –NCBI – National Center For Biotechnology Information DBs and Tools –Nucleic Acid Sequence Databases –Protein Information Resources –Metabolic Databases –Mapping Databases –Databases concerning Mutations –Literature Databases

Ulf Schmitz, Introduction to genomics and proteomics II29 Thanks for your attention!