Protein databases Henrik Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.

Slides:



Advertisements
Similar presentations
Click Here to Begin Your Lab
Advertisements

Translation By Josh Morris.
Mutations. DNA mRNA Transcription Introduction of Molecular Biology Cell Polypeptide (protein) Translation Ribosome.
Transcription & Translation Worksheet
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Transcription and Translation
Transcription and Translation
Bioinformatics.
Proteins are made by decoding the Information in DNA Proteins are not built directly from DNA.
FEATURES OF GENETIC CODE AND NON SENSE CODONS
Introduction to databases Tuomas Hätinen. Topics File Formats Databases -Primary structure: UniProt -Tertiary structure: PDB Database integration system.
Concepts and Applications Eighth Edition
How Proteins are Produced
DNA.
Sec 5.1 / 5.2. One Gene – One Polypeptide Hypothesis early 20 th century – Archibald Garrod physician that noticed that some metabolic errors were found.
DNA The Secret of Life. Deoxyribonucleic Acid DNA is the molecule responsible for controlling the activities of the cell It is the hereditary molecule.
PowerPoint ® Lecture Slides prepared by Janice Meeking, Mount Royal College C H A P T E R Copyright © 2010 Pearson Education, Inc. 3 Cells: The Living.
Wellcome Trust Workshop Working with Pathogen Genomes Module 2 Gene Prediction.
GENE EXPRESSION. Gene Expression Our phenotype is the result of the expression of proteins Different alleles encode for slightly different proteins Protein.
Gene Expression: From Gene to Protein
RNA Structure Like DNA, RNA is a nucleic acid. RNA is a nucleic acid made up of repeating nucleotides.
7. Protein Synthesis and the Genetic Code a). Overview of translation i). Requirements for protein synthesis ii). messenger RNA iii). Ribosomes and polysomes.
Chapter 11 DNA and Genes.
Cell Division and Gene Expression
Chapter 14 Genetic Code and Transcription. You Must Know The differences between replication (from chapter 13), transcription and translation and the.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
©1998 Timothy G. Standish From DNA To RNA To Protein Timothy G. Standish, Ph. D.
Parts is parts…. AMINO ACID building block of proteins contain an amino or NH 2 group and a carboxyl (acid) or COOH group PEPTIDE BOND covalent bond link.
Today 14.2 & 14.4 Transcription and Translation /student_view0/chapter3/animation__p rotein_synthesis__quiz_3_.html.
Example 1 DNA Triplet mRNA Codon tRNA anticodon A U A T A U G C G
G U A C G U A C C A U G G U A C A C U G UUU UUC UUA UCU UUG UCC UCA
Protein Synthesis Translation e.com/watch?v=_ Q2Ba2cFAew (central dogma song) e.com/watch?v=_ Q2Ba2cFAew.
Figure 17.4 DNA molecule Gene 1 Gene 2 Gene 3 DNA strand (template) TRANSCRIPTION mRNA Protein TRANSLATION Amino acid ACC AAACCGAG T UGG U UU G GC UC.
How Genes Work: From DNA to RNA to Protein Chapter 17.
Gene Translation:RNA -> Protein How does a particular sequence of nucleotides specify a particular sequence of amino acids?nucleotidesamino acids The answer:
F. PROTEIN SYNTHESIS [or translating the message]
DNA.
From DNA to Protein.
Protein databases Henrik Nielsen
Translation PROTEIN SYNTHESIS.
Whole process Step by step- from chromosomes to proteins.
Please turn in your homework
The blueprint of life; from DNA to Protein
Where is Cytochrome C? What is the role? Where does it come from?
What is Transcription and who is involved?
From Gene to Phenotype- part 2
Ch. 17 From Gene to Protein Thought Questions
Gene Expression: From Gene to Protein
Overview: The Flow of Genetic Information
Section Objectives Relate the concept of the gene to the sequence of nucleotides in DNA. Sequence the steps involved in protein synthesis.
Protein Synthesis Translation.
Overview: The Flow of Genetic Information
DNA The Secret of Life.
Cards created by Kelly Riedell Brookings High School Brookings, SD
Transcription You’re made of meat, which is made of protein.
Gene Expression: From Gene to Protein
Chapter 17 From Gene to Protein.
SC-100 Class 25 Molecular Genetics
Warm Up 3 2/5 Can DNA leave the nucleus?
Protein Structure Timothy G. Standish, Ph. D..
Today’s notes from the student table Something to write with
Transcription and Translation
Central Dogma and the Genetic Code
Bellringer Please answer on your bellringer sheet:
DNA, RNA, Amino Acids, Proteins, and Genes!.
How does DNA control our characteristics?
DNA and Words Activity.
Mutations Timothy G. Standish, Ph. D..
12.2 Replication of DNA DNA replication is the process of copying a DNA molecule. Semiconservative replication - each strand of the original double helix.
Presentation transcript:

Protein databases Henrik Nielsen

Background- Nucleotide databases GenBank, National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), USA. EMBL, European Bioinformatics Institute (EBI), England (Established in 1980 by the European Molecular Biology Laboratory, Heidelberg, Tyskland) DDBJ, National Institue of Genetics, Japan Together they form International Nucleotide Sequence Database Collaboration,

Protein databases Swiss-Prot, Established in 1986 in Swizerland ExPASy (Expert Protein Analysis System) Swiss Institute of Bioinformatics (SIB) and European Bioinformatics Institute (EBI) PIR, Established in 1984 National Biomedical Research Foundation, Georgetown University, USA In 2002 merged into: UniProt, A collaboration between SIB, EBI and Georgetown University.

UniProt UniProt Knowledgebase (UniProtKB) UniProt Reference Clusters (UniRef) UniProt Archive (UniParc) UniProt Knowledgebase Release 2011_02 (08-Feb-11) consists of: UniProtKB/Swiss-Prot: Annotated manually (curated) 525,207 entries UniProtKB/TrEMBL: Computer annotated 13,499,622 entries

Growth of UniProt Swiss-Prot TrEMBL

Content of UniProt Knowledgebase Amino acid sequences Functional and structural annotations –Function / activity –Secondary structure –Subcellular location –Mutations, phenotypes –Post-translational modifications Origin –organism: Species, subspecies; classification –tissue References Cross references

Amino acid sequences From where do you get amino acid sequences? Translation of nucleotide sequences (GenBank/EMBL/DDBJ) Amino acids sequencing: Edman degradation Mass spectrometry 3D-structures

Protein structure Primary structure: Amino acid sequence Secondary structure: ”Backbone” hydrogen bonding Alpha helix / Beta sheet / Turn Tertiary structure: Fold, 3D coordinates Quaternary structure: subunits

Subcellular location An animal cell:

Post-translational modifications Cleavage of signal peptide, transit peptide or pro-peptide Phosphorylation Glycosylation Lipid anchors Disulfide bond Prosthetic groups (e.g. metal ions)

Content of UniProt Knowledgebase Name, entry data etc Organism Functional annotations (comments) Sequence References Cross references –3D structure - PDB –EMBL –..

Evidence 3 types of non-experimental qualifiers in Sequence annotation and General comment: –Potential: Predicted using sequence analysis –Probable: Uncertain experimental evidence –By similarity: Predicted using sequence similarity

Cross references Other databases (there are ~100 in total): Nucleotide sequences 3D structure Protein-protein interactions Enzymatic activities and pathways Gene expression (microarrays and 2D-PAGE) Ontologies Families and domains Organism specific databases

The genetic code Degenerate (redundant) but not ambiguous Almost universal (deviations found in mitochondria)

Reading Frames 1 A piece of an mRNA-strand: 5' 3' augcccaagcugaauagcguagagggguuuucaucauuugaggacgauguauaa can be divided into triplets (codons) in three ways: 1 aug ccc aag cug aau agc gua gag ggg uuu uca uca uuu gag gac gau gua uaa M P K L N S V E G F S S F E D D V * 2 ugc cca agc uga aua gcg uag agg ggu uuu cau cau uug agg acg aug uau C P S * I A * R G F H H L R T M Y 3 gcc caa gcu gaa uag cgu aga ggg guu uuc auc auu uga gga cga ugu aua A Q A E * R R G V F I I * G R C I Each possible set of triplets is called a reading frame.

Reading Frames 2 Since there are two strands in DNA, there are six possible reading frames in a piece of DNA (three in each direction): 3 A Q A E * R R G V F I I * G R C I 2 C P S * I A * R G F H H L R T M Y 1 M P K L N S V E G F S S F E D D V * 5' ATGCCCAAGCTGAATAGCGTAGAGGGGTTTTCATCATTTGAGGACGATGTATAA 3’ 3' TACGGGTTCGACTTATCGCATCTCCCCAAAAGTAGTAAACTCCTGCTACATATT 5’ H G L Q I A Y L P K * * K L V I Y L -1 G L S F L T S P N E D N S S S T Y -2 A W A S Y R L P T K M M Q P R H I -3 A reading frame from a start codon to the first stop codon is called an open reading frame (underlined above).