ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.

Slides:



Advertisements
Similar presentations
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Advertisements

Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
BIOINFORMATICS Ency Lee.
How to use the web for bioinformatics Molecular Technologies Ethan Strauss X 1171
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Archives and Information Retrieval
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Bioinformatics and Phylogenetic Analysis
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
The Protein Data Bank (PDB)
Protein databases Henrik Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Resources in The Internet Freeware and Web Sites.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Chapter 2 Sequence databases A list of the databases’ uniform resource locators (URLs) discussed in this section is in Box 2.1.
Bioinformatics Resources and Tools on the Web: A Primer.
An Introduction to Bioinformatics Molecular Biology Databases.
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Bioinformatics for biomedicine
Introduction to databases Tuomas Hätinen. Topics File Formats Databases -Primary structure: UniProt -Tertiary structure: PDB Database integration system.
Master’s Degrees in Bioinformatics in Switzerland: Past, present and near future Patricia M. Palagi Swiss Institute of Bioinformatics.
Biological Databases By : Lim Yun Ping E mail :
Doug Raiford Lesson 3.  More and more sequence data is being generated every day  Useless if not made available to other researchers.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Copyright © 2009 Pearson Education, Inc. Art and Photos in PowerPoint ® Concepts of Genetics Ninth Edition Klug, Cummings, Spencer, Palladino Chapter 21.
1 Review of Biological Database Utilization. 2 Biological Databases We will discuss: Usefulness to the bioinformaticist Database types Search methods.
Biological Databases and Tools Sandra Sinisi / Kathryn Steiger November 25, 2002.
Copyright © 2010 Pearson Education Inc. Lecture 01 – Genetics & Genomics: An Introduction Based on Chapter 1 – Genetics: An introduction.
An Automated System for Deep Proteome Annotation Gary Van Domselaar †, Savita Shrivastava, Paul Stothard and David S. Wishart ‡ Unannotated Protein Sequence.
Organizing information in the post-genomic era The rise of bioinformatics.
Copyright © 2009 Pearson Education, Inc. Genomics, Bioinformatics, and Proteomics Chapter 21 Lecture Concepts of Genetics Tenth Edition.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
Protein Database David Shiuan Department of Life Science Institute of Biotechnology Interdisciplinary Program of Bioinformatics National Dong Hwa University.
REMINDERS 2 nd Exam on Nov.17 Coverage: Central Dogma of DNA Replication Transcription Translation Cell structure and function Recombinant DNA technology.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
EB3233 Bioinformatics Introduction to Bioinformatics.
Protein Domain Database
Application of Bioinformatics in Genetic Research Instructors: Dr. Henry Baker Dr. Luciano Brocchieri Dr. Michele Tennant Dr. Lei Zhou
Bioinformatics and Computational Biology
Computer Storage of Sequences
B i o i n f o r m a t i c s / B i o m e d i c a l A p p l i c a t i o n s i n E E L A Mexico, D.F., october 22 – 26, e – s c i e n c e M e x i c.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
High throughput biology data management and data intensive computing drivers George Michaels.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
Bioinformatics Computing 1 CMP 807 – Day 4 Kevin Galens.
BME435 BIOINFORMATICS.
Archives and Information Retrieval
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
생물정보학 Bioinformatics.
Mangaldai College, Mangaldai
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview

Bioinformatics Bioinformatics in the context of molecular biology Traditionally, molecular biology research was carried out entirely at the experimental laboratory bench but the huge increase in the scale of data being produced in this genomic era has seen a need to utilize computational processing.

Bioinformatics Following on from the explosion in the volume of genomic data, similar increase in data have been observed in the fields of proteomics, transcriptomics and metabalomics. There are three central biological processes around which bioinformatics tools have been developed: DNA sequence determines protein sequence Protein sequence determines protein structure Protein structure determines protein function

Bioinformatics Widely defined, bioinformatics is the application of computer technology to the management and analysis of biological data. The result is that computers are being used to gather, store, analyze and merge biological data.

Bioinformatics The ultimate goal of bioinformatics is to uncover the wealth of biological information hidden in the mass of data and obtain a clearer insight into the fundamental biology of organisms. The integration of information learned about these key biological processes should allow us to achieve the ultimate goal However, the molecular biology of an organism is a very complex issue even with research being carried out at different levels including the genome, proteome, transcriptome and metabalome levels, more can also be looked at in an integrative comu

Bioinformatics The challenge facing the bioinformatics community today is: Intelligent and efficient storage of this mass of data. Development of tools to allow the extraction of meaningful biological information. 



Examples of Bioinformatics Database interfaces Genbank/EMBL/DDBJ, Medline, SwissProt, PDB, … Sequence alignment / Multiple sequence alignment BLAST, FASTA, Clustal, Muscle Gene finding Genscan, GenomeScan, GeneMark, GRAIL Protein Domain analysis and identification pfam, BLOCKS, ProDom, Pattern Identification/Characterization Gibbs Sampler, AlignACE, MEME Protein Folding prediction PredictProtein, SwissModeler

Some key bioinformatics websites NCBI (The National Center for Biotechnology Information) http://www.ncbi.nlm.nih.gov/ EBI (The European Bioinformatics Institute) http://www.ebi.ac.uk/ UCSC Genome bioinformatics http://genome.ucsc.edu/ SwissProt/ExPASy (Swiss Bioinformatics Resource) http://www.expasy.org/ PDB (The Protein Databank) http://www.rcsb.org/pdb/ You are using someone else’s computer You are (probably) getting a reduced set of options or capacity Servers are great for sporadic or proof-of-principle work, but for intensive work, the software should be obtained and run locally

NCBI (The National Center for Biotechnology Information) Entrez interface to databases Medline/OMIM Genbank/Genpept/Structures BLAST server(s) Many flavors of blast Draft Human Genome Much more…

EBI (The European Bioinformatics Institute) SRS (sequence retrieval system) database interface EMBL, SwissProt, and many more Many server-based tools ClustalW, DALI, …

SwissProt/ExPASy Curation! Error rate in the information is greatly reduced in comparison to most other databases. Extensive cross-linking to other data sources SwissProt is the ‘gold-standard’ by which other databases can be measured, and is the best place to start if you have a specific protein to investigate

ExPASy Overview Historically Expasy has been one of the main online bioinformatics resources for proteomics. Evolving into an extensible and integrative portal for accessing many scientific resources, databases, and software in different areas of life sciences: Genomics Phylogeny/evolution Systems biology Population genetics Transcriptomics etc.

ExPASy Overview (cont.) The individual resources databases web-based and downloadable software tools are hosted in a decentralized way by different groups of the SIB Swiss Institute of Bioinformatics and partner institutions. Single web portal provides a common entry point to a wide range of resources developed and operated by different SIB groups and external institutions.

Visual Guidance (entry point)

Categorized Resources

Categories (example 1)

Categories (example 2)

Categories Proteomics Genomics Protein sequences and identification Structural bioinformatics System biology Phylogeny/evolution Population genetics Transcriptomics Biophysics Imaging IT infrastructure Drug design Proteomics Protein sequences and identification Mass spectrometry and 2-DE data Protein characterization and function Families, patterns and profiles Post-translational modification Protein structure Protein-protein interaction Similarity search/alignment 11 Categories Genomics Sequence alignment Similarity search Characterization/annotation

Advanced search/query Query all databases (i.e. launch a query across several SIB resources) Find resources: discover resources hosted on the portal (incl. auto-completion)

Querying All Databases

Finding Resources