EBI is an Outstation of the European Molecular Biology Laboratory. EBI patent related services Jennifer McDowall Senior Scientist, EMBL-EBI 3 rd Annual.

Slides:



Advertisements
Similar presentations
Genome Annotation: A Protein-centric Perspective.
Advertisements

European Bioinformatic Institute.
Improved Alignment of Protein Sequences Based on Common Parts David Hoksza Charles University in Prague Department of Software Engineering Czech Republic.
On line (DNA and amino acid) Sequence Information Lecture 7.
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Protein Databases EBI – European Bioinformatics Institute
Introduction to Bioinformatics - Tutorial no. 2 Global Alignment Local Alignment FASTA BLAST.
Class European Resources Protein Focused. Protein Databases EBI – European Bioinformatics Institute
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
UniProt - The Universal Protein Resource
Pairwise Alignment How do we tell whether two sequences are similar? BIO520 BioinformaticsJim Lund Assigned reading: Ch , Ch 5.1, get what you can.
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
Bioinformatics.
EBI is an Outstation of the European Molecular Biology Laboratory. EBI Roadshow James Watson, PhD Senior Scientific Training Officer EBI-EMBL
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Bioinformatics for biomedicine
The SLING project is funded by the European Commission within Research Infrastructures of the FP7 Capacities Specific Programme, grant agreement number.
Introduction to databases Tuomas Hätinen. Topics File Formats Databases -Primary structure: UniProt -Tertiary structure: PDB Database integration system.
Network Services for Biologists in the Genome Era The Work of the European Bioinformatics Institute.
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
Discover the UniProt Blast tool. Murcia, February, 2011Protein Sequence Databases Customize the BLAST results.
UniProt Non-redundant Reference Cluster (UniRef) Databases Swiss Institute of Bioinformatics (SIB) European Bioinformatics Institute (EMBL-EBI)
Blast 1. Blast 2 Low Complexity masking >GDB1_WHEAT MKTFLVFALIAVVATSAIAQMETSCISGLERPWQQQPLPPQQSFSQQPPFSQQQQQPLPQ QPSFSQQQPPFSQQQPILSQQPPFSQQQQPVLPQQSPFSQQQQLVLPPQQQQQQLVQQQI.
Workshop OUTLINE Part 1: Introduction and motivation How does BLAST work? Part 2: BLAST programs Sequence databases Work Steps Extract and analyze results.
Corrections. - The cacao genome is currently being sequenced - Human Chromosome 1 sequence Search ‘Genome’
Bioinformatics Overview, NCBI & GenBank JanPlan 2012.
Part I: Identifying sequences with … Speaker : S. Gaj Date
جلسه اول بیو انفورماتیک گردآوری:مسعود رسول آبادی
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
NCBI resources II: web-based tools and ftp resources Yanbin Yin Fall 2014 Most materials are downloaded from ftp://ftp.ncbi.nih.gov/pub/education/ 1.
Sequence Searching Strategies
Sequence Search and Analysis SPE 1653 (703)
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
Biological databases Exercises. Discovery of distinct sequence databases using ensembl.
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
Database search. Overview : 1. FastA : is suitable for protein sequence searching 2. BLAST : is suitable for DNA, RNA, protein sequence searching.
EMBL – EBI European Bioinformatics Institute UniProt - The Universal Protein Resource Claire O’Donovan.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Wrapping analytical services for caBIG Taverna-caGrid technical review meeting Stian Soiland-Reyes, myGrid University of Manchester, UK
Construction of Substitution matrices
Copyright OpenHelix. No use or reproduction without express written consent1.
Central hub for biological data UniProtKB/Swiss-Prot is a central hub for biological data: over 120 databases are cross-referenced (EMBL/DDBJ/GenBank,
Sequence Search Abhishek Niroula Department of Experimental Medical Science Lund University
Protein sequence databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen This also includes old material from my thesis
Copyright OpenHelix. No use or reproduction without express written consent1.
1 of 28 Evaluating Genes and Transcripts (“Genebuild”)
What is BLAST? Basic BLAST search What is BLAST?
HANDS-ON ConSurf! Web-Server: The ConSurf webserver.
What is sequencing? Video: WlxM (Illumina video) WlxM.
DNA / protein sequence analysis 第九組成員: 吳宇軒 侯卜夫 朱子豪 王俊偉
What is BLAST? Basic BLAST search What is BLAST?
A Practical Guide to NCBI BLAST
Basics of BLAST Basic BLAST Search - What is BLAST?
생물정보학 Bioinformatics.
UniProt: Universal Protein Resource
Access to Sequence Data and Related Information
BLAST.
Introduction to Bioinformatics
Protein Sequence Analysis - Overview -
Protein Sequence Analysis - Overview -
Basic Local Alignment Search Tool
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

EBI is an Outstation of the European Molecular Biology Laboratory. EBI patent related services Jennifer McDowall Senior Scientist, EMBL-EBI 3 rd Annual Forum for SMEs September 3-4 th 2009

Overview  Databases available  Sequence archives  Searching the database EBI patent related services

Databases available… EBI patent related services

September 2009 nucl > 9.4m sequences prot > 2.5m sequences GenBank EMBL DDBJ EPO USPTOJPO EPO policy: data released to public (and to EMBL) 18 months after the patent application date, independent of whether patent has been granted.. Sequence data from patent literature EBI patent related services

EMBL Know the Data…Nucleotides EBI patent related services Release and updates

EMBL Know the Data…Nucleotides Divided into classes and divisions... Release and updates ANN – Annotated Constructed SeqPAT – Patent CON – Constructed SequenceSTS – Sequence Tagged Site EST – Expressed Sequence TagSTD – Standard GSS – Genome Survey SequenceTPA – Third Party Annotation HTC – High Throughput cDNATSA – Transcriptome Shotgun Assembly HTG – High Throughput GenomeWGS – Whole Genome Shotgun EBI patent related services

EMBL Know the Data…Nucleotides Divided into classes and divisions... Release and updates EBI patent related services HUM – Human MUS – Mouse ROD – Rodent (excluding mouse) MAM – Mammal (excluding human, mouse, rodent) VRT – Vertebrate (excluding human, mouse, rodent, mammal) FUN – Fungi PRO – Prokaryote ENV – Environment INV – Invertebrate PHG – Phage SYN – Synthetic PLN – Plant VIR – ViralTGN – Transgenic UNC – Unclassified

EMBL Know the Data…Nucleotides Divided into classes and divisions... Release and updates Supplementary sets: EMBL-CDS, EMBL-MGA EBI patent related services Specialist databases: Immunoglobulins (IMGT/HLA, IMGT/LIGM) Alternative splicing (ASDT) Completed proteomes (Ensembl, Integr8) Variation (HGVBase, dbSNP)

EBI patent related services EMBL Patent Sequence Entry Version, dates, archive Patent number, title, link to patent

EBI patent related services UniProt Know the Data…Proteins Release and updates

UniProt Know the Data…Proteins Divided into 3 sections: Release and updates UniProtKB Taxonomic info Annotated sequence UniRef Combines sequences by % ID UniRef100, 90, 50 UniParc Protein archive Covers ALL proteins (including UniMess) EBI patent related services SwissProtTrEMBL Manual annotation Automatic annotation

UniProt Know the Data…Proteins Divided into 3 sections Release and updates Specialist databases linked to UniProt: Structure (PDBe, SGT) Immunoglobulins (IMGT/HLA) Alternative splicing (ASDT) Completed proteomes (Ensembl, Integr8) Protein interactions (IntAct) Protein signatures (InterPro) Patent proteins (EPO, USTPO, JPO, KIPO) EBI patent related services

Bulk download Nucleotide sequences Protein sequences

EBI patent related services Bulk download ftp.ebi.ac.uk/pub/databases/embl/patent/

Sequence archives… EBI patent related services

EMBL nucleotide sequence version archive (SVA) UniSave – UniProt sequence/annotation version archive Sequence archives EBI patent related services

EMBL sequence version archive (SVA) EBI patent related services View old entries Enter accession #

EBI patent related services Sequence record from EMBL SVA

EBI patent related services Comparing versions in EMBL SVA Select and compare versions

EBI patent related services

UniProtKB sequence annotation version server - UniSave Enter accession #

EBI patent related services UniSave results Select and compare versions View old entries

EBI patent related services Searching the databases…

EB-eye search by patent number Search for patent WO EBI patent related services

EB-eye search by patent number EBI patent related services

EB-eye nucleotide sequences from WO

Sequence Similarity Search Tools EBI patent related services Toolbox BLAST NCBI-BLAST Wu-BLAST FASTA FASTA suite Smith-Waterman MPsrch ScanPS SSEARCH PSI search PSI-SEARCH PSI-BLAST

Blast v. patent nucleotide sequences

EBI patent related services Fasta v. patent protein sequences

Tools: Genomes & Proteomes FASTA EBI patent related services

Database size Query length FASTA WU-BLAST NCBI BLAST PSI-SEARCH When to use which search? EBI patent related services

PDB Swiss-Prot UniRef50 UniRef 90 UniRef100 UniProtKB UniParc FASTA WU-BLAST NCBI BLAST PSI-SEARCH time to search When to use which search? EBI patent related services

InterProScan protein signature search EBI patent related services

InterPro signature database EBI patent related services

Some search guidelines…

Search Guidelines #1 Use the most appropriate tool for your search - Don’t assume one tool will cater to all your search needs Database size Query length FASTA WU-BLAST NCBI BLAST PSI-SEARCH EBI patent related services

Search Guidelines #1 Use the most appropriate tool for your search #2 Best search option  protein seq v. protein DB 2 nd  translated DNA seq v. protein DB 3 rd  DNA seq v. DNA DB Worst  protein seq v. transl DNA BD EBI patent related services

Search Guidelines #1 Use the most appropriate tool for your search #2 Best search option  protein seq v. protein DB #3 Search the smallest DB likely to have your sequence #4 Check statistics – histograms... #5 Change parameters when necessary (gap penalties, scoring matrices...) #6 Don’t assume homologues have the same function Orthologs have similar functions Paralogs acquire different functions EBI patent related services

Search Guidelines #1 Use the most appropriate tool for your search #2 Best search option  protein seq v. protein DB #3 Search the smallest DB likely to have your sequence #4 Check statistics – histograms... #5 Change parameters when necessary (gap penalties, scoring matrices...) #6 Don’t assume homologues have the same function EBI patent related services #7 Use multiple sequence alignments to validate relatedness #8 Consider filtering low complexity regions

Typical workflow search review Check stats compare evolution function EBI patent related services

EBI is an Outstation of the European Molecular Biology Laboratory. Contacts: