Advanced SRS Course 12/12/02 -Linking -Subentries -Applications.

Slides:



Advertisements
Similar presentations
THOMSON REUTERS INTEGRITY SM : INTEGRATED DRUG DISCOVERY AND DEVELOPMENT PORTAL.
Advertisements

PantherSoft Financials Smart Internal Billing. Agenda  Benefits  Security and User Roles  Definitions  Workflow  Defining/Modifying Items  Creating.
European Bioinformatic Institute.
Orchard Harvest™ LIS Review Results Training
On line (DNA and amino acid) Sequence Information Lecture 7.
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
Microsoft Office 2007: Introductory Computer Applications 11.
Concepts of Database Management Seventh Edition
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
UNESCO ICTLIP Module 4. Lesson 3 Database Design, and Information Storage and Retrieval Lesson 3. Information storage and retrieval using WinISIS.
Access Tutorial 3 Maintaining and Querying a Database
XP Chapter 3 Succeeding in Business with Microsoft Office Access 2003: A Problem-Solving Approach 1 Analyzing Data For Effective Decision Making.
©CMBI 2007 Search tools Google, MRS, (SRS). ©CMBI 2007 Search tools Google= Thé best generic search and retrieval system MRS= Maarten’s Retrieval System.
©CMBI 2005 Search tools Google, MRS, SRS. ©CMBI 2004 Search tools SRS = Sequence Retrieval System MRS = Maarten’s Retrieval System Google = Thé best generic.
HKUHKU Computer Centre Introduction to SRS Frankie Cheung
Concepts of Database Management Sixth Edition
Chapter 2 Sequence databases A list of the databases’ uniform resource locators (URLs) discussed in this section is in Box 2.1.
Concepts of Database Management Sixth Edition
Access Tutorial 3 Maintaining and Querying a Database
Using structure alignment tools. Structure alignment View a structural alignment of the P53 1T4F protein with Catalytic And Tetramerization Domains From.
Using 3D-SURFER. Before you start 3D-Surfer can be accessed at For visualization.
COMPREHENSIVE Word Tutorial 10 Managing Long Documents.
Making Sense of DNA and protein sequence analysis tools (course #2) Dave Baumler Genome Center of Wisconsin,
1 Access Lesson 3 Creating Queries Microsoft Office 2010 Introductory Pasewark & Pasewark.
1 Access Lesson 3 Creating Queries Microsoft Office 2010 Introductory.
Maintaining and Querying a Database Microsoft Access 2010.
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
XP New Perspectives on Microsoft Access 2002 Tutorial 51 Microsoft Access 2002 Tutorial 5 – Enhancing a Table’s Design, and Creating Advanced Queries and.
Chapter 4 The Relational Model 3: Advanced Topics Concepts of Database Management Seventh Edition.
Credit Union National Association Project Zip Code Using your Data: Queries and Repo rts.
Moodle (Course Management Systems). Assignments 1 Assignments are a refreshingly simple method for collecting student work. They are a simple and flexible.
Pathway Assignments. The assignment – Annotating Pathways KEGG Pathway Database.
CHAPTER 13 Creating a Workbook Part 2. Learning Objectives Work with cells and ranges Work with formulas and functions Preview and print a workbook 2.
Automating Database Processing Chapter 6. Chapter Introduction Design and implement user-friendly menu – Called navigation form Macros – Automate repetitive.
Analyzing Data For Effective Decision Making Chapter 3.
XP New Perspectives on Integrating Microsoft Office XP Tutorial 2 1 Integrating Microsoft Office XP Tutorial 2 – Integrating Word, Excel, and Access.
SAGExplore web server tutorial for Module II: Genome Mapping.
Moodle (Course Management Systems). Glossaries Moodle has a tool to help you and your students develop glossaries of terms and embed them in your course.
XP New Perspectives on Microsoft Office Access 2003 Tutorial 9 1 Microsoft Office Access 2003 Tutorial 9 – Using Action Queries, and Defining Table Relationships.
® Microsoft Office 2010 Access Tutorial 3 Maintaining and Querying a Database.
SEQUENCE RETRIEVAL SYSTEM SEQUENCE RETRIEVAL SYSTEM SRS SRS Ashwin Sivakumar, 02/12/03 Ashwin Sivakumar, 02/12/03 Hands on Workshop on Protein Analysis.
Concepts of Database Management Seventh Edition
® Microsoft Office 2010 Access Tutorial 3 Maintaining and Querying a Database.
Concepts of Database Management Seventh Edition
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
Performing Calculations—1 of 2 In addition to using queries to retrieve, update, sort, and filter data in a database, you can use a query to perform calculations.
Copyright OpenHelix. No use or reproduction without express written consent1.
SRS Introductory Course 5/12/ Temporary and permanent sessions - Simple querying - Browsing indices - Standard and extended query forms - User defined.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Copyright OpenHelix. No use or reproduction without express written consent1.
SAGExplore web server tutorial. The SAGExplore server has three different modules …
Protein sequence databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen This also includes old material from my thesis
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Protein databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen and from CSC bio-opas
Access Module Implementing a Database with Microsoft Access A Great Module on Your CD.
Welcome to the combined BLAST and Genome Browser Tutorial.
Welcome to the Protein Database Tutorial. This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Basic Navigation in Oracle R12 BY: Muhammad Irfan.
SOURCE LANGUAGE TOOLS Paratext 7.6 Source Language Text Window Source Language Search Tool Source Language Dictionary.
DNA / protein sequence analysis 第九組成員: 吳宇軒 侯卜夫 朱子豪 王俊偉
In this session, you will learn to:
Single Sample Registration
Exploring Microsoft® Access® 2016 Series Editor Mary Anne Poatsy
Genome Center of Wisconsin, UW-Madison
Welcome to the Protein Database Tutorial
Benchmark Series Microsoft Word 2016 Level 2
Word offers a number of features to help you streamline the formatting of documents. In this chapter, you will learn how to use predesigned building blocks.
Basic Local Alignment Search Tool
Presentation transcript:

Advanced SRS Course 12/12/02 -Linking -Subentries -Applications

Linking in SRS

Types of Links Hyperlinks -links between entries which are displayed as hypertext -useful for examining entries that are referenced directly from entries Query links -allow you to construct queries using the relationships between databanks -require SRS to search through entries or indices in other databanks looking for matches

Links between Databases SWISS-PROT EMBL PDB InterPro PROSITE PFAM BLOCKS

Advantages of SRS Linking Links are bi-directional ABC Direct link from ‘A’ to ‘B’Direct link from ‘B’ to ‘C’ Multistep link from ‘A’ to ‘C’

Database Network Graph

The link page Two forms of the link page: -type you see if you initiate linking from either the query manager or the query result page -type you see if you initiate linking from an individual entry page The difference is at the top of these pages – one provides a “find all entries” option, the other does not (see next 2 slides)

From query manager or query result page

Find all entries options In the selected databanks which are linked to the current query - this returns entries from other databanks which have links with entries in the current query In the current query which are linked to all selected databanks - this limits the query so that it includes only the entries(from the original query) which are linked to all of the selected databanks In the current query which are not linked to any of the selected databanks - this limits the query so that it includes only the entries(from the original query) which do not have links to the specified databanks

From an individual entry page No ‘find all entries’ options available

Linking from query manager page Can link from a single query or from multiple queries two ways to link your queries from the query manager page: - tick the checkbox that corresponds to a query set and click the LINK button - use the text box beside the Expression button

Expression linking Useful alternative to using the linking pages Can be used to search for a link between two or more sets of results or between a set of results and a databank

Linking operators < entries in the set or databank to the left of the operator are returned if they have a link to any entries in the set or databank to the right of the operator > entries in the set or databank to the right of the operator are returned if they have a link to any entries in the set or databank to the left of the operator

Linking operations <Q1 < Q2In Q1 that link to Q2 >Q1 > Q2In Q2 that link to Q1 combined with logical operators: < &Q1<Q2 & Q3In Q1 that link to Q2 &Q3 < |Q1<Q2 | Q3In Q1 that link to Q2 or Q3 < !Q1<Q2 ! Q3In Q1 that link to Q2 but not Q3

A1 A2 A3 A4 A5 A6 B1 B2 B3 B4 B5 A B A > B is B2 B3 B4 (all entries in B that have links to A) A < B is A1 A2 A5 A6 (all entries in A that have links to B)

Subentries

Necessary when there is repeated structured information within an entry FT DOMAIN 1 12 LUMENAL (POTENTIAL).DOMAIN FT TRANSMEM POTENTIAL.TRANSMEM FT DOMAIN CYTOPLASMIC (POTENTIAL).DOMAIN FT TRANSMEM POTENTIAL.TRANSMEM FT DOMAIN LUMENAL (POTENTIAL).DOMAIN DR EMBL; L44581; AAA ; -.L44581AAA99933 DR EMBL; L44582; AAA ; -.L44582AAA99934 DR EMBL; L44583; AAA ; -.L44583AAA99935 DR EMBL; L44584; AAA ; -.L44584AAA99936 DR EMBL; L44585; AAA ; -.L44585AAA99937 …..

Use subentries to: Search for entries containing one or more subentries with certain values and obtain a list of entry references Search for subentries with certain values and obtain a list of subentry references

Subentries have a double function: They are part of the entry and often require data from other fields in order for their meaning to be resolved and displayed Example: a SWISS-PROT feature requires part of the entry’s sequence to be displayed They can be regarded as databanks themselves and can be indexed and queried independently from the entries Example: search all the transmembrane segments with a given range of length

Subentries available Protein databases have 5 subentries: –Reference –Comment –Links –Feature –Counter Nucleotide databases have 3 subentries: –Reference –Features –Counters

Controlled vocabularies Some of the fields belonging to the subentries have a predetermined number of keys ( as specified by the database documentation). These fields have a controlled vocabulary and when you use the extended query forms you can select a value from a drop down menu. Examples are: –CommentType –DbName –FtKey –CountItem

The Counter subentry This is a special subentry created by SRS on the fly. It counts the number of times particular feature keys, comment types and links to a certain database occur within an entry It can be used to answer questions like: –How many entries have 3 or more links to EMBL? –How many entries have more than 8 disulphide bridges? –How many entries have 2 or more comments about function?

Subentry fields In the standard query form each subentry field name is preceded by the name of the subentry to which it belongs: Reference:authors Feature:FtKey Links:DbName The extended query form is divided up into sections. The top section contains the fields belonging to entry and below this are the subentries and the fields that they contain

Links with sets containing subentries Two types: –Simple Links –Parent Links

It is not possible to combine sets of entries with sets of subentries using the logical operators but link operators may be used between sets of entries and sets of subentries [swissprot-org:human] > [swissprot-ftkey:transmem] gives a set of transmembrane segment subentries found in human proteins [swissprot-org:human] < [swissprot-ftkey:transmem] returns all human entries that have a transmembrane segment Simple links

Parent Links Sometimes it is necessary to do an explicit conversion from subentries to entries. This can be done using the operand parent. This method looks for links from the subentries to their respective parent entries and retrieves a set containing parent entries. [swissprot-ftkey:transmem] > parent gives the parent entries for the set of subentries from SWISS-PROT that have transmembrane sequence features Logical operators can then be used to combine the set of parent entries with another set of entries

Types of entries Query Form

Using entry….. Feature that is 10 aa in length

Using feature….. Only returns transmem regions of exactly 10 aa

Applications

Applications in SRS SWISSPROT Upload user owned data Sequence query Run BLAST launch BLAST results - text file BLAST Indexing linking Pathway Prosite

Protein Applications in SRS Homology and similarity tools: BLASTP : database search tool FASTA : database search tool MPSrch Protein function analysis tools: PPSearh : BLASTProdom ScanRegExp FingerPrintScan PfScan InterProScan MPSrch Sequence analysis tools: - ClustalW

Nucleotide applications in SRS Homology and similarity tools BLASTN NFASTA FASTX FASTY Sequence analysis tools: NClustalW RestrictionMap