Getting data out of XML These exercises provide an overview of how to use the native Taverna XPath services to get data out of XML.

Slides:



Advertisements
Similar presentations
Fathom Overview Workshop on using Fathom in School Improvement Planning (SIP)
Advertisements

XSL eXtensible Stylesheet Language. What is XSL? XSL is a language that allows one to describe a browser how to process an XML file. XSL can convert an.
1 XSLT – eXtensible Stylesheet Language Transformations Modified Slides from Dr. Sagiv.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester.
Clearing your Cookies Google Chrome A short guide to help you navigate our website faster Brought to you by:
HELP GUIDE NEW USER REGISTRATION (SLIDE 2) TAKING A QUIZ (SLIDE 8) REVIEWING A QUIZ (SLIDE 17) GROUP MEMBERSHIP (SLIDE 26) CREATING QUIZZES (SLIDE 31)
® IBM Software Group © 2006 IBM Corporation The Eclipse Data Perspective and Database Explorer This section describes how to use the Eclipse Data Perspective,
Working with SharePoint Document Libraries. What are document libraries? Document libraries are collections of files that you can share with team members.
® IBM Software Group © 2006 IBM Corporation How to read/write XML using EGL This Learning Module shows how to utilize an EGL Library to read/write an XML.
So – You want to learn how to put an advanced article submission (cut and paste) onto the state website. (Note: If you have not done so, you will need.
Scottish Legal Aid Board Content Management using OpenCms Martin Spinks CTO Navyblue Tuesday, March 16, 2010.
If you are very familiar with SOAR, try these quick links: Principal’s SOAR checklist here here Term 1 tasks – new features in 2010 here here Term 1 tasks.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
1 iSee Player Tutorial Using the Forest Biomass Accumulation Model as an Example ( Tutorial Developed by: (
Using Backstage Lesson 2. Objectives Software Orientation: Backstage View Backstage view’s left-side navigation pane (see figure on the next slide) gives.
XP New Perspectives on XML, 2 nd Edition Tutorial 10 1 WORKING WITH THE DOCUMENT OBJECT MODEL TUTORIAL 10.
10-1 aslkjdhfalskhjfgalsdkfhalskdhjfglaskdhjflaskdhjfglaksjdhflakshflaksdhjfglaksjhflaksjhf.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Nowgen, Next Gen Workshop 17/01/2012.
CHAPTER 9 Introducing Microsoft Office Learning Objectives Start Office programs and explore common elements Use the Ribbon Work with files Use.
CHAPTER 9 Introducing Microsoft Office Learning Objectives Start Office programs and explore common elements Use the Ribbon Work with files Use.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
XP New Perspectives on Integrating Microsoft Office XP Tutorial 2 1 Integrating Microsoft Office XP Tutorial 2 – Integrating Word, Excel, and Access.
So – You want to learn how to put a BLOG article onto the state website. (Note: If you have not done so, you will need to review the web training provided.
1 Working with MS SQL Server Textbook Chapter 14.
Putting Applets into Web Pages.  Two things are involved in the process of putting applets onto web pages ◦ The.class files of the applet ◦ The html.
2 Copyright © 2004, Oracle. All rights reserved. Running a Forms Developer Application.
C# Tutorial -1 ASP.NET Web Application with Visual Studio 2005.
Microsoft ® Office OneNote ® 2003 Training Get to know OneNote CGI presents:
Microsoft ® Outlook ® 2010 Training Mailbox management 1: Creating folders.
Execute Workflow. Home page To execute a workflow navigate to My Workflows Page.
SADI and Taverna 2 Tutorial David Withers. Preamble The Taverna 2 platform is constantly changing; while the look and feel of the workbench may change,
Basic & Advanced Reporting in TIMSNT ** Part Two **
Quick guide on making PowerPoint slides  PowerPoint is a presentation program  A PowerPoint slideshow is a stack of slides being presented one after.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
BioMoby and Taverna 2 Tutorial Mark Wilkinson, Edward Kawas, David Withers.
Level 1 Tutorial Project How to put a movie player on your Weebly website using an HTML code.
PowerPoint Basics Tutorial 4: Interactivity & Media PowerPoint can communicate with the outside world by linking to different applications, managing different.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.
Creating Your First Web Page – Topic: Creating a wiki, blog, image blog and podcast Click Arrow - Next Slide 1.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
Debugging tools in Flash CIS 126. Debugging Flash provides several tools for testing ActionScript in your SWF files. –The Debugger, lets you find errors.
Primenumbers.co.uk This presentation will help you get the most out of this service.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 8 1 Microsoft Office Access 2003 Tutorial 8 – Integrating Access with the.
Describing and Annotating Experimental Data: Hands On.
Designing, Executing and Sharing Workflows with Taverna 2.2 Katy Wolstencroft myGrid University of Manchester.
Exploring Taverna engine Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester.
Advanced Taverna Aleksandra Pawlik University of Manchester materials by Katy Wolstencroft, Aleksandra Pawlik, Alan Williams
Taverna allows you to automatically iterate through large data sets. This section introduces you to some of the more advanced configuration options for.
Installing and Using Evolve Evolve is written in Java and runs on any system with Java 1.6 Download Evolve from Unzip.
Mendeley Reference Management Software Module II: Creating your Mendeley Library By Rehema Chande-Mallya (PhD)
Exploring Taverna 2 Katy Wolstencroft myGrid University of Manchester.
Aleksandra Pawlik University of Manchester. Something that can be put into a workflow Well described - what the component does Behaves “well” - conforms.
Aleksandra Pawlik Alan Williams University of Manchester.
An Introduction to Designing and Executing Workflows with Taverna Part 2 – Importing and exporting data Norman Morrison University of Manchester Credits:
These exercises highlight the services that do not perform biological functions, but are vital for running life science workflows.
Product Training Program
IBM Rational Rhapsody Advanced Systems Training v7.5
Single Sample Registration
An Introduction to Designing and Executing Workflows with Taverna
T_C_N_L_G_ E D I D I E O Y O H I E B J I R E A A W.
Introducing Microsoft Office 2010
Taverna Tutorial exercise 2: REST services from BioCatalogue
An Introduction to Designing, Executing and Sharing Workflows with Taverna and myExperiment Katy Wolstencroft University of Manchester.
Designing and Using Normalization Rules
Download from Zotero Home Page
Tutorial 7 – Integrating Access With the Web and With Other Programs
Xpath service Getting data out of XML Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester 1.
REST Services Data and tools on the Web have been exposed in both WSDL and REST. Taverna provides a custom processor for accessing REST services Peter.
An Introduction to Designing and Executing Workflows with Taverna
Presentation transcript:

Getting data out of XML These exercises provide an overview of how to use the native Taverna XPath services to get data out of XML

The Basics of XML  XML – eXtensible Markup Language  Designed for the storage and transport of data  This includes passing data between services or retrieving data from a Web page  Provides a machine readable dataset  Many service providers export data in XML

Example Katy Paul Reminder Don't forget about Bonn Trip! The following website has lost of information about XML and tutorials:

Exploring XML  Identify the root/top element in the example xml  Find all the child elements  What does each line end with ? If you get stuck, try exploring the W3Schools website for answers – the syntax page is especially good !!!

Workflows to retrieve XML

 Load into Taverna the ‘Search Pubmed’ workflow, from the Bonn myExperiment group   Run the workflow and see what output you get from Pubmed  try “Blood Clotting” as a search term if you can’t think of anything  Find the root and child elements in the xml  See if you can find the list of Pubmed ids  How many ids did you get for your search term?  There should be a count of them somewhere Exploring XML

 You should get something like this (with other elements too)  Familiarise yourself with this data  We’ll be extracting some of it next

Xpath and Getting the Data out  Xpath used to navigate through elements of XML  Used to find nodes, and data at those nodes  ‘Expressions’ are used to navigate through the document  Further details on what to use can be found at:  More information at: Sample Expressions

Lets have an example Katy Paul Reminder Don't forget about Bonn Trip! To get ‘Katy’ from the XML ‘Katy’ is under the element Navigate through the XML, starting at element, and ending at element So the Xpath expression would be: /note/to

Xpath in Taverna  Taverna has 2 modes of Xpath functionality  ‘XML from Text’ local java service  ‘Xpath Service’ Template  The local java service is designed for people who know the Xpath query they want to use and are confident in writing XPath  The Xpath Service Template is designed for a dynamic/exploratory retrieval of data…… and for those who are not confident writing XPath straight away  To start with, we will use only the Xpath Service Template

Xpath using the Service Template

Install the Xpath Plugin  To install the Xpath service template, you will have to update the Taverna Workbench  Click on 'Advanced', then select 'Updates and Plugins'  In the pop-up menu, click on the 'find new plugins' button  Find the Xpath update, and click 'Install'  You will need to restart Taverna for this to work correctly  Don't forget to save any workflows you have open !!!

Getting the Data out using the template  In Taverna, find the service template for XML data processing  Drag the service template onto an empty workflow  The configuration window should automatically open  Copy and Paste the example xml (the Katy XML from previous slide) into the relevant section of the popup box  If you haven’t got the data, you can get it from here:  Press the green arrow to generate XML tree structure (on the right hand side)

Getting the Data out Paste here Press this

Getting the Data out  You should be able to see the XML tree structure  Explore it by clicking on the “+” arrows to open and close nodes  Find the node and select it  Note, it also selects the root node – making a path through the XML to the IdList node  Click the ‘Generate Xpath Expression’ button  You should see the Xpath, or path to XML element, given as: /note/to

Getting the Data out Xpath Expression Data from XML

Getting the Data out using the template  In Taverna, find the service template for XML data processing  Drag the service template onto the ‘Search Pubmed’ workflow  The configuration window should automatically open  Paste the xml from your results pane into the relevant section  If you haven’t got the data, you can get it from here:  Press the green arrow to generate XML tree structure (on the right hand side)

Getting the Data out  What does /default:eSearchResult/default:IdList mean?  It describes how to navigate through the XML, from the root element ‘eSearchResult’ to get the IdList element.  ‘default’ represents the namespace for the elements, or a URI reference to where the data came from  Click on the ‘Show XML Tree’ button, and select ‘Show namespaces of XML elements’  This should show you the URI from where the data came from  When you have your Xpath query set up, click the apply button, close the popup window, and run the workflow  Try getting something else back from the XML by manually editing the generated Xpath query

XML advanced Using the native java Xpath service

Advanced XPath Service  Copy the XML from the results  Remove the Xpath Service template from the workflow  Locate the XPath service in the list of available services  Drag it onto the ‘Search PubMed’ workflow

Advanced XPath Service  Create an input for the service, called ‘xml_text’, and connect it to the port ‘xml-text’  Add another input port called ‘xpath_query’, and connect it to the ‘xpath’ port  Connect up the nodelist port to an output, called ‘element_text’  Run the workflow, using “Blood Clotting” as your search term  Enter an Xpath query that will retrieve – The TermSet counts for all terms in the TranslationStack – Re-write the Xpath to get the count only for the TermSet, whose term is: “Blood coagulation”[MeSH Terms] – Choose a data element of your own to get back from the XML