Programmatic access to EMBL-EBI resources

Slides:



Advertisements
Similar presentations
WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Advertisements

EBI is an Outstation of the European Molecular Biology Laboratory. Web Services Course CBS, DK. EBI Web Services Teresa Miyar EMBL-EBI External Services.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
Linux Operations and Administration
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
WIKI IN EDUCATION Giti Javidi. W HAT IS WIKI ? A Wiki can be thought of as a combination of a Web site and a Word document. At its simplest, it can be.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Nowgen, Next Gen Workshop 17/01/2012.
SEMESTER PROJECT PRESENTATION CS 6030 – Bioinformatics Instructor Dr.Elise de Doncker Chandana Guduru Jason Eric Johnson.
EMBRACE Web Services Taavi Hupponen CSC – Center for Scientific Computing, Finland BOSC 2007.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
HTML Hyper Text Markup Language A simple introduction.
Adding GO for Large Datasets COST Functional Modeling Workshop April, Helsinki.
Web Pages with Features. Features on Web Pages Interactive Pages –Shows current date, get server’s IP, interactive quizzes Processing Forms –Serach a.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
CGI Common Gateway Interface. CGI is the scheme to interface other programs to the Web Server.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Moby Web Services Iván Párraga García MSc on Bioinformatics for Health Sciences May 2006.
Stian Soiland-Reyes myGrid, School of Computer Science University of Manchester, UK UKOLN DevSci: Workflow Tools Bath,
EMBOSS over a Grid 1. 1st EELA Grid School December 4th of 2006 Eduardo MURRIETA LEON Romualdo ZAYAS-LAGUNAS Pierre-Alain BRANGER Jérôme VERLEYEN Roberto.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.
Web Pages with Features. Features on Web Pages Interactive Pages –Shows current date, get server’s IP, interactive quizzes Processing Forms –Serach a.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
EMBL-EBI MSD Search and Visualization tools Jawahar Swaminathan.
CharMeck.org Contributer Training SharePoint 2013 Orientation and Basic Training.
Copyright OpenHelix. No use or reproduction without express written consent1.
Portals and my Grid Stefan Rennick Egglestone Mixed Reality Laboratory University of Nottingham.
HANDS-ON ConSurf! Web-Server: The ConSurf webserver.
Designing, Executing and Sharing Workflows with Taverna 2.2 Katy Wolstencroft myGrid University of Manchester.
Exploring Taverna engine Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester.
Advanced Taverna Aleksandra Pawlik University of Manchester materials by Katy Wolstencroft, Aleksandra Pawlik, Alan Williams
An Introduction to Running, Reusing and Sharing Workflows with Taverna – part 2 Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester.
Taverna allows you to automatically iterate through large data sets. This section introduces you to some of the more advanced configuration options for.
Exploring Taverna 2 Katy Wolstencroft myGrid University of Manchester.
July LJM Introduction to Bioinformatics Lisa Mullan, HGMP-RC.
Designing, Executing and Sharing Workflows with Taverna 2.4 Different Service Types Katy Wolstencroft Helen Hulme myGrid University of Manchester.
Take a REST from manual searching
Internet Made Easy! Make sure all your information is always up to date and instantly available to all your clients.
BI Share Quick reference guide.
Business Directory REST API
Take a REST from manual searching: PDBe, programmatically
EMBL-EBI, programmatically - take a REST from manual searching: Sequence analysis tools Web Production Team Anna Foix Joon Lee.
Section 13 - Integrating with Third Party Tools
Data Virtualization Community Edition
The EBI Search RESTful API
Job workflow Pre production operations:
Product Retrieval Statistics Canada / Statistique Canada Title page
IP Publishing From IP Data Base to IP list to IP catalog
Workshop on Microbiome and Health
Bomgar Remote support software
CMPE 152: Compiler Design ANTLR 4 and C++
Introducing Microsoft Office 2010
INSTALLING AND SETTING UP APACHE2 IN A LINUX ENVIRONMENT
CISC103 Web Development Basics: Web site:
Chapter 27 WWW and HTTP.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Information Technology Ms. Abeer Helwa
Shim (Helper) Services and Beanshell Services
Aleksandra Pawlik materials by Katy Wolstencroft
Reference Management Software Tools Mendeley (Part A)
Follow-up from last night: XSEDE credits
Carthage ios 8 onwards Dependency manager that streamlines the process of integrating the libraries into the project.
REST Services Data and tools on the Web have been exposed in both WSDL and REST. Taverna provides a custom processor for accessing REST services Peter.
Welcome - webinar instructions
An Introduction to Designing and Executing Workflows with Taverna
Web Application Development Using PHP
Presentation transcript:

Programmatic access to EMBL-EBI resources Web Services Programmatic access to EMBL-EBI resources Andrew Cowley andrew.cowley@ebi.ac.uk www.ebi.ac.uk/webservices

Web interfaces are great for several uses Single jobs Hypothesis testing Publications Training But what if I want to examine every sequence in a proteome? Or my data needs lots of repetitive analysis?

Solution 1 – local install You could download programs/data to your own computer/server Lots of control Flexible Add your own programs/data But data is large Storage cost Maintenance/updating cost Just how friendly are you with your local IT?!

Solution 2 – Script against websites Literally automatic pasting data and clicking ‘run’??! NO! (well, please don’t) Instead, resources expose APIs (Application Programming Interface) via Web Services Use EMBL-EBIs compute resources, and up-to-date programs and data

Web Services Well-established protocols: REST and SOAP Built-in function help Large range of example clients available to download and run as is ‘Easy’ to plug into existing workflow tools Many third-party tools also exist that use our Web Services behind the scenes (In fact they’re so nice, we use them a lot either behind our own web interface or to chain together bioinformatics services)

EBI Search WS usage: Results enrichment Sequence similarity search tool results display referenced data, e.g. for BLAST/FASTA search:

Web Service examples Client front end and results, WP engine (UniProt, InterProScan)

Web Service examples External front end and results

EBI Search

EBI Search

How analysis tools are called Jobs per month

Web Services Synchronous & asynchronous

Web Services - synchronous

Web Services - synchronous

Web Services - asynchronous JobID “Is JobID done?” “Still cooking”

Web Services - asynchronous “Is JobID done?” “It’s ready!” Fetch JobID

Steps... Input parameters Meta-Information List parameters Get parameter details → Name, description, values... Submission Run (Email, title, values...) → Job Identifier Check status → RUNNING, FINISHED, ERROR... Results analysis List results available → Name, description, media type... Get result → Output, text, binaries (images)... Input parameters Job identifier (e.g. iprscan-S20110708-094729-0726-35857540-pg)

Web Services role play…

Key resources Documentation/example clients http://www.ebi.ac.uk/Tools/webservices Pre-compiled clients available for a large variety of languages SOAP WSDL: http://www.ebi.ac.uk/Tools/services/soap/{tool}?wsdl Most programming languages have SOAP client libraries Generate stubs or dynamically call methods REST http://www.ebi.ac.uk/Tools/services/rest/{tool}/{method}/{params} Basic HTTP requests Web browser, HTTP client libraries, CURL…

Tools at EMBL-EBI http://www.ebi.ac.uk/services Analyse your own data via ~100 tools: Search across our databases for homologous sequences using a suite of tools from classics such as BLAST and FASTA to recent advances such as PSI-Search Functionally annotate your sequence with protein domains and important sites using InterProScan Align your sequences to discover conserved regions with Clustal Omega, MUSCLE etc. Perform sequence translation, repeat analysis/masking, trans- membrane topology prediction etc. with EMBOSS and other tools Results enriched with data from EBI resources As well as providing data itself at EMBL-EBI, we also enable you to analyse your own data using our resources and data collections. The wide-ranging tools are developed internally, externally and by collaboration with leading authors. They are incorporated into our web framework to allow a consistent user experience, access to the latest data and to leverage cross-resource information, for example automatically annotating aligned regions of a BLAST protein search with domain and motif information. (and there are tools for things other than sequence! I’ve just concentrated on them for brevity) Sequence similarity searching: Central to genome annotation Characterising protein families Exploring distant evolutionary relationships http://www.ebi.ac.uk/services

Tools at EMBL-EBI http://www.ebi.ac.uk/services

Web Services exercise Linux command line help Open terminal cd =change directory/folder cd .. =go up directory mkdir =make directory cp <file1> <file2> =copy [tab] auto completes ls =list files in current directory ./[program] run program in my current directory cat =view contents of text file [shift PgUp/PgDn] scroll up/down a page [double click] select word [right click] paste www.ebi.ac.uk/~apc/Courses/ExploreBioSeq/ chmod a+x <file> =make file executable

Workflows

Workflows Galaxy Taverna

Web Services workflow $ncbiblast_lwp.pl --email email@example.org --program blastp --database uniprotkb_human --stype protein P01174.fasta $wsdbfetch_soaplite.pl fetchData @<jobid>.ids.txt fasta > P01174search2016_11_03.fasta $kalign_soaplite.pl --email email@example.org P01174search2016_11_03.fasta

Web Services workflow $ncbiblast_lwp.pl --email email@example.org --program blastp --database uniprotkb_human --stype protein P01174.fasta | wsdbfetch_soaplite.pl fetchData @- fasta | kalign_soaplite.pl --email email@example.org -

How to get help/contact us Most resources have Web Services or programmatic API links for more info Central collection of links/example clients: www.ebi.ac.uk/Tools/webservices Step-by-step instructions and example command lines: “Using EMBL-EBI Services via Web Interface and Programmatically via Web Services” Lopez R , Cowley A , Li W , McWilliam H Curr Protoc Bioinformatics. 2014 Dec 12;48:3.12.1-50. doi: 10.1002/0471250953.bi0312s48 Helpdesk: www.ebi.ac.uk/support/