A web portal for management of biological data and applications

Slides:



Advertisements
Similar presentations
Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid.
Advertisements

General introduction to Web services and an implementation example
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
Aug. 20, JPL, SoCalBSI '091 The power of bioinformatics tools in cancer research Early Detection Research Network, JPL Mentors: Dr. Chris Mattmann,
DataFoundry: An Approach to Scientific Data Integration Terence Critchlow Ron Musick Ida Lozares Center for Applied Scientific Computing Tom SlezakKrzystof.
CoMPAS Pro: Comprehensive Meta Prediction and Annotation Services for Proteins Sebastian J. Schultheiß Christoph Malisi.
ICPCA 2008 Research of architecture for digital campus LBS in Pervasive Computing Environment 1.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
CaGrid, Fog and Clouds Joel Saltz MD, PhD Director Center for Comprehensive Informatics.
Integromics: a grid-enalbled platform for integration of advanced bioinformatics tools and data Luca Corradi Luca Corradi BIO-Lab,
INTRODUCTION GOAL: to provide novel types of interaction between classification systems and MIAME-compliant databases We present a prototype module aimed.
Cancer is heterogeneous disease! -> enabled characterization of new tumor subtypes for improving personalized treatment and ultimately achieving better.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
WordFreak A Language Independent, Extensible Annotation Tool.
T.Jadczyk, Bioinformatics Applications in the Virtual Laboratory Bioinformatics Applications in the Virtual Laboratory Tomasz Jadczyk AGH University of.
Crystal-25 April The Rising Power of the Web Browser: Douglas du Boulay, Clinton Chee, Romain Quilici, Peter Turner, Mathew Wyatt. Part of a.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
University of Illinois at Urbana-Champaign BeeSpace Navigator v4.0 and Gene Summarizer beespace.uiuc.edu `
Bioinformatics Core Facility Guglielmo Roma January 2011.
Relational Database vs. Data Files By Willa Zhu JISAO/UW - PMEL/NOAA March 25, 2005.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
S. Shumilov – Zürich Analytical Visualization Framework - a visual data processing and knowledge discovery system Ivan Denisovich, Serge Shumilov Department.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
©2012 LIESMARS Wuhan University Building Integrated Cyberinfrastructure for GIScience through Geospatial Service Web Jianya Gong, Tong Zhang, Huayi Wu.
1 MedAT: Medical Resources Annotation Tool Monika Žáková *, Olga Štěpánková *, Taťána Maříková * Department of Cybernetics, CTU Prague Institute of Biology.
CSCE 315 – Programming Studio Spring Goal: Reuse and Sharing Many times we would like to reuse the same process or data for different purpose Want.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Biomedical and Bioscience Gateway to National Cyberinfrastructure John McGee Renaissance Computing Institute
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
A Data Engine for Grid Science Gateways Enabling Easy Transfers and Data Sharing Dr. Marco Fargetta (1), Mr. Riccardo Rotondo (2,*), Prof. Roberto Barbera.
Galaxy for analyzing genome data Hardison October 05, 2010
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Konstantin Okonechnikov Qualimap v2: advanced quality control of
Integrating ArcSight with Enterprise Ticketing Systems
Introduction to Bioinformatics and Functional Genomics
Biological Databases By: Komal Arora.
Online BIOS QTL atlases
Development of an interactive pipeline for Genome wide association analysis Falola Damilare & Adigun Taiwo – Covenant University Bioinformatics research.
Statistical modelling of complex biological mechanisms by combining multiple data sources: an example on gene environment interactions Prof. Jeanine. J.
Data-intensive Computing: Case Study Area 1: Bioinformatics
An Introduction to the IVC Software Framework
Unit – 5 JAVA Web Services
GF and RS, Dept. of CS, Mangalore University
Middleware independent Information Service
CUAHSI HIS Sharing hydrologic data
Similarities between Grid-enabled Medical and Engineering Applications
Lawrence Livermore National Laboratory
OpenWells Cross-Platform Mobile Application
Attività grid di Biomedicina in Italia e in Europa
Web Ontology Language for Service (OWL-S)
GSAF Grid Storage Access Framework
Hansheng Xue School of Computer Science and Technology
Some remarks on Portals and Web Services
A BRIEF INTRODUCTION TO UNIX OPERATING SYSTEM
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Middleware, Services, etc.
Grid Based Data Integration with Automatic Wrapper Generation
Lesson 3 Bioinformatics Laboratory
Maria Teresa Capria December 15, 2009 Paris – VOPlaneto 2009
Next Generation Sequencing Market. Report Description and Highlights According to Renub Research market research report “Next Generation Sequencing (NGS)
Presentation transcript:

A web portal for management of biological data and applications Matteo Gnocchi, Alessandro Orro, Davide Di Pasquale, Luciano Milanesi Institute for Biomedical Technologies – CNR, Segrate, Italy

Outline introduction motivation architecture ported tools genotype management system phenotype management system web portal xml-rpc interface liferay web interface genetic analysis linkage analysis

Introduction Motivations management of huge amount of data coming from a molecular biology laboratory availability of many tools for data analysis home made heterogeneous (formats and interfaces) not integrated in any framework

Introduction implementation of a web-based application for resources integration technologies ad-hoc solution based on xml-rpc liferay main field data management of data coming from genetics laboratory medical genetics studies (association studies, linkage analysis)

Architecture

Ported tools genotypes data management local database for gene annotation UCSC, www.genome.ucsc.edu phenotype data management statistical aplpications in development . . . linkage analysis

Genotypes data management management of data coming from ht genotyping platforms (Illumina, Affymetrix) postgresql + python (scripts)  

Genotypes data management data representation

Genotypes data management main features (bmc bioinformatics, 26 march 2008) some LIMS features users/projects samples (by projects) metadata (chips) genotypes manipulation data import/exports most important input formats (ped, map, ...) standard data from Affymetrix and Illumima interface only command line  

Genotypes data management python API metadata getAvailChips() getChipInfo(chipname: string) dataset management  getDatasets(project: string) newDatasset(name: string, project: string) user/permissions/groups/ projects addUser, addGroup, addProject delUser, ... etc.

Web portal middleware porting of the API interface to RPC-XML interface porting the command line application to XML-RPC  

Web portal portal implementation of a liferay based portal user management mapping portal user <-> appl user applications as portlets a resizable and independent windows into the portal that implement a single application each portlet uses one or more web services in order to accomplish its task  

Web portal

Web portal

Phenotype management phenotype: every info of a sample (patient) related to his clinical/demographics/medical status heterogeneous data treatment temporal reference management software metadata : defining new phenotypes insert/delete/edit of phenotype value import/export based on mysql + python cmdline interface -> xml-rpc

Phenotype management

Phenotype management

Data analysis Genotype/Phenotype can be combined together and analyzed with many tools case/control (association) Linkage analysis Quality control Stratification analysis computation No computational intensive -> xml-rpc Linkage analysis -> glite

Linkage analysis Linkage Analysis is a genetic analysis that permits the discovery of genetic correlations in complex diseases following their transmission through family generations.

Linkage analysis

conclusions The application developed represents an easy way to interact with the various biological data and informationcase/control (association) Liferay and Xml-rpc Future works Interaction with HP computing infrastructures XML-RPC –> WSDL (compatibility with WF systems)

thank you for your attention