Development of a web application for browsing BBMRI -omics data

Slides:



Advertisements
Similar presentations
Introductory to database handling Endre Sebestyén.
Advertisements

The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
BigBed/bigWig remote file access Hiram Clawson UCSC Center for Biomolecular Science & Engineering.
1 Chapter 12 Working With Access 2000 on the Internet.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
NCBI resources III: GEO and expression data analysis Yanbin Yin Fall
User Group 2015 Version 5 Features & Infrastructure Enhancements.
Before you begin If a yellow security bar appears at the top of the screen in PowerPoint, click Enable Editing. You need PowerPoint 2010 to view this presentation.
Web Content Management Systems. Lecture Contents Web Content Management Systems Non-technical users manage content Workflow management system Different.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Server-side Scripting Powering the webs favourite services.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Tutorial 1: Getting Started with Adobe Dreamweaver CS4.
Part 1. Persistent Data Web applications remember your setting by means of a database linked to the site.
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
NMED 3850 A Advanced Online Design January 12, 2010 V. Mahadevan.
Putting it all together Dynamic Data Base Access Norman White Stern School of Business.
Web Pages with Features. Features on Web Pages Interactive Pages –Shows current date, get server’s IP, interactive quizzes Processing Forms –Serach a.
Mainframe (Host) - Communications - User Interface - Business Logic - DBMS - Operating System - Storage (DB Files) Terminal (Display/Keyboard) Terminal.
Introduction to the Gramene Genetic Diversity module 5/2010 Build #31.
Copyright OpenHelix. No use or reproduction without express written consent1.
MUGEN MICE DATABASE (MMdb) (
GVS: Genome Variation Server Materials prepared by: Warren C. Lathe, PhD Updated: Q Version 2.
Framework for Virtual Web Laboratory I. Petković M. Rajković.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Progress on TripalBIMS Breeding Information Management System in Tripal Sook Jung, Taein Lee, Chun-Huai Chen, Jing Yu, Ksenija Gasic, Todd Campbell, Kate.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Cancer Institute Frederick National Laboratory is a federally funded research.
Ingenuity Pathway Analysis Alex Pico. Description "IPA is a software application that enables researchers to analyze and understand the complex biological.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
Physics validation database
Stanford University, Stanford, CA, USA
Business Directory REST API
Hub Updates for Year 3 Carl Kesselman.
A web portal for management of biological data and applications
The Client-Server Model
Online BIOS QTL atlases
Development of an interactive pipeline for Genome wide association analysis Falola Damilare & Adigun Taiwo – Covenant University Bioinformatics research.
Statistical Information Systems Introducing SIS tool .Stat
Web Development Web Servers.
Node.js Express Web Applications
Functional Mapping and Annotation of GWAS: FUMA
CUAHSI HIS Sharing hydrologic data
Web Software Model CS 4640 Programming Languages for Web Applications
The EBI Search RESTful API
CottonGen: An Up-to-Date Resource Enabling Genetics, Genomics and Breeding Research for Crop Improvement Plant and Animal Genome Conference XXV Jing Yu1,
PHP / MySQL Introduction
FAIR Sample and Data Access
Iteration 1 Presentation
Web Browser server client 3-Tier Architecture Apache web server PHP
GDSS – Digital Signature
Deep Phenotyping for Deep Learning (DPDL): Progress Report
Lecture 1: Multi-tier Architecture Overview
Shared Genomics Sharing paths of exploration to support collaborative reasoning in genomic data analysis David Hoyle, Mark.
Genetic-Variation-Driven Gene-Expression Changes Highlight Genes with Important Functions for Kidney Disease  Yi-An Ko, Huiguang Yi, Chengxiang Qiu, Shizheng.
Bioinformatics Research Group SRI International
for the Cotton Community
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Genome Database for Rosaceae:
Network Controllable MP3 Player
AIMS Equipment & Automation monitoring solution
One SNP at a Time: Moving beyond GWAS in Psoriasis
How to Effectively Search and Download Data in CottonGen
CottonGen: Enabling Cotton Research through Big-Data Analysis and Integration Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng,
BIOPAMA Data Management
Presentation transcript:

Development of a web application for browsing BBMRI -omics data Jan Bert van Klinken

Introduction In BBMRI-NL a unique collection of omics datasets have been generated, reflecting different layers of biological processes metabolome (NMR) transcriptome (RNA-seq) epigenome (CpG methylation) genome (SNP genotyping) Online BBMRI -omics atlas 31-03-2017

Introduction Within BBMRI-NL, the BIOS consortium aims at integrating these layers of data, in order to uncover molecular pathways involved in health and disease. Genome-wide association analyses between omics layers: genome -> methylome (meQTLs) genome -> transcriptome (eQTLs: gene, exon, exon ratio, polyA ratio) genome -> metabolome (metabolite QTLs) methylome -> transcriptome (eQTMs) methylome -> metabolome transcriptome -> metabolome Online BBMRI -omics atlas 31-03-2017

BBMRI -omics atlas An important task of BBMRI-NL 2.0 is to make the association results available to the public. To this end, a web browser has been developed by the BIOS team: http://genenetwork.nl/biosqtlbrowser Data can be downloaded all at once or the database can be queried for specific SNPs, CpG sites, genes. Currently, query results are presented in tabular format separately for each association study. Online BBMRI -omics atlas 31-03-2017

BBMRI -omics atlas (current online version)

BBMRI -omics atlas (current online version) bulk download query database query results

BBMRI -omics atlas Several aspects of the current browser have to be improved. Goal Make the BBMRI atlas accessible for and interpretable by a wide range of users* through a user-friendly** and sustainable*** web application. * users with different expertises and backgrounds ** present results in a visually more appealing way *** easy to maintain and to add new association results or data layers Online BBMRI -omics atlas 31-03-2017

BBMRI -omics atlas Features Make interface more user-friendly --> give browser more modern design --> interactive association tables --> visualise association results as a layered network Enable complex/composite queries --> submit a list of genetic variants, CpGs or genes --> submit e.g. variant-gene or variant-disease combinations Download query results Develop API to facilitate data access --> what type of interface with server? Online BBMRI -omics atlas 31-03-2017

BBMRI -omics atlas Features Calculate Linkage Disequilibrium between query SNP and top me/eQTLs from conditional analyses --> QTL tables only contain SNPs with strongest associations --> report also unconditional association of query SNP? Functional annotation of genetic variants and methylation sites --> import data from ENCODE on histone marks, DNAse activity, TF binding --> how to maintain? Gather gene-specific knowledge from external databases --> e.g. UniProt, ExPASy, TCDB, OMIM, Gene Ontology, etc. --> how to maintain? --> copyrighted database content; restricted content (KEGG, OMIM) Online BBMRI -omics atlas 31-03-2017

BBMRI -omics atlas Features FAIR-ification of BBMRI data (Findable Accessible Interoperable Reusable) --> store data in specific, machine readable format (RDF) --> assign metadata (data identifiers, provenance, usage/license) --> index or submit data to searchable resource include results of other BBMRI -omics (association) studies --> add metabolomics/phenotype layer --> splice site data, allele specific expression --> QTLs based on Haplotype Reference Consortium imputation Online BBMRI -omics atlas 31-03-2017

Implementation Overall workflow data standardisation import in PostgreSQL BBMRI -omics association study results SNP.txt LD.txt CpG.txt Gene.txt association_SNP_CpG.txt association_SNP_Gene.txt association_CpG_Gene.txt external SNP/ CpG/gene knowledge bases communication web-postgres server server.js Node+Express communication client-server via HTTP POST index.html main.js Online BBMRI -omics atlas 31-03-2017

Implementation Data standardisation import results from BBMRI association studies (standardise header) import SNP/CpG/gene specific knowledge from external databases import TSV files into PostgreSQL (CREATE TABLE and COPY) create index for fields that are queried SNP Id Chr ChrPos association_SNP_CpG SNP_Chr pvalue SNP_ChrPos Zscore CpG_Id type CpG Id Chr ChrPos association_SNP_Gene SNP_Chr pvalue SNP_ChrPos Zscore Gene_Id type association_CpG_Gene CpG_Id pvalue Gene_Id Zscore type LD Chr ChrPos1 ChrPos2 Gene Id Chr Name ChrPos ExtLinks ChrOrt Online BBMRI -omics atlas 31-03-2017

Implementation Interaction server-PostgreSQL communication via node-postgres module JSON query from browser/API is translated by the server into a series of consecutive SQL SELECT commands enable graph-like searches Example: SELECT Chr,ChrPos FROM SNP WHERE IdStr IN ('rs11781667'); SELECT * FROM LD INNER JOIN association_SNP_CpG ON (LD.Chr=association_SNP_CpG. SNP_Chr AND LD.ChrPos2=association_SNP_CpG.SNP_ChrPos) WHERE (LD.Chr,LD.ChrPos1) IN (('8 ',145040856)); SELECT * FROM LD INNER JOIN association_SNP_Gene ON (LD.Chr=association_SNP_Gene. SNP_Chr AND LD.ChrPos2=association_SNP_Gene.SNP_ChrPos) WHERE (LD.Chr,LD.ChrPos1) IN (('8 ',145040856)); SELECT * FROM association_CpG_Gene WHERE CpG_IdStr IN ('cg01640991','cg01640991',... SELECT * FROM SNP WHERE (Chr,ChrPos) IN (('8 ',145040856),('8 ',145037573),... SELECT * FROM CpG WHERE (IdStr) IN ('cg01640991','cg01640991',... SELECT * FROM Gene WHERE (IdNum) IN (178209,185189,255343,185189,... Online BBMRI -omics atlas 31-03-2017

Implementation Client-side implemented in JavaScript, Bootstrap for layout single page web application with Angular --> fast response with menu/tab switching, table sorting, popup window appearance, downloading results Client-server communication both the browser and the API communicate to the server via a HTTP POST, in which a JSON containing the query is sent, and the resulting association and SNP/CpG/gene tables are returned as a JSON Online BBMRI -omics atlas 31-03-2017

Results

Results Graph search: eQTMs affected by meQTLs are also returned Data access via API: send/ receive JSON via HTTP POST Graph search: eQTMs affected by meQTLs are also returned LD calculation: me/eQTLs in LD with query SNPs are returned Download query results (JSON/TSV) Query list of SNPs/CpGs/ genes Interactive table: SNPs/CpGs/genes can be clicked for more info

Current/future work Done: give browser modern design make it possible to query a list of SNPs/CpGs/genes calculate Linkage Disequilibrium (LD) between query SNP and top me/eQTLs from conditional analyses make it possible to download query results implement API to facilitate data access To do: put browser online enrich atlas with SNP/CpG/gene specific knowledge from external databases (ENCODE, pathway DBs, OMIM, etc) FAIR-ification of QTL results (Findable Accessible Interoperable Reusable) include results of other BBMRI -omics (association) studies Online BBMRI -omics atlas 31-03-2017

Current/future work Make browser more user-friendly and flexible: better represent omics data layers and associations between layers variants CpG sites genes phenotypes Online BBMRI -omics atlas 31-03-2017

Current/future work Alpha version of graph based interface of browser (query: OPLAH) Online BBMRI -omics atlas 31-03-2017

Conclusion Integration of GWAS loci with methylation/expression QTL data and knowledge bases is important for their interpretation. Task of BBMRI-NL 2.0 to develop a user-friendly and sustainable web application to browse results from -omics association studies. Current online version: http://genenetwork.nl/biosqtlbrowser . New browser has been developed, will soon be put online. Which additional features will/should be added is open for discussion. Online BBMRI -omics atlas 31-03-2017

Acknowledgements Peter-Bram 't Hoen Bas Heijmans Rick Jansen Leon Mei Maarten van Iterson Morris Swertz Lude Franke Joyce van Meurs Online BBMRI -omics atlas 31-03-2017