FaceBase Hub Years 1 through 5

Slides:



Advertisements
Similar presentations
Introduction Lesson 1 Microsoft Office 2010 and the Internet
Advertisements

GraduateCareersScotland.com. GradauteCareersScotland.com Content migration & revision Agcasscotland.org.uk graduatecareersscotland.com Content reorganised.
MAE Training for User July 8, Agenda Wiki FishEye Crucible Stash.
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
ENCODE Data Coordination at UCSC Kate Rosenbloom ENCODE DCC Technical Project Manager UCSC Genome Bioinformatics Group September 2010 Genome Browser SAB.
Accelerate Business Success With CRM CRM Interoperability.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Prof. Vishnuprasad Nagadevara Indian Institute of Management Bangalore
The HMP Data Analysis and Coordination Center (DACC) plays the role of collecting, integrating & standardizing different data types from diverse sources.
OVERVIEW May 26, Overview Automate the performance management and compensation processes Integrate employee data between
Customized cloud platform for computing on your terms !
Understanding the Web Site Development Process. Understanding the Web Site Development You need a good project plan Larger projects need a project manager.
SharePoint and SharePoint Online: Today and what's next? Presented by Luke Abeling – IT Platforms.
Session 1 SESSION 1 Working with Dreamweaver 8.0.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Metadata in the iPlant Collaborative Cyberinfrastructure Birds of a Feather meeting at PAG XXII, Jan. 14, 2014.
Delivering high quality, cost effective, green IT services in an agile manner that support NTU strategic plans NOW Upgrade from 9.4 to 10.2.
CceHUB An Environment for Collaborative Cancer Research Ann Christine Catlin CCE Annual Retreat May 26, 2010 clinical dataobservational & scientific data.
Evaluating & Maintaining a Site Domain 6. Conduct Technical Tests Dreamweaver provides many tools to assist in finalizing and testing your website for.
Copyright OpenHelix. No use or reproduction without express written consent1.
Accessing and visualizing genomics data
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 2.
Fab25 User Training Cerium Labs LabCollector - LIMS Lynette Ballast.
Windows Vista Configuration MCTS : Internet Explorer 7.0.
Dissemination of ONS Data - Future Channels and Tools Callum Foster, Web Data Access Project ONS 1.
Web Analytics Fundamentals Presented by Tejaswi, Chandrika, Sunil.
MSU Cognos Future Data Services September Cognos Improvements  Architecture  64- bit vs 32- bit  More server power, faster servers  Ghost.
Chapter Objectives Explain how to test a website before it is published Describe how to publish a website to a web server Identify ways to promote a published.
Towards a unified MOD resource: An Overview
What Is Adxstudio Portals?
Open Access and Research Data Symplectic Pilot
Summon® 2.0 Discovery Reinvented
Essential tools for implementing and testing websites
PIWIK JUNIOR TIDAL ASSOCIATE PROF., WEB SERVICES & MULTIMEDIA LIBRARIAN NEW YORK CITY COLLEGE OF TECHNOLOGY, CUNY.
Hub Updates for Year 3 Carl Kesselman.
Reporting and Analysis With Microsoft Office
Opening slide.
NGS Analysis Using Galaxy
Overview – SOE PatchTT November 2015.
Breeding Information Management System
User Guide PrimePortal – File Archive
Overview – SOE PatchTT December 2013.
Power BI Security Best Practices
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Naviance for the Novice
Microsoft FrontPage 2003 Illustrated Complete
Sport Clips Google Analytics for Franchisee - June 2017
SRA Submission Pipeline
Chapter 12: Automated data collection methods
How to customize your Microsoft SharePoint Online website
Functional Annotation of the Horse Genome
Oracle Sales Cloud Sales campaign
Linked Open Data Project
User Guide PrimePortal – File Archive
Power Apps Canvas and Model-Driven
What's New in eCognition 9
denblogs.com/jendorman
Enterprise Program Management Office
What’s New in I-Hub for ADP Workforce Now
2/24/2019 6:15 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
TOPMed Analysis Workshop Genetic Analysis Center Biostatistics Department University of Washington TOPMed Data Coordinating Center August 7-9, 2017 Introduction.
FaceBase Hub Updates of Year 4
What's New in eCognition 9
CFR Enhancement Session
eQTL Tools a collaboration in progress
What's New in eCognition 9
ArcGIS Online Steps for Success A best practices approach
CMP Creating Your Personal and Small Business Web Sites
Contract Management Software 100% Cloud-Based ContraxAware provides you with a deep set of easy to use contract management features.
Presentation transcript:

FaceBase Hub Years 1 through 5 Carl Kesselman

FaceBase Hub Goals Create an integrated, linked data resource, not just a repository of individual data sets Links to internal and external sources Promote self-curation to enable rapid turn around of data submission Promote data pipelines to support both raw data and derived data such as bioinformatics pipelines Promote FAIR principles, including focus on citable data Adapt rapidly to emerging data types, such as single cell gene expression Enhanced the end-user experience of data through online visualization

Years 1: Migration and improved data standards Transition from U Pitt to ISI Gathering of project requirements via short-term teams Initial new data model Updated request process and handling for human data Communications New wiki and mailing lists Monthly Steering Committee calls New FaceBase website

FaceBase 2 website

Years 2: Improving data standards Improved classification of data - ie, more accurate experiment types, adding phenotypes, support for transgenic enhancer data Clean up of existing data: consistent anatomical terms from OCDM, genotypes, Mouse Matrix page - rich visualization of all mouse control data Secure and flexible user and group management, support for fine-grained authorization User testing and usability enhancements

Mouse Matrix Link should be to main homepage

Year 3: Increase sophistication of repository Cross-cutting integrations and visualizations 3D Surface Model viewers - multi-mesh surface models and “landmark” annotations Higher resolution data model leads to more intensive inter-linkages: Dynamically generated navigation hyperlinks between linked data elements of the database Link from vocabulary terms (anatomy, phenotype, age stages, etc.) to annotated entities (datasets, samples, assays) Phenotype summaries (with integration Monarch Initiative) Gene Summaries (integration from Chai resource) Genome Browser - integrated custom browser within datasets Self-curation data submission tools

Year 4: Optimizing for collaboration and sharing Establishment of Bioinformatics Pipeline based on ENCODE More improvements on data model to represent diverse research data using FAIR principles Improved search and filtering interface Image Navigation via surface model viewer Improved integration with TrackHub and the internal JBrowse plugin for viewing genomic data internally and being able to compare with other datasets Data Submissions: Continued to streamline browser-based data submissions Added desktop & command-line data upload tools

Bioinformatics Pipeline Rationale - ensure that sequencing data between spokes can be compared. Solution - establish a common sequencing pipeline, (based on ENCODE) and operate on a cloud- based genome informatics service (DNAnexus). Process - Visel’s lab in Berkeley administers the routing of sequencing data from FaceBase to DNAnexus and back.

Highlights of Year 5: Bioinformatics Pipeline: coordinate curation of data and operation of pipeline, full automation. Vocabulary enhancements: finish integration with Uberon, improve semantic search Data curation: total data review, coordination with spokes, new curation tracking tools Image visualization and display: 3D mesh, imaging results across datasets, control vs mutant Usability enhancements: Bulk download capability Genome Browser/JBrowse integration and enhancements: ie, cross-dataset browsing of data

Highlights of Year 5 (cont.): FAIR Identifiers and Resolver Historical information tracking (versioning/provenance) Final push receiving and curating data from the spokes Migrating the HGAI website.

3D Mesh Viewer Building on the surface model viewer Connecting anatomical regions to the database. Clicking an image of an anatomical region pulls up the list of all datasets with data related to that region. Available on ALL FaceBase dataset pages https://www.facebase.org/chaise/record/#1/isa:dataset/RID=3V4A

Usage Statistics (past year) Database Statistics 832 datasets and growing 141 publications As of April 2019: over 4,300 individual data files - over 6 terabytes of data 18 different assay/experiment types Website Statistics Pageviews: 52,867 Sessions*: 19,560 Avg Session Duration: 3:40 Users**: 13,832 * Sessions: Total number of sessions within the data range. A Session is the period of time a users is actively engaged with the website. ** Users - as defined by Google = Unique Visitors = The number of unduplicated (counted only once) visitors to your website over the course of a specified time period. (Depending on cookies, so it’s not a foolproof number ie, user deletes cookies, visits from a different device.)

Data Download Statistics User activity within the Data Browser for the past year: 523 data file downloads 5,452 thumbnails* Usage of our Track Hub for the UCSC Genome Browser: 183,254 track downloads** * Filtering out for generic placeholder thumbnails ** The Genome Browser reads byte ranges of the part of the file the user is actually looking at

Possible Future Directions Continued alignment with FAIR guidelines and NIH COMMONS Enhancements planned for improving usability of self-curation, including curation task worklists and dashboards Codified curation quality metrics Next generation anatomical/visual search Advanced display of imaging data Enhanced genome browser configuration and integration Further integration and alignment with vocabularies Advanced semantic search capabilities Annotation tools for facilitating analysis of anatomy and phenotypes in datasets

Demos https://facebase.org/id/3V4A https://facebase.org/id/TMJ https://facebase.org/id/VXA Image navigation demo, revised JBrowse interface,

Let Us Know What You Think! Let us know your questions, comments, feedback at: help@facebase.org