External Data Access Adam Rauch, 6/05/08 Team: Geoff Snyder, Kevin Beverly, Cory Nathe, Matthew Bellew, Mark Igra, George Snelling.

Slides:



Advertisements
Similar presentations
Oracle to MySQL Database Migration SQLWays - Migration Software Presentation Copyright (c) Ispirer Systems Ltd. All Rights Reserved.
Advertisements

Chapter 5 Data Management. – The Best & Most Convenient Way to Learn Salesforce.com 2 Objectives By the end of the module, you.
Technical BI Project Lifecycle
T HE W EB - BASED I NTERFACE TO C ENSUS I NTERACTION D ATA - WICID Presentation to the ESRC Research Methods Festival Adam Dennett Centre for Interaction.
B.Sc. Multimedia ComputingMedia Technologies Database Technologies.
Multiple Tiers in Action
Feb 2015 QCWiz Capability QE&A Automation COE.
Document management New: SharePoint and SharePoint Online support Excel Add-in New: Office 365 support via export, filtering APIs, dimension descriptions,
LHCbPR V2 Sasha Mazurov, Amine Ben Hammou, Ben Couturier 5th LHCb Computing Workshop
1 Foundations of Software Design Lecture 27: Java Database Programming Marti Hearst Fall 2002.
Chapter 7 Managing Data Sources. ASP.NET 2.0, Third Edition2.
Microsoft Office SharePoint Server Business Intelligence Tom Rizzo Director, Microsoft Office SharePoint Server
Simple Web SQLite Manager/Form/Report
Confidential ODBC May 7, Features What is ODBC? Why Create an ODBC Driver for Rochade? How do we Expose Rochade as Relational Transformation.
Enterprise Reporting with Reporting Services SQL Server 2005 Donald Farmer Group Program Manager Microsoft Corporation.
Phil Brewster  One of the first steps – identify the proper data types  Decide how data (in columns) should be stored and used.
A Study in NoSQL & Distributed Database Systems John Hawkins.
Object Oriented Databases by Adam Stevenson. Object Databases Became commercially popular in mid 1990’s Became commercially popular in mid 1990’s You.
Class 3 Data and Business MIS 2000 Updated: January 2014.
PHASE 3: SYSTEMS DESIGN Chapter 7 Data Design.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Copyright © 2006, SAS Institute Inc. All rights reserved. Enterprise Guide 4.2 : A Primer SHRUG : Spring 2010 Presented by: Josée Ranger-Lacroix SAS Institute.
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
Planning and Tracking Projects with VSTS 2010 By Ahmed Nasr 1.
Is Apache CouchDB for you?
COLD FUSION Deepak Sethi. What is it…. Cold fusion is a complete web application server mainly used for developing e-business applications. It allows.
Data File Access API : Under the Hood Simon Horwith CTO Etrilogy Ltd.
Lisa Ruff Business Productivity/Accessibility TS Microsoft Federal.
Overview of Data Access MacDonald Ch. 15 MIS 324 Professor Sandvig.
M1G Introduction to Database Development 6. Building Applications.
1 Pre-Programmed Thread Tool Recommendation Pre-Programmed Thread February 12, 2002.
SURENDER SARA 10GAS Building Corporate KPI’s
Using SAS® Information Map Studio
SQL Server User Group Meeting Reporting Services Tips & Tricks Presented by Jason Buck of Custom Business Solutions.
Relational Databases Database Driven Applications Retrieving Data Changing Data Analysing Data What is a DBMS An application that holds the data manages.
Data Management Console Synonym Editor
Nadir Saghar, Tony Pan, Ashish Sharma REST for Data Services.
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
CERN - IT Department CH-1211 Genève 23 Switzerland t DB Development Tools Benthic SQL Developer Application Express WLCG Service Reliability.
Carey Probst Technical Director Technology Business Unit - OLAP Oracle Corporation.
Tips and Tricks for Managing and Administering your Enterprise Project Management Server Solution Mike Joe / Karthik Chermakani Software Test Engineer.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:
Building Dashboards SharePoint and Business Intelligence.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
Database Management Supplement 1. 2 I. The Hierarchy of Data Database File (Entity, Table) Record (info for a specific entity, Row) Field (Attribute,
Information Systems Today: Managing in the Digital World TB3-1 3 Technology Briefing Database Management “Modern organizations are said to be drowning.
Virtual techdays INDIA │ 9-11 February 2011 virtual techdays Data grail: Data Market on Windows Azure Sudhindra Kovalam │ Developer, Icertis Inc.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
07/21/97 MOSS Project Introduction and Definition -Senior Project-
The IPSO Factor Enriching portfolios with market data.
IS6146 Databases for Management Information Systems Lecture 1: Introduction to IS6146 Rob Gleasure robgleasure.com.
Session id: Darrell Hilliard Senior Delivery Manager Oracle University Oracle Corporation.
This was written with the assumption that workbooks would be added. Even if these are not introduced until later, the same basic ideas apply Hopefully.
External Data Access 5/29/08. Current Problems No way to load, process & analyze live Atlas data via critical analysis & programming tools (SAS, R, Perl)
LabKey Server 11.1 What’s New for Developers Josh Eckels March 17, 2011.
9 Copyright © 2004, Oracle. All rights reserved. Getting Started with Oracle Migration Workbench.
© 2011 LabKey Software LabKey Server Release 11.3 Atlas Developers Meeting 11/16/2011 Adam Rauch
ODBC, OCCI and JDBC overview
Information Systems Today: Managing in the Digital World
Leveraging BI in SharePoint with PowerPivot and Power View
The Client/Server Database Environment
ICT Database Lesson 1 What is a Database?.
Lecture 1: Multi-tier Architecture Overview
Agile testing for web API with Postman
Topic 11 Lesson 1 - Analyzing Data in Access
September 12-14, 2018 Raleigh, NC.
ITAS Risk Reporting Integration to an ERP
UFCEUS-20-2 Web Programming
Presentation transcript:

External Data Access Adam Rauch, 6/05/08 Team: Geoff Snyder, Kevin Beverly, Cory Nathe, Matthew Bellew, Mark Igra, George Snelling

Primary Goal Allow a variety of commonly used tools & languages to easily load, query, process, and analyze live data stored in the Atlas database

Available Methods & Their Problems Manual export to TSV, XLS, etc. – Example: Assay QC for Denny Lab Tedious, loses important column & run metadata – Example: Ad hoc SRA analysis with VISC assays Tedious, error prone, loses important metadata, introduces security risks

Available Methods & Their Problems Manual export to TSV, XLS, etc. Client API – Example: Record assay design history via Perl + API JSON format is inconvenient to work with Query API is incomplete

Available Methods & Their Problems Manual export to TSV, XLS, etc. Client API Direct access to the database from SQL tools – No current usages; may need for cross-folder queries, performance, limitations of web UI Potential security issue: entire database is readable OntologyManager (OM), aka “The Blender”, schema is too difficult to query Other objects (e.g., folders) are difficult to work with

Available Methods & Their Problems Manual export to TSV, XLS, etc. Client API Direct access to the database from SQL tools Dataset Snapshot – Not used currently Potential issues: currently datasets only, requires manual step, requires connecting to the database to retrieve data (need to be careful with security), not live

Problem Summary No way to load, process & analyze live Atlas data via key analysis tools & languages (Perl, Java, R, SAS, PHP) Direct PostgreSQL queries against Ontology Manager (OM), aka “The Blender”, are too difficult Accessing data by direct database connection is a potential security issue; need to minimize this Query client API is incomplete OM is too slow for large datasets

Two Big Categories of Tasks “Programmer / Analyst” Tasks – “I want to manipulate all the data I can see on the Atlas web site with my tool or language” – All current tasks fall into this category “Adminstrative” Tasks – Specialized tasks that require broad access to Atlas database via SQL – Performance, query flexibility, web site limitations

Programmer/Analyst Tasks: Requirements Require live, read-only access to all data user can view on Atlas from the tools she uses regularly Need ad hoc queries: easy to develop new queries and find the data of interest Need automated processes: e.g., nightly analysis of current data using a fixed query Need security: follow user’s Atlas permissions Key tools: Perl, Java, R, SAS, PHP

Administrative Tasks: Requirements A few tasks may require direct, read-only access to all data in the Atlas database using db admin tools, scripts, ODBC browsers, etc. Do not need to follow folder permissions since user can read all LabKey schemas Filtering by folder, finding lists & objects, etc. should be reasonably easy Need queries with a small number of joins – Tabular OM data (lists, assays, samples) must be easier to query – Joins are okay… just reduce the complexity Key tools : pgAdmin III, DbVisualizer, EMS SQL Manager, Perl

Programmer/Analyst Tasks: Proposed Solution Provide their tools access to data via the Atlas “front door” as a wrapper on top of the query client API – Recommended: Custom package for each tool: Perl/Java/R/SAS/PHP – Possible Alternative: ODBC wrapper Directly usable from more tools (SAS EXEC SQL, RODBC) Clumsy, hard to develop, doesn’t help with rest of client API (insert/update? charts? assays & other specialized data retrieval?) Encourage use of API-based solution to limit users who require direct database access Challenges – Development effort to create wrappers for each language – SAS interface may be difficult due to proprietary nature of SAS – Add full SQL query support to API – Fix issues with current SQL syntax – Improve ease-of-use (build query in UI then “export to SAS/R script”)

Administrative Tasks and Performance Issues: Proposed Solution Provide option to migrate tabular OM data from virtual to hard tables – User option for some types of data (e.g., lists) – Wholesale migration for others (e.g., datasets, flow) – Continue to use OM for data stored as trees or graphs (e.g., experiment) Challenges – Development and test effort – Naming hard tables in a reasonably discoverable way