© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Using the Purdue DB Technology to build.

Slides:



Advertisements
Similar presentations
WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Advertisements

Win8 on Intel Programming Course Desktop : WPF Cédric Andreolli Intel Software
Intel Do-It-Yourself Challenge node.js
Win8 on Intel Programming Course Desktop : Sensors Cédric Andreolli Intel Software
Internet of Things with Intel Edison Web controller
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 22 World Wide Web and HTTP.
The Keyboard Study Lecture /slide deck produced by Saul Greenberg, University of Calgary, Canada Notice: some material in this deck is used from other.
Bill Barnett, Bob Flynn & Anurag Shankar Pervasive Technology Institute and University Information Technology Services, Indiana University CASC. September.
Data Gateways for Scientific Communities Birds of a Feather (BoF) Tuesday, June 10, 2008 Craig Stewart (Indiana University) Chris Jordan.
Automating Bespoke Attack Ruei-Jiun Chapter 13. Outline Uses of bespoke automation ◦ Enumerating identifiers ◦ Harvesting data ◦ Web application fuzzing.
1 Chapter 12 Working With Access 2000 on the Internet.
ESE Einführung in Software Engineering X. CHAPTER Prof. O. Nierstrasz Wintersemester 2005 / 2006.
Web applications. Javascript. Web 2.0: The dynamic, read-write web UC Santa Cruz CMPS 10 – Introduction to Computer Science
The OWASP Foundation Copyright © The OWASP Foundation Permission is granted to copy, distribute and/or modify this document under.
1 Supplemental line if need be (example: Supported by the National Science Foundation) Delete if not needed. Supporting Polar Research with National Cyberinfrastructure.
Computer Science 101 Web Access to Databases Overview of Web Access to Databases.
1 Chapter 20 — Creating Web Projects Microsoft Visual Basic.NET, Introduction to Programming.
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
WEB DESIGNING Prof. Jesse A. Role Ph. D TM UEAB 2010.
 What I hate about you things people often do that hurt their Web site’s chances with search engines.
11 Games and Content Session 4.1. Session Overview  Show how games are made up of program code and content  Find out about the content management system.
Win8 on Intel Programming Course Modern UI : Features Cédric Andreolli Intel Software.
Intel Do-It-Yourself Challenge OpenCV
© Paradigm Publishing, Inc. 5-1 Chapter 5 Application Software Chapter 5 Application Software.
Win8 on Intel Programming Course The challenge Paul Guermonprez Intel Software
Chapter 33 CGI Technology for Dynamic Web Documents There are two alternative forms of retrieving web documents. Instead of retrieving static HTML documents,
Craig Stewart 23 July 2009 Cyberinfrastructure in research, education, and workforce development.
1 Web Basics Section 1.1 Compare the Internet and the Web Compare Web sites and Web pages Identify Web browser components Describe types of Web sites Section.
Using the Purdue DB Technology to build simple on-demand data exploration tools Michael Grobe Pervasive Technology Institute Indiana University Hubbub.
Introduction to SPSS Edward A. Greenberg, PhD
Goodbye from Indianapolis, IUPUI, and Craig A. Stewart Executive Director, Pervasive Technology Institute Associate Dean, Research Technologies Indiana.
Win8 on Intel Programming Course Modern UI HelloWorld in HTML5/JS Cédric Andreolli Intel.
Big Red II & Supporting Infrastructure Craig A. Stewart, Matthew R. Link, David Y Hancock Presented at IUPUI Faculty Council Information Technology Subcommittee.
Internet of Things with Intel Edison Compiling and running Pierre Collet Intel Software.
Leveraging the National Cyberinfrastructure for Top Down Mass Spectrometry Richard LeDuc.
1 Data Bound Controls II Chapter Objectives You will be able to Use a Data Source control to get data from a SQL database and make it available.
September 6, 2013 A HUBzero Extension for Automated Tagging Jim Mullen Advanced Biomedical IT Core Indiana University.
© 2007 by Prentice Hall 1 Introduction to databases.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. The IQ-Table & Collection Viewer A.
The Animated Sequence Chapter 5.1 in Sketching User Experiences: The Workbook.
26 June 2008 DG REGIO Evaluation Network Meeting Ex-post Evaluation of Cohesion Policy Programmes co-financed by the European Fund for Regional.
RNA-Seq 2013, Boston MA, 6/20/2013 Optimizing the National Cyberinfrastructure for Lower Bioinformatic Costs: Making the Most of Resources for Publicly.
NMED 3850 A Advanced Online Design January 12, 2010 V. Mahadevan.
XRules An XML Business Rules Language Introduction Copyright © Waleed Abdulla All rights reserved. August 2004.
1 Welcome to CSC 301 Web Programming Charles Frank.
CSC 2720 Building Web Applications Server-side Scripting with PHP.
Using Prior Knowledge to Improve Scoring in High-Throughput Top-Down Proteomics Experiments Rich LeDuc Le-Shin Wu.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
November 18, 2015 Quarterly Meeting 30Aug2011 – 1Sep2011 Campus Bridging Presentation.
February 27, 2007 University Information Technology Services Research Computing Craig A. Stewart Associate Vice President, Research Computing Chief Operating.
Win8 on Intel Programming Course Paul Guermonprez Intel Software
1 WWW. 2 World Wide Web Major application protocol used on the Internet Simple interface Two concepts –Point –Click.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Update on EAGER: Best Practices and.
Award # funded by the National Science Foundation Award #ACI Jetstream: A Distributed Cloud Infrastructure for.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
27.1 Chapter 27 WWW and HTTP Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
27.1 Chapter 27 WWW and HTTP Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
HUBzero® Platform for Scientific Collaboration Copyright © 2012 HUBzero Foundation, LLC Collaboration and Contribution Emily Kayser Hub Liaison, HUBzero®
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Informatics Tools at the Indiana CTSI.
27.1 Chapter 27 WWW and HTTP Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
Research & Academic Computing Indiana University Statewide IT Conference 11 September 2003 Indianapolis IN.
1 New Perspectives on Access 2016 Module 8: Sharing, Integrating, and Analyzing Data.
IST 220 – Intro to Databases
Using E-Business Suite Attachments
Matt Link Associate Vice President (Acting) Director, Systems
Methodology Overview 2 basics in user studies Lecture /slide deck produced by Saul Greenberg, University of Calgary, Canada Notice: some material in this.
Introduction to electronic resources management
Chapter 23 – ASP.NET Outline 23.1 Introduction NET Overview
E-resource evaluation tips
Presentation transcript:

© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Using the Purdue DB Technology to build simple on-demand data exploration tools Michael Grobe Presented at Hubbub Annual Meeting, 24 September 2012

The general idea This presentation describes some of the work that came out of our collaboration with Ann-Christine Catlin’s group at Purdue. The primary goal involved making data from Connie Weaver’s Nutritional Science Lab at Purdue available on the Indiana CTSI Hub using the tools developed by Ann-Christine’s group, which we took to calling the “Purdue database technology toolset,” or the “HubZero database technology toolset.”

More intro At the time we were working on this project, the Purdue database technology was focused around several tools that are very useful for collecting and displaying data, and therefore for building integrated data base interfaces: - Dbparse : allows users to upload CSV files to backend database tables - DataView : displays contents of backend database tables - Photo_Gallery and the Collections manage r: maintain and display collections of images which may be displayed on-their- own or from within DataView views - com_form : a component for building web-based forms for data collection and display

More intro To use the DataViewer for data display, programmers simply prepare configuration files that identify database tables to be displayed, and define display appearance, including suitable plots. However, for non-programming users, it seemed likely that a small set of ad hoc data exploration tools might be useful for getting a quick look at the data. In particular, the following capabilities seem to constitute a basic package of capabilities: - frequency tables, - cross tabulations, - simple correlations/regression plots, or - simple correlation matrices.

The prototype and a question The com_form and DataViewer components were used to provide a PROTOTYPE version of a new HubZero component that performs these functions (com_datashape). com_form was used to build forms requesting and validating user input, and com_dataview was used to display frequency charts and regression plots as directed at run-time. (Something of a mutation of the DataViewer model.) So today’s presentation will: - illustrate how these components can be used together, and - help gauge user-interest in having such capabilities proceed past the prototype phase. (It will not however, provide a tutorial on actual coding techniques.)

A starting form to create queries on the fly. (

Request a frequency table/chart

The field name pulldown menu

The resulting frequency chart

The cumulative frequency chart

Request a chart of partitioned field values

Resulting chart of partitioned field values

Cross tabulation on 10 partitions of 2 fields

Cross tabulation set-up(second half)

Resulting cross tabulation

User-specified partitions

Resulting table using user-specified partitions

Request a simple correlation/regression

Resulting regression

A correlation matrix (resulting from specifying a table)

Request user-defined displays via URLs You can invoke these applications via statically coded URLs. These examples will only work if you are logged in to the target CTSI hub instance in the same browser. Cross tablulate Retention with "Age category" (Teen or Adult) (1996) (show only the HTML table to avoid formatting imposed by the DataViewer display) Cross tablulate Retention with "Age category" (Teen or Adult) (1996) (show only the HTML table to avoid formatting imposed by the DataViewer display) Cross tabulate Retention with "Age category" (Teen or Adult) (1996) (show BOTH the HTML and DataView tables) Cross tabulate Retention with "Age category" (Teen or Adult) (1996) (show BOTH the HTML and DataView tables) Cross tabulate 10 Alk Phos categories with 10 Retention categories (HTML table only) Cross tabulate 10 Alk Phos categories with 10 Retention categories

Request user-defined displays via URLs Cross tabulate unique Retention values with unique Retention values (both cases interpreted as integers) (HTML table only) Cross tabulate unique Retention values with unique Retention values (both cases interpreted as integers) Retention cross-tabulated with FecCa (Fecal Calcium) Age by Pre-specified Age Month categories (There are some missing Age values.) Age by Pre-specified Age Month categories Correlate and regress Retention as a function of Post Menarcheal age (1996) Correlate and regress Retention as a function of Post Menarcheal age (1996) Correlation matrix composed of all numeric fields in the table cca_weaver_db1_1_1996 showing only values greater than.5 Correlation matrix composed of all numeric fields in the table cca_weaver_db1_1_1996 showing only values greater than.5

Summary Prototypes for 4 tools for (very) simple data exploration were built over 2 existing Purdue DB technology tools. (Correlation matrix tool can now run independently of the DataViewer.) These provide on-demand frequency charts, cross- tabulations, and simple correlation/regression output. These tools can help users get a “feel” for the data, but are not suitable for serious statistical analysis. Can these tools be helpful in the Hub context?

Sources of additional information – Tutorial: Setting up a spreadsheet for use with the cceHub DataViewer Tutorial: Setting up a spreadsheet for use with the cceHub DataViewer – Tutorial: Using the Web form component to build scripts to modify databases Tutorial: Using the Web form component to build scripts to modify databases – Tutorial: Using the Web form and DataViewer components for to request user-specified frequency tables/charts, cross-tabs, and regressions Tutorial: Using the Web form and DataViewer components for to request user-specified frequency tables/charts, cross-tabs, and regressions – Putting a Joomla! component into SVN for the development cycle Putting a Joomla! component into SVN for the development cycle

License terms Please cite as: Grobe, D.M. Using the Purdue DB Technology to build simple on-demand data exploration tools. (Presentation) Hubbub 2012 (Indianapolis, IN, 21 September 2012). Items indicated with a © are under copyright and used here with permission. Such items may not be reused without permission from the holder of copyright except where license terms noted on a slide permit reuse. Except where otherwise noted, contents of this presentation are copyright 2011 by the Trustees of Indiana University. This document is released under the Creative Commons Attribution 3.0 Unported license ( This license includes the following terms: You are free to share – to copy, distribute and transmit the work and to remix – to adapt the work under the following conditions: attribution – you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). For any reuse or distribution, you must make clear to others the license terms of this work.