Lecture 01: Introduction September 5, 2012 COMP 250-2 Visual Analytics and Provenance.

Slides:



Advertisements
Similar presentations
Provenance-Aware Storage Systems Margo Seltzer April 29, 2005.
Advertisements

BTEC 149. Windows Desktop Click on the Start Button.
WHAT D IS RAW, UNPROCESSED FACTS AND FIGURES COLLECTED, STORED AND PROCESSED BY COMPUTERS.
Parts of a Computer.
Master Informatique 1 Semantic Technologies Part 0Course Organization Semantic Technologies Werner Nutt.
LECTURE 10: ANALYTIC PROVENANCE April 6, 2015 COMP Topics in Visual Analytics Note: slide deck adapted from R. Chang.
1 CS 501 Spring 2002 CS 501: Software Engineering Lecture 11 Designing for Usability I.
Copyright © 2015 Pearson Education, Inc. Systems Documentation Techniques Chapter
Tailoring Needs Chapter 3. Contents This presentation covers the following: – Design considerations for tailored data-entry screens – Design considerations.
The Experience Factory May 2004 Leonardo Vaccaro.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Chapter 12: ADO.NET and ASP.NET Programming with Microsoft Visual Basic.NET, Second Edition.
VisTrails: Overview Juliana Freire University of Utah Joint work with: Erik Andersen, Steven P. Callahan, David Koop, Emanuele.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 1: Introduction to Decision Support Systems Decision Support.
Systems Analysis I Data Flow Diagrams
Interactive Whiteboards for Effective Primary Science Continuing Professional Development Unit Interactive Whiteboards for Effective Primary Science Start.
Course: Introduction to Computers
Database Design IST 7-10 Presented by Miss Egan and Miss Richards.
Trends (and Some Issues) in Electronic Records Management William Saffady, Professor College of Information and Computer Science Long Island University.
Software Development Unit 2 Databases What is a database? A collection of data organised in a manner that allows access, retrieval and use of that data.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
ACCB 133 Information Technology and Accounting Applications Lecture 6: Application Software.
ONTOLOGY SUPPORT For the Semantic Web. THE BIG PICTURE  Diagram, page 9  html5  xml can be used as a syntactic model for RDF and DAML/OIL  RDF, RDF.
January, 23, 2006 Ilkay Altintas
Idea-garden.org SOCIAL SEMANTIC INFORMATION SPACE An Interactive Learning Environment Fostering Creativity Grant agreement no: nd CIDOC CRM-SIG.
Topics Covered: Data preparation Data preparation Data capturing Data capturing Data verification and validation Data verification and validation Data.
Managing Business Data Lecture 8. Summary of Previous Lecture File Systems  Purpose and Limitations Database systems  Definition, advantages over file.
Systems Development Life Cycle Dirt Sport Custom.
Databases C HAPTER Chapter 10: Databases2 Databases and Structured Fields  A database is a collection of information –Typically stored as computer.
Geovisualization for Constructing and Sharing Concepts Alan M. MacEachren, Mark Gahegan, & Bill Pike GeoVISTA Center Geography, Penn State
Semantic Publishing Update Second TUC meeting Munich 22/23 April 2013 Barry Bishop, Ontotext.
2/6/01D-1 © 2001 T. Horton CS 494 Object-Oriented Analysis & Design Using PARTS to Illustrate Requirements Concepts.
Usage of `provenance’: A Tower of Babel Luc Moreau.
Sharing lessons through effective modelling Hilary Dexter University of Manchester Tom Franklin Franklin Consulting.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Multimedia ITGS. Multimedia Multimedia: Documents that contain information in more than one form: Text Sound Images Video Hypertext: A document or set.
1 CS430: Information Discovery Lecture 18 Usability 3.
CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Search Engine Architecture
1 CS 430 Database Theory Winter 2005 Lecture 2: General Concepts.
A Short Tutorial to Semantic Media Wiki (SMW) [[date:: July 21, 2009 ]] At [[part of:: Web Science Summer Research Week ]] By [[has speaker:: Jie Bao ]]
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Grade Book Database Presentation Jeanne Winstead CINS 137.
ReproZip Packing Experiments for Sharing and Publication Fernando Chirigati, Juliana Freire | NYU-Poly Dennis Shasha | NYU.
Fall UI Design and Implementation1 Lecture 15: Undo.
LECTURE 12: ANALYTIC PROVENANCE November 16, 2015 SDS235: Visual Analytics Note: slide deck adapted from R. Chang.
VisTrails Second Provenance Challenge Tommy Ellkvist David Koop Juliana Freire Joint work with: Erik Andersen, Steven P. Callahan, Emanuele Santos, Carlos.
Certificate in Accounting NOS 116, 118, 213, 222 Lecture 1: Introduction.
Use Case Diagrams. Introduction In the previous Lecture, you saw a brief review of the nine UML diagrams. Now that you have the clear, you'll start to.
T EST T OOLS U NIT VI This unit contains the overview of the test tools. Also prerequisites for applying these tools, tools selection and implementation.
IntroGoalCrowdPredictionWrap-up 1/26 Learning Debugging and Hacking the User Remco Chang Assistant Professor Tufts University.
#1 Make sense of problems and persevere in solving them How would you describe the problem in your own words? How would you describe what you are trying.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Blue Ridge Unified School District.  How will this help me? My Family Link  Where do I get started? Personal Profile Page  How do I manage my webpage?
TechKnowlogy Conference August 2, 2011 Using GoogleDocs for Collaboration.
Systems Documentation Techniques
Chapter 2: Database System Concepts and Architecture - Outline
Sharing lessons through effective modelling
Lecture 15: Analytic Provenance
Semantic Web Project Status
Search Engine Architecture
Databases.
CSc4730/6730 Scientific Visualization
Introduction to Systems Analysis and Design Stefano Moshi Memorial University College System Analysis & Design BIT
Search Engine Architecture
Automation of Control System Configuration TAC 18
Education & AI High level discussion points relating ‘techniques / tools’ to possible ‘projects’
Presentation transcript:

Lecture 01: Introduction September 5, 2012 COMP Visual Analytics and Provenance

Motivation: What is in a User’s Interactions? Types of Human-Visualization Interactions Text editing (input heavy, little output) Browsing, watching a movie (output heavy, little input) Visual Analysis (closer to 50-50) VisualizationHuman Output Input Keyboard, Mouse, etc Images (monitor)

Provenance: Definitions Provenance (according to Webster): 1.origin, source 2.the history of ownership of a valued object or work of art or literature Example: Has anyone traced the provenances of these paintings?

Provenance: Computer Science Examples in CS relating to provenance: Undo / Redo Diff / Revision history History logging (e.g., Wikipedia) Interaction logging Data annotation Database queries and results (Oracle Flashback) Screen capturing? Knowledge management Etc.

Problems and Challenges There are lots of people working on problems pertaining to provenance. For example: Open Provenance Model (OPM): common framework for describing information provenance (e.g., Wikipedia) Database: (1) how data is derived from large computation, (2) how data is copied / synthesized from one part of the database to another Knowledge management / Ontology: the study of relationships between data, concepts, processes, information, etc. (e.g., Scientific Workflow)

What is “Analytic Provenance”? Analytic Provenance (AP): similar to workflow, is the provenance of the analysis (the steps of the analysis). This includes: The record of the data and data manipulation The record of the user’s interactions The record of what the user sees The record of the computational products (both the visualization and the data) The record of the user’s analysis results How is provenance research in visual analytics similar and different from the others?

Similarities At a high level, both are attempts at keeping track of the changes of data and information over time. Along the same line, one common goal is to verify and validate the current information or the results of a query / analysis. Others?

Differences Exploration with high interactivity. Many provenance systems utilize state diagrams Similar to depicting “scientific workflow” What is a “state” in a highly exploratory system? Often, the “states” are data dependent Computation can be costly Storing only the procedures do not always lead to the same result due to computational powers Storing of “insight” How to determine which interactions are useful? (the labeling problem…)

Why Study Analytic Provenance? Damn good question… A problem that has plagued me for years Validation (of the results) –kind of lame Verification (of the process) – kind of lame Training – Less lame, but still not very sexy These can all be (somewhat) solved with video-capturing the screen (with some annotation and/or microphone)

Why Study Analytic Provenance? More interestingly, as a post hoc analysis: Redo of a certain task For example, I just did some analysis on the Census of Massachusetts. Can I reapply the same analysis to Rhode Island Census? What analysis steps are useful? For example, in solving a difficult problem, is there a key step (or steps) that lead to success (and to failure)? Building a knowledge-base If we believe that interactions encodes reasoning and analysis, can I aggregate all interactions/analyses into a knowledge-base?

Why Study Analytic Provenance? As an ad hoc analysis (that is, in real time), determine a user’s “analytical needs”: Related to “adaptive” visualization or interfaces, but goes beyond adapting the UI. For example: Analysis of search space: what data space has the user explored (and not explored)? Analysis of bias: is the user favoring some hypothesis and ignoring evidence? Essentially, anything that can give us some indication of who the user is (in terms of “ICD 3 ”)

Challenges? What are some challenges that need to be solved in order to accomplish the previously stated goals? How would you categorize them? CHI 2010 Analytic Provenance workshop

Questions?

Provenance and Scientific Workflows: Challenges and Opportunities Susan B.Davidson (Upenn) Juliana Freire (Utah/NYU) SIGMOD 2008

Prospective Provenance: Steps taken in the scientific workflow Retrospective Provenance: The environment in which the steps were taken Annotated Provenance: Some notes entered by the user Note the small squares in the upper left corner of each box (representing each step)

Ways to Store Provenance RDF (resource description file) SPARQL (query language for RDF)

Ways to Store Provenance OWL (Web Ontology Language)

Yuck… Knowledge vs. information vs. data Example about grass, rabbit, and wolf

Opportunities 1.Provenance and Scientific Publications: Reproducibility for scientific experiments 2.Provenance and Data Exploration: Simplify exploratory processes (graph reduction) 3.Social Network Analysis: Not sure how this is different from 1, but applied to SNA (and crowd sourcing) 4.Provenance in Education: “Teaching is one of the killer applications of provenance-enabled workflow systems”

User starts with an analogy template (left) The user then reapplies the workflow to a different dataset (right). The system automatically figures out inconsistencies or problems with the reapplication and either (a) fixes it or (b) warns the user

Open Problems 1.Information Management Infrastructure: Usability issue relating to how to use information management systems 2.Provenance Analytics and Visualization: Visualize and data-mine provenance. 3.Interoperability: Steps of workflow generated from different software and computers 4.Connecting Database and Workflow: Does provenance make sense without access to the (a) original or the (b) derived data

Questions?

Date and Time When and where should we meet??

Structure of the Course Open-ended The topic of provenance is both old and new… Read existing papers (that I think are important), and You suggest new ones Identify research opportunities Daily schedule (tentative): 1:30 – 2: presentations of 15 minutes each 2:30 – 2:45 break 2:45 – 4:00 group discussion

Time Line See course website Sign up for: 2 scribes positions (on different weeks) 2 papers to present (on different weeks) And on those weeks, you will lead the discussions 1 week where your group (of 3) will identify papers and lead discussions

Questions?