GenSpace: Exploring Social Networking Metaphors for Scientific Collaborative Work Gail Kaiser, Swapneel Sheth, Chris Murphy {kaiser, swapneel, cmurphy}

Slides:



Advertisements
Similar presentations
Chapter 14: Usability testing and field studies
Advertisements

GRADD: Scientific Workflows. Scientific Workflow E. Science laboris Workflows are the new rock and roll of eScience Machinery for coordinating the execution.
The Access Grid Ivan R. Judson 5/25/2004.
Common Instrument Middleware Architecture and Federation of Instrument Resources for X-ray Crystallography Rick McMullen Indiana University.
Systems Development Environment
BlogMyData A Virtual Research Environment for collaborative visualization of environmental data Andrew Milsted | 14 September 2010.
Practical Object-Oriented Design with UML 2e Slide 1/1 ©The McGraw-Hill Companies, 2004 PRACTICAL OBJECT-ORIENTED DESIGN WITH UML 2e Chapter 5: Restaurant.
1 genSpace: Community- Driven Knowledge Sharing for Biological Scientists Gail Kaiser’s Programming Systems Lab Columbia University Computer Science.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
GenSpace: Exploring Social Networking Metaphors for Knowledge Sharing and Scientific Collaborative Work Chris Murphy, Swapneel Sheth, Gail Kaiser, Lauren.
Federated Hierarchical Filter Grids STTR-funded project with Indiana, Caltech and Deep Web Technologies A Grid infrastructure for Data Analysis Integrates.
Distributed In Vivo Testing of Software Applications Matt Chu, Christian Murphy, Gail Kaiser Columbia University.
Chapter 8 Management Support and Coordination Systems.
ETH Initiative – Project FoodCASE Research on Data Quality of Scientific Database Systems by Example of Food Composition Databases Karl Presser.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
University of Michigan MCommunity Project Liz Salley Product Manager, Michigan Administrative Information Services Luke Tracy
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
Apache Airavata GSOC Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced.
The Design Discipline.
Chapter 13 Starting Design: Logical Architecture and UML Package Diagrams.
EUBA: The Emory User Behavior Analysis System Eugene Agichtein, Qi Guo and Ryan Kelly Intelligent Information Access Lab
Visualizing Cyber Security: Usable Workspaces Glenn A. Fink, Christopher L. North, Alex Endert, Stuart Rose.
IT Terminology Quiz VSB 1002: Business Dynamics II Spring 2009.
Information Architecture The science of figuring out what you want your Web site to do and then constructing a blueprint before you dive in and put the.
Investigating Trust Factors in Computer Mediated Group Collaboration Xusen Cheng Manchester Business School The University of Manchester, UK.
Chapter 1 Introduction to Data Mining
The Information Environment for Neuroscientists David R Newman
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Chapter © 2006 The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/ Irwin Chapter 7 IT INFRASTRUCTURES Business-Driven Technologies 7.
Semantically Enhanced Model Experiment Evaluation Process (SeMEEP) within the Atmospheric Chemistry Community Chris Martin 1,2, Mo Haji 2, Peter Dew 2,
Programming in Java Unit 3. Learning outcome:  LO2:Be able to design Java solutions  LO3:Be able to implement Java solutions Assessment criteria: 
Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.
Crystal-25 April The Rising Power of the Web Browser: Douglas du Boulay, Clinton Chee, Romain Quilici, Peter Turner, Mathew Wyatt. Part of a.
Object Management Group (OMG) Specifies open standards for every aspect of distributed computing Multiplatform Model Driven Architecture (MDA)
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Information System Development Courses Figure: ISD Course Structure.
7-1 Management Information Systems for the Information Age Copyright 2004 The McGraw-Hill Companies, Inc. All rights reserved Chapter 7 IT Infrastructures.
CONCLUSION & FUTURE WORK Given a new user with an information gathering task consisting of document IDs and respective term vectors, this can be compared.
Illustrations and Answers for TDT4252 exam, June
Introducing HingX now with Capacity Development Network.
Large Scale Nuclear Physics Calculations in a Workflow Environment and Data Provenance Capturing Fang Liu and Masha Sosonkina Scalable Computing Lab, USDOE.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
NA2 objectives The „v.E.R.C.“ (virtual Earth System Resource Center) v.E.R.C is: A (technical) infrastructure to improve the utilization of the ESM core-infrastructure,
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
CS 772: Global Knowledge Networks V. “Juggy” Jagannathan CSEE, West Virginia University.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Electronic labnotes Mari Wigham COMMIT/. Information WUR  Organising, sharing, finding and reusing data  Expertise in: ● Modelling data.
Recording the Context of Action for Process Documentation Ian Wootten Cardiff University, UK
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Usability Lab 2002 Cascade Kick-Off Meeting User Requirements - Web Site Design Multimedia Interface to Material Databases Flavio Fontana (Ulab)
Diagrams. Typically, we view the static parts of a system using one of the four following diagrams. 1. Class diagram 2. Object diagram 3. Component diagram.
A Collaborative e-Science Architecture towards a Virtual Research Environment Tran Vu Pham 1, Dr. Lydia MS Lau 1, Prof. Peter M Dew 2 & Prof. Michael J.
Thomas Gutberlet HZB User Coordination NMI3-II Neutron scattering and Muon spectroscopy Integrated Initiative WP5 Integrated User Access.
IT 5433 LM1. Learning Objectives Understand key terms in database Explain file processing systems List parts of a database environment Explain types of.
Business Intelligence Overview. What is Business Intelligence? Business Intelligence is the processes, technologies, and tools that help us change data.
Business System Development
SOFTWARE DESIGN AND ARCHITECTURE
Lesson 13 - Cleaning Data Lesson 14 - Creating Summary Tables
Data Warehouse.
INQUIRY PROCESSES.
Unified Modeling Language
Gail Kaiser, Swapneel Sheth, Chris Murphy
Starting Design: Logical Architecture and UML Package Diagrams
OGCE Portal Applications for Grid Computing
Lesson 13 - Cleaning Data Lesson 14 - Creating Summary Tables
Federated Hierarchical Filter Grids
ODV beta Multiple views, new views bar
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Presentation transcript:

genSpace: Exploring Social Networking Metaphors for Scientific Collaborative Work Gail Kaiser, Swapneel Sheth, Chris Murphy {kaiser, swapneel, cs.columbia.edu Programming Systems Lab Department of Computer Science Columbia University

genSpace Goals Prior “collaboratories” for bioinformatics scientists  Tool sharing (e.g. BioCoRE from UIUC)‏BioCoRE  Data sharing (e.g. BSC from PNNL)‏BSC  Instead, we seek to introduce knowledge sharing  What tools / datasets should I use to investigate this problem?  Who do I know who also uses this tool / dataset?  Which tools and datasets work nicely together?  Where does this tool / dataset fit in a typical workflow?  When did I previously use this tool / dataset?  How can I get help (from an expert who is online right now)?

genSpace Goals Social Networking facilitates Collaborative Filtering  What movies would I like?  Who also likes this book?  Which food and wine go together?  Where does this song fit in the playlist?  When was this restaurant last reviewed?  How can I get help about ? We investigate Social Networking Models as an approach to Scientific Knowledge Sharing We are implementing a prototype for geWorkbenchgeWorkbench

genSpace Technical Details genSpace client is a geWorkbench “component”, separate centralized genSpace server Instrument geWorkbench to capture and record analysis events Aggregate event logs for communities of users Data-mine event patterns Automatically construct networks of “similar” users Automatically construct implicit workflows consisting of sequences of analysis tools Query via Instant Messaging (IM) “bots” Visualize networks and workflows

genSpace Architecture

genSpace plugin for geWorkbench

IM based query interface (jclaim)

Single Workflow Visualization

Workflow Relationship Visualization

genSpace Research Questions We do not aim to (directly) produce new results answering bioinformatics research questions easier Instead, we seek to make it easier for bioinformatics researchers to find answers to their own questions We do, however, seek to produce (direct) new research results of interest in software systems  Adapting a “social” metaphor for collaborative work  Data mining implicit workflows  Publish/subscribe based on implicit participation in tool and dataset use communities  Security/privacy implications and amelioration