Fusion Tables.

Slides:



Advertisements
Similar presentations
Darrell W. Gunter EVP / CMO Collexis Holdings, Inc. March 23, 2010 Spring Conference CONTENT: Uncovering the Value and Benefits of Semantic Technology.
Advertisements

OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Motif Space Database Design Kiranjit Sidhu. 2 Outline  Schema Design  Content of Database  Functionality  Future Plans.
An Architecture for Creating Collaborative Semantically Capable Scientific Data Sharing Infrastructures Anuj R. Jaiswal, C. Lee Giles, Prasenjit Mitra,
Synthesis of Incomplete and Qualified Data using the GCE Data Toolbox Wade Sheldon Georgia Coastal Ecosystems LTER University of Georgia.
The IATI Data Store. In the beginning... IATI does not propose the development of a new aid database, and it does not seek to develop parallel standards.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Conceptual Modeling Issues in Web Applications enhanced with Web services Sara Comai, Politecnico di Milano In collaboration with:
CountryData Development Improving the collation, availability and dissemination of development indicators (including the MDGs) Nairobi, 27 November 2013.
Data Management David Nathan & Peter Austin & Robert Munro.
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
KMS Products By Justin Saunders. Overview This presentation will discuss the following: –A list of KMS products selected for review –The typical components.
Access 2013 Platform Overview Access Low up-front investment Easy to evolve and iterate Easy adoption One version of the truth Easy to collaborate.
EXTENDING DATABASE USABILITY Michelle Brown, MSc. Student.
The Glance Project ATLAS Management January 2012.
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
Access Forms and Queries. Entering Data in Your Table  You can add data to your table in Datasheet view, by typing in the columns and rows.  This.
Electronic labnotes Mari Wigham COMMIT/. Information WUR  Organising, sharing, finding and reusing data  Expertise in: ● Modelling data.
TopCAT Use Cases Priorities User Interface 1 ICAT developer workshop, August 2009 Laurent Lerusse – STFC
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
Steven Seida How Does an RDF Knowledge Store Compare to an RDBMS?
GOOGLE FUSION TABLES: WEB- CENTERED DATA MANAGEMENT AND COLLABORATION HectorGonzalez, et al. Google Inc. Presented by Donald Cha December 2, 2015.
Prizms for Data Publication and Management May 9, 2014 Katie Chastain.
#SummitNow CMIS in our Research Group Ian Wright University of
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
1 SQL SERVER 2005 Express CE-105 SPRING 2007 Engr. Faisal ur Rehman.
THE ANCHOR DASHBOARD Envisioning an online hub for anchor/community metrics May 20, 2015 Anchor Dashboard Cohort Meeting Baltimore John Duda, PhD Communications.
Developing Online Tools To Support The Visualization Of Ocean Data For Educational Applications Poster #1767 Michael Mills, S. Lichtenwalner,
Florida Technical College
Scholarly Workflow: Federal Prototype and Preprints
What is sql?.
Datab ase Systems Week 1 by Zohaib Jan.
BI tools: Excel’s Pivot table
The importance of being Connected
So, what was this course about?
Using Partitions and Fragments
Integrating Data for Archaeology
Building a CMMI Data Infrastructure
Applied CyberInfrastructure Concepts Fall 2017
ICT Database Lesson 1 What is a Database?.
Data Warehouse.
Lesson 1: Introduction to Trifacta Wrangler
Tutorial 8 Objectives Continue presenting methods to import data into Access, export data from Access, link applications with data stored in Access, and.
CS 174: Server-Side Web Programming February 12 Class Meeting
Principles of report writing
RichAnnotator: Annotating rich (XML-like) documents
Dynamic Data Access and Dynamically Generated WMS Layers
Lecture 5: Leave no relevant data behind: Data Search
TRAINING OF FOCAL POINTS on the CountrySTAT SYSTEM based on FENIX
WEBINAR: Test Automation & Robotic Automation of Dynamics AX with Rapise October 18th, 2018 – Adam
Data Model.
OOA&D II Bo Wang, Kan Qi Adapted from Alexey Tregubov’s Slides.
FOCOS 2 Meet A brand new version of FOCOS is now available
Dr. Bhavani Thuraisingham The University of Texas at Dallas
LOD reference architecture
Relational Database Design
Database Management System
BI tools: Excel’s Pivot table
Student Organizations
Databases This topic looks at the basic concept of a database, the key features and benefits of a Database Management System (DBMS) and the basic theory.
Programming with Data Lab 7
Reportnet 3.0 Database Feasibility Study – Approach
Vancouver Public Library
9/8/ :03 PM © 2006 Microsoft Corporation. All rights reserved.
Dynamic Data Access and Dynamically Generated WMS Layers
The Data of Visualization
Database management systems
Presentation transcript:

Fusion Tables

Takeaways Relatively “light” paper on a real-world public facing system Clearly useful to some people and organizations – many users… Companion paper in SOCC talks about details of implementation most are standard adaptations of existing techniques

Target Users are Data Enthusiasts Also called “factivists”: those who know nothing about DBMS They need to find good data, do meaningful data integration, and tell compelling stories Allow them to upload, collaborate, visualize data Also combine them with existing datasets Need to understand semantics of datasets

Goals of Fusion Tables Goal 1: Easy to use database system integrated with the web Support common workflows Easy upload Sharing Visualizations Publishing Goal 2: Fusion with other datasets; find others and combine with yours

Any thoughts about the first goal? Who are the target users for tools like this? We saw some examples…

Any thoughts about the first goal? Who are the target users for tools like this? We saw some examples… People who want to store and study small datasets But need something more powerful than excel Joins, Selects, aggregates (visualizations) e.g., Journalists, scientists, governments, non-profits

Let’s talk about each of these steps.. Data Acquisition Primarily work on CSVs They don’t require a schema in advance Automatically infer schemas Is this sufficient in practice?

Not really! Studies state that data acquisition accounts for 80% of the development time and cost in data science What if data is not in csv, but in JSON, or XML? How would you clean it then? Thoughts?

Other Recent Work There has been some recent work on cleaning data automatically with humans.. (there’s other work on this as well) http://vimeo.com/19185801

Drawbacks?

Drawbacks? If you make a mistake, hard to go back. Requires expertise on the part of users Can you think of other ways to do this acquisition without programming?

Drawbacks? If you make a mistake, hard to go back. Requires expertise on the part of users Can you think of other ways to do this acquisition without programming? Examples? Highlight regions vertically? Use semantic knowledge?

Sharing Fusion tables supports sharing and collaboration on tables; What are the issues that come up when multiple users are collaborating on tables?

Sharing Fusion tables supports sharing and collaboration on tables; What are the issues that come up when multiple users are collaborating on tables? What are the issues that come up when there are visualizations that derive from tables?

Sharing Fusion tables supports sharing and collaboration on tables; What are the issues that come up when multiple users are collaborating on tables? Need for coordination: conflicts. What are the issues that come up when there are visualizations that derive from tables?

Other issues What if users make mistakes while collaborating? What can we do then?

Our recent work: Datahub: Github for data

Eventually… Hopefully Powerful versioning query language

Users of Fusion Tables Examples…

Goals of Fusion Tables Goal 1: Easy to use database system integrated with the web Support common workflows Easy upload Sharing Visualizations Publishing Goal 2: Fusion with other datasets; find others and combine with yours

Fusion Tables Implementation Simple search and suggestion based on existing data What other use-cases could you see for web-data integration (i.e., finding data on the web to “mesh” with your data)?

Other Usecases Row or column augmentation Join augmentation Missing value augmentation Accuracy augmentation

For example

First source of data Many open data initiatives Governments Non-profits Collaborations and academic institutions E.g., uci repository