--Presented by Tianyi Zhang Building Community Wikipedias: A Machine-Human Partnership Approach.

Slides:



Advertisements
Similar presentations
XML DOCUMENTS AND DATABASES
Advertisements

Relational Databases for Querying XML Documents: Limitations & Opportunities VLDB`99 Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton,
Chapter 2 Data Models Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Organizing Data Chapter 5. Data Hierachy Table = Entities X Attributes Entities = Records Attributes = Fields.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
1 Extracting RDF Data from Unstructured Sources Based on an RDF Target Schema Tim Chartrand Research Supported By NSF.
Automatic Data Ramon Lawrence University of Manitoba
Lecture Nine Database Planning, Design, and Administration
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
1 Chapter 2 Database Environment. 2 Chapter 2 - Objectives u Purpose of three-level database architecture. u Contents of external, conceptual, and internal.
TIBCO Designer TIBCO BusinessWorks is a scalable, extensible, and easy to use integration platform that allows you to develop, deploy, and run integration.
Chapter 2 Database System Concepts and Architecture
DATABASE MANAGEMENT SYSTEM ARCHITECTURE
Chapter One Overview of Database Objectives: -Introduction -DBMS architecture -Definitions -Data models -DB lifecycle.
What is a Database? A database is any collection of data.
IS432: Semi-Structured Data Dr. Azeddine Chikh. 1. Semi Structured Data Object Exchange Model.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Information storage: Introduction of database 10/7/2004 Xiangming Mu.
Ron Kratzke, Vitech Corporation MBSE for System Testing Managing the development of system testing using the principles of Model.
CST203-2 Database Management Systems Lecture 2. One Tier Architecture Eg: In this scenario, a workgroup database is stored in a shared location on a single.
Interoperability in Information Schemas Ruben Mendes Orientador: Prof. José Borbinha MEIC-Tagus Instituto Superior Técnico.
Recent research : Temporal databases N. L. Sarda
Reviewing Recent ICSE Proceedings For:.  Defining and Continuous Checking of Structural Program Dependencies  Automatic Inference of Structural Changes.
Dr. Mohamed Osman Hegazi 1 Database Systems Concepts Database Systems Concepts Course Outlines: Introduction to Databases and DBMS. Database System Concepts.
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
Database Administration COMSATS INSTITUTE OF INFORMATION TECHNOLOGY, VEHARI.
Dimitrios Skoutas Alkis Simitsis
Modified by Juan M. Gomez Software Engineering, 6th edition. Chapter 7 Slide 1 Chapter 7 System Models.
 Three-Schema Architecture Three-Schema Architecture  Internal Level Internal Level  Conceptual Level Conceptual Level  External Level External Level.
P2Pedia A Distributed Wiki Network Management and Artificial Intelligence Laboratory Carleton University Presented by: Alexander Craig May 9 th, 2011.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 7 Slide 1 Chapter 7 System Models.
1 CS 430 Database Theory Winter 2005 Lecture 2: General Concepts.
Copyright 2006 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Third Edition Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Chapter.
Chapter 2 Database System Concepts and Architecture Dr. Bernard Chen Ph.D. University of Central Arkansas.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture Presented & Modified.
DATABASE MANAGEMENT SYSTEM ARCHITECTURE
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
A System to Generate Test Data and Symbolically Execute Programs Lori A. Clarke Presented by: Xia Cheng.
 To explain why the context of a system should be modelled as part of the RE process  To describe behavioural modelling, data modelling and object modelling.
Systems Development Lifecycle
Object Oriented Analysis and Design Introduction to Rational Rose.
What is a Wiki? A wiki is an online database that can be edited by anyone with access to it. “ Wiki ” is Hawaiian meaning ‘ fast ’ or ‘ quick ’
Database Environment Chapter 2. The Three-Level ANSI-SPARC Architecture External Level Conceptual Level Internal Level Physical Data.
Databases Salihu Ibrahim Dasuki (PhD) CSC102 INTRODUCTION TO COMPUTER SCIENCE.
Data Models. 2 The Importance of Data Models Data models –Relatively simple representations, usually graphical, of complex real-world data structures.
V7 Foundation Series Vignette Education Services.
MBI 630: Week 9 Conceptual Data Modeling and Designing Database 6/10/2016.
1 © 2013 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the.
1 CS122A: Introduction to Data Management Lecture #4 (E-R  Relational Translation) Instructor: Chen Li.
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
Fundamental of Database Systems
Database Systems: Design, Implementation, and Management Tenth Edition
Chapter 2 Database Environment.
Cloud based linked data platform for Structural Engineering Experiment
TSD – Teradata Software Downloads
Introduction To Database Systems
Chapter 2 Database Environment.
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment.
Data Model.
Analysis models and design models
Database Systems Instructor Name: Lecture-3.
Database Design Hacettepe University
Wikipedia Network Analysis: Commonality detection among Wikipedia authors Deepthi Sajja.
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment Pearson Education © 2009.
SDMX IT Tools SDMX Registry
Presentation transcript:

--Presented by Tianyi Zhang Building Community Wikipedias: A Machine-Human Partnership Approach

Motivation Wikipedia portal involves deeper and broader in our lives. Existing Approaches Semi-Automatic Methods(such as Libra, Citeseer) to extract and integrate data from a multitude of data sources. Deploy an initial portal in wiki format, then invite community members to revise and add material.(such as Wikipedia)

The Ultimate goal of this paper

Problems during the path to end How to build an initial wiki portal? How to model and implement the structure database G? What should be the view language? How to export the structured data of the views into wiki pages? How to manage user contributions to the portal? What is it that U is conceptually allowed to edit? How to efficiently infer such edits based on what U has done to a wiki page W? User can edit schema, not just the data. User’s edit limited to this wiki interface is usually ambiguous.

Contribution of this paper Introduce a novel hybrid approach which integrates user’s insight and machines input. Provide solution to modeling the underlying structured database, and exporting these views in wiki pages. Employ an efficient solution to process user edits from wiki page to underlying databases. Empirical results over a real-world implementation to demonstrate the promise of the approach

Architecture

Problems during the path to end How to build an initial wiki portal? How to model and implement the structure database G?

Has to be intuitive… The author define schema Gs of G to consist of a set of entity types E1,…En and a set of relation types R1,…Rn The author defines the data Gd of G to be a temporal ER data graph. A set of nodes that specify entity instances A set of edges that specify relation instances

Example

Storing G using RDBMS Create several tables to store definitions of entity types, relationship types and their attributes (such as person has attributes name, title, and organization). Create table Entity_ID(id, etype) and Relationship_ID(id, rtype, eid1, eid2) A_M and A_U are two tables which are to keep track of all values from users and machines. Human-first policy

Example

Problems during the path to end How to build an initial wiki portal? How to model and implement the structure database G? What should be the view language?

View Language Requirements Every wiki page is to describe one entity or relationship Materialize quickly Mortify view quickly

A path-based view language Data path and schema path Data path-start from one entity to another entity, retains only certain attributes for each node/edge along the path Schema path

Example

A path-based ER view

Example

Creating Views over ER Graph Wikify all entity and relationship Specify a default view schema Ve that specifies a neighborhood of instances of E

Problems during the path to end How to build an initial wiki portal? How to model and implement the structure database G? What should be the view language? How to export the structured data of the views into wiki pages?

Converting Views to Wiki Pages The straightforward solution is to convert Vd into a set of natural-language sentences. But, it is difficult to extract and update structured data.

Problems during the path to end How to build an initial wiki portal? How to model and implement the structure database G? What should be the view language? How to export the structured data of the views into wiki pages? How to manage user contributions to the portal? What is it that U is conceptually allowed to edit?

Problems during the path to end How to build an initial wiki portal? How to model and implement the structure database G? What should be the view language? How to export the structured data of the views into wiki pages? How to manage user contributions to the portal? What is it that U is conceptually allowed to edit? How to efficiently infer such edits based on what U has done to a wiki page W? User can edit schema, not just the data. User’s edit limited to this wiki interface is usually ambiguous.

Infer & Execute Structured Edits

Example

Mapping User Edits into Sequence of Actions

Empirical Evaluation

Thanks Question