Exploring Personal CoreSpace For DataSpace Management Li Yukun and Xiaofeng Meng WAMDM Lab Renmin University of China.

Slides:



Advertisements
Similar presentations
2 Introduction A central issue in supporting interoperability is achieving type compatibility. Type compatibility allows (a) entities developed by various.
Advertisements

BI Web Intelligence 4.0. Business Challenges Incorrect decisions based on inadequate data Lack of Ad hoc reporting and analysis Delayed decisions.
Personalized Navigation in the Semantic Web: An Enhanced Faceted Browser Michal Tvarožek FIIT STU BA.
Semantic Search Jiawei Rong Authors Semantic Search, in Proc. Of WWW Author R. Guhua (IBM) Rob McCool (Stanford University) Eric Miller.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
Methodology Conceptual Database Design
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Overview of Database Languages and Architectures.
BIOCMS: Resource Integration and Web Application Framework for Bioinformatics DHUNDY R BASTOLA †, *, ANIL KHADKA †, MOHAMMAD SHAFIULLAH † AND HESHAM ALI.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
© 2011 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 1 August 15th, 2012 BP & IA Team.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Creating Business Workflow Using SharePoint Designer 2007 Presented by Tarek Ghazali IT Technical Specialist Microsoft SQL Server MVP Microsoft SQL Server.
Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
Avalanche Internet Data Management System. Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche.
Social scope: Enabling Information Discovery On Social Content Sites
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Methodology - Conceptual Database Design Transparencies
Database Systems: Design, Implementation, and Management Ninth Edition
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Geospatial Technical Support Module 2 California Department of Water Resources Geospatial Technical Support Module 2 Architecture overview and Data Promotion.
1 A Theoretical Framework for Association Mining based on the Boolean Retrieval Model on the Boolean Retrieval Model Peter Bollmann-Sdorra.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Satoshi Oyama Takashi Kokubo Toru lshida 國立雲林科技大學 National Yunlin.
Adaptive Hypermedia Tutorial System Based on AHA Jing Zhai Dublin City University.
1-1 System Development Process System development process – a set of activities, methods, best practices, deliverables, and automated tools that stakeholders.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
1/26/2004TCSS545A Isabelle Bichindaritz1 Database Management Systems Design Methodology.
Methodology: Conceptual Databases Design
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
A Model for Fast Web Mining Prototyping Nivio Ziviani UFMG – Brazil Álvaro Pereir a Ricardo Baeza-Yates Jesus Bisbal UPF – Spain.
Methodology - Conceptual Database Design
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
Algorithmic Detection of Semantic Similarity WWW 2005.
Software Prototyping. Introduction Builds an approximation to some system Builds an approximation to some system Easy to learn and understand different.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
An approach for Framework Construction and Instantiation Using Pattern Languages Rosana Teresinha Vaccare Braga Paulo Cesar Masiero ICMC-USP: Institute.
Introduction to the Semantic Web and Linked Data
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
SocialVoD: a Social Feature-based P2P System Wei Chang, and Jie Wu Presenter: En Wang Temple University, PA, USA IEEE ICPP, September, Beijing, China1.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
1 Active Directory Service in Windows 2000 Li Yang SID: November 2000.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
1 SWE Introduction to Software Engineering Lecture 14 – System Modeling.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Data mining in web applications
Database Systems: Design, Implementation, and Management Tenth Edition
Using E-Business Suite Attachments
CCNT Lab of Zhejiang University
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
OrientX: an Integrated, Schema-Based Native XML Database System
Social Knowledge Mining
Database.
Evaluating Compuware OptimalJ as an MDA tool
Assoc. Prof. Dr. Syed Abdul-Rahman Al-Haddad
Data Model.
Magnet & /facet Zheng Liang
Semantic Markup for Semantic Web Tools:
On the Designing of Popular Packages
Research on Personal Dataspace Management
Information Networks: State of the Art
DATABASES WHAT IS A DATABASE?
Presentation transcript:

Exploring Personal CoreSpace For DataSpace Management Li Yukun and Xiaofeng Meng WAMDM Lab Renmin University of China

Outline  Introduction  CoreSpace Overview  CoreSpace Design  CoreSpace Implementation  Conclusion

Motivation Query Find a pdf file I downloaded from a web page and stored in a directory of D drive. Revisit a picture I developed for MDM2008 one years ago. Background With increasing of personal data set, PIM becomes a serious problem and a hot research issue; The current tools can not work well in some cases.

Related work  Current solutions Traditional tools  Folder explorer, Desktop Search DataSpace Support Platforms (DSSPs)  Personal data integration (Xin Dong,etc.)  Association-based query ( Salles MAV, etc. )  Data Resource Model RSM, SLN (Hai Zhuge, etc.)  Our solution Based on user features  Users play a key role  Revisit is an popular access style Research focuses  Highlight the role of users  Produce an effective approach for exploring PDS

Problem Definition Personal DataSpacePersonal CoreSpace Classify Exploring -Modeling user features -Exploring based on user features

Contributions  Propose CoreSpace Model Divide the semantic links among PDS into two classes:  Objective Semantic Link(OSL)  Memory-based Semantic Link(MSL) Describe Personal CoreSpace(PCS) based on Resource Space Model (RSM).  An ontology of Personal CoreSpace Discover several types of meaningful MSLs Design an ontology of PCS based on the MSLs  A facet-based search interface of PCS Propose a method to translate the PCS ontology into a facet-based search interface. Validate the effectiveness of our methods by implementing a prototype system.

Outline  Introduction  CoreSpace Overview  CoreSpace Design  CoreSpace Implement  Conclussion

Features of personal data  Features of personal data Versatile, heterogeneous, personalized, complex, evolutionary  Features of personal data operations Pay-Go Integration Known-item relocation-- “revisit” Multiple query methods Simple interface

Resource Space Model  A resource space is a n-dimensional space Axis : Xi is the name of an axis. Xi = (Ci1;Ci2;...;Cin) represents an axis with its coordinates and the order between them. Coordinate: C denotes the coordinate name in form of a noun or a noun phrase. Point: determines one or a set of entities, we denote it as PCS(X1;X2;...;Xn).  Data operation [1] H.Zhuge, Communities and Emerging Semantics in Semantic Link Network: Discovery and Learning, IEEE Transactions on Knowledge and Data Engineering, vol.21, no.6, 2009, pp [2] H. Zhuge. Resource space model, its design method and applications. The Journal of Systems and Software 72 (2004) [3] H.Zhuge, The Web Resource Space Model, Springer, 2008.

Personal CoreSpace Model  Personal DataSpace Data item  Attribute Owner Relationship  Personal CoreSpace A n-dimensional space Axis : Attributes of personal data items. Coordinate: Values of a certain attribute, which can be a tree structure. Point: A personal item or a set of personal items.

Outline  Introduction  CoreSpace Overview  CoreSpace Design  CoreSpace Implementation  Conclussion

Personal CoreSpace Ontology  Two type of attributes Natural attributes  Name, Type,Access time, Directory, Size, Source User-based attributes  Access frequency, access type, related task

 Type: { , Web pages, Picture, Documents,…}  Access time {”Today”,”Yesterday”,”Last week”,”Last month”,”Last year”,”One year ago”}  Directory A Tree structure  Size {(0,10K]; (10K,100K]; (100K,1M]; (1M,10M]; (10M,-)}  Sources {Self-developed, Cloned}  Access frequency {(1,5]; (6,15]; (16,50]; (50,-]}  Access type: {Read-only, Modified}  Related tasks A personal task set Personal CoreSpace Ontology

Outline  Introduction  CoreSpace Overview  CoreSpace Design  CoreSpace Implementation  Conclussion

CoreSpace Implementation  System Framework User behavior monitor Storage agent Item identify agent Query processor  Features PayGo evolution From CoreSpace to facet search Extendability

From CoreSpace to facet search  Method Take each coordinate Xi as a facet Fi, and take its coordinates as the options of facet Fi. Based on the hierarchical structure of PCS, we can easily construct a facet-based search interface.  Facet-based query logical Let X and Y’ be two selected nodes of facet tree, and they can be regarded as two conditional expressions. Our method is detailed as below.  If X is parent of Y, it means X and Y = Y;  If X is brother of Y, it means X or Y;  If X and Y are neither parent relation nor brother relationship, it means X or Y.

An example of query algebra The red nodes represents those options selected by user According to the rules we can get the logical expression R = {Xi | (Xi. type = JPG ∨ Xi.type =VSD) ∧ Xi. place = ”D : \Picture”}

Implementation

Outline  Introduction  CoreSpace Overview  CoreSpace Design  CoreSpace Implementation  Conclussion

Conclusion  This is just a preliminary work Propose a CoreSpace model Propose a method to explore PDS based on CoreSpace  Future work Try to discover more rules of user memory Enrich the ontology of PCS

Thanks