Presentation is loading. Please wait.

Presentation is loading. Please wait.

Dataspace: a new concept of data management

Similar presentations


Presentation on theme: "Dataspace: a new concept of data management"— Presentation transcript:

1 Dataspace: a new concept of data management
Li Yukun

2 Outline From database to dataspace PDS/PIM Related work
Challenge issues Our work on dataspace

3 Traditional RDBMS 先有格式, 支持有限的数据格式 关注数据的稳定性 需要的时候集中建成 面向应用主题

4 Query1:Please tell me all the information
in my dataspace about a conference Query2: please tell me the s and persons on a event 先有格式, 支持有限的数据格式 关注数据的稳定性 需要的时候集中建成 面向应用主题

5 Description of dataspace
淡化形式 开放的,支持多种不同的数据格式 强调数据的可关联性和可演化性 具有Pay-As-You-Go 特性 面向实体需要

6 From Database to Dataspace
The advantages of traditional model should be kept. New characters of data should be mapped. Focus

7 Outline From database to dataspace PDS/PIM Related work on dataspace
Challenge issues Our work on dataspace

8 PDS and PIM

9 Outline From database to dataspace PDS/PIM Related work on dataspace
Challenge issues Our work on dataspace

10 Related work on PIM/PDS
Memex——1945 (Vannevar Bush ) Lifestreams——1996 From Database to Dataspaces——2005 SIGIR PIM workshop 2005/2006 iDM——2006 (JensPeter Dittrich, Marcos Antonio Vaz Salles ) Indexing dataspace Resource space model

11 Outline From database to dataspace PDS/PIM Related work on dataspace
Challenge issues Our work on dataspace

12 Challenge issues on the topic
INPUT Profile OUTPUT

13 Challenge issues on the topic
Searching\Encountering\keeping\Extraction\ObjectIdentity\Evaluation INPUT Profile OUTPUT

14 Challenge issues on the topic
Model/Index /Store/Query/System INPUT OUTPUT

15 Challenge issues on the topic
INPUT OUTPUT Finding/Refining /Reminding/HCI/QL

16 Outline From database to dataspace PDS/PIM Related work on dataspace
Challenge issues Our work

17 Our work and proposal 1. Read related papers 2
Our work and proposal 1. Read related papers 2. Survey From Database to dataspace, from for enterprise to for people. (IDKE Report2006) PIM: 一个新的研究焦点(IDKE Report2006) 数据空间:一种新的数据管理技术,(计算机通讯, 07.8) 张相於毕业论文 3. Automatic content extraction from paper of PDF style. 4. Proposal for research of our group.

18 About the Proposal General Topic:
Related technology on content management Subtopic: Model of content management (classify\content-formalization\Query\importance\urgency) EMIEX: Object extraction based on content (Personal name\Location name\Event\Time\...). EMSN: Socal network construction and mining on log Intelligent reminding based on log (from to schedule) From to blog \ chatting\ phone-note log Demo development tasks: Read papers on content extraction\ personal recommendation \ user profile Read papers on management Prepare dataset (English \Chinese ) and classify Arithmetic and Policy

19 Motivation & Challenge
has become more popular and play an important role in work and daily life. We can get data for experiment. It has a more formal stytle. It’s characters is similar to Blog\BBS\Chating data. Challenge IR is a new area to us. Data collection is a hard process. A more detailed plan will be formed later

20 U H A N O Y K T


Download ppt "Dataspace: a new concept of data management"

Similar presentations


Ads by Google