Download presentation
Presentation is loading. Please wait.
Published byΔιοκλῆς Παπαδόπουλος Modified over 5 years ago
1
Dataspace: a new concept of data management
Li Yukun
2
Outline From database to dataspace PDS/PIM Related work
Challenge issues Our work on dataspace
3
Traditional RDBMS 先有格式, 支持有限的数据格式 关注数据的稳定性 需要的时候集中建成 面向应用主题
4
Query1:Please tell me all the information
in my dataspace about a conference Query2: please tell me the s and persons on a event 先有格式, 支持有限的数据格式 关注数据的稳定性 需要的时候集中建成 面向应用主题
5
Description of dataspace
淡化形式 开放的,支持多种不同的数据格式 强调数据的可关联性和可演化性 具有Pay-As-You-Go 特性 面向实体需要
6
From Database to Dataspace
The advantages of traditional model should be kept. New characters of data should be mapped. Focus
7
Outline From database to dataspace PDS/PIM Related work on dataspace
Challenge issues Our work on dataspace
8
PDS and PIM
9
Outline From database to dataspace PDS/PIM Related work on dataspace
Challenge issues Our work on dataspace
10
Related work on PIM/PDS
Memex——1945 (Vannevar Bush ) Lifestreams——1996 From Database to Dataspaces——2005 SIGIR PIM workshop 2005/2006 iDM——2006 (JensPeter Dittrich, Marcos Antonio Vaz Salles ) Indexing dataspace Resource space model
11
Outline From database to dataspace PDS/PIM Related work on dataspace
Challenge issues Our work on dataspace
12
Challenge issues on the topic
INPUT Profile OUTPUT
13
Challenge issues on the topic
Searching\Encountering\keeping\Extraction\ObjectIdentity\Evaluation INPUT Profile OUTPUT
14
Challenge issues on the topic
Model/Index /Store/Query/System INPUT OUTPUT
15
Challenge issues on the topic
INPUT OUTPUT Finding/Refining /Reminding/HCI/QL
16
Outline From database to dataspace PDS/PIM Related work on dataspace
Challenge issues Our work
17
Our work and proposal 1. Read related papers 2
Our work and proposal 1. Read related papers 2. Survey From Database to dataspace, from for enterprise to for people. (IDKE Report2006) PIM: 一个新的研究焦点(IDKE Report2006) 数据空间:一种新的数据管理技术,(计算机通讯, 07.8) 张相於毕业论文 3. Automatic content extraction from paper of PDF style. 4. Proposal for research of our group.
18
About the Proposal General Topic:
Related technology on content management Subtopic: Model of content management (classify\content-formalization\Query\importance\urgency) EMIEX: Object extraction based on content (Personal name\Location name\Event\Time\...). EMSN: Socal network construction and mining on log Intelligent reminding based on log (from to schedule) From to blog \ chatting\ phone-note log Demo development tasks: Read papers on content extraction\ personal recommendation \ user profile Read papers on management Prepare dataset (English \Chinese ) and classify Arithmetic and Policy
19
Motivation & Challenge
has become more popular and play an important role in work and daily life. We can get data for experiment. It has a more formal stytle. It’s characters is similar to Blog\BBS\Chating data. Challenge IR is a new area to us. Data collection is a hard process. A more detailed plan will be formed later
20
U H A N O Y K T
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.