Technology of Semantic Structuring of the Digital Library Content I. Filozova JINR LIT, Dubna LIT JINR (DUBNA), JULY 18, 2012 V International Conference.

Slides:



Advertisements
Similar presentations
Ontology Assessment – Proposed Framework and Methodology.
Advertisements

Open repositories: value added services The Socionet example Sergey Parinov, CEMI RAS and euroCRIS.
Mitsunori Ogihara Center for Computational Science
TU/e eindhoven university of technology PACIS'03 July Engineering Semantic Web Information Systems Richard Vdovjak Flavius Frasincar Geert-Jan Houben.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
METS: An Introduction Structuring Digital Content.
Project Proposal.
Service-based architecture for personalized and adaptive access to the knowledge in digital library Desislava Paneva Institute of Mathematics and Informatics.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
A Probabilistic Framework for Information Integration and Retrieval on the Semantic Web by Livia Predoiu, Heiner Stuckenschmidt Institute of Computer Science,
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
JINR DOCUMENT SERVER: Current Status and Future Plans I. Filozova 1, S. Kuniaev 2, G. Musulmanbekov 1, R. Semenov 1, G. Shestakova 1, P. Ustenko 2, T.Zaikina.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
FRAD: Functional Requirements for Authority Data.
A CRIS driven by research community: benefits and perspectives Sergey Parinov, CEMI RAS, Moscow, Russia euroCRIS DRIS-BP Task Group Leader.
Easy-to-Understand Tables RIT Standards Key Ideas and Details #1 KindergartenGrade 1Grade 2 With prompting and support, ask and answer questions about.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Configuration Management (CM)
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
JINR DOCUMENT SERVER: Current Status and Future Plans (From Open Access Repositories to Digital Libraries and to the Knowledge Infrastructure) I.Filozova.
Funded by the Library of Congress.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
CORPORUM-OntoExtract Ontology Extraction Tool Author: Robert Engels Company: CognIT a.s.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
Aquenergy Portal Elisabetta Zuanelli, University of Rome “Tor Vergata”, Italy E-Age 2014 Muscat december.
The Future of Cataloging Codes and Systems: IME ICC, FRBR, and RDA by Dr. Barbara B. Tillett Chief, Cataloging Policy & Support Office Library of Congress.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Grade 8 – Writing Standards Text Types and Purposes (1b) Write arguments to support claims with clear reasons and relevant evidence. Support claim(s) with.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
Technology of Semantic Structuring of the Digital Library Content I.Filozova JINR, Dubna JINR (DUBNA), MAY 18, 2012 III JINR/CERN School of Information.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Sergey Gromov Yulia Krasilnikova Vladimir Polyakov (NRTU MISIS, Moscow) KNOWLEDGE BASE CREATION FOR NATIONAL NANOTECHNOLOGY NETWORKS «CONSTRUCTIONAL NANOMATERIALS»
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
OWL Representing Information Using the Web Ontology Language.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
XXIII International Symposium on Nuclear Electronics & Computing NEC’11 JINR DOCUMENT SERVER: Current Status and Future Plans I.Filozova, S.Kuniaev, G.Musulmanbekov,
WEB PAGE CONTENTS VERIFICATION AGAINST TAGS USING DATA MINING TOOL IKNOW VІI scientific and practical seminar with international participation "Economic.
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Topic Maps introduction Peter-Paul Kruijsen CTO, Morpheus software ISOC seminar, april 5 th 2005.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Formal Specification: a Roadmap Axel van Lamsweerde published on ICSE (International Conference on Software Engineering) Jing Ai 10/28/2003.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
RDA: history and background Ann Huthwaite Library Resource Services Manager, QUT ACOC Seminar, Sydney, 24 October 2008.
Archives, Libraries, Museums: Possibilities of Co-operation within the Enwirinment of the Global Information Infrastructure - Croatian experience Vlatka.
Systems Analysis and Design in a Changing World, Fifth Edition
Research on Knowledge Element Relation and Knowledge Service for Agricultural Literature Resource Xie nengfu; Sun wei and Zhang xuefu 3rd April 2017.
Chapter 3: Curriculum © VAN SCHAIK PUBLISHERS Chapter 3: Curriculum.
International Research and Development Institute Uyo
Exploring Scholarly Data with Rexplore
Independent work of students
AMGA Web Interface Vincenzo Milazzo
Introduction of KNS55 Platform
Objectives, activities, and results of the database Lituanistika
Presentation transcript:

Technology of Semantic Structuring of the Digital Library Content I. Filozova JINR LIT, Dubna LIT JINR (DUBNA), JULY 18, 2012 V International Conference Distributed Computing and Grid-technologies in Science and Education

Contents Current Trends Problematic Situation Research Lines Realization Ideas QA-System on the Logic-Semantic Network Basis Summary

CURRENT TRENDS Traditional Publishing  Digital Archive-based approach; Accumulation by the scientific community the expansive digital information arrays → content integration on the metadata level → common Data and Information Spaces; The growth number of institutional repositories in the open access form. Repositories Number — Records Number ~ 40,000,000 according to ROAR statistics (ROAR -

HOW TO FIND

PROBLEMATIC SITUATION CREATION OF the EFFECTIVE MECHANISMS FOR the ANSWERS SEARCH TO QUESTIONS IN the DIGITAL INFORMATION FUNDS CREATION OF the EFFECTIVE MECHANISMS FOR the ANSWERS SEARCH TO QUESTIONS IN the DIGITAL INFORMATION FUNDS – ACTUAL PROBLEM FIND the INFORMATION ( INFORMATION SOURCE AND/OR INFORMATION ITSELF) QUESTION (V) ANSWERS SET (Q V ) MECHANISMS METHODS AND MECHANISMS FOR EFFECTIVE SEARCH (SEACRH TECHNOLOGY) DIGITAL INFORMATION FUND (INFORMATION SOURCERS) INFORMATION LAWS ? PERTINENCE (P) Q V = Q V R U Q V N P =

Cognitive Function of the Question Question  a thought query as the interrogative sentence. Answer  a realization of the cognitive function of the question as a new obtained judgment. Question TO DEVELOP THE KNOWLEDGE (TO EXTEND, TO PRODUCE A NEW) TO REFINE THE KNOWLEDGE TO SUPPLEMENT THE KNOWLEDGE Cognitive Indeterminacy UNKNOWN KNOWN

Process of Asking Questions and Search Answers Ask Question Find Answer Set Adequacy Question - Answer Search Scope Conformity Rules Search Technology Answer Technology of Conformity Setting Technology of Question Asking The Object and Subject of Research Question Answer Datum Question

RESEARCH LINES (1) Development of the method and mechanism for effective search of the set of the relevant answers to the questions. (2) Technology development for the creation and support of the catalog service of the information fund for providing an efficient search of the answers to the questions. (3) Software development  cataloguer workstation for the structuring of the information fund.

REALIZATION IDEAS OF RESEARCH LINES

The method basis is a way to describe the scientific and technical information by set of logic-semantic networks Question-Answer-Reaction (LSN QAR). The basis for the search engine are: motion way along LSN, controlled by the user; choice of LSN nodes (questions or answers) based on an ontological model of user question. The basis of the technology is a way of the description of the subject domain by LSN QAR set. Mechanism of technology is a workstation of the cataloguer (LSN QAR developer)

Formal Structure of Question, Answer, Reaction The logical structure of the question (Q): QUESTION = {QUESTION THEME (QT), QUESTION CONTENT (QC), QUESTION VOLUME (QV)} The logical structure of the answer (A): ANSWER = {ANSWER THEME (AT), ANSWER CONTENT (AC), ANSWER VOLUME (AV)} The logical structure of the reaction (R): REACTION= {REACTION THEME (RT), REACTION CONTENT (RC), REACTION VOLUME (RV)}

Logic-Semantic Network Question-Answer-Reaction Logic-semantic network  a set of the questions, answers and relationships between them forming an uniform system. Question  query expressed in the interrogative sentence aimed at the development, refinement or supplement of the knowledge. Answer  a realization the cognitive function of the question in the form of the new obtained judgment. Answer must be built in accordance with the content and structure of the asked question. Only in this case, the answer is regarded as relevant. Reaction  a semantic description of the question and answer. Types of reactions: 1. Question Reaction  a description of the datum question (to understand the enviroment and causes of the question and to establish the semantic adequacy with the answer scope). 2. Answer Reaction  a description of the answer scope (to understand the question semantics and relationship with answer).

Reaction Example (1) Logical unit Question-Answer-Reaction: Question 1 (Q1). What is a JAVA? Question 1 Reaction 1 (QR11). With respect pronunciation formed two different standards - borrowed from the English / d ʒɑ :və / and traditional «Ява» (on russian), corresponding to the traditional pronunciation of the Java name island. Question 1 Reaction 2 (QR12). Java (Indonesian: Jawa) is an island of Indonesia with a population of 135 million. Square  k 2 … Question 1 Reaction 3 (QR13). S lide show, photo-collage with the views of Java island.

Reaction Example (2) Answer 1 to Question 1 (A11). Java – an object-oriented programming language developed by Sun Microsystems. Reaction 1 of the Answer 1 to the Question 1 (RA11). Why is the language called JAVA? There is a version that language got its name from coffee grown on the same island. As you know, this drink is hot like some programmers. Therefore, a cup of steaming coffee is displayed on logo.

Reaction Example (3) Reaction 2 of the Answer 1 to the Question 1 (R2A11). Sun Microsystems, Inc (now part of Oracle Corporation) — U.S. company that produces software and hardware… Answer 2 to Question 1. Java — not only the language itself, but also a platform for development and execution of the applications based on this language.

Graph LSN QAR A 22 A 23 Q 31 Q 32 Q 33 Q 34 A 41 A 42 A R 10 R 21 A 21 Q 10 R 23

Analysis Method of Scientific Texts The document is studied by the expert in terms of: 1. Semantic matching title and content; 2. Set of filters: Filter 1 (F1) - General Part. F1 includes an analysis of the problem, its history, overview, topicality. Filter 2 (F2) - Author concept. F2 includes new terms introduced by the authors, traditional terms with the author's interpretation, the narrowing semantics. Filter 3 (F3) - Examples and illustrations. To clarify difficult places in the text, reduce the text size under stringent restrictions on the volume. Filter 4 (F4) - The idea of the author. Describes and explains the author's main idea. 3. Markup text (formulation of the basic questions, answers and reactions).

Navigation on LSN № edge Way 11,4,11 21,4,12 31,5,13 51,5,14 61,6,15 71,6,16 82,7,13 92,7,14 102,8,17 113,9,15 123,9,16 133,10,17 A 22 A 23 Q 31 Q 32 Q 33 Q 34 A 41 A 42 A R 10 R 21 A 21 Q 10 R 23

Multilayer Related Set of Graphs

List of Available Questions and Card of Selected Question ( fragment ) Answer Reaction 1 Question Reaction Next Level Questions Question Answer 1 Answer 2 (Это интересно …) Answer Reaction 2

Card of Question Reaction

LSN + Visualization Questions Answers

Summary It’s proposed:  "Catalog Service" creation and support for the funds-corpuses,  Question-Answer Navigator creation that provides such features: - the ability of the refinement and deepening of the understanding the question meaning; - the ability of the refining, deepening, expansion of the knowledge or the obtaining a new knowledge during the answer to question search process. Realization of such "Catalog Service" and Navigator allows to study the DL content by the natural mode for the human: refinement, generalization and obtaining a new knowledge ̶ question-answer mode. The main problem of the proposed question-answer system is a maximal automation of the process of the creation and support of the fund service catalog.

Even the most foolish idea can be implemented masterfully. Leszek Kumor Even the most foolish idea can be implemented masterfully. Leszek Kumor