Presentation by Yuri de Lugt. Presentation structure Definitions of knowledge management Forms of knowledge Knowledge infrastructure Collexis background.

Slides:



Advertisements
Similar presentations
Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
Advertisements

1 OOA-HR Workshop, 11 October 2006 Semantic Metadata Extraction using GATE Diana Maynard Natural Language Processing Group University of Sheffield, UK.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Metadata Strategies Alternatives for creating value from metadata Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
Knowledge Portal: An Innovative Approach to Libraries Presented at NACLIN New Delhi By Sharad Kumar Sonker Department of Lib. & Info. Sci. Babasaheb Bhimrao.
Semantic Content Infrastructure for Knowledge Applications Tools of Change 2011 Thane Kerner, CEO Silverchair.
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
Text mining Extract from various presentations: Temis, URI-INIST-CNRS, Aster Data …
UpdataCapital The Context The Application Latest technical developments.
1 Transportation Librarians Roundtable Transportation Research Thesaurus: WSDOT Use Cases February 14, 2008 Andy Everett Metadata Repository Administrator.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
University Library Internet searching: getting the best from Outline The Web – the good, the bad and the ugly Search engines and Google Getting the best.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Libraries and Institutional Content Management Systems
What are search engines? Tools used for locating web pages Automated software programs known as spiders or bots to survey the Web and build their databases.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Yuri de Lugt Collexis Karin Clavel TU Delft Library.
International Atomic Energy Agency INIS Training Seminar Principles of Information Retrieval and Query Formulation 07 – 11 October 2013 Vienna, Austria.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Web 2.0 for Government Knowledge Management Everyone benefits by sharing knowledge March 24, 2010 Emerging Technologies Work Group Rich Zaziski, CEO FYI.
SharePoint Users Group Content Classification Step by Step SharePoint 2007 and 2010.
Query Expansion.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
H. Lundbeck A/S3-Oct-151 Assessing the effectiveness of your current search and retrieval function Anna G. Eslau, Information Specialist, H. Lundbeck A/S.
Defining Text Mining Preprocessing Transforming unstructured data stored in document collections into a more explicitly structured intermediate format.
Implicit An Agent-Based Recommendation System for Web Search Presented by Shaun McQuaker Presentation based on paper Implicit:
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Chapter Chapter 3 Internet Agents. Chapter Contents Background Web Search Agents Information Filtering Agents Notification Agents Other Service.
Flexible Text Mining using Interactive Information Extraction David Milward
Themes Architecture Content Metadata Interoperability Standards Knowledge Organisation Systems Use and Users Legal and Economic Issues The Future.
НИУ ВШЭ – НИЖНИЙ НОВГОРОД EDUARD BABKIN NIKOLAY KARPOV TATIANA BABKINA NATIONAL RESEARCH UNIVERSITY HIGHER SCHOOL OF ECONOMICS A method of ontology-aided.
Knowledge Management and the Information Specialist: dancing in step or missing a beat? Hettie Groenewald Erica van der Westhuizen Johann van Wyk Academic.
Planning a search strategy.  A search strategy may be broadly defined as a conscious approach to decision making to solve a problem or achieve an objective.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Text Analytics Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.
WEB PAGE CONTENTS VERIFICATION AGAINST TAGS USING DATA MINING TOOL IKNOW VІI scientific and practical seminar with international participation "Economic.
Topic Maps introduction Peter-Paul Kruijsen CTO, Morpheus software ISOC seminar, april 5 th 2005.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
Developing GRID Applications GRACE Project
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
GUIDE. P UB M ED
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Experience Management
Thanks to Bill Arms, Marti Hearst
KNOWLEDGE MANAGEMENT (KM) Session # 34
Introduction to Information Retrieval
KNOWLEDGE MANAGEMENT (KM) Session # 37
AI Discovery Template IBM Cloud Architecture Center
Presentation transcript:

Presentation by Yuri de Lugt

Presentation structure Definitions of knowledge management Forms of knowledge Knowledge infrastructure Collexis background Collexis application examples Demonstrations (optional)

Definitions Knowledge Management

Collexis definitions on KM Data “Structured data” (data, stored set of a meaningful combination of characters and symbols) Information “Data with ADDED value for the receiver” Knowledge “Combined information and experience in the minds of people” “Knowing how to act to retrieve optimal added value” Competence Combination of Knowledge, skills and behavior that leads to an essential contribution to achieving the goals of the business

Collexis definitions on KM Knowledge management: “Creating an enviornment in wich knowledge will retrieve maximum added value” “Actions and Rules based on a consistent strategy witch enables an organisation and her employees to use the available knowledge as a strategic production factor for optimal performance.”

Forms of Knowledge To perform ‘Knowledge Management’ one needs to know what forms of Knowledge are there.

‘Forms’ of Knowledge Explicit Knowledge –Stored, Transferable Knowledge. –Handbooks, Information Systems, Procedures, etc. –Information Tacit (implicit) Knowledge –In peoples minds –Improved by experience –Not stored, hard to store

Knowledge transformation Tacit Explicit From To Social Internal External Combine Copy by seeing Imitate Master-student Report Visualize Modelling Merge Recombine & systemize Learn by doing

Collexis definitions on KM Thus: Knowledge is not manageable Only the circumstances in which Knowledge can be explored best are manageable Only explicit knowledge can be stored Implicit knowledge can be made accessible ► Collexis facilitates the optimum circumstances to explore and develop knowledge.

The Knowledge infrastructure To find out where Collexis can be of use, we must explore the Knowledge infrastructure

Knowledge infrastructure Structure & processes IT ManagementCulture Knowledge flow Develop Share Use Evaluate

3 Instrument Groups © Human Connection 2000

Collexis & Knowledge management Collexis facilitates: Retrieving information (portals, search and retrieval) Merging information to support Knowledge processes (dynamic categorization, heat maps, content graphs, etc.) Access knowledge by identifying experts (expert finder) Analyzing information to retrieve knowledge (semantic web) Information mining (semantic web, heat maps, content graphs) ►How ?

Collexis Functions Content Resources Business Cases Tools Collexis Approach

Collexis Functions Business Cases Tools Collexis Approach Documents WebXML Databases content

Business Cases Tools Collexis Approach Documents WebXML Databases content Fingerprinting Aggregation Associative Concept Graphs Homonym detection Concept Maps Structured data

Collexis Approach Fingerprinting Aggregation Associative Concept Graphs Homonym detection Concept Maps Structured data Vocabulary search, textual search Who is Who Semantic Networks Specialized Search Dynamic Portals Gap analysis Competitor Analysis applications information mining information matching / searching Documents WebXML Databases content

Collexis Approach Fingerprinting Aggregation Associative Concept Graphs Homonym detection Concept Maps Structured data Vocabulary search, textual search Who is Who Semantic Networks Specialized Search Dynamic Portals Gap analysis Competitor Analysis applications information mining information matching / searching level 1: level 2: level 3: levels of information management Documents WebXML Databases content

Collexis Approach Fingerprinting Aggregation Associative Concept Graphs Homonym detection Concept Maps Structured data Vocabulary search, textual search Who is Who Semantic Networks Specialized Search Dynamic Portals Gap analysis Competitor Analysis applications information mining information matching / searching level 1: use of normalization, language independency, synonyms, matching, thesaurus / vocabulary level 2: use of explicit links, thesaurus hierarchy, contextual information level 3: use of cooccurrence, concept clustering, ontologies levels of information management Documents WebXML Databases content

What are we going to show? Documents WebXML Databases content Fingerprinting Aggregation Associative Concept Graphs Homonym detection Concept Maps Structured data Vocabulary search, textual search Who is Who Semantic Networks Specialized Search Dynamic Portals Gap analysis Competitor Analysis applications information mining information matching / searching level 1: use of normalization, language independency, synonyms, matching, thesaurus / vocabulary level 2: use of explicit links, thesaurus hierarchy, contextual information level 3: use of cooccurrence, concept clustering, ontologies levels of information management

Collexis Why it was created and how it works

One Million Hit Syndrome Billions of gigabytes of information is available Is this bad news or good news? How to manage? –Human indexing is too expensive –Automatic indexing is usually not advanced enough –Keyword search provides too little or too much

Information Abundance Increasing amount of digital information in organizations Stored in different forms and different formats Spread over a variety of databases and archives Internet adds staggering volumes of information

The Human Touch Store Explicit Knowledge in (relational) thesauri Use free texts and content relations Explore and use linguistic techniques Searching for documents or knowledge? –Knowledge is embedded in people –Collexis behaves like a (human) expert –Collexis finds information, experts and organizations –Collexis supports knowledge exploration

The power of Fingerprints Collexis is based on the principle of Fingerprinting Fingerprint: a profile of a piece of information A Fingerprint contains a list of weighted concepts Concepts are derived from a Thesaurus Fingerprint characteristics: unique and small 100% Malaria 35% Agencies 30% Enthusiastic 28% Collaboration 27% Funding 27% Africa 25% Science 15% Dedications 15% Applaud 15% agenda 14% Inaccurate 14% advocacy 13% hope 13% research funding 13% Fund Raising

What is a thesaurus? A thesaurus is a specialized vocabulary (“repository of knowledge”) of a particular domain, such as medicine, energy or ICT. It contains selected words, terms and concepts with their semantic relations in a hierarchical structure and can also contain synonyms What is a thesaurus? Aircraft Airplane Means of transport Means of transport Train Automobile Car Truck Lorry Motor Vehicle Plane Simplified Thesaurus example

Collexis ®, the concept Word-based Searching What? Why? How? Who? indexing  Concept matching 

The magic of Fingerprinting content fingerprints Jobs CV’s, Skills Articles books s Word RFP’s people fingerprints average organization fingerprints average

Concept Fingerprints Free text Fingerprints Query text or document Multi concept & free text + Σ MATCH Result list

Collexis  characteristics Accurate and sensitive Collexis Fingerprints are highly sensitive and accurate, and can be manipulated to optimize search results –Precision: only relevant documents shown –Recall: all relevant documents shown, even when narrowing the search Performance even in millions of documents, search results should be provided instantly Human approach not only documents, but also experts and organizations can be the result of a search

Collexis  characteristics Open architecture Easy integration by the use of an API Omnivore Collexis processes structured and unstructured information; this is possible in one action Adaptable Collexis respects existing databases and does not require large hardware investments Fast in any language & language independent Collexis works across languages.

Some Collexis  markets and applications Publishing (portals) Scientific organizations (Referee finder) Biotechnology (Genes and protein identification) Pharmaceutical & Chemical (Research and development) Library (Research) Healthcare (Intranet) Legislation & Jurisdiction (Legal Intelligence) Universities (Research and Portals) International Authorities (Information Mining)

Collexis Application Examples

Search and Retrieval

Indexing

Search results

Experts retrieved

Refine a search

Use the thesaurus

Information sharing Add2Collexis

Find information and experts; Search assistance through so called “proposed concepts” Available for domains Life Sciences, ICT, food and agriculture (other knowledge domains on request); Supported formats: MS-Word, RTF,.txt, HTML and.pdf; Fully web enabled; Application based on Microsoft®.Net technology.

Add2Collexis

Search using thesaurus intelligence

e-Vamp Automatically enriches existing web pages with hyperlinks; Use thesaurus for concept recognition ; Allows the user to apply a Collexis match for related documents, experts or external database information; Outlinking to other search engines; Configured for different domains

Original document

e-Vamped document

Search using e-Vamp

ClipFinder small client application easy to use complete texts as query input makes use of clipboard fast

Meta-Analysis

Knowledge maps

Networks for Meta-Analysis

Demonstrations (optional)

Thank you!

Choice of word and measured phrase above the reach of ordinary men. William Wordsworth Choice of word and measured phrase above the reach of ordinary men. (William Wordsworth)