Building a Top Down Ontology From the Bottom Up Step by Step Approach for Identifying & Constructing Dimensions of an Ontology draft (v0.8): DeniseBedford.

Slides:



Advertisements
Similar presentations
GEOSS StP Browse Scenario Doug Nebert 13Jun2011. Support rapid discovery of data in support of critical EO priorities The GEO Web Portal supports search.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Ontology Assessment – Proposed Framework and Methodology.
DCMI Workshop on Metadata and Search Vendor Panel Presentation Bradley P. Allen
How to use the DET (Data Entry Tool) Core data Set H.
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Foundational Objects. Areas of coverage Technical objects Foundational objects Lessons learned from review of Use Case content Simple Study Simple Questionnaire.
Content Categorization A Road Map Julia Marshall USAID (Bridgeborn Inc.)
Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
DARE Domain Analysis and Reuse Environment סמינר: נושאים מתקדמים בהנדסת תכנה מרצה: ד"ר איריס ריינהרץ- ברגר סמסטר א', תשס"ז אהרוני ענת ברזני ערבה.
PIM Platform Free text search. When you type in the search field a suggestion tool helps you to find a concept from the ontology.
Text mining Extract from various presentations: Temis, URI-INIST-CNRS, Aster Data …
Terminology and Controlled Vocabulary Efforts at the U.S. Environmental Protection Agency Richard Huffine Federal Manager, EPA National Library Network.
Oregon Spatial Data Library Partnership Metadata Training OU Knight Library Eugene, Oregon December 3, 2009 Kuuipo Walsh Institute for Natural Resources.
A Virtual Organisation for e-Learning Nicola Capuano, Pierre Carrolaggi, Jerome Combaz, Fabio Crestani, Matteo Gaeta, Erich Herber, Enver Sangineto, Krassen.
Taxonomy Boot Camp Panel Text Analytics Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Automating Keyphrase Extraction with Multi-Objective Genetic Algorithms (MOGA) Jia-Long Wu Alice M. Agogino Berkeley Expert System Laboratory U.C. Berkeley.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
Neural Technology and Fuzzy Systems in Network Security Project Progress Group 2: Omar Ehtisham Anwar Aneela Laeeq
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
1 Archive-It Training University of Maryland July 12, 2007.
Enterprise Search/ Text Analytics Evaluation Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics And Text Mining Best of Text and Data
1 Developing an Ontolog Ontology Denise A. D. Bedford April 13, 2006.
The Future of Metadata Denise Bedford World Bank Presentation to Fall Metadata Forum November 2, 2005 Department of Homeland Security.
Using Windows Firewall and Windows Defender
Using Publishing Profiles to dump data out of Alma needed for resource sharing systems such as HathiTrust Margaret Briand Wolfe Systems Librarian Boston.
1 Artificial Intelligence Applications Institute Centre for Intelligent Systems and their Applications Stuart Aitken Artificial Intelligence Applications.
12 July 2002Colloquim on Applications of Natural Langauge Corpora, Saarland University Domain-specific Web Corpora and their Applications Gregor Erbach.
What You Need before You Deploy Master Data Management Presented by Malcolm Chisholm Ph.D. Telephone – Fax
An Ontology-Based Approach for Sharing Digital Resources in Teacher Education 7 th International Workshop on Ontologies and Semantic Web for E-Learning.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Semantics and Syntax of Dublin Core Usage in Open Archives Initiative Data Providers of Cultural Heritage Materials Arwen Hutt, University of Tennessee.
VIKEF – Take the VIKEF train towards smart services …
Black Box Software Testing Copyright © 2003 Cem Kaner & James Bach 1 Black Box Software Testing Spring 2005 PART 7 -- FUNCTION TESTING by Cem Kaner, J.D.,
Content analysis and CERN Roman Chyla. Artificial intelligence Natural language processing Web of data Content analysis.
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
1 Everyday Requirements for an Open Ontology Repository Denise Bedford Ontolog Community Panel Presentation April 3, 2008.
Inventory Labels WcBc is configured for Multiple types of Labels and here are a few of them. WcBc is configured for Multiple types of Labels and here are.
Article by Dunja Mladenic, Marko Grobelnik, Blaz Fortuna, and Miha Grcar, Chapter 3 in Semantic Knowledge Management: Integrating Ontology Management,
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Text Analytics Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
May 2007 Registration Status Small Group Meeting 1: August 24, 2009.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Draft School Education Ontology (SEO) Fumiaki Toyoshima.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Taxonomy and Text Analytics Case Studies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Using Publishing Profiles to dump data out of Alma needed for resource sharing systems such as HathiTrust Margaret Briand Wolfe Systems Librarian Boston.
Semblog Project Personal Information Distribution with Social Network
Information Organization: Overview
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Metadata Editor Introduction
ece 627 intelligent web: ontology and beyond
Domain Modeling.
Information Organization: Overview
Marine Career Discovery
Extracting Information from Diverse and Noisy Scanned Document Images
Presentation transcript:

Building a Top Down Ontology From the Bottom Up Step by Step Approach for Identifying & Constructing Dimensions of an Ontology draft (v0.8): DeniseBedford /

Step by Step Approach Step 1: Identify the boundaries of the ontology –What will be ontologized (broad definition of content)? –Who will use the ontology? –How they will use the ontology? Following steps pertain to creating one dimension of an ontology to apply to content -- Step 2: Create a content inventory –Identify the sources of content –Use an inventory tool (COAST) to generate a full inventory –Working group weeds/selects from the inventory to create a core set of content to work with Step 3. Extract list of concepts from the content –Use the inventory to capture content items as a training set –Identify the types of concepts to be extracted – noun-phrase descriptors, entity identifiers, names, institutions, etc. –Configure and run the concept extraction

Step by Step Approach Step 4. Review the list of concepts –Quickly scan the concept list to determine concentrations of concepts –Check whether these concentrations make sense in terms of categories –If so, begin to build a categorization profile and organize the concepts within –Determine whats missing from the list of concepts (domain experts help us here..) –Determine what is in the list that is not pertinent to the topic (peripheral or out of bounds for the topic) – (domain experts here us here, too) –Prune the list of concepts – in some cases find new content and repeat the process Step 5x. Build the categorizer profile –Build a rule-based categorizer around the concept clusters (manual bunching at a very coarse level) –Or,…check clustering of concepts using a clustering engine (here you can feed the refined list of concepts back into a clustering engine and run them against the training set)

Questions for Domain Expert Review 1.If you were talking about ontology with an expert, are all of the concepts you would use included in the list? If not, what is missing? 1.Are there a few concepts missing, or is there a larger domain or knowledge area that is missing? 1.What is in the list that doesnt pertain to ontologies? 1.If you were looking for information about ontologies – from an expert point of view – would you use any of these concepts to search? Which ones are missing? What shouldnt be in the list? 1.If you were looking for information about ontologies from a novices point of view – what is missing from the list of concepts? What shouldnt be there?

Step by Step Approach Step 6. Test the Categorization Profile against the content –Define the xml output structure for the metadata –Run the profile against the content –Review the categorization results –Accept/refine the profile Other steps to creating the full ontology –Determining what kind of functionality you need to support use of the ontologized content –…search & discovery system –…browse of categories of content –…reporting –…recommender engines