Sunday May 4 – 5 PM Bradford, Hlava, McNaughton

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Taxonomy as Content Outline, Site Map and Search Aid SLA NWR Vancouver October 6, 2006 Marjorie M.K. Hlava President
Jone Garmendia, Head of Cataloguing 25 November 2011 The National Archives Taxonomy.
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
Knowledge is Empowerment Guide no. 5 Searching MEDLINE Full Text: by Subject, & by Publications. Register in My Ebsco Host & Create Alerts.
Controlling values The equivalence relationship. The vocabulary problem What is this?
Access Innovations Presents: Data Harmony version 3.8 New Features.
PubMed and its search options Jan Emmerich, Sonja Jacobi, Kerstin Müller (5th Semester Library Management)
Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
Taxonomies, Lexicons and Organizing Knowledge Wendi Pohs, IBM Software Group.
Access Innovations, Inc. Marjorie M.K. Hlava Jay Ven Eman.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
Implementing a Taxonomy in a Content Management Portal Content Week 2005 Miami, Florida Monday, January 31, 2005 Workshop H 2:45pm – 4:45 pm Marjorie M.K.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
Standards for networked knowledge organisation systems Ron Davies European Library Automation Group Bucharest, April 2006.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
Reengineering AGROVOC to Ontologies Step towards better semantic structure NKOS Workshop 31 May 2003 Rice University Houston, Texas, USA Frehiwot Fisseha.
Knowledge organisation and information architecture, Nils Pharo Knowledge organisation and the Web Nils Pharo, 6th November 2002.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Libraries and Institutional Content Management Systems
Implementing Metadata Marjorie M K Hlava, President Access Innovations, Inc. Albuquerque, NM
Long-Term Ecological Research working_groups/controlled_vocabulary Working Group: “Synthesis through data.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Redefining Perspectives A thought leadership forum for technologists interested in defining a new future June COPYRIGHT ©2015 SAPIENT CORPORATION.
ROI & Impact: Quantitative & Qualitative Measures for Taxonomies Wednesday, 11 February :00 – 12:30 PM MST Presented by Jay Ven Eman, Ph.D., CEO.
Taxonomies: Hidden but Critical Tools Marjorie M.K. Hlava President Access Innovations, Inc.
Copyright © 2006 Access Innovations, Inc. 1 Building Taxonomies Part 3 Alice Redmond-Neal Access Innovations, Inc. Enterprise Search Summit New York City,
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Indexing Knowledge Daniel Vasicek 2014 March 27 Introduction Basic topic is : All Human Knowledge Who Cares? Simple Examples.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Copyright C.M. Mitchell Consulting 2005 Taxonomy 101 – Why is it so Important? Presented by: Carol Mitchell.
Copyright © 2006 Access Innovations, Inc. 1 Building Taxonomies Part 5 Alice Redmond-Neal Access Innovations, Inc. Enterprise Search Summit New York City,
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
Controlled Vocabulary Working Group Virtual Water Cooler Session April 6-7, 2009 Moderator: John Porter rm.action?confKey=jhp7e.
Controlled Vocabulary & Thesaurus Design Hierarchies & Taxonomies.
The UNESCO Thesaurus Meeting for Managers of UNESCO Documentation Networks Meron Ewketu UNESCO Library June
TOPIC: Transportation Research Thesaurus: Taxonomy Development and Use Cases 14 February :00 PM EST Presented by Jay Ven Eman, Ph.D., CEO Access.
Copyright © 2006 Access Innovations, Inc. 1 Building Taxonomies Part 2 Alice Redmond-Neal Access Innovations, Inc. Enterprise Search Summit New York City,
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Evolution of a production pipeline Marjorie M.K. Hlava President Access Innovations.
INFO Week 8 Subject Indexing & Knowledge Representation Dr. Xia Lin Assistant Professor College of Information Science and Technology Drexel University.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Thesauri usage in information retrieval systems: example of LISTA and ERIC database thesaurus Kristina Feldvari Departmant of Information Sciences, Faculty.
Controlled Vocabulary & Thesaurus Design Hierarchies.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Text Analytics in Action: Using Text Analytics as a Toolset TBC 4:15 p.m. - 5:00 p.m. Marjorie Hlava Semantic enrichment / Semantic Fingerprinting.
APS Taxonomy Project Arthur Smith, American Physical Society April 2014.
Implementing Linked Open Data in a Controlled Vocabulary Marjorie M.K. Hlava President Access Innovations Inc
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
Oxlip+. What is Oxlip+? A tool for finding & linking to databases – Online collections of (scholarly) materials – Includes full text / indexes / range.
Charlyn P. Salcedo Instructor Types of Indexing Languages.
Controlling values for information organization 384C – Organizing Information Spring 2016 Karen Wickett School of Information University of Texas at Austin.
1 How do we describe something? n What something is about? –What the content of an object is “about”? n Different methods (Wilson, 1968) –counting terms.
Implementing Taxonomy Taxonomy Talk from the Publishing World Special Libraries Association Philadelphia, Pennsylvania 14 June 2016.
Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,
Information Organization
Information Retrieval
Over 1,000 books, journals, videos and reference material
Taxonomies, Lexicons and Organizing Knowledge
Information Retrieval
Database Design Hacettepe University
Chapter 31: Information Retrieval
Chapter 19: Information Retrieval
Presentation transcript:

Sunday May 4 – 5 PM Bradford, Hlava, McNaughton Taxonomies CSE Sunday May 4 – 5 PM Bradford, Hlava, McNaughton

Presenters - Taxonomies Marjorie Hlava Access Innovations Monica Bradford American Association for the Advancement of Science AAAS Charlotte McNaughton ASCE

What is a Taxonomy? ANSI/ NISO Z39.19-2010 controlled “A collection of controlled vocabulary terms organized into a hierarchical structure.” Yes! Missing: equivalence, homographic, and associative relationships and notes

Structure Of Controlled Vocabularies Lists Synonyms Taxonomy Thesaurus Ontology INCREASING COMPLEXITY and CONTROL Ambiguity Ambiguity Ambiguity Specifies a KOS Synonym Synonym Directionality in Hierarchy Hierarchy Relationships relationships Copyright © 2013 Access Innovations, Inc.

Taxonomy? Thesaurus? TAXONOMY OWL can specify THESAURUS Main Term (MT) = subject term, heading, node, category, descriptor, class Main Term (MT) Top Term (TT) Broader Terms (BT) Narrower Terms (NT) Related Terms (RT) See also (SA) Non-Preferred Term (NP) Used for (UF), See (S) Scope Note (SN) History (H) TAXONOMY OWL can specify THESAURUS Copyright © 2013 Access Innovations, Inc.

How Do Terms Relate? Hierarchical relationships -- Parents and their children Equivalence relationships -- Aliases, synonyms Associative relationships -- Cousins TAXONOMY THESAURUS

Disambiguation Bridge Structure Bridge Dentistry Bridge Game Bridge Concept

Achieving Synonymy Find like concepts Merge the terms Choose a preferred form Build term record Hierarchy Equivalence Associative

Taxonomy

Linked Data Assert that the AIP Thesaurus term “Nonlinear optics” refers to the same concept as the dbpedia page “Nonlinear optics” by putting links in both places.

Content Recommender Thesaurus terms Similar content The more terms in common, the higher the recommendation of content as similar.

7. Content Recommender Grants available Selected Article Search “thin film sputtering” Grants available Upcoming conferences on this topic More Articles on the same topic Authors working in this space

Journal Profile Pages Each journal can be characterized by the most frequently used indexing terms. 2014 JLAPEN (Journal of Laser Applications) 13 most frequently used indexing terms:

Index the image by analyzing the text associated with the image: Image Indexing Index the image by analyzing the text associated with the image: caption

Author Submission/Reviewer Tools Add a box: “Suggest New terms” Image: Courtesy AACR and EJPress

Reports and Research Tools for Internal Use

Reports and Research Tools for Internal Use

The Workflow Build Search inverted index Create user interface Gather source data Tag and Create metadata Put in data base with tags Client Data Full Text HTML, PDF, Data Feeds, etc. Automatic Summarization Search Presentation Layer Increases accuracy Browse by Subject Auto-completion Broader Terms Narrower Terms Related Terms Machine Aided Indexer (M.A.I.™) Database Repository Search Software Inline Tagging Client taxonomy Client Taxonomy Metadata and Entity Extractor Thesaurus Master

Taxonomy Driven Search Presentation Auto-completion using the taxonomy NICEM, the National Information Center for Educational Media, is a database of over 640,000 audio and visual items in all subject areas that apply to learning, from preschool through professional. Access Innovations applies subject terms from the NICEM taxonomy to each bibliographic record. In addition to that backend data work, we also created a search and presentation layer for the website, which we call Search Harmony. Here are some of the user-friendly features that are included in Search Harmony: [Click] Users can navigate the site by browsing the full taxonomy, and see the number of records tagged with each subject [CLICK] Auto-completion of search terms, which is a common feature of many search engines, but in this case the user is assisted in formulating a search by seeing a pick list of terms from the taxonomy, including all synonyms – even if they appear in the middle of a phrase. [CLICK] The user is also guided to expand the topic with broader terms or related terms from the taxonomy, or narrow the search to find more precise information. [CLICK] The resulting site has been recognized as an indispensable resource for educators. Guide the user Navigate the full taxonomy “tree” BROWSE

Taxonomy Thesaurus view Term Record view Special attention to Non-Preferred Term -- goldmine Copyright © 2005 - Access Innovations, Inc.

Knowledge Organization Systems Linked Entities Contextual Specificity Complex High value Semantic network Ontology Thesaurus Taxonomy Controlled vocabulary Synonym set/ring Name authority file Uncontrolled list Uncontrolled list has the Highest Cost over Time! Simple Low Value Unrelated Entities Ambiguity 22

Thanks! Questions after all three speakers Marjorie M K Hlava, President Access Innovations, Inc. 4725 Indian School Rd, Ste. 100 Albuquerque, NM 87110 +1-505-998-0800 www.accessinn.com www.dataharmony.com Email: info@accessinn.com

About Access Innovations Access Innovations are experts in content creation, enrichment, and conversion services. We provide services to semantically enrich and tag raw text into highly structured data. We deliver clean, well-formed, metadata-enriched content so our clients can reuse, repurpose, store, and find their knowledge assets. We go beyond the standards to build taxonomies and other data control structures as a solid foundation for your information. Our services and software allow organizations to use and present their information to both internal and external constituents by leveraging search, presentation, and e-commerce. We change search to found! Quick Facts Founded in 1978 Headquartered in Albuquerque, NM Privately held Delivered more than 2000 engagements

Suggested taxonomy descriptors

Normal text extraction

Near conceptual synonyms

Nonsensical suggestions

Small Taxonomy Near synonym, conceptual duplicate

Refined presentation

Dependent concepts

Ontological dependencies