HIVE as a Machine-aided Indexing Tool Personal Keyword use without vocabulary control Machine-aided indexing term extraction Participant relevant and not.

Slides:



Advertisements
Similar presentations
The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
Advertisements

Subject Analysis: An Introduction Based on BASIC SUBJECT CATALOGING USING LCSH edited by Lori Robare.
Library website Online (Database subscription) Read/ evaluate Use HOW TO FIND ONLINE JOURNALS/ARTICLES ACCESS WITHIN CAMPUS Online databases are accessible.
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Helping Helping Interdisciplinary Vocabulary Engineering Ryan Scherle – National Evolutionary Synthesis Center Jose Aguera – University of North Carolina.
PubMed and its search options Jan Emmerich, Sonja Jacobi, Kerstin Müller (5th Semester Library Management)
PubMed: Outline Coverage MeSH, mapping and subheadings Simple search Limits Displaying and managing results MeSH database Single citation matcher.
Thesaurus-Based Index Term Extraction Olena Medelyan Digital Library Laboratory.
Library Class for TCM Medline & AMED. Medline MEDLINE® is the U.S. National Library of Medicine's® (NLM) premier bibliographic database that contains.
Helping Interdisciplinary Vocabulary Engineering (HIVE) OCTOBER 31, 2011 Joan Boone Nico Carver Jane Greenberg Lina Huang Robert Losee Mady Madhura José.
Page 1 June 2, 2015 Optimizing for Search Making it easier for users to find your content.
Medical Knowledge Watch at the Belgium Poison Centre Christophe Dupriez 26 June 2007.
1 Question Answering in Biomedicine Student: Andreea Tutos Id: Supervisor: Diego Molla.
Search Strategies Online Search Techniques. Universal Search Techniques Precision- getting results that are relevant, “on topic.” Recall- getting all.
Thesaurus Design and Development
Approaches to automatic summarization Lecture 5. Types of summaries Extracts – Sentences from the original document are displayed together to form a summary.
Multi-Concept Alignment and Evaluation Shenghui Wang, Antoine Isaac, Lourens van der Meij, Stefan Schlobach Ontology Matching Workshop Oct. 11 th, 2007.
Indexing 1/2 BDK12-3 Information Retrieval William Hersh, MD Department of Medical Informatics & Clinical Epidemiology Oregon Health & Science University.
Psychology 214.3: Finding the Research For Your Research Proposal.
1 DATABASES By: Hanna Ben-Or Phone: October 2011.
BME1450: Biomaterials and Biomedical Research Michelle Baratta Engineering & Computer Science Library Maria Buda Dentistry Library.
Databases Indexes & Abstracts. Indexes & Abstracts = Serials When most librarians think about science and technology they think about serials and the:
2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
Multilingual Information Exchange APAN, Bangkok 27 January 2005
Medline on OvidSP. Medline Facts Extensive MeSH thesaurus structure with many synonyms used in mapping and multidatabase searching with Embase Thesaurus.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
HIVE: Enabling Common Language and Interdisciplinarity EPA-NIEHS Advancing Environmental Health Data Sharing and Analysis: Finding a Common Language June.
Librarians vs. Automation Carolyn Weber Lucio Campanelli Will Hohyon Ryu.
Mark M Hall Information School / Computer Science Sheffield University Sheffield, UK EuropeanaTech 2011, Vienna, 4 th - 6 th October 2011 Aggregating Cultural.
Dryad Management Board Meeting Friday, May 22 1:30 p.m. Session 3: Software development timeline and priorities Slides pprepared by the Dryad development.
Psychology (02): Finding the Research For Your Literature Review & Research.
Caroline Williams, Executive Director of Intute Andy Priest, Intute Technical Co-ordinator
Shelly Warwick, MLS, Ph.D – Permission is granted to reproduce and edit this work for non-commercial educational use as long as attribution is provided.
1 CS 430: Information Discovery Lecture 25 Cluster Analysis 2 Thesaurus Construction.
Resources for Biological Research Catherine Dockerty and Sophie Wilcox February 2008.
INFO Week 8 Subject Indexing & Knowledge Representation Dr. Xia Lin Assistant Professor College of Information Science and Technology Drexel University.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
Translating Dialects in Search: Mapping between Specialized Languages of Discourse and Documentary Languages Vivien Petras UC Berkeley School of Information.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
1 Automatic indexing Salton: When the assignment of content identifiers is carried out with the aid of modern computing equipment the operation becomes.
WISER : The Ovid databases Ovid is the platform for searching many of the life science and medicine databases. Juliet Ralph, Radcliffe Science Library.
Librarians vs. Automation Carolyn Weber Lucio Campanelli Will Hohyon Ryu.
Reference Collections: Collection Characteristics.
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
ALA Annual Meeting Claire Cocco Global Product Manager CONTENTdm Users Group June 30th, 2008.
Jane Greenberg & the Dryad Team The DRYAD Repository ~~~~~~ INLS 720 visit to NESCent November 17, 2008.
Sources of Clinical Effectiveness Information & Finding the Evidence Presenter Contact details.
Automatic vs manual indexing Focus on subject indexing Not a relevant question? –Wherever full text is available, automatic methods predominate Simple.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
HIVE-DRYAD Integration. For Curators Use HIVE to generate subject, taxon, and spatial terms suggestion. Curator’s needs: – Get terms suggestion from HIVE.
Exploiting information: getting the most from the OU Library March 2016.
Chelcie Rowell Jane Greenberg Metadata Research Center UNC-Chapel Hill CONTROLLED VOCABULARY STATUS & POTENTIAL IN DATA REPOSITORIES Authority Control.
Major Issues n Information is mostly online n Information is increasing available in full-text (full-content) n There is an explosion in the amount of.
Knowledge is Empowerment Tutorial Guide no. 28 EBSCO ACADEMIC SEARCH PREMIER AND USE OF SUBJECT TERMS.
BME1450: Biomaterials and Biomedical Research
System for Semi-automatic ontology construction
Guangbing Yang Presentation for Xerox Docushare Symposium in 2011
Evgeniy, Tretyakov Alexey, Artamonov
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Writing a Research Abstract
EDS Discovery Health & EBSCO eBooks Workflow Optimization
Applying Key Phrase Extraction to aid Invalidity Search
Cataloging the Internet
DATABASES By: Hanna Ben-Or Phone:
Advanced search techniques in databases
Searching with context
LEADS-4-NDP: Fellowship
Presentation transcript:

HIVE as a Machine-aided Indexing Tool Personal Keyword use without vocabulary control Machine-aided indexing term extraction Participant relevant and not relevant judgments Inter-indexing consistency Rolling’s Measure Hooper’s Measure

Organizing Scientific Data Sets

HIVE/Dryad Evaluation Questions – Given Dryad article metadata (title, abstract, depositor-supplied keywords), what are the best approaches for term suggestion from selected controlled vocabularies (MeSH, ITIS, TGN)? – Can one approach be used for subject, taxonomic and geographic indexing? Method – Create “gold standard” of manually index records based on mapping of Dryad, MEDLINE and BIOSIS Previews to MeSH, TGN, ITIS – Evaluate state-of-the-art techniques for automatic subject, and taxonomic, and geographic indexing Preliminary results – For taxonomic name indexing, untrained KEA++ performs almost as well as state-of-the-art taxonomic name extraction (FindIt) – For geographic name indexing with TGN, simple graph-based ranking algorithm outperforms KEA++. Craig Willis, Hollie White, Lee Richardson, Casey Rawson Jane Greenberg, Bob Losee, Ryan Scherle, Todd Vision

Thesaurus Walking: Automatic Indexing with Controlled Vocabularies Questions – Starting from the location of terms in a document and moving to the indexer assigned controlled terms, how do indexers navigate in a thesaurus? – How can this knowledge be used to improve techniques for automatic indexing with controlled vocabularies? – How can this knowledge be used to improve thesauri? Methodology – Unsupervised, graph-based approach using random walks on thesauri Preliminary results – Indexer assigned controlled terms are identified at a rate much higher than random, but far from perfect. – Suggests that this method could best be used in combination with other dissimilar automatic indexing methods. Craig Willis, Bob Losee, Jane Greenberg