University of Florida CTSI: Consuming and disambiguating publications data from Microsoft Academic Search in VIVO. Nicholas Rejack 1, Erik Schmidt 1, Michael.

Slides:



Advertisements
Similar presentations
Publish or be damned… building your publication record John Germov Zlatko Skrbis.
Advertisements

Jump to Contents Instructor Tutorial essignments.com Paperless assignment submission system.
Using CAB Abstracts to Search for Articles. Objectives Learn what CAB Abstracts is Know the main features of CAB Abstracts Learn how to conduct searches.
PubMed/History; Accessing Full-Text Articles (module 4.4)
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
UF VIVO is intended to be a comprehensive resource for scholarship, scholarly networking, and information about scholarship at the university. Automation.
New Features Update ISI Web of Knowledge. Copyright 2006 Thomson Corporation 2 New features added Mozilla Firefox web browser is now supported New access.
EThority as a Business Intelligence Solution for VIVO Data Mike Conlon, Alicia Turner, Will Collante UF Clinical and Translational Science Institute BackgroundAnalytics.
1 Using Scopus for Literature Research. 2 Why Scopus?  A comprehensive abstract and citation database of peer- reviewed literature and quality web sources.
How Search Engines Work Source:
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
PubMed Search Options (Basic Course: Module 6). Table of Contents  History  Advanced Search  Accessing full text articles from HINARI/PubMed  Failure.
Online Products From Oxford University Press This presentation gives a brief description of Oxford Handbooks Online It tells you what Oxford Handbooks.
PubMed/History; Accessing Full-Text Articles (module 4.4)
Online Resources From Oxford University Press This presentation gives a brief description of Oxford Medical Libraries Online It tells you what the libraries.
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE KIEV, 31 JANUARY.
IT Introduction to Website Development Welcome!
Mike Conlon Here’s Mike on a conference call from his home. Mike spends a lot of time on conference calls from his home, and from coffee shops in and around.
IL Step 1: Sources of Information Information Literacy 1.
Databases and Library Catalogs Global Index Medicus/Global Health Library PubMed Source Bibliographic Database: International Health and Disability.
Gathering and Analyzing Web Use Statistics: A Practical Tutorial for Archivists Michael Szajewski, Ball State University, Archivist for Digital Development.
1 Introduction to Library Databases Basic Searching.
THOMSON SCIENTIFIC Web of Science Using the specialized search and analyze features Jackie Stapleton, librarian Fall 2006.
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
Welcome to Web Of Knowledge. Copyright 2006 Thomson Corporation 2 A very brief history of the citation index The concept of the citation index was first.
1 Scopus as a Research Tool March Why Scopus?  A comprehensive abstract and citation database of peer-reviewed literature and quality web sources.
PubMed/History, Advanced Search and Review (module 4.3)
1 ScopusScopus Empowering Your Research. 2 As a Comprehensive Abstracts Database ~18,000 sources (90% peer-reviewed journals) from 5,000 publishers Comprehensive.
ERIC and the WorldCat Registry Lawrence Henry ERIC Program Manager Joanna White WorldCat Registry Product Manager.
University of Nizwa Academic Search Premier Tutorial.
A Survey of Patent Search Engine Software Jennifer Lewis April 24, 2007 CSE 8337.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE LVIV, 11 SEPTEMBER.
Case Study ProsperaSoft’s global sourcing model gives the maximum benefit to customers in terms of cost savings, improved quality, access to highly talented.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Method How are data being collected? Data collection is done manually from paper IRB records. Every member of the CTSI Study Registry team is thoroughly.
Utilizing Amazon Cloud Infrastructure for low-cost Evaluation or Robust Production Performance of the VIVO Application Vincent J. Sposato 1, Stephen V.
CiNii Articles is a service that provides information on scholastic articles, with an emphasis on Japanese papers. It allows users to find the articles.
Science Direct. Go to Search Article Data Bases (Blue Box) Scroll Down Or Click “S” Science Direct is Third.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
Indexing of Tables and Figures: Scientists’ Reaction Carol Tenopir University of Tennessee web.utk.edu/~tenopir/
1 DATABASE INTERNATIONAL BIBLIOGRAPHY OF PERIODICAL LITERATURE IN THE HUMANITIES AND SOCIAL SCIENCES ONLINE.
Enriched Knowledge Service Platform and Cross-Database Search September, 2015.
BIBSAM-konsortiet 13/01/2016 ICLC Paris 2009 Updates: the BIBSAM consortium, Sweden Technical conditions in licenses Anna Lundén, coordinator.
Full-text Article Access Problems Using the ‘Journals by title A-Z’ list, we are attempting to access a full-text article from the Blood. Although HINARI.
PubMed/How to Search, Display, Download & (module 4.1)
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
Project Update Mike Conlon VIVO Project Director.
University of Florida’s dchecker: Software for ensuring semantic data integrity Nicholas Rejack, MS 1, Christopher P. Barnes 1, Michael Conlon, PhD 2
Evaluation of Scholarship using Web of Science Gayle Baker Electronic Services Coordinator UT Libraries.
1 Serials Collections Analysis for Libraries and Consortia Presentation to ICOLC Jim McGinty CEO.
Using JSTOR November What is JSTOR?JSTOR 2.JSTOR demonstration −Searching JSTOR −Format of the journal content −Using a MyJSTOR account to organize.
ELISQ Systems Demonstration Sagnik Ray Choudhury Doha -- May 2015.
OxLIP+ Electronic Resources Gillian Beattie Angela Carritt.
1 e-Resources on Social Sciences: Scopus. 2 Why Scopus?  A comprehensive abstract and citation database of peer-reviewed literature and quality web sources.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
Quick guide < Keyword search >
Using JSTOR May 2016.
Connect UNAVCO, a VIVO for a Scientific Community
Quick guide < Keyword search >
University of Florida CTS-IT: Automated Data Translation from EMR to REDCap Problem Research often depends upon reliable access to clinical data. Many.
An ecosystem of contributions
PREMIS Tools and Services
VIVO Expert Finder Update
Technical Integration Guide
Your personalized medical & scientific journal
Lesson 2: Gathering and Organizing Information Using ICT KEY QUESTION: HOW DO YOU GATHER AND ORGANIZE INFORMATION USING THE COMPUTER AND INTERNET?
Search for Article Citation
Presentation transcript:

University of Florida CTSI: Consuming and disambiguating publications data from Microsoft Academic Search in VIVO. Nicholas Rejack 1, Erik Schmidt 1, Michael Conlon 1 1 Clinical and Translational Science Institute, University of Florida Linking to a New Data Source What is Academic Search? Using the Academic Search Data Did it Work? Disambiguating publication authorships is a well-recognized problem in the field of academic publishing. The task is further complicated by the welter of available sources of publications data. UF has taken steps to input publication data via hand- input and automated ingests of Thompson-Reuters data. Future efforts to add new data sources, such as Microsoft Academic Search, to VIVO will enrich publications data with the end goal of creating complete, fully-disambiguated publication records for each author. This project was intended to be a proof of concept, demonstrating our ability to create a programmatic link between VIVO and the Microsoft Academic Search API, then retrieve publications data about University of Florida investigators. Project work was performed on a small subset of investigators homed in our CTSI. Additional efforts have centered around providing a list of publications involving University of Florida authors back to the Microsoft Academic Search team. Future work on this project is expected to include the possible correction/union of details attached to publication titles that may be present in both systems. Microsoft Academic Search is a free service developed by Microsoft Research to help scholars, scientists, students, and practitioners quickly and easily find academic content, researchers, institutions, and activities. Microsoft Academic Search takes full advantage of results from the Bing search engine, indexing thousands more publications than can be found at any other single source (almost 39 million publications for 20 million authors at this time). ( The Microsoft Academic Search API can be accessed using an API key, which can be requested from the Academic Search web site. On the Academic Search side, our process involves getting JSON objects back from the RESTful interface using Python. On the VIVO side, our process involves getting JSON objects using SPARQL. A hybrid record consisting of data elements from both services is then constructed. Future work will involve serializing the data back out to VIVO-compliant RDF/XML to enrich the VIVO publication record. Microsoft Academic Search uses machine-learning algorithms to disambiguate authorships, sometimes leading to papers being incorrectly attributed or grouped. As UF’s data is hand- curated and features authors we have a personal interest in, future work should involve sorting out incorrectly attributed papers. We believe the hybrid approach (automation and hand-entry) is needed to cover all the cases. Our project is considered a success, as we’ve been able to retrieve data from Academic Search, compare it to existing VIVO data in order to match or otherwise disambiguate, then ingest any new data into VIVO. We are also able to produce a list of missing publications for the Academic Search team, and are working on a process to provide this data to them. We believe that evaluation of publication details is simply a matter of developing the proper code, likely in Python, since all connections are already in place and required data is available. Please contact any of the authors of this poster regarding this work. All authors can be found in UF VIVO. Fetching data from two sources Reading JSON objects