Who’s who? Author identification in INSPIRE -Heath O’Connell, Fermilab November 2012AAHEP61.

Slides:



Advertisements
Similar presentations
CIDOC 2000 Using GEM Metadata to Access Education Resources Nancy Virgil Morgan Coordinator
Advertisements

How to Create a Local Collection
EBSCOadmin Authentication
Professional Profiles Module 3 1. Objectives In this module you will learn: Professional Profile basics How to create a Professional Profile How to add.
50 Years of Experience in Making Grey Literature Available Matching the Expectations of the Particle Physics Community Carmen ODell.
© 2012 Association for Computing Machinery Intro to the ACM Digital Library February 24, 2012 Intro to the ACM Digital Library February 24, 2012.
Indispensable tools for research at its best COS Pivot: Accessing Pivot and Managing Your Profile.
© Paradigm Publishing, Inc Access 2010 Level 1 Unit 1Creating Tables and Queries Chapter 2Creating Relationships between Tables.
“The Honeywell Web-based Corrective Action Solution”
Extended DISC Online System User Instruction: How to Run a Team Analysis.
British Library document Supply Service. 2 Building the future service Live November 2011 £6m project over 2 years Replace ALL of the current technology.
Slide Number1 Myscouts “How To” for Manitoba & Saskatchewan May 2013.
MS-Access XP Lesson 1. Introduction to MS-Access Database Management System Software (DBMS) Store data in databases Database is a collection of table.
Space Missions Can Your Library Automation Software Do This? David Hook MDA
Member Access. This presentation will help you familiarize yourself with the functions of Member Access. Throughout this presentation, you can enter the.
WELCOME to the LTER Data Co-op with PASTA (Provenance Aware Synthesis Tracking Architecture) All Scientists Meeting 2012 Your source for LTER data.
“Facts are stubborn things, but statistics are pliable.”
Materials Data Curation System
SciVal Experts & SciVal Funding Information Sessions.
NIMAC 2.0 Basics for AUs: Searching, Downloading, and Assigning Files & Using the Reports Options 1www.nimac.us.
PAWN V0.7 University of Maryland Institute for Advanced Computer Studies.
APS Update HEP Information Providers Summit Harvard-Smithsonian April 15, 2010 Mark Doyle, American Physical Society.
PENN Community Project SUG Presentation April 8, 2002.
EmpowHR EmpowHR Security Overview. 2 Application Security Administration Permission List Roles User Profiles Row level security Distributed Security Administration.
The Quote Request Model Joanne Woytek. 2 Conference ‘11 Why Use the Quote Request Tool  Only recommended method for: Determining what is available on.
Comprehensive Large Array-data Stewardship System (CLASS) Web Site Tutorial Visit CLASS Site at
Salesforce.com Web to Leads. Unit Name Web to Leads A web to lead provides users the ability to gather information from their website visitors which automatically.
Jean Phillips Schwerdtfeger Library Space Science and Engineering Center University of Wisconsin-Madison November 2005.
Automation Repository - QTP Tutorials Made Easy The Zero th Step TEST AUTOMATION AND QTP.
Faculty Manager An ACEware Webinar. In this webinar... Adding and Maintaining a Faculty Record Faculty Mgr Preferences & UDF’s Storing a faculty resume.
Login Screen This is the Sign In page for the Dashboard Enter Id and Password to sign In New User Registration.
KNAW’s new Research Information System NIOO November 10, 2014
ORCID Technical Report May 18, Development Approach 2 Alpha Completed Spring 2010 Self-claim oriented Limited light integration with a few participant.
INSPIRE Travis Brooks (SLAC) Tibor Simko (CERN). SPIRES’ History Index to HEP literature for 35 years Via terminal login Via Via web (1st U.S. Website/1st.
Classroom User Training June 29, 2005 Presented by:
Login Screen This is the Sign In page for the Dashboard New User Registration Enter Id and Password to sign In.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
DAY 15: ACCESS CHAPTER 2 Larry Reaves October 7,
Online Autonomous Citation Management for CiteSeer CSE598B Course Project By Huajing Li.
PowerPoint Lesson 10 Sharing and Delivering Presentations Microsoft Office 2010 Advanced Cable / Morrison 1.
Credit Union National Association Installing and Uploading Project Zip Code.
INTRODUCTION TO RESEARCHERID. Agenda What is ResearcherID? Why do you need ResearcherID? Search ResearcherID Access ResearcherID and create a profile.
Word Lesson 13 Sharing Documents Microsoft Office 2010 Advanced Cable / Morrison 1.
SCHOLARONE MANUSCRIPTS ORCID Outreach Meeting – 5/17/12 Chris Heid, Product Manager ScholarOne.
DEMO - 8/14/2007. R2 Feature List ReceiveDocumentBatch Web Service SendPESCAcknowledgment Web Service Validate Acknowledgment Upload Acknowledgment Transcript.
A Tony Thomas-inspired guide to INSPIRE The evolution from SPIRES to INSPIRE and what it means for you Tony Thomas 60th Birthday Fest Feb Heath.
V 1.0Slide 1 Staff – Basic Info. V 1.0Slide 2 Staff – Basic Info Input the search criteria. Click search to start searching.
CITI 2 Software Update Features, Modifications, and Planned Upgrades October 2007 Mike Fallon;
A Global Digital Library for High-Energy Physics Annette Holtkamp CERN-UNESCO School on Digital Libraries – Rabat, Nov 2010.
Authors & More Eleonora Presani – Elsevier
CSCI 6962: Server-side Design and Programming Shopping Carts and Databases.
Collecting Copyright Transfers and Disclosures via Editorial Manager™ -- Editorial Office Guide 2015.
Business Objects XIr2 Windows NT Authentication Single Sign-on 18 August 2006.
1 Logging into the new PCard (PaymentNet) System: PAYMENTNET * Introduction * May use IE 8.0 or greater or Firefox * Do not.
2016 CSO System Training & Networking Conference / Copyright © 2016 #csoconf 2016 CSO System Training & Networking Conference / Copyright © 2016 #csoconf.
Collaboration at RHIC 550 Collaborators from 75 institutions in 15 countries Here today representing PHENIX Irina Sourikova and Jamie Nagle Co-Spokespersons.
Member Access.
An Introduction to Orcid
The High Energy Physics information platform: Introduction
Annette Holtkamp - AAHEP7
Open Sales Order Report
H.B. O'Connell HEP Info Summit DESY May 2008
Lecture 12 Lecture 12: Indexing.
An Introducation to ResearcherID
Capturing and Organizing Scientific Annotations
Creating (or linking) an ORCID profile from within Pure
Contact: Ya’ou JIANG: Tips for Researchers Contact: Ya’ou JIANG:
This is the Sign In page for the Dashboard
Research publications at Jefferson Lab
Better Management of Instructors
Presentation transcript:

Who’s who? Author identification in INSPIRE -Heath O’Connell, Fermilab November 2012AAHEP61

What’s the problem? Author search is the most popular search Names are not unique – Denis Bernard (theory), – Denis Bernard (BABAR), – Bernard Denis (accelerators) – David Nathan Brown and David Norvil Brown (both BABAR) 2,800+ authors on ATLAS November 2012AAHEP62

How do we deal with this? HEPNAMES database to collect information on scientists – Establish identity of author as a person – 99,000 records managed by 1 FTE – 34,000 INSPIRE ID numbers assigned. Record checked for duplicates, etc. Bib Author Identify (BAI): computer algorithm to identify author profile based on publication info such as affiliation and co-authors – Establish BAI profile, may or may not correspond to a unique person November 2012AAHEP63

INSPIRE ID vs. ORCID ID INSPIRE ID gives us immediate control – New ATLAS member can be assigned an ID that day by us, do not have to wait for person – HEPNames record curated for that person IDs are all “one-to-one” and an association can be made at a later date (ask users?) Mark Doyle: – ORCID | INSPIRE Start promoting ORCID with button to ORCID in our system November 2012AAHEP64

Adding authors and affiliations to HEP records 1-10 authors – Add by hand using an auto-suggest script which guesses the affiliation based on older records. More than 10 authors (typically experimental) – Did they use an authors.xml file? Yes: extract authors and affs cleanly in a few seconds. No: use script that extracts authors and affiliations from TeX file and matches their ID number based on name and experiment. e.g. “d. denisov” + “FNAL-E-0823” = INSPIRE Affiliations matched with INSTITUTIONS database November 2012AAHEP65

Authors.xml file November 2012AAHEP66 Authors.xml file was proposed by INSPIRE and developed in partnership with arXiv.org and publishers such as the APS to enable collaborations to ensure all authors are properly specified.

Helping the Smaller Collaborations people – Big enough to be a problem – Small enough to have no system in place INSPIRE has created a system these collaborations can use to manage their authors and create author.xml and LaTeX files November 2012AAHEP67

Author management for collaborations November 2012AAHEP68

Let’s get automated Bib Author Identify (BAI): 12,000 lines of code that uses metadata to create likely author profiles to identify a person 6.7 M “signatures” on 1M papers in HEP 270,000 author profiles created – cf. HEPNames: 100,000 records On average each profile has 25 papers November 2012AAHEP69

November 2012AAHEP610 For people with very common names it naturally has some difficulties. These are cleaned by a combination of user and operator effort. Algorithm will get smarter so A.J. Martin and A.D. Martin aren’t in same profile.

How to reach users Use the HEPNames database to identify candidates for a mailout. Look for people who have verified their HEPNames record (know they respond). 10,000 s have been sent out. November 2012AAHEP611

Author Publication Profile Page November 2012AAHEP612

Login page: arXiv or “guest” November 2012AAHEP613

Claim your papers, remove others November 2012AAHEP614

Claiming results versus total PapersSignaturesAuthor Profiles Total in HEP1,000,0006,000,000270,000 Claiming actions151,000350,000 (4,000,000)5,000 (100,000) November 2012AAHEP615 N.B. Very high number of signatures (4,000,000) on small number of papers (151,000). Probably an effect of newer papers being claimed, hence more signatures from big collaborations.

Summary 98,000 records in HEPNAMES – 34,000 with INSPIRE ID (real, unique people) Will integrate ORCID and INSPIRE Created author.xml format for collaborations and system for them to manage authors BAI algorithm created 270,000 author profiles 10,000 solicitation s  5,000 responses 150,000 papers claimed (out of 1,000,000) November 2012AAHEP616