Presentation is loading. Please wait.

Presentation is loading. Please wait.

The High Energy Physics information platform: Introduction

Similar presentations


Presentation on theme: "The High Energy Physics information platform: Introduction"— Presentation transcript:

1 The High Energy Physics information platform: Introduction
Annette Holtkamp CERN CERN-UNESCO School on Digital Libraries, Kumasi, Nov 2016

2 The HEP community Close-knit community ~30,000 active HEP researchers
50% experimentalists 50% theorists very international (even small author groups) ~40,000 papers/year Long Open Access tradition Community based information services arXiv, INSPIRE Kumasi, Nov 2016

3 INSPIRE overview Kumasi, Nov 2016

4 Comprehensive HEP information platform
conceived in 2007 In production since 2012 Invenio Evolution of SPIRES ( ) high data quality, manually curated comprehensive coverage high acceptance, user involvement run by Kumasi, Nov 2016

5 https://inspirehep.net (Invenio 1)
Kumasi, Nov 2016

6 Kumasi, Nov 2016

7 INSPIRE content HEP literature Jobs Jobs Network of collections
Conferences Data HEP literature Institutions Experiments HepNames HepNames Jobs Jobs Kumasi, Nov 2016

8 HEP literature Kumasi, Nov 2016

9 literature collection
1,2 million records (Nov 2016) Preprints journal articles conference papers books + proceedings theses Metadata enrichment Affiliations, keywords, conference + publication info, experiments … 1 search/second Kumasi, Nov 2016

10 Fulltext repository >50% of HEP collection with fulltext
All OA material arXiv, theses, preprints, OA journal articles esp “endangered” material (conf procs) Access restricted articles hidden archive of journal articles searchable Historical material – scanning important preprint/conference series a few journals Kumasi, Nov 2016

11 Kumasi, Nov 2016

12 Kumasi, Nov 2016

13 Kumasi, Nov 2016

14 Kumasi, Nov 2016

15 Kumasi, Nov 2016

16 Kumasi, Nov 2016

17 Kumasi, Nov 2016

18 Kumasi, Nov 2016

19 Kumasi, Nov 2016

20 Kumasi, Nov 2016

21 Plot thumbnails Kumasi, Nov 2016

22 Plots Kumasi, Nov 2016

23 Searchable captions caption:<searchterm> Kumasi, Nov 2016

24 Kumasi, Nov 2016

25 Kumasi, Nov 2016

26 Kumasi, Nov 2016

27 Reference correction: crowd sourcing
Kumasi, Nov 2016

28 Search Kumasi, Nov 2016

29 Kumasi, Nov 2016

30 Kumasi, Nov 2016

31 Advanced search Kumasi, Nov 2016

32 Search syntax Options : Google-like freetext search Invenio syntax
searches in title, abstract, keywords… “CMS Higgs” Invenio syntax “collaboration:CMS title:Higgs” Kumasi, Nov 2016

33 Which syntax to use? Free text search: simple, works in most cases
Invenio syntax: more specific search results Kumasi, Nov 2016

34 Invenio syntax Many abbreviations for search terms
author au, a title ti, t collaboration cn May be mixed with free text search cn:cms higgs Kumasi, Nov 2016

35 Fulltext search Kumasi, Nov 2016

36 second-order search operators
refersto refersto:affiliation:CERN All papers citing articles written by CERN authors citedby citedby:author:… All papers cited by articles written by … Kumasi, Nov 2016

37 Complex search example
Find the most influential CERN papers on the Higgs particle written before 2000 that don‘t cite any paper by Peter Higgs Kumasi, Nov 2016

38 Complex search example
Find the most influential CERN papers on the Higgs particle written before 2000 that don‘t cite any paper by Peter Higgs Affiliation:CERN title:Higgs cited:100->5000 -refersto:author:Higgs date:1900->1999 Kumasi, Nov 2016

39 Search help Kumasi, Nov 2016

40 Citation analysis Kumasi, Nov 2016

41 Kumasi, Nov 2016

42 Kumasi, Nov 2016

43 Citesummary: author Kumasi, Nov 2016

44 Citesummary: any search
Kumasi, Nov 2016

45 Author profiles Kumasi, Nov 2016

46 Who’s who? The INSPIRE search for Y Wang returns 3346 papers of at least 44 different authors How to find the papers of Yan Wang? Kumasi, Nov 2016

47 Author disambiguation
Goal: Unambiguously associate papers with their authors regardless of name variations Method: Algorithm based on metadata in Inspire coauthors, affiliation, collaboration… that clusters papers probably written by the same author Author Profile Pages Kumasi, Nov 2016

48 Kumasi, Nov 2016

49 Kumasi, Nov 2016

50 Crowdsourcing: manage profile
e.g. update affiliation history Kumasi, Nov 2016

51 Crowdsourcing: manage publications
e.g. claim or reject papers Kumasi, Nov 2016

52 INSPIRE: The future Kumasi, Nov 2016

53 Test version: qa.inspirehep.net
Invenio3 New data model Complete UI redesign Test version: qa.inspirehep.net Kumasi, Nov 2016

54 Kumasi, Nov 2016

55 Kumasi, Nov 2016

56 Machine learning Author disambiguation Content selection
Subject guessing Experiment guessing Metadata extraction from pdf Kumasi, Nov 2016

57 Don’t hesitate to contact me with any questions
Thank you for your attention! Don’t hesitate to contact me with any questions Kumasi, Nov 2016


Download ppt "The High Energy Physics information platform: Introduction"

Similar presentations


Ads by Google