Download presentation
Presentation is loading. Please wait.
Published byGabriel Neal Modified over 6 years ago
1
The High Energy Physics information platform: Introduction
Annette Holtkamp CERN CERN-UNESCO School on Digital Libraries, Kumasi, Nov 2016
2
The HEP community Close-knit community ~30,000 active HEP researchers
50% experimentalists 50% theorists very international (even small author groups) ~40,000 papers/year Long Open Access tradition Community based information services arXiv, INSPIRE Kumasi, Nov 2016
3
INSPIRE overview Kumasi, Nov 2016
4
Comprehensive HEP information platform
conceived in 2007 In production since 2012 Invenio Evolution of SPIRES ( ) high data quality, manually curated comprehensive coverage high acceptance, user involvement run by Kumasi, Nov 2016
5
https://inspirehep.net (Invenio 1)
Kumasi, Nov 2016
6
Kumasi, Nov 2016
7
INSPIRE content HEP literature Jobs Jobs Network of collections
Conferences Data HEP literature Institutions Experiments HepNames HepNames Jobs Jobs Kumasi, Nov 2016
8
HEP literature Kumasi, Nov 2016
9
literature collection
1,2 million records (Nov 2016) Preprints journal articles conference papers books + proceedings theses Metadata enrichment Affiliations, keywords, conference + publication info, experiments … 1 search/second Kumasi, Nov 2016
10
Fulltext repository >50% of HEP collection with fulltext
All OA material arXiv, theses, preprints, OA journal articles esp “endangered” material (conf procs) Access restricted articles hidden archive of journal articles searchable Historical material – scanning important preprint/conference series a few journals Kumasi, Nov 2016
11
Kumasi, Nov 2016
12
Kumasi, Nov 2016
13
Kumasi, Nov 2016
14
Kumasi, Nov 2016
15
Kumasi, Nov 2016
16
Kumasi, Nov 2016
17
Kumasi, Nov 2016
18
Kumasi, Nov 2016
19
Kumasi, Nov 2016
20
Kumasi, Nov 2016
21
Plot thumbnails Kumasi, Nov 2016
22
Plots Kumasi, Nov 2016
23
Searchable captions caption:<searchterm> Kumasi, Nov 2016
24
Kumasi, Nov 2016
25
Kumasi, Nov 2016
26
Kumasi, Nov 2016
27
Reference correction: crowd sourcing
Kumasi, Nov 2016
28
Search Kumasi, Nov 2016
29
Kumasi, Nov 2016
30
Kumasi, Nov 2016
31
Advanced search Kumasi, Nov 2016
32
Search syntax Options : Google-like freetext search Invenio syntax
searches in title, abstract, keywords… “CMS Higgs” Invenio syntax “collaboration:CMS title:Higgs” Kumasi, Nov 2016
33
Which syntax to use? Free text search: simple, works in most cases
Invenio syntax: more specific search results Kumasi, Nov 2016
34
Invenio syntax Many abbreviations for search terms
author au, a title ti, t collaboration cn … May be mixed with free text search cn:cms higgs Kumasi, Nov 2016
35
Fulltext search Kumasi, Nov 2016
36
second-order search operators
refersto refersto:affiliation:CERN All papers citing articles written by CERN authors citedby citedby:author:… All papers cited by articles written by … Kumasi, Nov 2016
37
Complex search example
Find the most influential CERN papers on the Higgs particle written before 2000 that don‘t cite any paper by Peter Higgs Kumasi, Nov 2016
38
Complex search example
Find the most influential CERN papers on the Higgs particle written before 2000 that don‘t cite any paper by Peter Higgs Affiliation:CERN title:Higgs cited:100->5000 -refersto:author:Higgs date:1900->1999 Kumasi, Nov 2016
39
Search help Kumasi, Nov 2016
40
Citation analysis Kumasi, Nov 2016
41
Kumasi, Nov 2016
42
Kumasi, Nov 2016
43
Citesummary: author Kumasi, Nov 2016
44
Citesummary: any search
Kumasi, Nov 2016
45
Author profiles Kumasi, Nov 2016
46
Who’s who? The INSPIRE search for Y Wang returns 3346 papers of at least 44 different authors How to find the papers of Yan Wang? Kumasi, Nov 2016
47
Author disambiguation
Goal: Unambiguously associate papers with their authors regardless of name variations Method: Algorithm based on metadata in Inspire coauthors, affiliation, collaboration… that clusters papers probably written by the same author Author Profile Pages Kumasi, Nov 2016
48
Kumasi, Nov 2016
49
Kumasi, Nov 2016
50
Crowdsourcing: manage profile
e.g. update affiliation history Kumasi, Nov 2016
51
Crowdsourcing: manage publications
e.g. claim or reject papers Kumasi, Nov 2016
52
INSPIRE: The future Kumasi, Nov 2016
53
Test version: qa.inspirehep.net
Invenio3 New data model Complete UI redesign Test version: qa.inspirehep.net Kumasi, Nov 2016
54
Kumasi, Nov 2016
55
Kumasi, Nov 2016
56
Machine learning Author disambiguation Content selection
Subject guessing Experiment guessing Metadata extraction from pdf … Kumasi, Nov 2016
57
Don’t hesitate to contact me with any questions
Thank you for your attention! Don’t hesitate to contact me with any questions Kumasi, Nov 2016
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.