Arnold Rots SAO/ CXC 2013-09-28IVOA Interop Waikoloa/DCP: PID Granularity - Arnold Rots1.

Slides:



Advertisements
Similar presentations
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Advertisements

Registries Work Package 2 Requirements, Science Cases, Use Cases, Test Cases Charter: Focus on science case scenarios, and use cases related specifically.
VOEvent - IVOA Interop Kyoto1 Open Issues for VOEvent Arnold Rots Harvard-Smithsonian CfA / CXC T HE US N ATIONAL V IRTUAL O BSERVATORY.
The VAO is operated by the VAO, LLC. Alternative Protocols for Discovery & Access Mike Fitzpatrick NOAO.
IVOA Interop Kyoto - VOTable1 Space-Time Coordinate Metadata for VOTable Arnold Rots Harvard-Smithsonian CfA / CXC T HE US N ATIONAL V IRTUAL.
SCAR Data Management SSG Plenary 30 th July 2010 Kim Finney (Manager, Australian Antarctic Data Centre & Chief Officer, SCAR Standing Committee on Antarctic.
Supplemental Data: Questions and Considerations Alexander ( Sasha ) Schwarzman Information Systems Analyst American Geophysical Union (AGU) Co-chair, TWG.
Supporting education and research Repositories in Context Digital repositories as components of an integrated infrastructure for education Leona Carpenter.
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Principles of Personalisation of Service Discovery Electronics and Computer Science, University of Southampton myGrid UK e-Science Project Juri Papay,
SCIDIP-ES Components Oct ,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation.
Rots et al., Persistent Identifiers1 Associating Persistent Identifiers between Trustworthy Repositories Arnold Rots, Alberto Accomazzi, Günther.
Solar and STP Physics with AstroGrid 1. Mullard Space Science Laboratory, University College London. 2. School of Physics and Astronomy, University of.
Resource Discovery Module DigiTool Version 3.0. Resource Discovery 2 Deposit Approval Search & Index Dispatcher & Viewers Single & Bulk Web Services DigiTool.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Creating a Secured and Trusted Information Sphere in Different Markets Giuseppe Contino.
 Keep it simple and sufficient (do not multiply it unnecessarily)  Make it intuitive and self-explanatory  Make it easily discoverable and accessible.
Regression testing Tor Stållhane. What is regression testing – 1 Regression testing is testing done to check that a system update does not re- introduce.
A Future Vision of Invisible And Rigorous Records Management Denise A. D. Bedford, Ph.D. Visionary Senior Information Officer Information Quality Group.
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
By N.Gopinath AP/CSE. Why a Data Warehouse Application – Business Perspectives  There are several reasons why organizations consider Data Warehousing.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
Innovation & Supplementary Material Eleonora Presani – Elsevier
Delivering business value through Context Driven Content Management Karsten Fogh Ho-Lanng, CTO.
March 2014 Basic Content Management Tuffolo Group Perspective TUFFOLO.
…using Git/Tortoise Git
What is a Database? SECTION 1. Database Technology and its Evolution Decades long evolution Early data processing systems Today's systems New technology.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
Access and Query Task Force Status at F2F1 Simon Miles.
EPA Enterprise Data Architecture Metadata Framework Assessment Kevin J. Kirby, Enterprise Data Architect EPA Enterprise Architecture Team
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
Virtual techdays INDIA │ august 2010 ENTERPRISE CONTENT MANAGEMENT WITH SHAREPOINT 2010 Naresh K Satapathy │ Solution Specialist, Microsoft Corporation.
DSpace vs Fedora Ralph LeVan OCLC Research. What Do You Want From a Repository? How do you create your metadata? How do you assemble your objects? How.
Introduction of Geoprocessing Lecture 9. Geoprocessing  Geoprocessing is any GIS operation used to manipulate data. A typical geoprocessing operation.
Lessons of the Square Watermelon Japanese grocery stores had a problem. They are much smaller than their US counterparts and therefore don't have room.
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
Persistent Identifiers (PIDs) & Digital Objects (DOs) Christine Staiger & Robert Verkerk SURFsara.
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
LCG – AA review 1 Simulation LCG/AA review Sept 2006.
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
Why RDA? A domain repository perspective George Alter ICPSR University of Michigan.
Arnold H. Rots & Sherry L. Winkelman Chandra Data Archive Smithsonian Astrophysical Observatory Rots & Winkelman - IAU XXIX 2015, FM31.
IVOA Interop, Beijing, China, May IVOA Data Access Layer Working Group Sessions Doug Tody (NRAO/NVO ) Markus Dolensky (ESO/EuroVO) Data Access Layer.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
1 The Metadata Groups - Keith G Jeffery. 2 Positioning  Raise profile of metadata  Data first  Also software, resources, users  Achieve outputs/outcomes.
Data Fabric IG From Testing to Recommendations Beth Plale.
ATLAS Distributed Computing Tutorial Tags: What, Why, When, Where and How? Mike Kenyon University of Glasgow.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Building a Data Warehouse
AP CSP: Finding a Data Story
RECENT TRENDS IN METADATA GENERATION
Model Governance Industry Evolution Beyond Model Accuracy
Flexible Extensible Digital Object Repository Architecture
Cloud based Open Source Backup/Restore Tool
Flexible Extensible Digital Object Repository Architecture
Doron Goldfarb & Yann LE FRANC
Linking persistent identifiers at the British Library
VI-SEEM Data Repository
Persistent identifiers in VI-SEEM
Using Excel to Graph Data
WGISS-WGCV Joint Session
LESSON 13 – INTRO TO ARRAYS
WG/IG Collaboration Meeting June Göteborg METADATA GROUPS PERSPECTIVE Keith G Jeffery & Rebecca Koskela.
Scott Thorne & Chuck Shubert
Using Excel to Graph Data
RDA uptake activities and plans: ESGF
Presentation transcript:

Arnold Rots SAO/ CXC IVOA Interop Waikoloa/DCP: PID Granularity - Arnold Rots1

Discoverability Scope: higher level data products Scattered over many repositories Requires Persistent Identifiers (PID) Registry infrastructure Metadata standards Metadata extraction tools Provenance information But this is only part of the problem IVOA Interop Waikoloa/DCP: PID Granularity - Arnold Rots2

Complex Cases Versioning Purists will insist that PIDs are version specific Growing realization that users often prefer current default Improved calibration, more complete datasets, etc. Version information can also be appended to the “root” PID Compound datasets or data objects Made up of smaller components Multiple files, multiple data objects, etc IVOA Interop Waikoloa/DCP: PID Granularity - Arnold Rots3

Drilling Down into Compound Data Objects From the user’s perspective For instance, a user interested in masses of galaxies “Get me papers on galaxy mass estimates” Respond with a list of pointers to papers in ADS “Get me galaxy mass estimates” Don’t provide the list of papers, provide pointers to the electronic versions of the tables in those papers “Get me mass estimates for M81” Just provide the relevant number(s) IVOA Interop Waikoloa/DCP: PID Granularity - Arnold Rots4

How to Drill Down? The obvious way is to implement this by introducing a hierarchical structure in the tokens tacked on to the end of the PIDs However, watch out for interference with versioning A table (figure, …) can easily be identified as a component of a paper, but a single cell in the table??? Need vastly more metadata about the content of compound data objects How do we know what users may want to retrieve? Would it be sufficient to store the quantities contained and the objects covered, then parameterize? IVOA Interop Waikoloa/DCP: PID Granularity - Arnold Rots5

Forward At this time we are trying to solve the issue of how to cite simple datasets and data objects Next we will need to put effort into discoverability But I strongly believe the demand for more sophisticated data discovery, this drilling down into compound data objects, is just around the corner Several people at RDA seemed to be thinking in the same direction – that is a hopeful sign In whatever we are working on now, though, we need to keep the future perspectives in mind IVOA Interop Waikoloa/DCP: PID Granularity - Arnold Rots6