What can we do with controlled vocabularies? The PIMMS story Charlotte Pascoe May 11 th 2012, Rutherford Appleton Laboratory.

Slides:



Advertisements
Similar presentations
Software Requirements
Advertisements

28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
DC Architecture WG meeting Monday Sept 12 Slot 1: Slot 2: Location: Seminar Room 4.1.E01.
The Seven Pillars of Open Language Archiving: A Vision Statement Gary Simons and Steven Bird Workshop on Web-based Language Documentation and Description.
Data Documentation Initiative (DDI) Workshop Carol Perry Ernie Boyko April 2005 Kingston Ontario.
Shoaib Sufi CCLRC e-Science Centre CCLRC Scientific Metadata (CSMD) Model April 2004 NESC.
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
Metadata workshop, June The Workshop Workshop Timetable introduction to the Go-Geo! project metadata overview Go-Geo! portal hands on session.
A centre of expertise in digital information managementwww.ukoln.ac.uk QA For Web Sites: QA Focus Resources Brian Kelly UKOLN University of Bath Bath .
Metadata Infrastructure Two approaches for creating Controlled Vocabs Charlotte Pascoe JISCMRD-DCC Institutional RDM Services Progress Workshop, Nottingham,
The PIMMS project and Natural Language Processing for Climate Science Extending the Chemical Tagger natural language processing tool with climate science.
The BADC-CSV Format Meeting user and metadata requirements Graham A Parton*, Sam J Pepler British Atmospheric Data Centre, Rutherford Appleton Laboratory,
ESGF and ES-DOC Documenting climate models and their simulations ES-DOC current and future plans Working with ESGF Eric Guilyardi, Balaji, Cecelia DeLuca,
Information Professionals and Learning Object Repositories … more than just metadata quality … Sarah Currier Stòr Cùram Project Librarian JISC X4L Repository.
Information Modelling MOLES Metadata Objects for Linking Environmental Sciences S. Ventouras Rutherford Appleton Laboratory.
M. Lautenschlager, H. Ramthun 1 Metafor Review 5 / 2010.
Announcements ●Exam II range ; mean 72
SWE Introduction to Software Engineering
CATEGORIES OF INFORMATION There are three main categories of business information,and these are related to the purpose for which the information is utilized.
1 Workshop on Metadata Interoperability for Electronic Records Management November 15, 2001 Archives II, College Park, MD.
Overview of Software Requirements
Information Extraction from Documents for Automating Softwre Testing by Patricia Lutsky Presented by Ramiro Lopez.
1 1 Roadmap to an IEPD What do developers need to do?
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
METADATA Research Data Management. What is metadata? Metadata is additional information that is required to make sense of your files – it’s data about.
Eric Guilyardi (LOCEAN/IPSL and Univ. Reading) and the Metafor team Common Metadata for Climate Modelling Digital Repositories IS-ENES kick-off meeting.
1 The Problem Do you have: A legacy ABL system with millions of Lines of ABL Code? Years and years of modifications to your ABL code? System documentation.
FCM Quality of Life Reporting System Metadata By: Acacia Consulting and Research June 2002.
1 Eric Guilyardi and the Metafor team Common Metadata for Climate Modelling Digital Repositories Metafor Dissemination Workshop Abingdon, 14 March 2011.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
Interoperability Scenario Producing summary versions of compound multimedia historical documents.
Chapter 16 The World Wide Web. 2 The Web An infrastructure of information combined and the network software used to access it Web page A document that.
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
CIM – The Common Information Model in Climate Research
WebWatch Ian Peacock UKOLN University of Bath Bath BA2 7AY UK
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Metadata, the CARARE Aggregation service and 3D ICONS Kate Fernie, MDR Partners, UK.
Introduction to MDA (Model Driven Architecture) CYT.
Adaptive Processes © Adaptive Processes Simpler, Faster, Better Software Requirements.
Call with D. Maraun Statistical Downscaling Controlled Vocabulary 5 DEC 2013.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
Database Essentials. Key Terms Big Data Describes a dataset that cannot be stored or processed using traditional database software. Examples: Google search.
1 OSG Accounting Service Requirements Matteo Melani SLAC for the OSG Accounting Activity.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Questionnaire Project Plan Alpha development Beta development Release for CMIP5 approval Production Phase After CMIP5 Questionnaire 01 Jan 2010 – 30 Dec.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
EVA Workshop, 26 March 2003, Florence, Italy1 COINE Cultural Objects In Networked Environments Anthi Baliou University of Macedonia,Library Thessaloniki,
Now launched! Visit nature.com/scientificdata Honorary Academic Editor Susanna-Assunta Sansone Advisory.
Sarah Callaghan 1, Eric Guilyardi 2, Charlotte Pascoe 3 and the Metafor Project Team 1 BADC- UK, 2 University of Reading, UK.
Using XML to store Descriptive Metadata Richard Murphy Rosarie O’Riordan Central Statistics Office Ireland.
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
Mining the Biomedical Research Literature Ken Baclawski.
CISB113 Fundamentals of Information Systems IS Development.
Web Technologies for Bioinformatics Ken Baclawski.
Page 1© Crown copyright 2004 FLUME Marco Christoforou, Rupert Ford, Steve Mullerworth, Graham Riley, Allyn Treshansky, et. al. 19 October 2007.
Sarah Callaghan 1, Gerry Devine 2, Eric Guilyardi 3, Bryan Lawrence 1, Charlotte Pascoe 1, Lois Steenman-Clark 2 and the Metafor Project Team 1 NCAS-BADC;
Model Design using Hierarchical Web-Based Libraries F. Bernardi Pr. J.F. Santucci {bernardi, University of Corsica SPE Laboratory.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Integrated metadata systems History Status Vision Roadmap
Principals of Research Writing. What is Research Writing? Process of communicating your research  Before the fact  Research proposal  After the fact.
Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Understanding the Value and Importance of Proper Data Documentation 5-1 At the conclusion of this module the participant will be able to List the seven.
Metadata for the SKA - Niruj Mohan Ramanujam, NCRA.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
CMIP5 Questionnaire Roadmap – beta
OBI – Standard Semantic
Presentation transcript:

What can we do with controlled vocabularies? The PIMMS story Charlotte Pascoe May 11 th 2012, Rutherford Appleton Laboratory

Portable Infrastructure for the Metafor Metadata System

Software Activity Data Grids Quality Shared ISO Some concepts are shared We can record the quality of things We reuse various ISO classes We can talk about DataObjects collected together in any number of ways, stored in a particular medium We can talk about hierarchical ModelComponents with ModelProperties, some of which can be coupled together We can talk about Simulations run in support of Experiments. Experiments consist of Requirements; Simulations conform to Requirements A particular Activity uses a particular SoftwareComponent We can define a GridSpec or some other geometry Common Information Model

Mind maps are used to capture information requirements from domain experts and build a controlled vocabulary. Mind Maps

Python Parser Definition of component type Radiation required Definition of property name RadiativeTimeStep required Definition of property name SchemeType required Definition of property name Method required Definition of property name NumberOfSpectralIntervals required A python parser processes the XML files generated by the mind maps

CMIP5 Questionnaire

CIM Document Viewer

GMD Journal Article

Chemical Tagger ChemicalTagger is an open-source tool that uses OSCAR4 and NLP techniques for tagging and parsing experimental sections in the chemistry literature.

xslt transform has been written to allow the Metafor atmosphere controlled vocabulary to be used by chemical tagger Chemical tagger software then parsed a GMD abstract and experiment description looking for Metafor Controlled Vocabularies the software identified many useful phrases Chemical Tagger and PIMMS NN-MODEL template is called. With a value of : generalcirculationmodel(AOGCM) With domain (from preceding-sibling): atmosphere-ocean ResolutionPhrase: With a value of : HorizontalresolutionsettoT42, correspondingroughlytoagridsizeof2.8° Vertical Resolution: 20verticalslevels VERTICAL DETAILS: and the height of the model top isapproximately 30km.

NN-MODEL template is called. With a value of : oceangeneralcirculationmodel(OGCM) NN-MODEL With domain: ocean Equation Type : Primitive Equation Type : hydrostatic Equation Type : Boussinesq ResolutionPhrase: With a value of : zonalresolution isfixedat ° ° Horizontal Grid with value: 256equallyspacedgridpoints Horizontal Grid with value: 192gridpoints Vertical Resolution: 43verticallevels VERTICAL DETAILS:, thetop8ofwhich areinσ-coordinates. Chemical Tagger and PIMMS

CIM Document Viewer

CIM was designed to be populated by modellers with the (probably over simplistic) assumption that if something isn't in the CIM document then it either isn't in the model or isn't relevant. But CIM documents created by harvesting information from papers will naturally not cover everything about a model, so missing info doesn't mean that those things weren't included/aren't relevant. PIMMS will need to describe different protocols for interpreting CIM documents depending on how they were created, but we will also want to ensure that that CIM accounts for missing data more intelligently in future releases. In essence the difference between journal article descriptions and metadata documentation is Narrative. Journal articles need to tell a story so the information they include is only that which is relevant to the narrative, whereas metadata documentation is an attempt to include as much as possible across the board. The general nature of metadata documentation is probably why it has historically been perceived as such a boring task to complete. PIMMS will make metadata documentation more fun by bringing back the Narrative, once PIMMS is established at an institution users will be able to create generalised metadata having only described those things that are relevant to the story of their experiment. Harvested Metadata vs Documented Metadata