November 1&2, 2010. Are we there yet? YES What to expect along the way A Brief History Some Jargon you may need to know First Detour: NLM DTD vs PMC.

Slides:



Advertisements
Similar presentations
NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
Advertisements

Data Documentation Initiative (DDI) Workshop Carol Perry Ernie Boyko April 2005 Kingston Ontario.
EAD Revision: Technical Considerations Terry Catapano EAD Roundtable Meeting
1 Web Data Management XML Schema. 2 In this lecture XML Schemas Elements v. Types Regular expressions Expressive power Resources W3C Draft:
An Introduction to XML Based on the W3C XML Recommendations.
NATIONAL LIBRARY OF MEDICINE PubMed Central and the NLM Journal Archiving Vocabulary.
Content and Systems Week 3. Today’s goals Obtaining, describing, indexing content –XML –Metadata Preparing for the installation of Dspace –Computers available.
The COUNTER Code of Practice for Books and Reference Works Peter Shepherd Project Director COUNTER UKSG E-Books Seminar, 9 November 2005.
NATIONAL LIBRARY OF MEDICINE PubMed Central and the NLM DTDs.
PubMed Central Mahyar Ahmadpour-B. Kowsar Publicatin Corp. Kowsar Editorial Meeting 1 September 19th, 2013 Tehran, Iran.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2004.
NATIONAL LIBRARY OF MEDICINE NLM Journal Archiving and Interchange Tagset Jeff Beck National Center for Biotechnology Information National Library of Medicine.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
NATIONAL LIBRARY OF MEDICINE PubMed Central and the NLM Journal Archiving Vocabulary.
Bookshelf Leafing through XML NLM Journal Article Tag Suite Conference 2010 Martin Latterner and Marilu Hoeppner National Center for Biotechnology Information.
JATS for Ejournals and BITS for Ebooks-- Adopting BITS for Scholars Portal Ebook Repository JATS conference April 22, 2015.
15-Jun-15 RELAX NG. 2 What is RELAX NG? RELAX NG is a schema language for XML It is an alternative to DTDs and XML Schemas It is based on earlier schema.
1 XML: Document Type Definitions 2 Road Map  Introduction to DTDs  What’s a DTD?  Why are they important?  What will we cover?  Our First DTD 
Advanced Technical Writing 2006 Session #3. Today in Class… ► Teams pitch poster concepts:  Meet with your editorial team, show us how your material.
RELAX NG. Caveat I did not have a RELAX NG validator when I wrote these slides. Therefore, if an example appears to be wrong, it probably is.
Jennifer Widom XML Data XML Schema. Jennifer Widom XML Schema “Valid” XML Adheres to basic structural requirements  Also adheres to content-specific.
XML: More than just a cool acronym? Michael Mason DecisionSoft Limited.
Writing Your Last DTD ? Alex Brown Griffin Brown Digital Publishing Ltd.
Chapter 4: Document Type Definitions. Chapter 4 Objectives Learn to create DTDs Validate an XML document against a DTD Use DTDs to create XML documents.
Mark Sullivan University of Florida Libraries Digital Library of the Caribbean.
CREATED BY ChanoknanChinnanon PanissaraUsanachote
Practical RDF Chapter 1. RDF: An Introduction
XML D EMYSTIFIED Presented By Carl-Erik Svensson.
An Introduction to XML Presented by Scott Nemec at the UniForum Chicago meeting on 7/25/2006.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
(the NLM DTDs) Update on the NLM Journal Article Tag Suite Jeffrey Beck
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
XP 1 DECLARING A DTD A DTD can be used to: –Ensure all required elements are present in the document –Prevent undefined elements from being used –Enforce.
What is XML?  XML stands for EXtensible Markup Language  XML is a markup language much like HTML  XML was designed to carry data, not to display data.
FIGIS’ML Hands-on training - © FAO/FIGIS An introduction to XML Objectives : –what is XML? –XML and HTML –XML documents structure well-formedness.
Content and Computer Platforms Week 3. Today’s goals Obtaining, describing, indexing content –XML –Metadata Preparing for the installation of Dspace –Computers.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
XML and Digital Libraries M. Zubair Department of Computer Science Old Dominion University.
September 26 & 27, No Smoking! (yes, they are serious about this)
Collection Guides Usability Study, Novice Users Group A Presentation of Findings Spring 2010.
DITA Single Source technology. What is Single Source? Single source technology is a concept of publishing documents when same content can be used in different.
Overview of EAD Jenn Riley Metadata Librarian Digital Library Program.
October 16 & 17, No Smoking! (yes, they are serious about this)
The eXtensible Markup Language (XML). Presentation Outline Part 1: The basics of creating an XML document Part 2: Developing constraints for a well formed.
XML 2nd EDITION Tutorial 4 Working With Schemas. XP Schemas A schema is an XML document that defines the content and structure of one or more XML documents.
Tutorial 13 Validating Documents with Schemas
SCORM Course Meta-data 3 major components: Content Aggregation Meta-data –context specific data describing the packaged course SCO Meta-data –context independent.
Content and Systems Week 3. Today’s goals Obtaining, describing, indexing content –XML –Metadata Preparing for the installation of Dspace –Computers available.
Internet & World Wide Web How to Program, 5/e. © by Pearson Education, Inc. All Rights Reserved.2.
Working with XML Schemas ©NIITeXtensible Markup Language/Lesson 3/Slide 1 of 36 Objectives In this lesson, you will learn to: * Declare attributes in an.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Primer on XML Schema CSE 544 April, XML Schemas Generalizes DTDs Uses XML syntax Two parts: structure and datatypes Very complex –criticized –alternative.
David Orchard W3C Lead BEA Systems Web service and XML Extensibility and Versioning.
GenX- XML Mapping of GenCAM Andy Dugenske Andy Scholand Manufacturing Research Center Georgia Institute of Technology January 23, 1999.
Structured Documents - XML and FrameMaker 7 Asit Pant.
SCHOOL OF LIBRARY, ARCHIVE AND INFORMATION STUDIES Andy Dawson LIS1510 Library and Archives Automation Issues XML and extensible systems Andy Dawson School.
Copyright  2010 Inera Incorporated. All Rights Reserved NLM DTD Flexibility: How and Why Applications of the NLM DTD Vary Presented by Bruce D. Rosenblum.
Fitting in Functions Katherine M. Wisser & Anila Angjeli Description Section August 2015.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Future Functionality and CrossRef Policy Special Member Meeting December 4th, 2001.
EAD 101: An Introduction to Encoded Archival Description XML and the Encoded Archival Description: Providing Access to Collections Oregon Library Association.
Linda Schmandt Structured Text & XML in Medicine 16 Jan 2004.
Advanced Technical Writing 2006 Session #3. Today in Class… ► Show-n-tell your CSS Objects from exercise 1 ► Meet with your editorial team, refine/post.
Advanced Accounting Information Systems Day 28 Introduction to XBRL October 30, 2009.
NATIONAL LIBRARY OF MEDICINE PubMed Central, an XML-based Archive of Life Sciences Journal Articles (at the US National Library of Medicine) Jeff Beck.
XML BASICS and more…. What is XML? In common:  XML is a standard, simple, self-describing way of encoding both text and data so that content can be processed.
7th Annual Hong Kong Innovative Users Group Meeting
COUNTER Update February 2006.
Presentation transcript:

November 1&2, 2010

Are we there yet? YES

What to expect along the way A Brief History Some Jargon you may need to know First Detour: NLM DTD vs PMC The surveys DTD is Dead! A detour about the silly name Further down the road

A Brief History Version 1 was released in December 2002 with the Archiving and Interchange DTD and the Journal Publishing DTD. Version 1 was based on work at NCBI to upgrade the PubMed Central DTD and a project at Harvard University funded by the Mellon Foundation to address the problems of archiving scholarly journals in electronic form (E-journals).

The initial meeting included participants from NCBI, Harvard, and the Mellon Foundation along with NCBI’s consultants, Mulberry Technologies, and Harvard’s consultants, Inera, Inc. But there was confusion about what the model should be.

Easy Target for Conversion? Should the new DTD be a broad, descriptive target that would be easy to translate articles from other SGML or XML models into? A model like this would have many optional elements with few things in a prescribed order, and different ways to tag the same object.

Easy model to create content in? Or should the new DTD be a narrower, prescriptive target that would give creators of new XML articles guidance about how to make a valid article? A model like this would have more required elements with fewer choices on how to tag the same object.

The DTD Spectrum Optimized for Conversion to Optimized to Create Content in

The DTD Spectrum Conversion Creation Archive and Interchange DTDJournal Publishing DTD

The Colors

Everything was fine, until

The two archiving strategies Archiving the intellectual content of the article? Or Archiving the article file?

If you need to archive the entire file, you need a way to keep those items in the file that the Archiving and Interchange DTD did not worry about.

Punctuation in Keywords. Keyword Group in Archiving 1.0: Keywords: DNA analysis; gene expression; parallel cloning; fluid microarray. DNA analysis gene expression parallel cloning fluid microarray

Punctuation in Keywords. Keyword Group in Archiving 2.0: Keywords: DNA analysis; gene expression; parallel cloning; fluid microarray. Keywords: DNA analysis ; gene expression ; parallel cloning ; fluid microarray.

The DTD Spectrum Conversion Creation Is Journal Publishing meeting our needs? Not really. It is too restrictive for some users, and not prescriptive enough to be a good Authoring model.

The DTD Spectrum Conversion Creation Article Authoring DTD

JATS? Journal Article Tag Suite The Tag Suite is the collection of all Elements and Attributes. Each model (Archiving, Publishing, Authoring) is a Tag Set. Each schema (DTD, XSD, RELAX NG) represents a model or Tag Set.

NLM DTD vs PubMed Central PubMed Central (PMC) is a user of the NLM DTD.

The JATS Survey Half of the respondents said that they impose rules other than schema validation on their content. To the question on using the DTDs as published or modified in some way: – 9 as published – 5 subset – 5 superset – 4 informed

To the question, “What form(s) of the Tag Set are you using?” 18 DTD 4 XSD 1 RELAX NG

Dear Software manager, I am very upset that my tax money is supporting obsolete technology. Please translate your DTDs into XML schemas. XMLSpy can be used for this purpose. The schemas will then work with Microsoft Word 2003 and other modern software. Thank you. -- May 31, 2003

The NISO Standard In late 2009, a new NISO Working Group was formed to address the “Standardized Markup for Journal Articles” project. The first task was to address the list of change requests that had accumulated since we released version 3.0 at the end of 2008.

v 3.1 A minor, backward-compatible update. Increased support for multi-language documents. Increased support for tagging documents accessibility. Much of this will be improved and explicit documentation on how to use the accessibility elements and attributes.

Back to NISO The Standard will describe the elements and attributes in the JATS. And then it will describe each of the three article models: – Archiving and Interchange – Journal Publishing – Article Authoring

The standard will not include the schema files, DTDs, XSDs, or RELAX NGs. These will be non-normative supporting documents - as the Tag Libraries will be.

Book Review Assembling a group to review the current NCBI Book Tag Set and build a Tag Set for books from the JATS. Content should scoped similar to the journal work. Not trying to model every book.

New Discussion List A new open discussion list for all things JATS. You can read about the JATS-List at: List/index.html and subscribe at: List/subscribe-unsubscribe.html

Or Google “jats-list”

So, we’re here Time to get on with the conference.