Presentation is loading. Please wait.

Presentation is loading. Please wait.

Bookshelf Leafing through XML NLM Journal Article Tag Suite Conference 2010 Martin Latterner and Marilu Hoeppner National Center for Biotechnology Information.

Similar presentations


Presentation on theme: "Bookshelf Leafing through XML NLM Journal Article Tag Suite Conference 2010 Martin Latterner and Marilu Hoeppner National Center for Biotechnology Information."— Presentation transcript:

1 Bookshelf Leafing through XML NLM Journal Article Tag Suite Conference 2010 Martin Latterner and Marilu Hoeppner National Center for Biotechnology Information National Library of Medicine next><prev

2 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 NLM BOOK DTD v2.3

3 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 NLM Collection Catalog PubMed Abstracts Electronic Literature Archive Books, Monographs, Reports Journals Other publication formats Book chapters, Monographs, Reports Books in PubMed Non-PubMed Books User guides, Documentation Journal articles PMC Journals PubMed Central Bookshelf Entrez Literature Resources

4 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Features of the Book DTD Books and journals within PubMed Central Bookshelf Workflows Integration of information between databases

5 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Modifications Allowed icon as a child of exlnk. Allowed pre as a child of entry. Allowed glossary as a child of chapter. Added type: ppt. Added attributes id and BID to. Added attribute id to. Added, child of. Added, and as children of. Added as child of. … NCBI Book DTD 1.0 Based on ISO 12083 Article DTD

6 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 March 2003 v1.0 December 2004 v2.0 November 2005 v2.1 BOOKSHELF XML DATA NCBI BOOK DTD

7 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Book DTD of the NLM Journal Article Tag Suite

8 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Designed to capture the semantic elements of the content, not form e.g. bibliographic metadata

9 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 CONFLICT OF INTEREST IN MEDICAL RESEARCH Committee on Conflict of Interest in Medical Research Board on Health Sciences Policy INSTITUTE OF MEDICINE OF THE NATIONAL ACADEMIES THE NATIONAL ACADEMIES PRESS Washington, D.C. THE NATIONAL ACADEMIES PRESS 500 Fifth Street, N.W. Washington, DC 20001 ISBN 978-0- 309-13188-9 (hardcover) Copyright 2009 by the National Academy of Sciences. All rights reserved. Printed in the United States of America

10 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Conflict of Interest in Medical Research Institute of Medicine (US) Committee on Conflict of Interest in Medical Research, Education, and Practice National Academies Press (US) Washington (DC) 978-0-309-13188-9 2009 Copyright © 2009, National Academy of Sciences 2009

11 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 More granular text descriptions are handled at attribute level e.g. preface, foreword

12 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010

13 ArticleBook DTD v3.0 Elements

14 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 XML XHTML

15 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 XML IDDM2 Bookshelf PubMed Central …

16 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 ArticleBook abbrev-type article-type response-type alternate-form-type book-id book-part-number book-part-type graphic-type (obsolete) indexed map-alt map-coords map-name map-shape primary qualifier taxonomic-id DTD v3.0 Attributes

17 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Books & Journals in PubMed Central

18 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Source Conversion (1)Third-party vendor services: Tagging rules for journals can be applied to book content, especially, for lower level document objects. Citations Figures Tables (2)In-house conversion: For content submitted in external DTDs, code reuse of PMC journal modules for handling: Dates Strings CALS to XHTML table conversion

19 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Data Processing and Ingest Software to lookup PubMed IDs in citations Imaging resizing software and validation checks for graphics and supplementary data files such as PDF Loading code for the extraction of key information, such as dates, subject categories, etc

20 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 CHOP-IT-UP

21 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Output Formats HTML Uses base XSLT Article rendering rules for conversion of XML to HTML; book- specific overwrites or modifications PDF Uses XSL-FO base code for articles; book-specific overwrites or modifications

22 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Advantages of using a Shared Tag Set Share XSLT modules during ingest, conversion processes, and rendering Use similar database infrastructure Enables closer integration for a variety of processes, such as PubMed submission and indexing

23 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Bookshelf Workflows

24 Submission of Content to Bookshelf PDF or Word XML in NLM Book DTD XML in external DTDs Word authoring followed by conversion to XML (in- house)

25 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Submitted Files PDF Word XML (External DTD) NLM Book DTD XML Third-party vendor or In-house Converters Requirements Pass validation Pass stylecheck

26 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 PMC CMS CHOP-IT-UP

27 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010

28 NCBI Word converter XML Instant HTML Preview Publish to Bookshelf Microsoft Word document Word Authoring Followed by Conversion to XML

29 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Stylechecker Check business rules Goal: one set of rendering rules for uniform source XML data 2 Checkpoints Whole book (modified article stylechecker) Individual book-part (article stylechecker)

30 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 Integrating Content from Different Databases

31 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010

32 Information in the Molecular Genetics and OMIM tables may differ from that elsewhere in the GeneReview: tables may contain more recent information. — ED. Table A. Polycystic Kidney Disease, Autosomal Recessive: Genes and Databases Gene Symbol Chromosomal Locus Protein Name Locus Specific HGMD Data in the JATS Book DTD Delivered from External Database Processing Instruction in Source XML

33 Latterner M and Hoeppner MA. Bookshelf: Leafing through XML. JATS-CON 2010 next><prev


Download ppt "Bookshelf Leafing through XML NLM Journal Article Tag Suite Conference 2010 Martin Latterner and Marilu Hoeppner National Center for Biotechnology Information."

Similar presentations


Ads by Google