Plazi: Prospects for Markup of Legacy and New Taxonomic Literature Terry Catapano TDWG Fremantle, WA October 21, 2008.

Slides:



Advertisements
Similar presentations
TRAINING OF TRAINERS AGRIS AP. AGRIS ORIGIN In 1975, FAO set up AGRIS to improve access and exchange of information on agricultureAGRIS The largest cooperative.
Advertisements

NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
A Common Standard for Data and Metadata: The ESDS Qualidata XML Schema Libby Bishop ESDS Qualidata – UK Data Archive E-Research Workshop Melbourne 27 April.
Forest Markup / Metadata Language FML
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Pensoft Writing Tool (PWT) Lyubomir Penev ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussles ViBRANT.
Making small data big! The Biodiversity Data Journal (BDJ) Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts, Vincent Smith ViBRANT.
PubMed Central Mahyar Ahmadpour-B. Kowsar Publicatin Corp. Kowsar Editorial Meeting 1 September 19th, 2013 Tehran, Iran.
Taxonomic Literature Standards and Synergies TDWG 2006 Anna L. Weitzman & Christopher H. C. Lyal.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
TaxPub: An Extension of JATS for Taxonomic Descriptions Terry Catapano
IAEA International Atomic Energy Agency ICSTI 2013 Annual Members’ Meeting March 2013.
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
NATIONAL LIBRARY OF MEDICINE NLM Journal Archiving and Interchange Tagset Jeff Beck National Center for Biotechnology Information National Library of Medicine.
The XML mark up process from the viewpoint of a biodiversity publisher Lyubomir Penev, Donat Agosti, Teodor Georgiev, Terry Catapano, Vladimir Blagoderov,
Bookshelf Leafing through XML NLM Journal Article Tag Suite Conference 2010 Martin Latterner and Marilu Hoeppner National Center for Biotechnology Information.
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
Contents and Formats Existing Digital Sources Gertraud Griepke Cornell University, July 26th 2002.
1 Workshop on Metadata Interoperability for Electronic Records Management November 15, 2001 Archives II, College Park, MD.
Link yourself or perish? PhytoKeys, the next generation journal in systematic botany Lyubomir Penev 1, W. John Kress 2, Sandra Knapp 3, De-Zhu Li 4, Susanne.
Copyright © 2003 Pearson Education, Inc. Slide 1-1 Created by Cheryl M. Hughes, Harvard University Extension School — Cambridge, MA The Web Wizard’s Guide.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
A METS Application Profile for Historical Newspapers
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Moving beyond free text. Authors Scientist does research Scientist publishes research results in journal article Old Paradigm:
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Luc Audrain Hachette Livre Head of digitalization
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
Scientific Markup Languages Birds of a Feather A 10-Minute Introduction to XML Timothy W. Cole Mathematics Librarian & Professor of.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
The Pensoft Journal System and XML-based workflow Lyubomir Penev Life and Literature Conference, Chicago 2011 ViBRANT Virtual Biodversity.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Patrick Leary 23 October, 2008 TDWG Fremantle Experiences With Species Profile Model.
J-STAGE, NOW NEXT STAGE large scale scholarly e-journal platform of Japan.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
TaxonX : A mark-up schema and approach for systematics literature American Museum of Natural History and University of Karlsruhe in collaboration with.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
5.2 Scope: This standard defines common data interchange formats for event records for voting systems. Voting systems, including election administration.
The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.
The PLAZI Markup System Donat Agosti Terry Catapano Robert “Bob“ Morris Guido Sautter Universität Karlsruhe (TH) Research University – founded 1825.
SCORM Course Meta-data 3 major components: Content Aggregation Meta-data –context specific data describing the packaged course SCO Meta-data –context independent.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Document Computing Technologies for Managing Electronic Document Collections Ross Wilkinson... [et al.] Circulation Counter [RES3H] ZA4080.D
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Literature & interoperability: a working example using ants Donat Agosti, Terry Catapano, Guido Sautter, Christiana Klingenberg & Christie Stephenson TDWG.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Feb 21-25, 2005ICM 2005 Mumbai1 Converting Existing Corpus to an OAI Compliant Repository J. Tang, K. Maly, and M. Zubair Department of Computer Science.
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
Mechanisms for coordination and delivery of taxon profiles in Australia Longitudinal use case scenarios from primary data custodians and roles for data.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Updating image To update the background image: Go to ‘View’ Select ‘Slide Master’ Select the page with the image Right click on the image and select ‘Change.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
International Congress of Entomology, Orlando
Repository Software - Standards
Jenn Riley Metadata Librarian Digital Library Program
Rebecca Lawrence Managing Director, F February 2018
PREMIS Tools and Services
Jenn Riley Metadata Librarian Digital Library Program
Publishing and Mark-up of Collection Data
Presentation transcript:

Plazi: Prospects for Markup of Legacy and New Taxonomic Literature Terry Catapano TDWG Fremantle, WA October 21, 2008

NSF/DFG Grant (AMNH/University of Karlsruhe)‏ XML Markup of taxonomic publications for extraction of: Treatments Scientific Names Morphological Characters Distribution Data Collection locales/events For: Open Access Submission to db's Retrieval Ontology development

Markup Languages Provides grammar to define document types Delineate & identify document elements (atoms) in text Syntax: Structural relationships between elements (parent/child, cardinality, ordinality, id/idref, key/keyref)‏ Beyond the PDF‏

TaxonX schema Golden Gate Editor 250 Docs/7500 Treatments DSpace-based Digital Object Repository (handles)‏ SRS TAPIR (specimen data)‏ Species Profile Model/RDF (descriptive data)‏

Wildly heterogeneous Requires lax structuring of documents Need for regularization Requires editorial policy (reproduction: text of work or text of document) Defers much work of interoperability Benefits Treatments +names, subsections, localities, bibliographic references Extraction & representation in other services Costs GoldenGate configured for testbed: 3 minutes per page $5 page(?)‏

New Literature Different markup activity Different markup activity Prospective not Retrospective More optimal cost/benefit ratio? Strict modeling for consistent documents/data Increased regularization Increased sharing, re-use Decreased costs (potentially)‏: Application QC Adoption

TDWG Vocabularies supply many concepts NLM Journal Archiving and Interchange Tag Suite DTD's for markup of journal articles Archiving, Publishing, Authoring, other modules possible Wide adoption by publishers and aggregators; LOC Actively maintained Module for taxonomic treatments in Publishing

Inherit generic features from existing Tag Set Bibliographic references Tables Linking supporting material/data (xlink)‏ Linking to graphic and media objects (xlink)‏ Treatments Treatment sections Scientific names, Geographic names, Characters/States Specimens and other materials citations

Plazi: NLM conversion of Zootaxa and PLOS One articles Apply markup at earliest stage possible Develop tools to assist (probably easier than for “pure” legacy literature)‏ Extend codes and structures to handle electronic publication Shifts “illustrated narrative” complex digital objects METS, OAI-ORE, MPEG-21/DIDL

Text Materials Description Treatment Image Data Nomenclature

Linked Data Machines > Documents > Data Open documents, free data Reduced costs of use/re-use (e.g., SPM for EOL)‏ Broaden scope of application Accelerate velocity of information exchange