NewsML™, NITF & NewsCodes The winning triple Michael Steidl IPTC Managing Director ANSA/FIEG meeting 19 April 2006, Rome.

Slides:



Advertisements
Similar presentations
Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.
Advertisements

OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
XP New Perspectives on Microsoft Office Word 2003 Tutorial 7 1 Microsoft Office Word 2003 Tutorial 7 – Collaborating With Others and Creating Web Pages.
XBRL Distribution Using NewsML Ken Wolf XBRL Software Architect, BusinessWire businesswire.com April 2005.
IPTCs RightsML Rights expressions for the news industry ODRL Meeting 28 September 2011 Barcelona, Spain Michael Steidl, Managing Director IPTC.
CREATING WEB PAGES INTERNET IN THE CURRICULUM MODULE 8:
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
1. Content – Collective term for all text, images, videos, etc. that you want to deliver to your audience. 2. Structure – How the content is placed on.
Google Chrome & Search C Chapter 18. Objectives 1.Use Google Chrome to navigate the Word Wide Web. 2.Manage bookmarks for web pages. 3.Perform basic keyword.
Semantically meaningful elements Elements that are self-descriptive; they describe the purpose of the content they contain Examples: – element defines.
Microsoft Office Suite Microsoft PowerPoint
CONCEPTS FOR FLUID LAYOUT Web Page Layout. Website Layouts Most websites have organized their content in multiple columns (formatted like a magazine or.
Publishing Workflow for InDesign Import/Export of XML
Practical Object-Oriented Design with UML 2e Slide 1/1 ©The McGraw-Hill Companies, 2004 PRACTICAL OBJECT-ORIENTED DESIGN WITH UML 2e Chapter 5: Restaurant.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Glencoe Digital Communication Tools Create a Web Page with HTML Chapter Contents Lesson 4.1Lesson 4.1 Get Started with HTML (85) Lesson 4.2Lesson 4.2 Format.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
Web Content Management at GCN.com The Gilbane Conference: Content Technologies for Government Alec Dann SVP of Internet Publishing PostNewsweek Tech Media.
ASIDIC Spring Conference ‘Smart Content’ Uncovering the Value and Benefits of Semantic Technology Richard C. Fusco Director, Content Strategy – McGraw-Hill.
Meaning Through Design © M. Grazia Busà Functions of design  Attracts audiences  Guides readers through the publication  Communicates how to.
Chapter 9 Collecting Data with Forms. A form on a web page consists of form objects such as text boxes or radio buttons into which users type information.
HTML Comprehensive Concepts and Techniques Intro Project Introduction to HTML.
Slide 1 Today you will: think about criteria for judging a website understand that an effective website will match the needs and interests of users use.
Chapter Objectives Discuss the relationship between page length, content placement, and usability Complete Step 4: Specify the website’s navigation system.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
GMMP 2009/2010. Follow the GMMP Monitoring Methodology Guide Monitoring involves: Quantitative analysis: the numbers of women and men in the world's news,
Lesson 4: Using HTML5 Markup.  The distinguishing characteristics of HTML5 syntax  The new HTML5 sectioning elements  Adding support for HTML5 elements.
Metadata: first principles Pat Bell Knowledge, Analysis and Intelligence.
GET YOUR BOOK LISTED ON SEARCH ENGINES AND BOOK INDUSTRY DATABASES Renée RegisterRebecca Albani.
HTML (HyperText Markup Language)
IPTC Semantic Web Working Group Stuart Myles Associated Press 7 th March 2011.
The Lifecycle of Embedded Image Metadata within Digital Photographs: Challenges and Best Practices. - or - The Secret Life of Photo Metadata To promote,
Maintaining Your Website Using Cascade CMS Presented by Chris Cheung and Marketing & Communications.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
Website Development with Dreamweaver
Learning Web Design: Chapter 4. HTML  Hypertext Markup Language (HTML)  Uses tags to tell the browser the start and end of a certain kind of formatting.
Web Accessiblity Carol Gordon SIU Medical Library.
Cascading Style Sheets by Pavlovic Nenad by. Presentation Contents  What is CSS?  Why CSS?  Types of Style Sheets  Style Sheets Syntax  Box Formatting.
Copyright © 2013 MyGraphicsLab / Pearson Education STRUCTURE AND HTML TAGS MyGraphicsLab: Adobe Dreamweaver CS6 ACA Certification Preparation for Web Communication.
 Attractive page layout  The contrast and blend of clolours is well balanced  Legible fonts  Headlines, brief news items, photos and videos provided.
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
© 2010 Delmar, Cengage Learning Chapter 8 Collecting Data with Forms.
XML and Digital Libraries M. Zubair Department of Computer Science Old Dominion University.
The LOM RDF binding – update Mikael Nilsson The Knowledge Management.
1 XML An Overview Roger Debreceny University of Hawai`i Skip White University of Delaware XBRL Workshop, August 2006.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Chapter 7.  Feature sidebars ▪ A piece that is used to support or accompany another story that has been written  In Magazines ▪ Usually written over.
Standards-Based Knowledge Systems using NewsML and Topic Maps Presented by Daniel Rivers-MooreDaniel Rivers-Moore Director of New Technologies, RivComRivCom.
Core Publisher: Station Administrator Tools. Training 1: Site Administration Training 2: Programs Training 3: Content Tagging Training 4: Creating Posts.
Presented By:- Thomas Steiner Raphael Troncy Michael Hausenblas Reviewed By:- Sudeep Malik Professor :- Chris Mattmann.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
“I don’t want to get my copyright stripped off” Michael Steidl, Privacy & Security Workshop 13 October 2015, Brussels (Belgium)
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
National Panchayat Portal ( Offering Dynamic Website for each Panchayat Helping them to manage Content.
UI's for inputting and presenting the metadata of hypermedia documents Kai Kuikkaniemi HUT T
Today’s Lesson….. 1.Formative Assessment Given Back – Go through Answers. 2.Webpage Design.
Microsoft Expression Web 3 – Illustrated Unit D: Structuring and Styling Text.
XP Review 1 New Perspectives on JavaScript, Comprehensive1 Introducing HTML and XHTML Creating Web Pages with HTML.
Basic HTML Document Structure. Slide 2 Goals (XHTML HTML5) XHTML Separate document structure and content from document formatting HTML 5 Create a formal.
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
Conceptual Overview For Understanding the New Paradigm Provided by: Web Services Section.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
CONCEPTS FOR FLUID LAYOUT Web Page Layout. Essential Questions What challenges do mobile devices present to Web designers? What are the basic concepts.
Adobe Dreamweaver CS4 OPEN YOUR NOTEBOOK FILES
Attributes and Values Describing Entities.
OCR Level 02 – Cambridge Technical
HTML / CSS Mai Moustafa Senior Web Designer eSpace eSpace.
Attributes and Values Describing Entities.
Presentation transcript:

NewsML™, NITF & NewsCodes The winning triple Michael Steidl IPTC Managing Director ANSA/FIEG meeting 19 April 2006, Rome

© 2006 IPTC All rights reserved2 Who is what NewsML 1: News Markup Language for managing and packaging of news –allows versioning of news items: easy tracking of breaking news = evolving stories. –rich set of management metadata: publishing status (“usable”, ”embargoed”, “canceled”, …) why updated, links to other news items (like “see also”) –packaging of news items of different media types (text, photo, …) NITF: News Industry Text Format for marking up text news –inline markup of text –structure for semi-layout (e.g. tables) NewsCodes: for proper categorisation –Subject NewsCodes with about 1300 terms, in three levels

© 2006 IPTC All rights reserved3 NewsML™ version 1 How a NewsML instance is built: Structured content: story package Top Content Container = the NewsItem

© 2006 IPTC All rights reserved4 NewsML™ version 1 How a NewsML instance is built: Structured content: story package Top Content Container text / role = interview Content Component

© 2006 IPTC All rights reserved5 NewsML™ version 1 How a NewsML instance is built : Structured content: story package Top Content Container text / role = interview text / role = background

© 2006 IPTC All rights reserved6 NewsML™ version 1 How a NewsML instance is built : Structured content: story package Top Content Container text / role = interview text / role = background photo / role = pic of person

© 2006 IPTC All rights reserved7 NewsML™ version 1 How a NewsML instance is built : Structured content: web page package Top Content Container text / role = main story

© 2006 IPTC All rights reserved8 NewsML™ version 1 How a NewsML instance is built : Structured content: web page package Top Content Container text / role = main story text / role = tickerline1 text / role = tickerline2 text / role = tickerline3 text / role = tickerline4

© 2006 IPTC All rights reserved9 NewsML™ version 1 How a NewsML instance is built : Structured content: web page package Top Content Container text / role = main story text / role = tickerline1 text / role = tickerline2 text / role = tickerline3 text / role = tickerline4 text / role = sidebar photo role = pic text / role=newsaudio / role = sound

© 2006 IPTC All rights reserved10 NewsML™ version 1 Versioning: The original version is circulated at 11:32 ANSA Italy wins world championship ID: abc123 Version: 1 Another news 5 Another news 4 Another news 3 Another news 2 Another news 1 Italy wins world championship ID: abc123 Version: 1

© 2006 IPTC All rights reserved11 NewsML™ version 1 Versioning: An updated version is circulated at 13:43 ANSA Italy wins world championship ID: abc123 Version: 2 Another news 9 Another news 8 Another news 7 Another news 6 Italy wins world championship ID: abc123 Version: 1 Italy wins world championship ID: abc123 Version: 2 This is an update Italy wins world championship ID: abc123 Version: 1

© 2006 IPTC All rights reserved12 NewsML™ version 1 Summary NewsML provides a rich, well designed and extensible set of metadata to enhance routing and selecting. NewsML allows to manage items: –each item has a unique identifier –each item has a distinct version NewsML allows to package several pieces of content into one item – content of various media types NewsML adds value to packaging: –“roles” identify why the content is there –groups of packages enhance the structure

© 2006 IPTC All rights reserved13 NITF Feature “inline mark up”: one can add metadata to portions of the news text: The weather was superb today in Norfolk, Virginia. Made me want to take my boat, manufactured by the Acme Boat Company. This inline mark up may be used to add linked information to the final rendition: like identifying information about entities (“what company is that exactly?”) or a link to a background story. and to add layout “recommendations” (e.g. emphasised)

© 2006 IPTC All rights reserved14 NITF feature: “structure/layout mark up” today tide tomorrow next day third day beach high low ….  this sequence of strange looking code translates into a decent table (▼) and into a even more fashionable version on a layout system for newspapers.

© 2006 IPTC All rights reserved15 NITF Summary NITF is a kind of “HTML for all kinds of media” – it delivers the features of easy web publishing also to the print layout. Inline mark up allows to link to reference information and to background information Structure mark up allows to convey layout information from the maker of the news to its users.

© 2006 IPTC All rights reserved16 IPTC metadata codes The challenge: “The most effective communication occurs when all parties involved agree on the meaning of the terms being used.” (Fast,Leise & Steckel, “Boxes and Arrows”)

© 2006 IPTC All rights reserved17 IPTC metadata codes The challenge: “The most effective communication occurs when all parties involved agree on the meaning of the terms being used.” (Fast,Leise & Steckel, “Boxes and Arrows”) The solution: IPTC’s controlled vocabularies = Managed lists of codes (= abstract notations) with names (in different languages) with explicit explanations ( ≈ encyclopaedia) (in different lang.) each of the 28 for a specific scope to navigate content

© 2006 IPTC All rights reserved18 IPTC NewsCodes The common name for ALL controlled vocabularies maintained by the IPTC is IPTC NewsCodes (More info at

© 2006 IPTC All rights reserved19 IPTC NewsCodes Currently the IPTC maintains 28 sets of NewsCodes IPTC NewsCodes break out into groups:

© 2006 IPTC All rights reserved20 IPTC NewsCodes What the content is about –Subject-NewsCodes: ~ 1300 terms at 3 levels –SubjectQualifier-NewsCodes: men, women, age groups, sports specific qualifiers, …

© 2006 IPTC All rights reserved21 IPTC NewsCodes Formal attributes of the content –Genre-NewsCodes like current, update, wrap-up, background, feature, interview, review … –Scene-NewsCodes for photos like head-/half-/full-shot, interior/exterior, single/two/group … –Importance-NewsCodes identifying 6 levels –Location-NewsCodes are location qualifiers from “WorldRegion” to “Sublocation”

© 2006 IPTC All rights reserved22 IPTC NewsCodes Formal attributes of the media data –Format (mimetype, mediatype) –Encoding –Encoders –Physical Characteristics –Colourspace

© 2006 IPTC All rights reserved23 IPTC NewsCodes Codes to manage news exchange –(news) Provider-NewsCodes – already registered with the IPTC? –Status-NewsCodes (usable, embargoed …) –Priority-NewsCodes (9 levels) –Urgency-NewsCodes (9 levels) –Of interest to-NewsCodes identifying groups of the audience the content is aimed at –Relevance-NewsCodes identifying journalistic relevance –Role-NewsCodes to provide semantics to news package components (NewsML!)

© 2006 IPTC All rights reserved24 IPTC NewsCodes In depth … IPTC’s huge taxonomy to describe content The Subject NewsCodes

© 2006 IPTC All rights reserved25 IPTC NewsCodes The Subject NewsCodes Three level tree structure ~ 1300 terms in total 17 top level Subjects (Broadest term) for art, crime/law, disaster, economy/business, education, environment, health, human interest, labour, lifestyle, politics, religion, science/technology, social issues, sports, unrest/war, weather ~ 350 intermediate level terms (Narrow term, NT) ~ 900 third (= lowest) level terms (most NT)

© 2006 IPTC All rights reserved26 IPTC NewsCodes The Subject NewsCodes Term structure: each term has … a Code: 8 digits (e.g ) a Name: language specific string (e.g: weather/forecast or Meteorología/Pronósticos) an Explanation: short text describing the concept of this Subject- NewsCode term management data (versioning)

© 2006 IPTC All rights reserved27 IPTC NewsCodes The Subject NewsCodes Where to apply … Explicit tags are provided by: NITF NewsML IIM (aka “IPTC Headers” for images) “IPTC Core” Scheme for XMP (for Adobe CS products)

© 2006 IPTC All rights reserved28 IPTC NewsCodes The Subject NewsCodes How to apply … manually by editors (pick lists) automatically by categorization engines “mixed mode”: suggested by categorizer, changed/approved by editor

© 2006 IPTC All rights reserved29 IPTC NewsCodes A Subject NewsCodes example: “IPTC gave a presentation about their news technology at an ANSA/FIEG meeting in Rome” would e.g. resolve to: – (Technology/IT) – (Economy/Computing and IT) – (Economy/Media/News agency)

© 2006 IPTC All rights reserved30 IPTC NewsCodes The Subject NewsCodes You are in control: you can make your own subset select the Subject Codes you want to use for your agency select sets of Subject Codes for the various desks in your agency (e.g. economy, sports …)

© 2006 IPTC All rights reserved31 IPTC NewsCodes The Subject NewsCodes Additional refinement: Qualifiers –primarily used for sports –adds facets to the content like men/women, individual/team, indoor/outdoor …

© 2006 IPTC All rights reserved32 Thank you for your time