PC in TB Manfred Thaller PLANETS TB meeting, DenHaag, Sept 28th. '06.

Slides:



Advertisements
Similar presentations
XCEL / XCDL Tools Jan Schnasse PLANETS: Den Haag,
Advertisements

PC/4 Manfred Thaller PLANETS TB meeting, DenHaag, Sept 29th. '06.
Characterisation Adrian Brown The National Archives, UK.
XML III. Learning Objectives Formatting XML Documents: Overview Using Cascading Style Sheets to format XML documents Using XSL to format XML documents.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Designing Websites Using HTML and FrontPage A Typical Webpage View Source A webpage is a text file containing instructions to tell a computer how the.
Web Development & Design Foundations with XHTML
XML Technology in E-Commerce
What is a text within the Digital Humanities, or some of them at least? Manfred Thaller, Universität zu Köln Digital Humanities 2012, July 20 th 2012.
SMPTE Timed Text in the UltraViolet™ Common File Format Mike Dolan (TBT)
The Concept of Computer Architecture
Content Types: Markup and Multimedia. Introduction Markup languages use extra textual syntax to encode: –Formatting / display information –Structure information.
EE442—Multimedia Networking Jane Dong California State University, Los Angeles.
WMES3103 : INFORMATION RETRIEVAL
Media: Text “Words and symbols in any form, spoken or written, are the most common system of communication.” ~ unknown.
XHTML and CSS Overview. Hypertext Markup Language A set of markup tags and associated syntax rules Unlike a programming language, you cannot describe.
Developing a Basic Web Page with HTML
11 Data Interface Standard for Accounting Software Project Progress Report China National Audit Office June, 2015.
Text. Graphics Images – photos Animation Video Audio Text Copyright issues.
The PLANETS-Ontology in the context of the PLANETS-Testbed and the XCL-Software.
Prepared by George Holt Digital Photography BITMAP GRAPHIC ESSENTIALS.
Chapter 6 Text and Multimedia Languages and Properties
Naresuan University Multimedia Paisarn Muneesawang
ACM 511 HTML Week -1 ACM 511 Course Notes. Books ACM 511 Course Notes.
1 Web Developer Foundations: Using XHTML Chapter 2 Key Concepts.
HTML 4 Foundation Level Course HyperText Markup Language Most common language used in creating Web documents. You can use HTML to create cross-platform.
The XCL Languages Digital Preservation – The Planets Way Dresden, April 23 rd 2010 Manfred Thaller, Universität zu Köln.
EXtensible Characterisation Languages (XCL) Manfred Thaller, (University at Cologne) DPP meeting, Glasgow, Nov. 23 rd 2006.
Object Orientated Data Topic 5: Multimedia Technology.
Multimedia Specification Design and Production 2012 / Semester 1 / L3 Lecturer: Dr. Nikos Gazepidis
File Formats, Significant Properties Manfred Thaller Universität zu* Köln February 19 th, 2009 *University at not of Cologne.
Information Processes and Technology Multimedia: Graphics.
Graphics. Graphic is the important media used to show the appearance of integrative media applications. According to DBP dictionary, graphics mean drawing.
SEC (1.4) Representing Information as bit patterns.
1 Text Reference: Warford. 2 Computer Architecture: The design of those aspects of a computer which are visible to the programmer. Architecture Organization.
HTML Basics Computers. What is an HTML file? *HTML is a format that tells a computer how to display a web page. The documents themselves are plain text.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Document Computing Technologies for Managing Electronic Document Collections Ross Wilkinson... [et al.] Circulation Counter [RES3H] ZA4080.D
XCL-Tools in relation to Significant characteristics in Planets Manfred Thaller Universität zu* Köln *University at not of Cologne.
HTML Basics. HTML Coding HTML Hypertext markup language The code used to create web pages.
Sampling Design & Measurement Scaling
MULTIMEDIA Multimedia is the field concerned with the computer- controlled integration of text, graphics, drawings, still and moving images (Video), animation,
Microsoft Expression Web 3 – Illustrated Unit D: Structuring and Styling Text.
Digital Graphics for Computer Games Pixels Types of Digital Graphics (Raster and Vector) Compression.
1 Problem Solving using Computers “Data....Representation, and Storage.
Chapter 11 File Systems and Directories. 2 File Systems (Chapter 11.1) File: 1. A named collection of related data. 2.smallest amount of information that.
Lesson 5 MULTIMEDIA. Multimedia on the Web has expanded rapidly as broadband connections have allowed users to connect at faster speeds. Almost all Web.
XP 2 HTML Tutorial 1: Developing a Basic Web Page.
CS 101 – Sept. 11 Review linear vs. non-linear representations. Text representation Compression techniques Image representation –grayscale –File size issues.
XML Extensible Markup Language
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
XML 1.Introduction to XML 2.Document Type Definition (DTD) 3.XML Parser 4.Example: CGI Gateway to XML Middleware.
Blended HTML and CSS Fundamentals 3 rd EDITION Tutorial 1 Using HTML to Create Web Pages.
Text and Images Key Revision Points.
AP CSP: Encoding and Sending Formatted Text
File Compression 3.3.
XML QUESTIONS AND ANSWERS
Representing Information as bit patterns
Chapter 3:- Graphics Eyad Alshareef Eyad Alshareef.
Overview What is Multimedia? Characteristics of multimedia
Representing Images 2.6 – Data Representation.
Database Systems Instructor Name: Lecture-3.
File Analysis with MicroSoft DEBUG
Assist. Lecturer Safeen H. Rasool Collage of Science Department of IT
CS3220 Web and Internet Programming HTML and XML Basics
Image Metadata Summary of 4/18/99 NISO/DLF Image Metadata Meeting
Presentation transcript:

PC in TB Manfred Thaller PLANETS TB meeting, DenHaag, Sept 28th. '06

PC * in TB * as represented by PC/2, PC/4 and PP/5 or: The XCEL / XCDL concept.

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06

Building block I A language, which allows a program to read "any file specification" based on ==> "eXtensible Characterisation Extraction Language" Formulate the humanly readable specifications of TIFF, RTF, WAV …in a language, which a general purpose program can read. General enough that any existing format specification can be expressed in it. (LATeX, MAX, VRML …)

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Building block I - Warning After the alphabet had been designed...

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Building block I - Warning After the alphabet had been designed somebody had still to write all those books.

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Building block I - Warning After the alphabet had been designed somebody had still to write all those books.

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Building block II A language, which allows a program to describe "any file content" using a ==> "eXtensible Characterisation Definition Language" Formulate the content of any file in an abstract language, which captures the complete information contained in it. General enough that any existing content can be expressed in it.

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Building block III A program, which is able to interpret a format description in XCEL, and, using that, extracts from any file of that format a XCDL description of its content. Production level quality. Indicative performance: <= 1 second / file.

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Building block IV A program, which takes two XCDL descriptions and delivers a statement about the similarity of the information described.

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Relationship to DOW PC/2 defines the languages. (Starting: month 1 – [ finished month 18 ]. ) Deliverable: End month 5. Reuses PRONOM / DROID. PC/4 implements the extraction mechanism (Starting: month 1, ups, 4 – [ finished month 18 ]. ) Reuses any existing tools. PP/5 implements comparison mechanism and metrics of similarity of "information". (Starting: month 15.)

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Metadata Derivation File format A: # of color bands File format B: depth

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Metadata Derivation From observed file properties ==> Property Ontology

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Basic Elements: Byte Order Encodings Position Types... Structuring Elements: Item (logical unit that contains at least one sub-item) Symbol (smallest logical unit) Image Schema: Colour Type Width Height Bit Depth … Text Schema: Font-Style Font-Family Size Language … Multimedia Schema: Pitch Samplerate Channels Framerate... PNG Instance RTF Instance TIFF Instance PDF Instance WAV Instance MPEG4 Instance Processing Instructions: filepointers symbol-counters … Schema Architecture

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Metrics of Comparison I "Information" will be grouped according to three levels: – Descriptive (width, height,photogrammetric interpretation, aka 1 = red ) – History (compression,photogrammetric interpretation, aka 1 = red) – Content (bytestream)

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Metrics of Comparison II – Descriptive (width, height,photogrammetric interpretation, aka 1 = red ) Can this be the same object? – History (compression,photogrammetric interpretation, aka 1 = red) Can this have been the same object? – Content (bytestream) Is this the same object?

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 Metrics of Comparison III – Is the sequence of (UTF16) characters the same? – Are properties with the same symbolic name applied to the same areas within the UTF16 sequence? – Are the properties related to the same objects?

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 XCDL: Observation An XCDL description at the content level is actually a "universal virtual file format" … … though inflated to about 210 % of the original size.

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 PC (XCEL/XCDL) ==> TB Provide: comparison tool. [ profiling tool. ] [ validation. ] [ identification. ]

Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06 TB ==> PC (XCEL/XCDL) Quis custodiet ipsos custodes? Or: Who tests the testing tool? Or: Beta (and possibly pre-Beta) testing. Behaviour. Performance. Calibration. Reference objects.

The end Manfred Thaller PLANETS TB, Den Haag, Sept. 28 th '06