Technology – Broad View Aspects that play a role when integrating archives leave the details of some core topics to the 2. day Bernhard Neumair:Base Technologies.

Slides:



Advertisements
Similar presentations
Introducing the ELAR information system architecture
Advertisements

Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
The way to open resources Laurent Romary CNRS. Two aspects of scientific communication Research papers –All types (Conferences, journals, grey literature.
IST Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,
Accessing Distributed Resources Information: An OLAC perspective Steven Bird Gary Simons Chu-Ren Huang Melbourne SIL Academia Sinica ENABLER/ELSNET Workshop.
CLARIN Metadata & ISO DCR Daan Broeder. Max-Planck Institute for Psycholinguistics TKE ES05 Workshop, August 14th Dublin.
Dr. Bruce A. Scharlau, AHDIT, ES2002 E-Business Workshop AHDIT: Ad Hoc Data Interoperability Tool Dr. Bruce A. Scharlau Dept. of Computing Science University.
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Meta Data Larry, Stirling md on data access – data types, domain meta-data discovery Scott, Ohio State – caBIG md driven architecture semantic md Alexander.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
Interoperability aspects in the The Virtual Language Observatory Dieter Van Uytvanck Max Planck Institute for Psycholinguistics
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Advanced Metadata Usage Daan Broeder TLA - MPI for Psycholinguistics / CLARIN Metadata in Context, APA/CLARIN Workshop, September 2010 Nijmegen.
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
© Copyright 2012 STI INNSBRUCK Apache Stanbol.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
The Language Archive – Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Metadata Component Framework Possible Standardization Work.
The current state of Metadata - as far as we understand it - Peter Wittenburg The Language Archive - Max Planck Institute CLARIN Research Infrastructure.
1 Workshop Goals DELAMAN and DAM-LR Peter Wittenburg MPI for Psycholinguistics Access Management Nijmegen November 2004.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
What Linguists Want (we think) Helen Aristar Dry & Anthony Aristar LINGUIST List & E-MELD.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Agenda CMDI Workshop 9.15 Welcome 9.30 Introduction to metadata and the CLARIN Metadata Infrastructure (CMDI) 10.15Coffee 10.30Use of ISOCat within CMDI.
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
June 20, 2006E-MELD 2006, MSU1 Toward Implementation of Best Practice: Anthony Aristar, Wayne State University Other E-MELD Outcomes.
The TARO Project Texas Archival Resources Online Fred Gilmore Sr Operating Systems Specialist UT Austin General Libraries April.
The role of Parthenos for CLARIN ERIC Steven Krauwer CLARIN ERIC Executive Director 1.
CLARIN Metadata Infrastructure Component Metadata and intermediate solutions Daan Broeder Claus Zinn Dieter van Uytvanck - Max-Planck Institute for Psycholinguistics.
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
Query Processing In Multimedia Databases Dheeraj Kumar Mekala Devarasetty Bhanu Kiran.
MPEG-7 Interoperability Use Case. Motivation MPEG-7: set of standardized tools for describing multimedia content at different abstraction levels Implemented.
1 DOBES/MPI Archive - architecture - Paul Trilsbeek, Roman Skiba, Peter Wittenburg MPI for Psycholinguistics Access Management Nijmegen November 2004.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Metadata Helen Aristar Dry Eastern Michigan University LINGUIST List.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Vocabularies for Description of Accessibility Issues in MMUI Željko Obrenović, Raphaël Troncy, Lynda Hardman Semantic Media Interfaces, CWI, Amsterdam.
Metadata for the GPII Liddy Nevile. DRD metadata.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
LEXUS a flexible web based lexicon tool LEXUS a flexible web based lexicon tool, august 21 th, 2005 Marc Kemps-Snijders Peter Wittenburg
CLARIN Issues Peter Wittenburg MPI for Psycholinguistics Nijmegen, NL.
A Data Category Registry- and Component- based Metadata Framework Daan Broeder et al. Max-Planck Institute for Psycholinguistics LREC 2010.
XML Extras Outline 1 - XML in 10 Points 2 - XML Family of Technologies 3 - XML is Modular 4 - RDF and Semantic Web 5- XML Example: UK GovTalk Group’s Schema.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
Johannes Keizer Food and Agriculture Organization of the UN Library and Documentation Systems Division Slide 1 AGRIS the next steps of the network
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
UNEP Terminology Workshop - Geneva, April 15, Environmental Terminology & Thesaurus Workshop UN Environment Programme Regional Office of Europe.
SemAF – Basics: Semantic annotation framework Harry Bunt Tilburg University isa -6 Joint ISO - ACL/SIGSEM workshop Oxford, January 2011 TC 37/SC.
Formats, interoperability and standards Marc Kemps-Snijders.
A Data Category Registry- and Component- based Metadata Framework Daan Broeder et al. Max-Planck Institute for Psycholinguistics LREC 2010.
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Toward Best Practice for Language Resource Conversion
Software Engineering (CSI 321)
PREMIS Tools and Services
Session 2: Metadata and Catalogues
Presentation transcript:

Technology – Broad View Aspects that play a role when integrating archives leave the details of some core topics to the 2. day Bernhard Neumair:Base Technologies Thomas Soddemann:Middleware Concepts Egon Verharen:Application Concepts Peter Wittenburg: Interoperability Issues

Interoperability Aspects excellent paper by Bird and Simons many discussions about this topic during the last years (Semantic Web, eScience) it will keep us busy despite all standardization efforts D. Whalen: take all resources about EL you can get some general statements (short since we know about the problems) more elaboration on metadata since MD is important to get archives together MD is not a central topic at this workshop

Interoperability Aspects technical encoding differences are still a problem for texts (UNICODE not yet 100% coverage) well-known to relevant boards need better web-services for character conversion media shooting on a moving target due to technological development (storage/network + encoding/formats) but conversion algorithms are available in video area we miss same stability as in the audio area

Interoperability Aspects differences in structure and structure description will remain still many non-XML formats in use (legacy CHAT, WORD, …) still many resources are created without constraints (lots of errors) XML does not solve the issue, however, it makes structure explicit need agreements on general underlying models (AG, LAF, LMF, …) some conversions require interpretation (what to do with inheritance and conditions) debated in several meetings and initiatives (E-Meld, OLAC, …) need improved format conversion web-services

Interoperability Aspects linguistic encoding differences will remain different theories and languages ad hoc needs during fieldwork also debated broadly during last years formal frameworks are available (RDF, RDF-S, OWL) concrete activities to tackle these issues GOLD ontology (E-Meld) ISO TC37/SC4 Data Category Registry IMDI-OLAC Gateway ECHO DORA domain (10 repositories, 5 disciplines) …

Metadata – Classical View relevant for integrating archives I I MD search … content search combined search content view content play MD browse metadata domain resource domain

Metadata Classical View I Data provider Service provider MD search OAI PMH wrapper many examples: IMDI-OLAC IMDI-Ethnology IMDI-HoA IMDI-HoS …

Metadata Future View I MD Descriptions are abstractions MD Descriptions are fingerprints small handy informative with limitations metadata domain resource domain

Metadata Future View I MD Descriptions are abstractions MD Descriptions are fingerprints small handy informative with limitations metadata domain so – let’s see what we can do with them and what the requirements will be

Metadata Future View II users can collect MD search on MD browse in MD run statistics enhance them add private resources … basket has a new temporary personalized view on archival resources it’s a private workspace maintain all references

Metadata Future View II how can it work? either remain within one domain such as IMDI or convert all MD at the receiver side (ECHO solution) ontology

Metadata Future View III MD providers offer services structure is made explicit pick what you want and the way you want ontology still needed to do searches etc who does what? will providers use ontologies? ontology

Ontology Debate I rich ontology vs. flat concept registry (incl. is_a relation) in case of flat registry: where to put all relations such as is_similar, has_a, … centralized ontologies vs. practical ontologies GOLD starts with SUMO ISO DCR is central – agreement on personal DCRs

Ontology Debate II how long will it take to be there? nevertheless – have to start now! central ISO DCR MPI DCR personal DCR Search Engine relations Domain of Ontologies there will be many knowledge sources

Metadata Concepts MDlanguagebundlinghierarchiesbrowsingannotations MPEG7 √√√√√ METS √√√√ IMDI √√√√ OLAC √√ DC √ different container types are used (files, DB, CMS, …) different shells/services necessary for exploration “all” are schema based

Metadata Future did not speak about content integration problems are similar – text is not so much data what we want is clear or ? is it realistic? many questions to be answered it is an interesting time