Special applications for Digital Libraries: computer-aided philological and linguistic analysis of digital documents Istituto di Linguistica Computazionale.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Cultural Heritage in REGional NETworks REGNET. October 2001Project presentation REGNET 2 T1.3. IDENTIFICATION OF STANDARDS TO BE USED 1. OBJECTIVES 2.
Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges.
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Developing a Generic Toolkit: Architecture and technology issues ALLC/ACH Conference.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
WEB SERVICES. FIRST AND FOREMOST - LINKS Tomcat AXIS2 -
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Achille Felicetti, Emanuele Bellini, Cinzia Luddi Fondazione Rinascimento.
Logics for Data and Knowledge Representation Projects and thesis introduction.
Douglas Kim Web Application Developer DMSS Tech Meeting Stanford University Libraries 14 May 2010 Parker on the Web Technical Architecture.
METS at UC Berkeley Part I: Generating METS Objects.
Wangga: Songs of North Australia The University of Sydney Library Ross Coleman Sten Christensen Gary Browne Department of Music, University of Sydney Professor.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
An Introduction to Metadata by Wendy Duff ECURE 2000 October 6, 2000.
Digitisation and Access to Archival Collections: A Case Study of the Sofia Municipal Government (1878 – 1879) Maria Nisheva-Pavlova, Pavel Pavlov Faculty.
Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents Victor BARANOV.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
UCLA Digital Library Technical Architecture June 13, 2002 UCLA Digital Library Presenter: Curtis Fornadley, Senior Programmer/Analyst.
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
Tools and resources supporting the cultural tourism Istituto di Linguistica Computazionale “Antonio Zampolli” CNR - Pisa GL14: November 28, Sassolini.
At the NATIONAL TRANSPORTATION LIBRARY CIL 2007 Washington, DC Joyce W. Koeneman Digital Librarian, NTL Research and Innovative Technology Administration.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Digital Encoding What’s behind E-text Resources?.
Marty Harris aka TEXT QUERY SYSTEM Marty Harris Mgr TRD.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Towards Online Accessibility of Valuable Phenomena of the Bulgarian Folklore Heritage Radoslav Pavlov 1 Konstantin Rangochev 1 Desislava Paneva-Marinova.
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
EXtensible Neuroimaging Archive Toolkit (XNAT) Washington University Neuroinformatics Group.
Sharing repositories for editorial scholarship of digital texts. The Pinakes 3.0 Open Source Project Andrea Bozzi - CNR-ILC,
Combining XTF and the cloud => powerful digital collections presence at a low cost Al Cornish Washington State University.
LIDA May 2009 Considering the humanities scholars perspectives of digital libraries: an Italian case study Anna Maria Tammaro University of Parma.
Accessing distributed linguistic resources An XML based architecture Laurent Romary Laboratoire Loria, Nancy (F) Samuel Cruz-Lara, Patrice Bonhomme, Christophe.
Information Retrieval CENG 555 Spring Course Web Page Authoritative source of administrivia In-class announcements generally reflected on Web.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
Digital Library Syllabus Uploader Will Cameron CSC 8530 October 19, 2006 Project Presentation 2.
GREY LITERATURE AND COMPUTATIONAL LINGUISTICS: FROM PAPER TO NET Claudia Marzi, Gabriella Pardelli, Manuela Sassi Istituto di Linguistica Computazionale.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Applying the semantic web to the Arnamagnæan manuscript collection Gardar Gudgeirsson - Raqoon ehf - - Project.
An Interoperable Portal for the Historic Environment Tony Austin, Julian Richards Archaeology Data Service, Department of Archaeology,
Introduction to metadata
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
1 By: Suman Negi, Technical Officer ‘B’ DESIDOC, DRDO, Delhi Presentation at NACLIN 14 (During 9-11 December 2014, Pondicherry) Design and Development.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
Digital libraries and web- based information systems Mohsen Kamyar.
Service-oriented architecture of the Bulgarian folklore library Konstantin Rangochev † Vasil Badev † Desislava Paneva † Detelin Luchev ‡ † Institute of.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Ask a Librarian: The Role of Librarians in the Music Information Retrieval Community Jenn Riley, Indiana University Constance A. Mayer, University of Maryland.
GL15 Grey Literature Bratislava 2-3 december 2013 Industrial Philology: problems and techniques of data and archives preservation for future generations.
Tiziana // Alessandra Lenzi - MG Breaking down the walls Project Museo Galileo and the Linked Open Data A joint project between.
Digital University of Pisa Alessandro Lenci CoLing Lab – Laboratorio di Linguistica Computazionale Università di Pisa Aix-Marseille Université.
VERI is an interface that provides a Web based front end to the access the datasets generated by the MVED. The goal is to Provide open access to the Don.
Topic Maps for Cultural Heritage Collections Conal Tuohy Senior Developer New Zealand Electronic Text Centre
© The ATHENA Consortium. Susan Thomas SAP AG, Research Department How do you do semantics? Semantic Web Drawings by Sebastian Cremers Unit 3:
24 November CERL Thesaurus. 24 November CERL Thesaurus Started as a compensation for the lack of global authority control within the HPB.
C. Candace Chou University of St.Thomas EndNote for Researchers.
VIVO architecture March 1, Major Components Vitro is a general-purpose Web-based application leveraging semantic standards VIVO is a customized.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
The Open Archives Initiative: Perspectives on Metadata Harvesting OAI Provider & Harvesting Services at the University of Illinois Timothy W. Cole Mathematics.
A Generic Toolkit for Electronic Editions of Medieval Manuscripts
7th Annual Hong Kong Innovative Users Group Meeting
Digital library and OR 21 October 2002 Members’ Council
Zachary Cleaver Semantic Web.
Cataloging the Internet
Istituto di Linguistica Computazionale – Pisa
Semantic Annotation service
Oya Y. Rieger Cornell University Library May 2004
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Presentation transcript:

Special applications for Digital Libraries: computer-aided philological and linguistic analysis of digital documents Istituto di Linguistica Computazionale – Pisa Andrea Bozzi NEH/CNR Meeting Washington DC October 5, 2007

Presentation contents 1.An EU supported system for Greek papyrology 2.A special application for browsing and searching demotic documents on ostraka; 3.A philological workstation for digital medieval manuscripts; 4.CHLT-LEMLAT (EC-NSF project) to perform lemmatization of Latin texts; 5.How to integrate all these modules in a web- based open source application.

Presentation contents

The philological workstation: image and text transcription

Image segmentation and semi-automatic word linking

Annotations and critical apparatus

Wordforms list and specific indexes

The web philological workstation to manage documents of the Istituto Papirologico Vitelli in Florence (restricted use)

Presentation contents Andrea Bozzi NEH/CNR Meeting, Washington October 5, 2007

OMM 1381: E. Bresciani, S. Pernigotti, M.C. Betrò, Ostraka demotici da Narmuti, Pisa, 1983, pp ; OMM 300: Gallo P., Ostraca demotici e ieratici dall’archivio bilingue di Narmouthis, Pisa, 1997, pp ; OMM 393: R. Pintaudi, P.J. Sijpesteijn, Ostraka greci da Narmuthis, Pisa, 1993, p. 40. Special system for teaching and retrieving linguistic information from demotic texts on ostraka

L’archivio delle immagini digitali e la tabella dei segni demotici

Research results: see the blue parts (arrow) where the selected symbol has been found

Presentation contents Andrea Bozzi NEH/CNR Meeting, Washington October 5, 2007

Textual criticism for medieval manuscripts Link to the list of collated sources

Selection of the variant eixens Evaluation of the variant reading in the collated source

Recording of the variant Eixens in the Critical apparatus

Variants search in different ancient printed editions of the same work Link to the list of collated books

Image of the corresponding page

Presentation contents Andrea Bozzi NEH/CNR Meeting, Washington October 5, 2007

Lemmatization results (C. Sallustius Crispus, De coniuratione Catilinae, 1-2)

Lemmatization results of selected wordforms

Presentation contents Andrea Bozzi NEH/CNR Meeting, Washington DC October 5, 2007

Pinakes Aim: web-based open source application to manage cultural heritage historical data in digital format. Partners: –Fondazione Rinascimento Digitale, Florence; –Istituto e Museo della Storia della Scienza, Florence; –Ministero per i Beni Culturali, Rome –CNR, Istituto di Linguistica Computazionale, Pisa

Technology –Programming language: JAVA (Jdk1.5) –Servlet Engine: Tomcat 5.5.x + Apache HTTP Connectors. –Web server: Apache httpd server 2.2.x. –Web Applications Framework: Jakarta Struts –Web Service Framework: Apache Axis 1.4 –Database Engine: Postgres 8.1 –Programming environment: NetBeans –Final development: Hibernate

Standards DCMI (Dublin Core Metadata Initiative) TEI (Text Encoding Initiative) OWL (Ontology Web Language) RDF-XML (Resource Description Framework) SPARQL (Query Language fo RDF) UTF8 (Unicode Transformation Format).