Sharing repositories for editorial scholarship of digital texts. The Pinakes 3.0 Open Source Project Andrea Bozzi - CNR-ILC,

Slides:



Advertisements
Similar presentations
PUMA & MetaPub Open Access to Italian CNR Repositories in the Perspective of the European Digital Repository Infrastructure GL9 - NINTH INTERNATIONAL CONFERENCE.
Advertisements

Daniela Luzi, Rosa Di Cesare, Roberta Ruggieri, Loredana Cerbara Consiglio Nazionale delle Ricerche, Istituto di Ricerche sulla Popolazione e le Politiche.
Strategies and activities undertaken in Italy for diffusion and dissemination of Minerva products Ministerial NEtwoRk for Valorising Activities in digitisation.
Rossella Caffo Ministero per i beni e le attività culturali Istituto centrale per il catalogo unico delle biblioteche italiane (ICCU) World Digital Library.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LSA Symposium: The Open Language Archives Community.
Pisa Research Area National Research Council Computer Science Institutes ERCIM Italian Partners Norma Lijtmaer.
DILIGENT Digital libraries powered by the Grid Peter Fankhauser
The DRIVER Infrastructure (Digital Repository Infrastructure Vision for European Research) Paolo Manghi ISTI - National Research Council, Italy.
DRIVER Step One towards a Pan-European Digital Repository Infrastructure Norbert Lossau Bielefeld University, Germany Scientific coordinator of the Project.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Special applications for Digital Libraries: computer-aided philological and linguistic analysis of digital documents Istituto di Linguistica Computazionale.
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
1 Workshop on Novel Technologies for Digital Preservation of Cultural Heritage Collections, Ormylia, 21-22/5/2004 LABORATORIES ON SCIENCE AND TECHNOLOGY.
Digital Libraries of the Future – and the Role of Libraries Donatella Castelli ISTI-CNR.
Torrossa The Casalini Libri Full Text Platform Featuring ebook and ejournal content from Romance language countries Moscow International.
MICHAEL and the Italian Culture Portal: a cooperation model among national, regional, and local institutions The MICHAEL Project is funded under the European.
Why search again and again? Encore and next-generation searching at UQ Keith Webster University Librarian & Director of Learning Services.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
Gabriella Pardelli, Manuela Sassi, Sara Goggi Istituto di Linguistica Computazionale “Antonio Zampolli”, ILC Consiglio Nazionale delle Ricerche CNR- Pisa,
The OpenAIRE Project Open Access Infrastructure for Research in Europe Stefania Biagioni, Donatella Castelli, Paolo Manghi CNR - ISTI GL11 - Library of.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
Vesna Župan, M.Sc. informator-adviser The “Svetozar Marković” University Library Belgrade.
GOLD: a ten-year database of school best practices Agenzia Nazionale per lo Sviluppo dell’Autonomia Scolastica (former INDIRE) Antonella.
LIDA May 2009 Considering the humanities scholars perspectives of digital libraries: an Italian case study Anna Maria Tammaro University of Parma.
The physics departments and documents network EUNIS Conference, Bled, June 29 th -July 2 nd 2004 Michael Schlenker: Dynamic.
Save time. Reduce costs. Find and reuse interoperability solutions on Joinup for developing European public services Nikolaos Loutas
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
Developments of the Semantic Web at the Museum of the History of Science Florence, 16 June 2003 Marco Berni.
EContentplus BERNSTEIN – THE MEMORY OF PAPERS Collaborative systems for paper expertise and history (targeted project) max. EU funding: 1,6 Mill EURO project.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Database Management System Prepared by Dr. Ahmed El-Ragal Reviewed & Presented By Mr. Mahmoud Rafeek Alfarra College Of Science & Technology- Khan younis.
Business Modeling of the Application Architecture of the Bulgarian Folklore Artery Business Modeling of the Application Architecture of the Bulgarian Folklore.
GREY LITERATURE AND COMPUTATIONAL LINGUISTICS: FROM PAPER TO NET Claudia Marzi, Gabriella Pardelli, Manuela Sassi Istituto di Linguistica Computazionale.
EVA Workshop, 26 March 2003, Florence, Italy1 COINE Cultural Objects In Networked Environments Anthi Baliou University of Macedonia,Library Thessaloniki,
Project of a European Doctorate in Digital Culture “Comunitatis Europeae Doctor” Francesca Bocchi Francesca Bocchi - Doctorate in Digital Culture Maastricht,
DILIGENT A step towards a knowledge infrastructure.
ON-line SERVICES based on DIGITAL DOCUMENTS Prof. Doina Banciu ROCS Bucharest, 2008.
INTELLECTUAL RIGHTS AND HISTORIC CORPORA Mark Sandler University of Michigan ICOLC, March, 2003.
NOV-3261-SL-3699 v.1.0 The DeSurvey website Véronique BRUNIQUEL First Annual Meeting – April 4-7, 2006 Vasto, Italy.
Examples for Open Access Scholar Electronic Repository by New Bulgarian University IP LibCMASS Sofia 2011 Contract № 2011-ERA-IP-7 Sofia, September,
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Why to care about research?
GL15 Grey Literature Bratislava 2-3 december 2013 Industrial Philology: problems and techniques of data and archives preservation for future generations.
Tiziana // Alessandra Lenzi - MG Breaking down the walls Project Museo Galileo and the Linked Open Data A joint project between.
Digital University of Pisa Alessandro Lenci CoLing Lab – Laboratorio di Linguistica Computazionale Università di Pisa Aix-Marseille Université.
LIALIA The LIA Project Italian Accessible Books London Book Fair– April, 11 th.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Faculty of Education, Language and Community Services Stavroula Tsembas Marketing and Distribution: Metadata Linkages What is metadata? information about.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Our Digital Showcase Scholars’ Mine Annual Report from July 2015 – June 2016 Providing global access to the digital, scholarly and cultural resources.
Share.TEC: Sharing Digital Resources in the Teacher Education Community Fred de Vries Open Universiteit, Centre for Learning Sciences and Technologies.
e-Education and Knowledge Society
Task 2.6 Eric Delory PLOCAN
An Approach to Software Preservation
At the fringes of the Republic of Scientists:
Project 1 Introduction to HTML.
VI-SEEM Data Repository
DRIVER Digital Repository Infrastructure Vision for European Research
Darja Fišer CLARIN ERIC Director of User Involvement
Istituto di Linguistica Computazionale – Pisa
Malte Dreyer – Matthias Razum
Objectives, activities, and results of the database Lituanistika
Culture Statistics: policy needs
IDRP: The first distributed data management infrastructure for nanoscience Rossella Aversa Karlsruhe Institute for Technology (KIT) – Steinbuch Center.
SDMX IT Tools SDMX Registry
Presentation transcript:

Sharing repositories for editorial scholarship of digital texts. The Pinakes 3.0 Open Source Project Andrea Bozzi - CNR-ILC, Pisa, Italy & Andrea Scotti - IMSS/FRD, Florence, Italy The Marriage of Mercury and Philology: Problems and Outcomes in Digital Philology Edinburgh

Summary 1.Actors (a), objectives (b), state of the art (c) and current user group (d). 2.PKA: Powered Knowledge Architecture: overall view and generalistic function of all modules (a-b). 3.PK Main: Dynamic modeling of schema/s and data (on web direct). 4.PK Text: Methods and functionalities and subset modules (a-g). 5.Towards a possible co-ordination act for a European Consortium in Digital Humanistic & Linguistic Scholarship.

1. Actors, objectives, state of the art and future development (a). Individuals: Author and project leader: Andrea Scotti (Fondazione Rinascimento Digitale, Florence; Istituto e Museo di Storia della Scienza, Florence) Co-authors and developers: Fabrizio Butini and Corrado Veser (Istituto e Museo di Storia della Scienza, Florence) Pinakes Text - Author: Andrea Bozzi (ILC - Consiglio Nazionale delle Ricerche, Pisa) Search engine, Pinakes Text, and PK Advanced Edition - Development: Paolo Ruffolo (ILC - Consiglio Nazionale delle Ricerche, Pisa) & Engineering Faculty for IT, Valeriano Sandrucci, Luca Romano Dep. for Software Architecture and Validation, University of Florence Project Duration: since 2005 Institutions: 1.The project is financially supported by the Fondazione Rinascimento Digitale which has been created as a no-profit institution within the framework activities of the Ente Cassa di Risparmio in Florence. 2.The promoter is the Institute and Museum for the History of Science in Florence in coordination with the Libraries Directorate of the Italian Ministry for Cultural Heritage.

1.The overall goal is to facilitate both understanding the epistemological relevance of computational methodology in the humanities research/studies and to offer there within a feasible way to deploy it across different disciplines. 2.This implies to make available a public and customable set of tools to describe, manage, structure and publish all kind of information & research results concerning the cultural heritage in general and the humanities studies in particular. 3.To offer a coordination between existing repositories and develop a set of services to make accessible and moveable research results within a new perspective of authorship and intellectual property using a generalized standard model of metadata description. 1. Actors, objectives, state of the art and current user group (b).

1. Actors, objectives, state of the art and current user group (c). 1.Pinakes 3.0 Base Edition that includes: a)PK Schema and Project administration Alpha version published in 2007 and available at the home page. The Beta version will be published within April b)PK dynamic Input interface Alpha Version published in 2007 and available at the home page. The Beta version will be published within April 2008 c)PK Text experimental version. The Alpha version will be available within April Pinakes 3.0 Base Edition documentation and code is published since 2006 both on the home page pinakes.imss.fi.it and on SourceForge.org. Currently all code is visible and accessible also on the Italian National Observatory for Open Source at the (Public Administration main home page).

1. Actors, objectives, state of the art and current user group (d). 1.All Pinakes 2.0 projects published on the web since 1996 and visible on the web at the address: will be brought into the current version. Among them already: a)Panopticon Lavoisier (Works and Life of ) b)Parnassus Scientiarum (The Waller Collection) c)Theatre of Nature: works and life of Ulisse Aldovandi Are already transferred and undergoing a significant test. 2.Candidates that have submitted a cooperation act are: a)National Edition of G. Galilei including Iconography and scientific instruments - IMSS/MIBAC b)Work of Dante Alighieri - SDI c)Liz - Letteratura Italiana Zanichelli from University of Rome d)Uffizi Library - Florence e)Gabinetto Viessieux - Palazzo Strozzi, Florence f)University of Siena - All archeological excavation, research data sets concerning Tuscany. g)The Medieval Philosophical Texts and manuscirpt catalogue - SISAL, Fodazione Franceschini, Florence

1. Actors, objectives, state of the art and current user group (d continued). Since end 2007 and 2008 a new set of tools, produced in cooperation with other research bodies, will be included in Pinakes 3.0 OSI package. Among them we can list: The morpho-syntactical analyzer engine produced within in the cooperation activities of “Padre Brusa Center”, Universita´Cattolica il Sacro cuore, Milan, by Marco Passarotti - a TreeBank application - and the computational linguistic activities of the Karl University of Prague (Dep. of IT & Linguistics). This model will be crossed with that of TigerSearch by the University of Stuttgart and currently used from the unit of Greek Language Analysis at the Universita´ Ca´Foscari, Venezia, by Citti´s group. The applications resulting from a EU Project called “Beyond Text” which cross the morpho-syntactical analysis with the pattern recognition method in order to make also non-natural languages searchable (see ahead in this presentation).

2. PKA: Powered Knowledge Architecture: overall view and generalistic function of all modules (a).

2. PKA: Powered Knowledge Architecture: overall view and generalistic function of all modules (b).

PK Main: Dynamic modeling of schema/s and dataPK Main: Dynamic modeling of schema/s and data.

Pinakes3 Text BrowserSearch Engine PinakesText DB - Tag + position storage Input application XMl/TEI Loader CVS Repository of digital objects 4. PK Text: Methods and functionalities and subset modules (a).

4. PK Text: Methods and functionalities and subset modules (b). Main working interface

4. PK Text: Methods and functionalities and subset modules (c). Text variants finder

4. PK Text: Methods and functionalities and subset modules (d). Word and position finder

4. PK Text: Methods and functionalities and subset modules (e). Image selection, texts selection, tagging and tag qualification

4. PK Text: Methods and functionalities and subset modules (f).Tag menu of sources and word values

4. PK Text: Methods and functionalities and subset modules (g).View of all transcription and variants with tag included

PK Text: Methods and functionalities and subset modules (h).Sample of a TreeBank search/analytic result - syntactic tdependence tree of a sentence of the Index Thomisticus -annotation tool by : TrEd (UFAL – Praga) -on top: the sentence - to each word of the sentence corrisponds a node in the tree and to each word is accociated a lemma (vd. Producit/produco) and a number of Tags concerning the attibute time, gender, number etc. to each word is associated a “afun” (analytic function) : Obj, Sb, etc.

PK Text: Methods and functionalities and subset modules (i).Sample of a TreeBank NetGragh

Towards a possible co-ordination act for a European Consortium in Digital Humanistic & Linguistic Scholarship. A list key points to be discussed (a)

Towards a possible co-ordination act for a European Consortium in Digital Humanistic & Linguistic Scholarship. A list key points to be discussed (b)

Towards a possible co-ordination act for a European Consortium in Digital Humanistic & Linguistic Scholarship. A list key points to be discussed (c)