Tools and resources supporting the cultural tourism Istituto di Linguistica Computazionale “Antonio Zampolli” CNR - Pisa GL14: November 28, 2012 1 Sassolini.

Slides:



Advertisements
Similar presentations
a Terminological and Statistical Approach
Advertisements

PUMA & MetaPub Open Access to Italian CNR Repositories in the Perspective of the European Digital Repository Infrastructure GL9 - NINTH INTERNATIONAL CONFERENCE.
Development Data Platform Vilas Mandlekar DECDG e-DevelopmentThematic Group March 24, 2005.
Pisa Research Area National Research Council Computer Science Institutes ERCIM Italian Partners Norma Lijtmaer.
A platform of for knowledge and services sharing Fernando Ferri IRPPS-CNR.
MultiMatch: Multilingual / Multimedia Access to Cultural Heritage Carol Peters ISTI – CNR.
From Web Archiving services to Web scale data processing platform Internet Memory Research GA IIPC, Paris, May 19th 2014.
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
Text mining Extract from various presentations: Temis, URI-INIST-CNRS, Aster Data …
Multilingual multimedia thesaurus for conservation and restoration collaborative networked model of construction Lucijana Leoni University of Dubrovnik.
Innovative technologies for customized and dynamic multimedia content production for tourism applications F. Spadoni, F. Tariffi*, E.Sassolini° Rigel Engineering.
Human Language Technologies. Issue Corporate data stores contain mostly natural language materials. Knowledge Management systems utilize rich semantic.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen D ö rre, Peter Gerstl, and Roland Seiffert.
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
Advanced Distributed Learning. Conditions Before SCORM  Couldn’t move courses from one Learning Management System to another  Couldn’t reuse content.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Funded by: European Commission – 6th Framework Project Reference: IST WP8: Exploitation and Dissemination Mondeca return of experience on transitioning.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Building an Ontological Base for Experimental Evaluation of Semantic Web Applications Peter Bartalos, Michal Barla, Gyorgy Frivolt, Michal Tvarožek, Anton.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Co-funded by the European Union Semantic CMS Community Design of Semantic CMS From free text input to automatic entity enrichment Copyright IKS Consortium.
KNOWLEDGE DATABASE Topics inside  Document sharing  Event marketing  Web content.
Claudia Marzi Institute for Computational Linguistics (ILC) National Research Council (CNR) - Italy.
Taylor Trayner. Definition  Set of business processes developed in an organization to create, store, transfer, and apply knowledge  Knowledge is a firm.
Marty Harris aka TEXT QUERY SYSTEM Marty Harris Mgr TRD.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
FP OntoGrid: Paving the way for Knowledgeable Grid Services and Systems WP8: Use case 1: Quality Analysis for Satellite Missions.
Laboratory for Internet Computing Harnessing Distributed, Heterogeneous Information Sources –Data integration with different formats –Extraction of information.
Claudia Marzi Institute for Computational Linguistics, “Antonio Zampolli” – Italian National Research Council University of Pavia – Dept. of Theoretical.
Gabriella Pardelli, Manuela Sassi, Sara Goggi Istituto di Linguistica Computazionale “Antonio Zampolli”, ILC Consiglio Nazionale delle Ricerche CNR- Pisa,
Caught in the Web: Web Archiving at U of A Libraries Geoff Harder and Kenton Good Digital Preservation Seminar | March 5, 2010 | University of Alberta.
GEO: a special collection for Earth Science community *Stefania Biagioni, *Silvia Giannini, **Cecilia Giussani *CNR-ISTI, **CNR-IGG Pisa, Italy GL13 Conference,
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
Semantic Learning Instructor: Professor Cercone Razieh Niazi.
VIRTUAL INFORMATION AND KNOWLEDGE ENVIRONMENT FRAMEWORK IP-FP
1 Caselex: an e-Government Application favouring Interoperability Roberta Nannucci ITTIG/CNR Supported by the European Commission.
Webarchivering in het Audiovisuele Domein Web archiving in the audiovisual Domain Julia Vytopil- Nederlands Instituut voor Beeld en Geluid Netherlands.
GREY LITERATURE AND COMPUTATIONAL LINGUISTICS: FROM PAPER TO NET Claudia Marzi, Gabriella Pardelli, Manuela Sassi Istituto di Linguistica Computazionale.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Background and the importance of the studyPurposes of the studyFramework of the studyResearch methodologyResults of the researchConclusion Outline of.
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
Ontology-Centered Personalized Presentation of Knowledge Extracted from the Web Ralitsa Angelova.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. How valuable is medical social media data? Content analysis of the medical web Presenter :Tsai Tzung.
Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Trevor Crum 04/23/2014 *Slides modified from Shamil Mustafayev’s 2013 presentation * 1.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
CALIMERA: Co-ordination Action Cultural Applications: Local Institutions Mediating Electronic Resource Access.
Iana Atanassova Research: – Information retrieval in scientific publications exploiting semantic annotations and linguistic knowledge bases – Ranking algorithms.
PLANETS - DP sustainability 1 PLANETS Workshop on Sustainable Models for Digital Preservation Nov Brussels Carlos Oliveira (Deputy Head of Unit)
GL15 Grey Literature Bratislava 2-3 december 2013 Industrial Philology: problems and techniques of data and archives preservation for future generations.
Digital University of Pisa Alessandro Lenci CoLing Lab – Laboratorio di Linguistica Computazionale Università di Pisa Aix-Marseille Université.
By R. O. Nanthini and R. Jayakumar.  tools used on the web to find the required information  Akeredolu officially described the Web as “a wide- area.
Human-Assisted Machine Annotation Sergei Nirenburg, Marjorie McShane, Stephen Beale Institute for Language and Information Technologies University of Maryland.
Enriching Europeana Newspapers Nuno Freire, Europeana Foundation/The European Library Clemens Neudecker, Staatsbibliothek.
Semantic networks for improved access to biomedical databases Istituto di Linguistica Computazionale “Antonio Zampolli” CNR – Pisa Sassolini Eva, Cucurullo.
Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Shamil Mustafayev 04/16/
WP1: Plan for the remainder (1) Ontology –Finalise ontology and lexicons for the 2 nd domain (RTV) Changes agreed in Heraklion –Improvement to existing.
METADATA MANAGEMENT AT ISTAT: CONCEPTUAL FOUNDATIONS AND TOOLS Istituto Nazionale di Statistica ITALY.
DALOS DrAfting Legislation with Ontology-based Support E. Francesconi, D. Tiscornia CNR-ITTIG Istituto di Teoria e Tecniche dell’Informazione Giuridica.
Enhancing the Search for Math Materials Abundance of math materials available online  Compiled knowledge collections, paper database, other math related.
Role of Metadata in dissemination of census data Regional Seminar on dissemination and spatial analysis of census data, Nairobi, September, 2010.
DALOS DrAfting Legislation with Ontology-based Support E. Francesconi, D. Tiscornia CNR-ITTIG Istituto di Teoria e Tecniche dell’Informazione Giuridica.
Thai AGROVOC Ontology Base for Agricultural Information Retrieval
Institute of Informatics & Telecommunications NCSR “Demokritos”
Market Research: Types of Data Mr. Singh.
NewCronos what policy and architecture contents consultation evolution
CSE 635 Multimedia Information Retrieval
Infrastructrural Language Resources and International Cooperation
Deep SEARCH 9 A new tool in the box for automatic content classification: DS9 Machine Learning uses Hybrid Semantic AI ConTech November.
Presentation transcript:

Tools and resources supporting the cultural tourism Istituto di Linguistica Computazionale “Antonio Zampolli” CNR - Pisa GL14: November 28, Sassolini Eva, Cinini Alessandra, Sbrulli Stefano, Picchi Eugenio

Cultural Heritage domain Cultural Heritage is an open domain – large amount of information available from heterogeneous sources – hard to classify information through hierarchical criteria – approaches based on criteria of "semantic similarity“ are desirable 2 GL14: November 28, 2012

ICT for Cultural Heritage high diffusion on the Internet continuous development of information technologies advanced NLP technologies to discover domain-specific knowledge text annotation by means of terminology and named entities for: – {text} analysis – “ browsing – “ categorization 3 GL14: November 28, 2012

Specific tools and Resources creation of linguistic resources – reference corpus – repository of terms pertaining to the cultural heritage domain extraction of relevant information from texts – named entities – single term – multi-words exploitation of the linguistic resources extracted text browsing 4 GL14: November 28, 2012

Background/experience Linguistic Miner – system for the automatic extraction of linguistic knowledge from text Text Power – building of terminological resources – enrichment and annotation of textual material GL14: November 28, 20125

Text acquisition strategy (1) Two phases for the automatic text acquisition: 1.spidering tools on institutional websites of historical, cultural, artistic and naturalistic interest first Corpus building generation/enrichment of linguistic resources GL14: November 28, 20126

Text acquisition strategy (2) 2.specialized crawlers that work on a bulk of text material available on- line using the extracted knowledge as basis for a new search strategy of text materials and increase the available information Reference (text) Corpus building GL14: November 28, 20127

projects & applications Online dissemination of the historical, artistic and landscaped regional heritage Partner: ILC, Basilicata APT (i.e. Agenzia di Promozione Territoriale della regione Basilicata) Objectives: – strategies implementation for the regional heritage promotion and dissemination – definition of a model for relevant information acquisition – automatic extraction of semantic information and terminology for the text categorization GL14: November 28, SMARTCITY: new solutions for content engineering and ambient intelligence as support of cultural tourism Partner: ILC, consortium of companies (Space, Rigel Engineering, Meta) Objectives: – technologies for the cultural heritage preservation and enhancement; – methodologies and solutions to meet new demand of cultural spaces in particular for tourist purposes

GL14: November 28, 20129

Purposes A very significant issue for e-Participation and e- Government is the need to know salient facts and features, hidden amidst a very large quantity of data. The collection of relevant information is useful in decision-making: – for a general user, to implement a service to the citizen ensuring transparency and the right to information; – creating a specialized database of valuable help for expert users GL14: November 28,