Visual Text Mining with SWAPit Detection of semantic relationships among text documents and associated data sources Andreas Becks Fraunhofer-Institute.

Slides:



Advertisements
Similar presentations
Final Project Instructor: Nguyen Anh Tu Students: Tran Tien Tai Tran Tien Tai Tran Ngoc Mai Tran Ngoc Mai Tu Kim Tuan Tu Kim Tuan Nguyen Ngoc Phuong Nguyen.
Advertisements

IST SEWASIE 16 May 2002 Sonia Bergamaschi Università di Modena e Reggio Emilia.
Chapter 1: The Database Environment
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
August 6, 2009 Joint Ontolog-OOR Panel 1 Ontology Repository Research Issues Joint Ontolog-OOR Panel Discussion Ken Baclawski August 6, 2009.
Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
Simile and the Semantic Web Draft Presentation for the W3C Technical Plenary Cannes, March 1-5, 2004.
1 Integrating user environments and data liquidity to improve the research experience.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
|epcc| NeSC Workshop Open Issues in Grid Scheduling Ali Anjomshoaa EPCC, University of Edinburgh Tuesday, 21 October 2003 Overview of a Grid Scheduling.
An overview of collection-level metadata Applications of Metadata BCS Electronic Publishing Specialist Group, Ismaili Centre, London, 29 May 2002 Pete.
Wincite Knowledge Warehousing and Networking Sophisticated Simplicity.
Top Tips Enterprise Content Management Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Ontology-based User Modeling for Web-based Information Systems Anton Andrejko, Michal Barla and Mária Bieliková {andrejko, barla,
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Unveiling ProjectWise V8 XM Edition. ProjectWise V8 XM Edition An integrated system of collaboration servers that enable your AEC project teams, your.
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
Text mining Extract from various presentations: Temis, URI-INIST-CNRS, Aster Data …
Tom Sheridan IT Director Gas Technology Institute (GTI)
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
1 Knowledge Management Session 4. 2 Objectives 1.What is knowledge management? Why do businesses today need knowledge management programs and systems.
W w w. f a c t i v a. c o m © 2002 Dow Jones Reuters Business Interactive LLC (trading as Factiva). All rights reserved. The Keys to Successful Strategic.
Faceted Navigation: Search and Browse Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen D ö rre, Peter Gerstl, and Roland Seiffert.
IST SEWASIE general meeting Aachen, March 14, 2005 System Evolution Tools Maurizio Vincini and Enrico Franconi.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
IASW – 2005, Jyväskylä, FinlandUniversity of Vaasa, Department of Computer Science, Finland INFORMATION ARCHITECTURES FOR SEMANTIC WEB APPLICATIONS Kimmo.
Eleventh Edition 1 Introduction to Essentials for Information Systems Irwin/McGraw-Hill Copyright © 2002, The McGraw-Hill Companies, Inc. All rights reserved.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Libraries and Institutional Content Management Systems
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Midwest Documentum User Group Harley-Davidson Documentum WCM 10/10/2006.
Marko Grobelnik Jasna Škrbec Jozef Stefan Institute Social Context as a part of News-Archive-Explorer Web application for exploratory browsing of news.
GMD German National Research Center for Information Technology Innovation through Research Jörg M. Haake Applying Collaborative Open Hypermedia.
1 Building Semantic Applications Paul Warren
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Organizational Memory: Issues in Design & Implementation Sree Nilakanta May 1, 2000.
IST SEWASIE SEWASIE 3rd Review March 14, 2005 SEWASIE Value Proposition and End User Demo Andreas Becks.
Case Study – Venture Portfolio Tracking and Competitive Intelligence Sam Knox - Director of Analyst Services Christopher Cho - Consulting Analyst.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Ontology-Centered Personalized Presentation of Knowledge Extracted from the Web Ralitsa Angelova.
Text Analytics Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Personalized Recommendation of Related Content Based on Automatic Metadata Extraction Andreas Nauerz 1, Fedor Bakalov 2, Birgitta.
Knowledge Modeling and Discovery. About Thetus Thetus develops knowledge modeling and discovery infrastructure software for customers who: Have high-value.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Virtual Information and Knowledge Environments Workshop on Knowledge Technologies within the 6th Framework Programme -- Luxembourg, May 2002 Dr.-Ing.
DITA: Not just for Tech Docs Ann Rockley The Rockley Group.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Semantic (web) activity at Elsevier Marc Krellenstein VP, Search and Discovery Elsevier October 27, 2004
© CGI Group Inc. User Guide PrimePortal – General.
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
INTAROS WP5 Data integration and management
Knowledge Management Tools
Searching and browsing through fragments of TED Talks
Web Mining Department of Computer Science and Engg.
Magnet & /facet Zheng Liang
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Presentation transcript:

Visual Text Mining with SWAPit Detection of semantic relationships among text documents and associated data sources Andreas Becks Fraunhofer-Institute of Applied Information Technology Sankt Augustin & Aachen, Germany Aachen St.Augustin Roma, 24 novembre 2005

2 © Fraunhofer-FIT 2005 Lost in the Ocean of Text Documents? Text Mining helps to explore and analyse natural-language texts uncover relationships, recognize trends group, condense pieces of knowledge categorize text information A huge amount of organisational knowledge is stored in text documents 85 to 90 percent of all corporate data according to Merrill Lynch and Gartner studies Even when DMS and desktop search are used, a huge amount of time is necessary to find important information 80% of companies and 40% of public administrations need more than one day [Zylab survey]

3 © Fraunhofer-FIT 2005 SWAPit Helps You to Navigate Through Your Text Data The tool visualises semantic relationships among text documents... X-ray view for document archives

4 © Fraunhofer-FIT 2005 SWAPit Integrates Text and Data Mining... and allows to navigate, search, browse and analyse text documents and associated data and metadata text documents catalogue of text categories related structured data Similarity View Category View Tools for analysis and search Fact View categorization associations

5 © Fraunhofer-FIT 2005 Application Example: Document Management New text documents Protocollazione Titolario Information about type, AOO/UO, Fascicoli, etc. Project selection Document similarity helps to create fascicoli and find misclassified documents DL-based categorization

6 © Fraunhofer-FIT 2005 Application Example: Monitoring News in the Textile Sector Whats up with competitors, collaborators, markets, materials, …? news ticker news categories Get a quick overview of business- relevant text information Explore documents, understand their relevance to the company

7 © Fraunhofer-FIT 2005 Esempio applicativo: Monitoraggio delle notizie nel settore tessile Cosa succede riguardo la concorrenza, collaboratori, mercati, materiale,..? sorgente delle notizie categorie di notizie Si ottiene un rapido panorama delle informazioni testuali rilevanti per il business Si esplorano documenti, si capiscono importanza e rilevanza per lazienda

8 © Fraunhofer-FIT 2005 Application Example: CRM in an Insurance Company Which customer type does complain about what? Which types of problems lead to contract cancellations? customer complaints categories of complaints customer and contract databases Group customer complaints based on their content Detect relationships and patterns in customer and contract data

9 © Fraunhofer-FIT 2005 Esempio applicativo: CRM in una compagnia assicurativa Che tipo di cliente fa reclami e a proposito di cosa? Quali tipi di problemi portano ad una risoluzione dei contratti? reclami dei clienti categorie di reclami database clienti e contratti Si raggruppano i reclami dei clienti in base al contenuto Si individuano le correlazioni e le caratteristiche comuni nei dati dei clienti e dei contratti

10 © Fraunhofer-FIT 2005 SWAPit as a Single Point of Access operational databases text documents user-specific schema & integrated access DL-based integration Virtual Integrated Database From scattered information......to integrated information multi-schema databases, distributed & data-centred access intuitive, user-centred access DL-based categorization

11 © Fraunhofer-FIT 2005 Monitoring Documents with SWAPit and DL unfiltered and unstructured text documents DL-based filter conceptually filtered, relevant text documents DL-based catalogue builder 3 news in 1 minute 1 document map per day From information overflow... intuitively structured text documents...to information overview

12 © Fraunhofer-FIT 2005 Displaying XML Documents in SWAPit From complex, machine-readable documents......to a human-oriented presentation data with technically rich structural annotation customized, task-oriented view web ontology metadata (selected attributes and elements) text content from specified attributes and elements XML ontology-context of specified elements

13 © Fraunhofer-FIT 2005 Conclusion: Visual and Intuitive Text Mining with SWAPit SWAPit combines views on text documents and associated data sources on a single sreen Overview instead of overflow Improves quality of text access tasks Leverages knowledge sources Flexible architecture Designed to integrate Semantic Web technology Derives additional power from integration of DL technologies Can be integrated easily into existing infrastructures or company portals Can be tailored to specific needs of different market segments Long-standing experience in research and practical applications Document Management, Business Intelligence, Customer Relationship Management,... Main sectors: Insurance, Textile, Engineering, Social Science Technology has been extended in a joint project with Maurizio Lenzerini (SEWASIE)

14 © Fraunhofer-FIT 2005 Grazie dellattenzione!