SEWEBAR - a Framework for Creating and Dissemination of Analytical Reports from Data Mining Jan Rauch, Milan Šimůnek University of Economics, Prague, Czech.

Slides:



Advertisements
Similar presentations
Configuration management
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Groupe de travail athérosclérose 1 Project STULONG Project STULONG Some analytical work at EuroMISE and University of Economics, Prague Jan Rauch EuroMISE.
SDS-Rules and Association Rules March 17, 2004Nicosia, Cyprus Tomáš Karban 1 Jan Rauch 2 Milan Šimůnek 2 1 Charles University, Prague Dept. of Software.
CASE STUDY: IMPLEMENTING SDMX EXCHANGE WITH MEMBER COUNTRIES IN SHORT-TERM ECONOMIC STATISTICS (STES)
Working visit at KDD Laboratory University of North Carolina Charlotte Web Intelligence / Intelligent Agent Technology / Granular Computing / Bioinformatics.
University of Economics, Prague MLNET related activities of Laboratory for Intelligent Systems and Dept. of Information and Knowledge Engineering
1.Data categorization 2.Information 3.Knowledge 4.Wisdom 5.Social understanding Which of the following requires a firm to expend resources to organize.
United Nations Statistics Division Principles and concepts of classifications.
CZECH STATISTICAL OFFICE | Na padesatem 81, Prague 10 | Jitka Prokop, Czech Statistical Office SMS-QUALITY The project and application.
Delmar Learning Copyright © 2003 Delmar Learning, a Thomson Learning company Nursing Leadership & Management Patricia Kelly-Heidenthal
Chapter 2 Data Models Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Caro-COOPS Data Management: Metadata. Cast-Net addresses the need for improved connectivity among coastal observing systems by creating a regional framework.
Software Metrics II Speaker: Jerry Gao Ph.D. San Jose State University URL: Sept., 2001.
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Chapter - 10 Principles of management. Principles of management originate and grow as a result of past experiences and study.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
RESEARCH FRAMEWORK Yulia Sofiatin Department of Epidemiology and Biostatistics 2012 YS 2011.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
2 1 Chapter 2 Data Model Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
Research in Computing Discipline Prabhas Chongstitvatana.
9 th CSO Forum on ACRWC – Dakar, Senegal (2 – 4 April 2014) Edmund Amarkwei Foley (Technical Advisor on Children, Ministry of Gender, Children and Social.
WP5.4 - Introduction  Knowledge Extraction from Complementary Sources  This activity is concerned with augmenting the semantic multimedia metadata basis.
Ciarán O’Leary Wednesday, 23 rd September Ciarán O’Leary School of Computing, Dublin Institute of Technology, Kevin St Research Interests Distributed.
Avalanche Internet Data Management System. Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
Attribute Assessment Implementation – ME 4R03 Saeid Habibi.
Analysis of Death Causes in the STULONG Data Set Jan Burian, Jan Rauch EuroMISE – Cardio University of Economics Prague.
Development in the Ferda project December 2006 Martin Ralbovský.
Ferda Visual Environment for Data Mining Martin Ralbovský.
Akershus University College, Nov 2007, Carn conference AKERSHUS UNIVERSITY COLLEGE Research based implementation of the new corriculum frame work in vocational.
Modifying Logic of Discovery for Dealing with Domain Knowledge in Data Mining Jan Rauch University of Economics, Prague Czech Republic.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Panel Discussion on Foundations of Data Mining at RSCTC2004 J. T. Yao University of Regina Web:
Self-Organised Data Mining – 20 Years after GUHA-80 Martin Kejkula KEG 8 th April 2004
An approach for Framework Construction and Instantiation Using Pattern Languages Rosana Teresinha Vaccare Braga Paulo Cesar Masiero ICMC-USP: Institute.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
1 MedAT: Medical Resources Annotation Tool Monika Žáková *, Olga Štěpánková *, Taťána Maříková * Department of Cybernetics, CTU Prague Institute of Biology.
Two Paradigms for Official Statistics Production Boris Lorenc, Jakob Engdahl and Klas Blomqvist Statistics Sweden.
Linguistic summaries on relational databases Miroslav Hudec University of Economics in Bratislava, Department of Applied Informatics FSTA, 2014.
Introduction to Comparative Education
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
How to measure the impact of R&D on SD ? Laurence Esterle, MD, PhD Cermes and Ifris France Cyprus, 16 – 17 October L. ESTERLE Linking science and.
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
Of An Expert System.  Introduction  What is AI?  Intelligent in Human & Machine? What is Expert System? How are Expert System used? Elements of ES.
Application Individualization and Differentiation in Czech Primary Schools - One of the Characteristic Features of Inclusion Application Individualization.
UNDAF Info Presentation to UNDAF Programming Network April Claes Johansson UNICEF.
Lecture №4 METHODS OF RESEARCH. Method (Greek. methodos) - way of knowledge, the study of natural phenomena and social life. It is also a set of methods.
TO FACILITATE REVIEW OF STATE SCIENCE JJANAN Conceptual Mapping to Facilitate Review of State Science Standards Jane Heinze-Fry, Museum Institute for Teaching.
Ecological Interface Design Overview Park Young Ho Dept. of Nuclear & Quantum Engineering Korea Advanced Institute of Science and Technology May
MDD-Kurs / MDA Cortex Brainware Consulting & Training GmbH Copyright © 2007 Cortex Brainware GmbH Bild 1Ver.: 1.0 How does intelligent functionality implemented.
SDS-Rules and Classification Tomáš Karban ECML/PKDD 2003 – Dubrovnik (Cavtat) September 22, 2003.
Representing and Reasoning with Heterogeneous, Modular and Distributed ontologies UniTN/IRST contribution to KnowledgeWeb.WP 2.1.
TAIEX-REGIO Workshop on Applying the Partnership Principle in the European Structural and Investment Funds Bratislava, 20/05/2016 Involvement of Partners.
A Solution to the Recall Problem using Rough Set Theory Professor Djamel Bouchaffra (Advisor) Tarek Dakhlallah (Ph.D. Student) Computer Science & Engineering.
Independent Study of Ontologies
RIA: Communication – building credibility
School of Information Management Nanjing University China
Dr. Sudha Ram Huimin Zhao Department of MIS University of Arizona
Generic Statistical Business Process Model (GSBPM)

Web Mining Department of Computer Science and Engg.
Motivation Semantic Transformation Module Most of the existing works neglect the semantic relationship between the visual feature and linguistic knowledge,
Nursing Informatics and the Foundation of Knowledge
Presentation transcript:

SEWEBAR - a Framework for Creating and Dissemination of Analytical Reports from Data Mining Jan Rauch, Milan Šimůnek University of Economics, Prague, Czech Republic

SEWEBAR2 SEWEBAR - a Framework for Creating and Dissemination of Analytical Reports from Data Mining Starting points Principles (as seen now) Simple examples First steps

SEWEBAR3 SEWEBAR – Starting points (1) Several similar mining problems a la STULONG: ADAMEK, TINITUS HEPATITIS, SOCIOLOGY, …:  Cca attributes  thousands of objects (usually patients)  domain expert (non informatics) available  some (this time relatively simple) background knowledge available Reasonable result form is a well structured analytical report that must be  created  stored  retrieved  disseminated  used to answer more complex analytical questions

SEWEBAR4 SEWEBAR – Starting points (2) Some results concerning partial related projects  Report assistant (it works)  AR2NL (successful experiment)  EverMiner (considerations)  SEWEBAR (considerations)  observational calculi Grants: LISp, Czech Science Foundation (GAČR), Kontakt, CBI, ?? Students can contribute (4IZ460, 4IZ210, ? ) Dealing with knowledge and semantics „is in“ (see e.g. „10 Challenging problems in Data Mining Research“ -

SEWEBAR5 SEWEBAR – inspiration by Semantic Web (SEmantic WEB and Analytical Reports)

SEWEBAR6 SEWEBAR – Principles (1) There is a structured set of (types of) patterns of local analytical questions  What strong relations (  *,  *, …) are valid in given data?  What strong known relations are not valid in given data?  What exceptions from … are valid in given data?  …. There are various items of background knowledge in easy understandable form  Bier consumption  BMI  Mother hypertension  + Hypertension  ,  -, …. Application of the pattern of analytical question to a given item of background knowledge and to a given data matrix leads to a concrete analytical question.

SEWEBAR7 SEWEBAR – Principles (2) To each local analytical question there is type of local analytical report answering the question The concrete local analytical question can be answered by the GUHA procedures implemented in the LISP-Miner system The corresponding analytical report can be automatically created There is a similar structured set of patterns of global analytical questions (concerning several similar data matrices) that can be automatically answered on the basis of the local analytical reports

SEWEBAR8 SEWEBAR – Principles From local analytical question to analytical report

SEWEBAR9 SEWEBAR – simple examples Pattern of analytical question – mutual influence of attributes Pattern of analytical question – groups of attributes Answering „analytical question – groups of attributes“ by 4ft-Miner Analytical report

SEWEBAR10 SEWEBAR - a Framework for Creating and Dissemination of Analytical Reports from Data Mining Starting points Principles (as understood now) Simple examples First steps

SEWEBAR11 SEWEBAR – Principles for first steps To implement soon first version (simplified if necessary) of support for the whole process dealing with local and global analytical reports. The whole process covers:  Formulation of reasonable local analytical questions using background knowledge  Creation of analytical reports answering particular analytical questions  Formulating and answering reasonable global analytical questions Use the first version to  Gradually improve and enhance particular parts  Develop corresponding theory using observational calculi

SEWEBAR12 Control panel – tool for first steps

SEWEBAR13 SEWEBAR – First steps (1) Background knowledge and local analytical questions: 1. We start with ADAMEK and STULONG data sets 2. Background knowledge – we use current version of Knowledge Base 3. To define first version of the set of LAQ - Local Analytical Questions 4. To implement LAQPA - Local Analytical Question Patterns Administrator 5. To implement LAQA - Local Analytical Questions Administrator

SEWEBAR14 SEWEBAR – First steps (3) Local analytical reports: 6. Enhancement of 4ft-Miner by filtering out of uninteresting rules 7. EverMiner modules 8. To define skelets of analytical reports 9. Generator of analytical reports

SEWEBAR15 SEWEBAR – First steps (4) Global analytical reports - implemented using ?Topic Maps Content management system? 9. To define rules for indexing analytical reports by Topic Maps 10. To implement tool for automated indexing analytical reports for Topic Maps 11. To define first version of a set of global analytical questions 12. To implement tool for automated answering global analytical reports 13. ??IGA grant??

SEWEBAR16 Thank you for your attention

SEWEBAR17 Thank you for your attention