Ferda Visual Environment for Data Mining Martin Ralbovský.

Slides:



Advertisements
Similar presentations
Ch 3 System Development Environment
Advertisements

Computers Are Your Future
KOS and the Conduct of Science© Straits Knowledge 2011 Knowledge Organisation Systems as Enablers to the Conduct of Science Patrick Lambe.
The Last Procedure Before First Functional Prototype Grant Boomer, Brett Papineau, Tanis Lopez, Archana Shrestha CS 383.
Information Systems Analysis and Design
Siemens Applied Automation Process Gas Chromatography NeSSI – Lets Do It.pptSlide 1; IFPAC; February 22, 2006 Let’s Do It! Using NeSSI IFPAC: February.
T-FLEX DOCs PLM, Document and Workflow Management.
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
Realizing OPM Philosophy in the Context of Full Life- Cycle Support Avi Soffer Technion, Israel Institute of Technology Thesis Advisor: Prof. Dov Dori.
Chapter 1 The Systems Development Environment
Copyright 2002 Prentice-Hall, Inc. Chapter 4 Automated Tools for Systems Development 4.1 Modern Systems Analysis and Design Third Edition.
1 Kharkiv National University of Radioelectronics, Ukraine Ontology-Based Portal for National Educational and Scientific Resources Management Masha Klymova.
Copyright 2002 Prentice-Hall, Inc. Chapter 4 Automated Tools for Systems Development 4.1 Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer.
Copyright 2004 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Second Edition Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Chapter.
Software Testing Name: Madam Currie Course: Swen5431 Semester: Summer 2K.
Tiered architectures 1 to N tiers. 2 An architectural history of computing 1 tier architecture – monolithic Information Systems – Presentation / frontend,
Introduction to Rule-Based Systems, Expert Systems, Fuzzy Systems Introduction to Rule-Based Systems, Expert Systems, Fuzzy Systems (sections 2.7, 2.8,
SEWEBAR - a Framework for Creating and Dissemination of Analytical Reports from Data Mining Jan Rauch, Milan Šimůnek University of Economics, Prague, Czech.
Robots at Work Dr Gerard McKee Active Robotics Laboratory School of Systems Engineering The University of Reading, UK
Architectural Design.
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 1.1.
® IBM Software Group © 2006 IBM Corporation PRJ480 Mastering the Management of Iterative Development v2 Module 3: Phase Management - Inception.
Copyright 2001 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Chapter 1 The Systems.
Rapid Prototyping Model
WP6: Grid Authorization Service Review meeting in Berlin, March 8 th 2004 Marcin Adamski Michał Chmielewski Sergiusz Fonrobert Jarek Nabrzyski Tomasz Nowocień.
Martin Ralbovský KIZI FIS VŠE The GUHA method Provides a general mainframe for retrieving interesting information from data Strong foundations.
Software Configuration Management
Karolina Muszyńska. Reverse engineering - looking at the solution to figure out how it works Reverse engineering - breaking something down in order to.
Copyright 2002 Prentice-Hall, Inc. Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design.
Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.
Software development process ธนวัฒน์ แซ่ เอียบ. The development process Process –set of rules which define how a development project. Methodology and.
Development in the Ferda project December 2006 Martin Ralbovský.
Ontology-Driven Data Preparation for Data Mining Martin Zeman, KSI MFF UK Martin Ralbovský, KIZI FIS VŠE.
An Approach To Automate a Process of Detecting Unauthorised Accesses M. Chmielewski, A. Gowdiak, N. Meyer, T. Ostwald, M. Stroiński
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Architectural Design Identifying system components and their interfaces.
KNOWLEDGE GRIDS Akshat Mishra GRID SEMINAR WINTER 2008 Feb 2008.
SYSTEMS ANALYSIS AND DESIGN LIFE CYCLE
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Software Engineering Prof. Ing. Ivo Vondrak, CSc. Dept. of Computer Science Technical University of Ostrava
Overview Of Expert System Tools Expert System Tools : are all designed to support prototyping. Prototype : is a working model that is functionally equivalent.
Self-Organised Data Mining – 20 Years after GUHA-80 Martin Kejkula KEG 8 th April 2004
COMM89 Knowledge-Based Systems Engineering Lecture 8 Life-cycles and Methodologies
Chapter 6 CASE Tools Software Engineering Chapter 6-- CASE TOOLS
Service Level Management with Agent Technology Torsten Bissel, Manfred Bogen, Christian Bonkowski, Volker Hadamschek, Dieter Strecker GMD - German National.
Volgograd State Technical University Applied Computational Linguistic Society Undergraduate and post-graduate scientific researches under the direction.
Software Engineering and Object-Oriented Design Topics: Solutions Modules Key Programming Issues Development Methods Object-Oriented Principles.
Copyright 2002 Prentice-Hall, Inc. Chapter 4 Automated Tools for Systems Development 4.1 Modern Systems Analysis and Design.
Chapter 4 Automated Tools for Systems Development Modern Systems Analysis and Design Third Edition 4.1.
Inria Rhône-AlpesEMGnet meeting - December 98 1 A Platform for EMG Studies Danielle Ziébelin, Martine Maume and Philippe Genoud INRIA Rhône-Alpes Projet.
2 Software CASE tools state-of-the-art UML modeling Partially automatic code generation Refactoring browsers (occasionally) Context-sensitive search and.
Knowledge Support for Modeling and Simulation Michal Ševčenko Czech Technical University in Prague.
Metadata Schema Registries: background and context MEG Registry Workshop, Bath, 21 January 2003 Rachel Heery UKOLN, University of Bath Bath, BA2 7AY UKOLN.
 System Requirement Specification and System Planning.
C_ITIP211 LECTURER: E.DONDO. Unit 1 : The Systems Development Environment.
Modern Systems Analysis and Design Third Edition
PLM, Document and Workflow Management
Chapter 11: Software Configuration Management
Modern Systems Analysis and Design Third Edition
System Design and Modeling
Maintaining software solutions
Chapter 4 Automated Tools for Systems Development
Modern Systems Analysis and Design Third Edition
Modern Systems Analysis and Design Third Edition
.NET vs. J2EE Architecture
Chapter 11: Software Configuration Management
Introduction to Systems Analysis and Design Stefano Moshi Memorial University College System Analysis & Design BIT
Members: Keshava Shiva Sanjeeve Kareena
Modern Systems Analysis and Design Third Edition
T-FLEX DOCs PLM, Document and Workflow Management.
Presentation transcript:

Ferda Visual Environment for Data Mining Martin Ralbovský

Ferda History 1 LISp-Miner System – Implementation of several GUHA procedures + more LISp-Miner System – Implementation of several GUHA procedures + more 2003: Idea of creating a new Clementine- like visual interface for LISp-Miner 2003: Idea of creating a new Clementine- like visual interface for LISp-Miner 2003: Ferda project started based on this idea 2003: Ferda project started based on this idea subject Softwarový projekt at MFF UK

Ferda History – 2006: Development of Ferda project 2004 – 2006: Development of Ferda project February 2006: Ferda presented at Znalosti 2006 conference February 2006: Ferda presented at Znalosti 2006 conference April 2006: Ferda became a approved software project at MFF UK April 2006: Ferda became a approved software project at MFF UK Now: Further development of Ferda system, master theses of Ferda creators Now: Further development of Ferda system, master theses of Ferda creators

Ferda Advantages Modular and extensible architecture, usage of middleware, support for distributed computing Modular and extensible architecture, usage of middleware, support for distributed computing Ferda’s box model: ability implement and include new boxes, possible engine for EverMiner Ferda’s box model: ability implement and include new boxes, possible engine for EverMiner Comprehensive user interface including new features such as box archive Comprehensive user interface including new features such as box archive

Ferda Disadvantages Not so well tested (haven’t been used for education) Not so well tested (haven’t been used for education) Dependent on LISp-Miner modules and metabase Dependent on LISp-Miner modules and metabase Slower then LISp-Miner Slower then LISp-Miner

Future goals for Ferda “Spreading Ferda” “Spreading Ferda” Getting more people to work for Ferda – creation of new boxes, modules Getting more people to work for Ferda – creation of new boxes, modules Cooperation with other systems Cooperation with other systems Road to EverMiner Road to EverMiner

Master theses improvements for Ferda Reimplementing LISp-Miner procedures Reimplementing LISp-Miner procedures Relational versions of some procedures (SD4FT) Relational versions of some procedures (SD4FT) Domain knowledge support Domain knowledge support

Reimplementing LISp-Miner procedures 1 Not working with the metabase anymore– faster implementation Not working with the metabase anymore– faster implementation Modular implementation of data mining task - enables the full potential of the Ferda’s box module Modular implementation of data mining task - enables the full potential of the Ferda’s box module Open implementation of 4ft, SD4ft, KL, SDKL, CF and SDCF procedures Open implementation of 4ft, SD4ft, KL, SDKL, CF and SDCF procedures

Reimplementing LISp-Miner procedures 2 – further plans Enabling fuzzy computing Enabling fuzzy computing Data stream support – connecting Ferda to Sumatra TT Data stream support – connecting Ferda to Sumatra TT Distributed computing Distributed computing KL Collaps, 4ftUV Filter implementation KL Collaps, 4ftUV Filter implementation “little” improvements to task setup (literal, cedent…) “little” improvements to task setup (literal, cedent…)

Ontologies in Ferda Ontologies aid user in various phases of CRISP-DM cycle, planning to develop (semi)automated tools to help with: Identification of redundant attributes Identification of redundant attributes Creation of attributes Creation of attributes Creation of partial cedents Creation of partial cedents …

Field knowledge in Ferda Field knowledge – vague term, rules that are common knowledge, widely accepted in a domain Field knowledge – vague term, rules that are common knowledge, widely accepted in a domain Formalization of field knowledge using abstract attributes and quantifiers Formalization of field knowledge using abstract attributes and quantifiers Creation of boxes in Ferda that enable user to express field knowledge, veryfiing field knowledge against procedures’ output Creation of boxes in Ferda that enable user to express field knowledge, veryfiing field knowledge against procedures’ output