Presentation is loading. Please wait.

Presentation is loading. Please wait.

Ferda Visual Environment for Data Mining Martin Ralbovský.

Similar presentations


Presentation on theme: "Ferda Visual Environment for Data Mining Martin Ralbovský."— Presentation transcript:

1 Ferda Visual Environment for Data Mining Martin Ralbovský

2 Ferda History 1 LISp-Miner System – Implementation of several GUHA procedures + more LISp-Miner System – Implementation of several GUHA procedures + more 2003: Idea of creating a new Clementine- like visual interface for LISp-Miner 2003: Idea of creating a new Clementine- like visual interface for LISp-Miner 2003: Ferda project started based on this idea 2003: Ferda project started based on this idea subject Softwarový projekt at MFF UK

3 Ferda History 2 2004 – 2006: Development of Ferda project 2004 – 2006: Development of Ferda project February 2006: Ferda presented at Znalosti 2006 conference February 2006: Ferda presented at Znalosti 2006 conference April 2006: Ferda became a approved software project at MFF UK April 2006: Ferda became a approved software project at MFF UK Now: Further development of Ferda system, master theses of Ferda creators Now: Further development of Ferda system, master theses of Ferda creators

4 Ferda Advantages Modular and extensible architecture, usage of middleware, support for distributed computing Modular and extensible architecture, usage of middleware, support for distributed computing Ferda’s box model: ability implement and include new boxes, possible engine for EverMiner Ferda’s box model: ability implement and include new boxes, possible engine for EverMiner Comprehensive user interface including new features such as box archive Comprehensive user interface including new features such as box archive

5 Ferda Disadvantages Not so well tested (haven’t been used for education) Not so well tested (haven’t been used for education) Dependent on LISp-Miner modules and metabase Dependent on LISp-Miner modules and metabase Slower then LISp-Miner Slower then LISp-Miner

6 Future goals for Ferda “Spreading Ferda” “Spreading Ferda” Getting more people to work for Ferda – creation of new boxes, modules Getting more people to work for Ferda – creation of new boxes, modules Cooperation with other systems Cooperation with other systems Road to EverMiner Road to EverMiner

7 Master theses improvements for Ferda Reimplementing LISp-Miner procedures Reimplementing LISp-Miner procedures Relational versions of some procedures (SD4FT) Relational versions of some procedures (SD4FT) Domain knowledge support Domain knowledge support

8 Reimplementing LISp-Miner procedures 1 Not working with the metabase anymore– faster implementation Not working with the metabase anymore– faster implementation Modular implementation of data mining task - enables the full potential of the Ferda’s box module Modular implementation of data mining task - enables the full potential of the Ferda’s box module Open implementation of 4ft, SD4ft, KL, SDKL, CF and SDCF procedures Open implementation of 4ft, SD4ft, KL, SDKL, CF and SDCF procedures

9 Reimplementing LISp-Miner procedures 2 – further plans Enabling fuzzy computing Enabling fuzzy computing Data stream support – connecting Ferda to Sumatra TT Data stream support – connecting Ferda to Sumatra TT Distributed computing Distributed computing KL Collaps, 4ftUV Filter implementation KL Collaps, 4ftUV Filter implementation “little” improvements to task setup (literal, cedent…) “little” improvements to task setup (literal, cedent…)

10 Ontologies in Ferda Ontologies aid user in various phases of CRISP-DM cycle, planning to develop (semi)automated tools to help with: Identification of redundant attributes Identification of redundant attributes Creation of attributes Creation of attributes Creation of partial cedents Creation of partial cedents …

11 Field knowledge in Ferda Field knowledge – vague term, rules that are common knowledge, widely accepted in a domain Field knowledge – vague term, rules that are common knowledge, widely accepted in a domain Formalization of field knowledge using abstract attributes and quantifiers Formalization of field knowledge using abstract attributes and quantifiers Creation of boxes in Ferda that enable user to express field knowledge, veryfiing field knowledge against procedures’ output Creation of boxes in Ferda that enable user to express field knowledge, veryfiing field knowledge against procedures’ output


Download ppt "Ferda Visual Environment for Data Mining Martin Ralbovský."

Similar presentations


Ads by Google