SOFT COMPUTING TECHNIQUES FOR STATISTICAL DATABASES Miroslav Hudec INFOSTAT – Bratislava MSIS 2009.

Slides:



Advertisements
Similar presentations
Mathematics in the MYP.
Advertisements

Automatic Histogram Threshold Using Fuzzy Measures 呂惠琪.
Chapter 12 Information Systems Chapter Goals Define the role of general information systems Explain how spreadsheets are organized Create spreadsheets.
Declarative Specification of Ambiance in VRML Landscapes Vincent Jolivet, Dimitri Plemenos, Patrick Poulingeas. MSI Laboratory. University of Limoges (France).
Data classification based on tolerant rough set reporter: yanan yean.
Chapter 12 Information Systems Nell Dale John Lewis.
Building Knowledge-Driven DSS and Mining Data
MSIS 110: Introduction to Computers; Instructor: S. Mathiyalakan1 Organizing Data and Information Chapter 5.
Introduction What is Fuzzy Logic? HOW DOES FL WORK? Differences between Classical set (crisps) and Fuzzy set theory Example 1 Example 2 Classifying Houses.
Dr. Kalpakis CMSC 461, Database Management Systems Introduction.
CORE 2: Information systems and Databases STORAGE & RETRIEVAL 2 : SEARCHING, SELECTING & SORTING.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Introduction SWE 619. Why Is Building Good Software Hard? Large software systems enormously complex  Millions of “moving parts” People expect software.
Database Security and Auditing: Protecting Data Integrity and Accessibility Chapter 3 Administration of Users.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
1 Introduction to databases concepts CCIS – IS department Level 4.
 Introduction Introduction  Purpose of Database SystemsPurpose of Database Systems  Levels of Abstraction Levels of Abstraction  Instances and Schemas.
RAJIKA TANDON DATABASES CSE 781 – Database Management Systems Instructor: Dr. A. Goel.
CST203-2 Database Management Systems Lecture 2. One Tier Architecture Eg: In this scenario, a workgroup database is stored in a shared location on a single.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Information Systems: Databases Define the role of general information systems Describe the elements of a database management system (DBMS) Describe the.
Geographic Information System GIS This project is implemented through the CENTRAL EUROPE Programme co-financed by the ERDF GIS Geographic Inf o rmation.
Designing Persistency Delos NoE, Preservation Cluster Workshop: Persistency in Digital Libraries 14. February 2006, Oxford Internet Institute.
A hybrid-expert-system based tool for scheduling and decision support F.Franek, McMaster University+Terren Corp., Canada V.L.Rosicky, Terren Corp., Canada.
3. Rough set extensions  In the rough set literature, several extensions have been developed that attempt to handle better the uncertainty present in.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Intelligent Information Expert System for Employment and General Purposed Fuzzy Shell.
Migrating From Relational To Object-Oriented Databases Masood Asif, Kenny Dunlop, Gerard Given & Grant Stalker.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Big Ideas Differentiation Frames with Icons. 1. Number Uses, Classification, and Representation- Numbers can be used for different purposes, and numbers.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
1 CS 430 Database Theory Winter 2005 Lecture 2: General Concepts.
A Probabilistic Quantifier Fuzzification Mechanism: The Model and Its Evaluation for Information Retrieval Felix Díaz-Hemida, David E. Losada, Alberto.
ES component and structure Dr. Ahmed Elfaig The production system or rule-based system has three main component and subcomponents shown in Figure 1. 1.Knowledge.
Uncertainty Management in Rule-based Expert Systems
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
Daniel A. Keim, Hans-Peter Kriegel Institute for Computer Science, University of Munich 3/23/ VisDB: Database exploration using Multidimensional.
MUNICIPALITIES CLASSIFICATION BASED ON FUZZY RULES
Linguistic summaries on relational databases Miroslav Hudec University of Economics in Bratislava, Department of Applied Informatics FSTA, 2014.
Use of Expert Systems for Application Systems Development.
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
CS223: Software Engineering Lecture 2: Introduction to Software Engineering.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Relational Algebra Chapter 4, Part A.
CS 440 Database Management Systems Stored procedures & OR mapping 1.
IIS 645 Database Management Systems DDr. Khorsheed Today’s Topics 1. Course Overview 22. Introduction to Database management 33. Components of Database.
1 Software Requirements Descriptions and specifications of a system.
Introduction to Spatial Computing CSE 555
Aga Private computer Institute Prepared by: Srwa Mohammad
Roles in the Database Environment
Classifications of Software Requirements
PLM, Document and Workflow Management
An Introduction to database system
CHAPTER 1 Introduction BIC 3337 EXPERT SYSTEM.
Towards Easier Data-Binding for Derived Values in Ecore-based Models
Physical Data Model – step-by-step instructions and template
Chapter 12 Rational Functions.
Chapter 12 Information Systems.
Databases and Information Management
CS 174: Server-Side Web Programming February 12 Class Meeting
Chapter 2 Database Environment.
Unit# 9: Computer Program Development
Database.
Database solutions Database environment Marzena Nowakowska Faculty of Management and Computer Modelling Kielce University of Technology rooms: 3.21 C,
Disseminating statistical data by short quantified sentences of natural language Miroslav Hudec Faculty of Economic Informatics, University of Economics.
Lessons Vocabulary Access 2016.
DSS Concepts, Methodologies and Technologies
Database SQL.
Course Instructor: Supriya Gupta Asstt. Prof
Presentation transcript:

SOFT COMPUTING TECHNIQUES FOR STATISTICAL DATABASES Miroslav Hudec INFOSTAT – Bratislava MSIS 2009

Introduction Soft computing (by fuzzy logic) Database query (SQL - fuzzy) case study Data classification (usual - fuzzy) case study Conclusion

Soft computing The essential property of soft computing (SC) is to “soften” hard computing (HC) techniques for coping with the imprecision, ambiguity and uncertainty. HC uses two-valued logic (e.g. the element satisfies or not the criterion) Fuzzy logic as a part of SC uses many valued logic (e.g. the element can partly satisfy the criterion) Computing with words is inspired by the human capability to perform a wide variety of tasks without exact measurements and computations. (Flexible database query. Interesting for statistical IS?)

Database queries (SQL) select * from Table where attribute_p > P and attribute_r < R. two-valued logic

SQL and fuzzy queries SQL conditions >=, <=, = many-valued logic fuzzy logical operators and, or: 1 and 1 =1 0 and 1 =0 one function for and and or operator two-valued logic 0,7 and 0,358=? (0.358) (0.2506) for {0,1} logic minimum and product become ordinary and operator big small about

Case study select district, roads, area from T where roads is Big and area is Small The length of road indicator is represented by „Big value“ fuzzy set with these parameters L d =200km and L p =300km. The „Small value“ fuzzy set with parameters L p =450km 2 and L g =650km 2 describes the area of district attribute.

Solution If SQL was used, this additional valuable information would remain hidden.

Discussion For the very soft gradation, the infinite number of SQL queries has to be used. In case of fuzzy queries, one query is sufficient. The advantages of this approach for users are as follows: the connection to a database (connection string) and data accessing (SQL command) do not have to be modified; users do not need to learn a new query language; the interface supports (quasi) natural language; presenting of obtained data is in similar way as from SQL but with additional valuable information; users see data “behind the corner“ (colored areas in table) and can take into account possible interested data.

Data classification two-valued logic How to solve this problem without additional calculation? Approximate reasoning and fuzzy logic

Data classification many-valued logic The same GLC classify_into [class Cx ] select [attributes] from [tables, views]

Case study In this case study municipalities are classified according to the percentage of needs for the winter road maintenance. This example contains following fuzzy rules : If Road is Small and Snow is Small Then Maintenance is Small; If Road is Small and Snow is Big Then Maintenance is Medium; If Road is Big and Snow is Small Then Maintenance is Medium; If Road is Big and Snow is Big Then Maintenance is Big. (0.1) (0.5) (0.9)

Case study classify_into S select * from Table where roads is Small and snow is Small; classify_into M select * from Table where (roads is Small and snow is Big) or (roads is Big and snow is Small); classify_into B select * from Table where roads is Big and snow is Big.

Case study If classical classification were used, this additional valuable information would remain hidden. If classical classification were used, this additional valuable information would remain hidden (Softer classification between objects T1-T4).

Implementation

SQL and fuzzy approach SQL queries are useful when a clean and exact boundary between selected and non selected data is required (faster and less calculations). Fuzzy queries provide flexibility for the definition of query and inclusion of records that almost meet the query criterion (more operations, more information). User decides which type of query is better for each task.

Conclusion This approach allows users of statistical information systems to use their approximate reasoning during work with data. When users work with usual software tools they have to change their many-valued logical thinking (approximate reasoning) into the two-valued computer logic. This fuzzy approach supports work with linguistic expressions on the client side, nevertheless it does not need any modification of relational databases.

Thank you for your attention