Martin Ralbovský Jan Rauch KIZI FIS VŠE. Contents Motivation & introduction Graphs of quantifiers Classes of quantifiers, tables of critical frequencies.

Slides:



Advertisements
Similar presentations
Jürgen Klüver Information Technologies and Educational Processes
Advertisements

Chapter 17 Multivariable Calculus.
Design of Experiments Lecture I
Statistically motivated quantifiers GUHA matrices/searches can be seen from a statistical point of view, too. We may ask ‘Is the coincidence of two predicates.
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
« هو اللطیف » By : Atefe Malek. khatabi Spring 90.
By : L. Pour Mohammad Bagher Author : Vladimir N. Vapnik
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
Elementary hypothesis testing
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Reduced Support Vector Machine
PSY 307 – Statistics for the Behavioral Sciences
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Inferences About Process Quality
Generalized Linear Models
Chapter 9 Numerical Integration Numerical Integration Application: Normal Distributions Copyright © The McGraw-Hill Companies, Inc. Permission required.
Chapter 7 Inferences Regarding Population Variances.
LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 5 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
Hypothesis Testing.
Martin Ralbovský KIZI FIS VŠE The GUHA method Provides a general mainframe for retrieving interesting information from data Strong foundations.
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Education 793 Class Notes T-tests 29 October 2003.
Simple Linear Regression Models
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Modeling and simulation of systems Simulation optimization and example of its usage in flexible production system control.
Example We can also evaluate a definite integral by interpretation of definite integral. Ex. Find by interpretation of definite integral. Sol. By the interpretation.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Statistical Decision Theory
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Chapter 7 Estimates and Sample Sizes
Section 9.2 Testing the Mean  9.2 / 1. Testing the Mean  When  is Known Let x be the appropriate random variable. Obtain a simple random sample (of.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 Classes of association rules short overview Jan Rauch, Department of Knowledge and Information Engineering University of Economics, Prague.
 In Chapter 10 we tested a parameter from a population represented by a sample against a known population ( ).  In chapter 11 we will test a parameter.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
GG 313 Geological Data Analysis Lecture 13 Solution of Simultaneous Equations October 4, 2005.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Estimates and Sample Sizes Chapter 6 M A R I O F. T R I O L A Copyright © 1998,
Chapter 8: Simple Linear Regression Yang Zhenlin.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Copyright ©2006 Brooks/Cole A division of Thomson Learning, Inc. Introduction to Probability and Statistics Twelfth Edition Robert J. Beaver Barbara M.
1 6. Mean, Variance, Moments and Characteristic Functions For a r.v X, its p.d.f represents complete information about it, and for any Borel set B on the.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
MATB344 Applied Statistics I. Experimental Designs for Small Samples II. Statistical Tests of Significance III. Small Sample Test Statistics Chapter 10.
ENGR 610 Applied Statistics Fall Week 7 Marshall University CITE Jack Smith.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Hypothesis Testing Example 3: Test the hypothesis that the average content of containers of a particular lubricant is 10 litters if the contents of random.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Biostatistics Class 3 Probability Distributions 2/15/2000.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Review: Discrete Mathematics and Its Applications
Chapter Nine Hypothesis Testing.
Inference about the slope parameter and correlation
Chapter Eight Estimation.
Two-Sample Hypothesis Testing
3. The X and Y samples are independent of one another.
Sampling Distributions and Estimation
SUBMITTED TO: Mrs. BIKRAMJEET KAUR
Community &family medicine
Review: Discrete Mathematics and Its Applications
Inferences Regarding Population Variances
Chapter 18: The Chi-Square Statistic
Presentation transcript:

Martin Ralbovský Jan Rauch KIZI FIS VŠE

Contents Motivation & introduction Graphs of quantifiers Classes of quantifiers, tables of critical frequencies Graphs of tables of critical frequencies

Motivation Association measures = quantifiers are crucial for quality association mining They have been extensively studied The formulas are hard to comprehend Sometimes interesting results

Four-fold contingency table Mψ ψ abr cds kln

Considered quantifiers 1 Founded implication Lower critical implication Upper critical implication

Considered quantifiers 2 Above average dependence Fishers quantifier Simple deviation

Considered quantifiers 3 Founded equivalence Pairing

Contents Motivation & introduction Graphs of quantifiers Classes of quantifiers, tables of critical frequencies Graphs of tables of critical frequencies

Initial remarks We used the Maple software: possibility to graph 2 dimensions usage of animation parameter to graph 3 dimensions We wanted to see the graphs and compare them, to know more about quantifiers from the graphs.

Graph and animation examples Founded implication graph

Findings after graphing Most graphs difficult to interpret and - founded & critical implications One interesting result obtained – comparison of founded equivalence and pairing quantifiers

Founded equivalence & pairing – known facts Founded equivalence equivalence > implication founded equivalence > founded implication finds equivalent occurrence of ψ and (in terms of positive/negative examples) [Kupka] Pairing quantifier new quantifier [Kupka] pairing of tuple of examined items

Graphs of FE, Pairing Founded equivalencePairing

Learning from graphs Characterizing properties: Founded equivalence – the bigger a+d, the better Pairing – the more a=d, the better When shouldnt be used: Founded equivalencePairing How to help Look at the contingency tables Proper base settings could help Combining the quantifiers

Contents Motivation & introduction Graphs of quantifiers Classes of quantifiers, tables of critical frequencies Graphs of tables of critical frequencies by Jan Rauch

Contents Motivation & introduction Graphs of quantifiers Classes of quantifiers, tables of critical frequencies Graphs of tables of critical frequencies

Comparing implicational quantifiers Founded implication – confidence, basic measure for association mining, simple to comprehend Lower and upper critical equivalence – statistical binomial test, hard to comprehend, computationally demanding If and when can be critical implications replaced by founded implication? What is relation between them?

Using table of maximal bs Table of maximal b is another definition for the quantifier It reduces the dimension (b and p), can be used to compare the implicational quantifiers For founded implication table of maximal b can be written as a function For critical implications, we cannot separate the variables

Tables of maximal bs for implicational quantifiers

Learning from graphs Lower critical < founded < upper critical Founded implication graph – linear curve Critical implications graphs – ??? We examined slopes of graphs: Lower critical impl Upper critical impl

Learning from graphs II Seems that critical implications graphs are symmetric with respect to founded implication graph (slope 2.5) Our working hypothesis: For all natural a: lower critical < founded < upper critical

Creating tables of minimal |b-c| Constructing tables of minimal |b-c| for symmetrical quantifier with F property Algorithm: For given N, finding quadruples, contingency tables (a,b,c,d) for which the quantifier is valid For given a, d searching for maximal |b-c| when the quantifier is still valid Matrix indexed with a, d created Graphing the matrix

Construction

Fishers quantifier vs. simple deviation

For smaller n, the quantifiers are different For higher n, the quantifiers tend to be the same Why???

Above average quantifier

Above average quantifier - analysis In region with high a and low d with respect to n, the quantifier is not valid In this region the two fractions tend to have the same value, therefore it is hard to fulfill the inequality Again, not implicational behavior