SampleFK 111 232 371 481 591 6112 7232 8371 9392 10451 11461 12591 ChiMerge Discretization Statistical approach to Data Discretization Applies the Chi.

Slides:



Advertisements
Similar presentations
Exponential Distribution. = mean interval between consequent events = rate = mean number of counts in the unit interval > 0 X = distance between events.
Advertisements

Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
Business Statistics - QBM117 Revising interval estimation.
ChiMerge Technique Example
1 The Chi squared table provides critical value for various right hand tail probabilities ‘A’. The form of the probabilities that appear in the Chi-Squared.
More Discrete Probability Distributions
Selection Sort
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
© 2002 Thomson / South-Western Slide 8-1 Chapter 8 Estimation with Single Samples.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 11 Inferences About Population Variances n Inference about a Population Variance n.
Probability & Statistics I IE 254 Summer 1999 Chapter 4  Continuous Random Variables  What is the difference between a discrete & a continuous R.V.?
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 7 - Sampling Distribution of Means.
§ 5.3 Normal Distributions: Finding Values. Probability and Normal Distributions If a random variable, x, is normally distributed, you can find the probability.
Selection Sort
Elementary Statistics Discrete Probability Distributions.
Non-parametric Tests e.g., Chi-Square. When to use various statistics n Parametric n Interval or ratio data n Name parametric tests we covered Tuesday.
4.3 More Discrete Probability Distributions NOTES Coach Bridges.
Data Mining Spring 2007 Noisy data Data Discretization using Entropy based and ChiMerge.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Discrete Probability Distributions Chapter 4. § 4.3 More Discrete Probability Distributions.
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
Discrete Probability Distributions Chapter 4. § 4.3 More Discrete Probability Distributions.
Distributions GeometricPoisson Probability Distribution Review.
MathematicalMarketing Slide 4b.1 Distributions Chapter 4: Part b – The Multivariate Normal Distribution We will be discussing  The Multivariate Normal.
WARM UP: Penny Sampling 1.) Take a look at the graphs that you made yesterday. What are some intuitive takeaways just from looking at the graphs?
Discrete Probability Distributions Chapter 4. § 4.3 More Discrete Probability Distributions.
Chapter 8 DISCRETIZATION Cios / Pedrycz / Swiniarski / Kurgan.
Chapter 7 Estimation. Chapter 7 ESTIMATION What if it is impossible or impractical to use a large sample? Apply the Student ’ s t distribution.
ChiMerge Discretization
Statistical Inferences for Population Variances
Inference concerning two population variances
The Exponential and Gamma Distributions
Objectives (BPS chapter 23)
Review 1. Describing variables.
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Ch8.4 P-Values The P-value is the smallest level of significance at which H0 would be rejected when a specified test procedure is used on a given data.
Variance and Standard Deviation Confidence Intervals
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Merging Merge. Keep track of smallest element in each sorted half.
Confidence Intervals and Hypothesis Tests for Variances for One Sample
AP Statistics: Chapter 7
Section 6-4 – Confidence Intervals for the Population Variance and Standard Deviation Estimating Population Parameters.
Goodness-of-Fit Tests
Bivariate Testing (Chi Square)
Chapter 8 – Continuous Probability Distributions
Chi Squared Test.
Construction Engineering 221
POINT ESTIMATOR OF PARAMETERS
Statistical Process Control
Continuous Probability Distributions Part 2
Chapter 4 Discrete Probability Distributions.
Chi2 (A.K.A X2).
Data Analysis Module: Chi Square
Continuous Probability Distributions Part 2
A G L O R H I M S T A Merging Merge.
Discrete Probability Distributions
Continuous Probability Distributions Part 2
Chapter 7 Lecture 3 Section: 7.5.
Introduction to Probability Distributions
A G L O R H I M S T A Merging Merge.
A G L O R H I M S T A Merging Merge.
Introduction to Probability Distributions
Chapter 6 Confidence Intervals.
Continuous Probability Distributions Part 2
Statistical Inference for the Mean: t-test
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Sampling Distributions Key to Statistical Inference
Chi - square.
Presentation transcript:

SampleFK ChiMerge Discretization Statistical approach to Data Discretization Applies the Chi Square method to determine the probability of similarity of data between two intervals.

SampleFK {0,2} {2,5} {5,7.5} {7.5,8.5} {8.5,10} {10,17} {17,30} {30,38} {38,42} {42,45.5} {45.5,52} {52,60} Intervals ChiMerge Discretization Example Sort and order the attributes that you want to group (in this example attribute F). Start with having every unique value in the attribute be in its own interval.

SampleFK SampleK=1K= total202 SampleK=1K= total112 ChiMerge Discretization Example Begin calculating the Chi Square test on every interval

SampleK=1K= total202 SampleK=1K= total112 E 11 = (1/2)*1 =.05 E 12 = (1/2)*1 =.05 E 21 = (1/2)*1 =.05 E 22 = (1/2)*1 =.05 E 11 = (1/2)*2 = 1 E 12 = (0/2)*2 = 0 E 21 = (1/2)*2 = 1 E 22 = (0/2)*2 = 0 X 2 = (0-.5) 2 /.5 + (0-.5) 2 /.5 + (0-.5) 2 /.5 + (0-.5) 2 /.5 = 2 X 2 = (1-1) 2 /1+(0-0) 2 /0+ (1-1) 2 /1+(0-0) 2 /0 = 0 Threshold.1 with df=1 from Chi square distribution chart merge if X 2 < ChiMerge Discretization Example

SampleFK {0,2} {2,5} {5,7.5} {7.5,8.5} {8.5,10} {10,17} {17,30} {30,38} {38,42} {42,45.5} {45.5,52} {52,60} Intervals Chi ChiMerge Discretization Example Calculate all the Chi Square value for all intervals Merge the intervals with the smallest Chi values

SampleFK {0,2} {2,5} {5,10} {10,30} {30,38} {38,42} {42,60} Intervals Chi 2 ChiMerge Discretization Example Repeat

SampleFK {0,5} {5,10} {10,30} {30,42} {42,60} Intervals Chi 2 ChiMerge Discretization Example Again

SampleFK {0,5} {5,10} {10,30} {42,60} Intervals Chi 2 ChiMerge Discretization Example Until

SampleFK {0,10} {10,30} {42,60} Intervals Chi 2 ChiMerge Discretization Example There are no more intervals that can satisfy the Chi Square test.