Solution 7 W=sum of ranks of group A (best student=rank 1, etc).

Slides:



Advertisements
Similar presentations
Chapter 16 Introduction to Nonparametric Statistics
Advertisements

Measures of Dispersion and Standard Scores
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Calculus Review Texas A&M University Dept. of Statistics.
Normal Distributions What is a Normal Distribution? Why are Many Variables Normally Distributed? Why are Many Variables Normally Distributed? How Are Normal.
Continuous Random Variables and Probability Distributions
Chap 6-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 6 Continuous Random Variables and Probability Distributions Statistics.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Random variables Petter Mostad Repetition Sample space, set theory, events, probability Conditional probability, Bayes theorem, independence,
Some Useful Continuous Probability Distributions.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
Use of moment generating functions 1.Using the moment generating functions of X, Y, Z, …determine the moment generating function of W = h(X, Y, Z, …).
Chapter 5.6 From DeGroot & Schervish. Uniform Distribution.
BUS304 – Data Charaterization1 Other Numerical Measures  Median  Mode  Range  Percentiles  Quartiles, Interquartile range.
INVESTIGATION 1.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Essential Statistics Chapter 21 Describing Distributions with Numbers.
Chapter 2 Describing Distributions with Numbers. Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range)
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
1 Uses both direction (sign) and magnitude. Applies to the case of symmetric continuous distributions: Mean equals median. Wilcoxon Signed-Rank Test.
Models for Continuous Variables. Challenge What to do about histograms describing distributions for continuous data? Especially for large collections.
LIS 570 Summarising and presenting data - Univariate analysis.
The Normal Distribution AS Mathematics Statistics 1 Module.
1 Nonparametric Statistical Techniques Chapter 18.
4.3 Probability Distributions of Continuous Random Variables: For any continuous r. v. X, there exists a function f(x), called the density function of.
1 2 Sample Tests – Small Samples 1.Small sample, independent groups a. Test of equality of population variances b. If variances are equal, t-test c. If.
Chapter 3 Probability Distribution Normal Distribution.
Exploratory Data Analysis
Continuous random variables
Where Are You? Children Adults.
Chapter 3 Probability Distribution
MATB344 Applied Statistics
Normal Distribution and Parameter Estimation
MTH 161: Introduction To Statistics
4.3 Probability Distributions of Continuous Random Variables:
Cumulative distribution functions and expected values
Column Sequences An Investigation.
Chapter 8: Fundamental Sampling Distributions and Data Descriptions:
Chapter 12 Statistics 2012 Pearson Education, Inc.
Chapter 7: Sampling Distributions
Lecture 16 Nonparametric Two-Sample Tests
Environmental Modeling Basic Testing Methods - Statistics
SA3202 Statistical Methods for Social Sciences
Lecture 13 The Quantile Test
Lecture 11 Nonparametric Statistics Introduction
Lecture 15 Wilcoxon Tests
Arithmetic Mean This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Lecture 12 Binomial Tests and Quantiles
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Chapter 5 Continuous Random Variables and Probability Distributions
Tutorial 7 Consider the example discussed in the lecture, concerning the comparison of two teaching methods A and B, and let W denote the sum of the.
Tutorial 9 Suppose that a random sample of size 10 is drawn from a normal distribution with mean 10 and variance 4. Find the following probabilities:
Texas A&M University Dept. of Statistics
Selecting Among 2-Sample Tests
4.3 Probability Distributions of Continuous Random Variables:
The Rank-Sum Test Section 15.2.
Click the mouse button or press the Space Bar to display the answers.
Lecture 12: Normal Distribution
CHAPTER 2: Basic Summary Statistics
ASV Chapters 1 - Sample Spaces and Probabilities
Nonparametric Statistics
Normal Distribution The Bell Curve.
Chapter 8: Fundamental Sampling Distributions and Data Descriptions:
Measures of Variability
Distribution-Free Procedures
Chapter 12 Statistics.
Solution 8 1. (a). H0: x(.5)=63 vs H1: x(.5) not equal to 63 (x(.5) is the median of X) U=#{I|Xi
Chapter 5 Continuous Random Variables and Probability Distributions
Continuous Distributions
The Mean Variance Standard Deviation and Z-Scores
Presentation transcript:

Solution 7 W=sum of ranks of group A (best student=rank 1, etc). (a) (i). Reject H0 if W is too small. Reject H0 if W<=7, alpha=2/35=.0571; Reject H0 if W<=8, alpha=4/35=.1143. (ii). Reject H0 if W is too large. Reject H0 if W>=17, alpha=2/35=.0571; Reject H0 if W>=16, alpha=4/35=.1143. (iii). Reject H0 if W is too small or too large. Reject H0 if W<=6 or W>=18, alpha=2/35=.0571; Reject H0 if W<=7 or W>=17, alpha=4/35=.1143, Reject H0 if W<=6 or W>=17, alpha=3/35=.0857, (Not equal tailed) Reject H0 if W<=7 or W>=18, alpha=3/35=.0857. (Not equal tailed) Note: In each case, the nearest test to the nominal size 10%, has actual size 11.43%. My recommendation is not to exceed the nominal size alpha, and go for a test of smaller size. (b). W Probability Possible combination 6 1/35 (1,2,3) 7 1/35 (1,2,4) 8 2/35 (1,2,5) (1,3,4) 9 3/35 (1,2,6) (1,3,5) (2,3,4) 10 4/35 (1,2,7) (1,3,6) (1,4,5) (2,3,5) 11 4/35 (1,3,7) (1,4,6) (2,3,6) (2,4,5) 12 5/35 (1,4,7) (1,5,6) (2,3,7) (2,4,6) (3,4,5) 13 4/35 (1,5,7) (2,4,7) (2,3,6) (3,4,6) 14 4/35 (1,6,7) (2,5,7) (3,4,7) (3,5,6) 15 3/35 (2,6,7) (3,5,7) (4,5,6) 16 2/35 (3,6,7) (4,5,7) 17 1/35 (4,6,7) 18 1/35 (5,6,7) 1/18/2019 SA3202, Solution 7

Thus E(W)=12, Var(W)=8 Note: The distribution is symmetric; even for such small sample sizes (3 and 4) it is not very non-normal. (c ). The point for parts (c ) and (d) is this: for large sample sizes, we need to use normal approximation, for which we need to know the mean and variance. We can find the mean by noting that the distribution of W is symmetric. So the mean is the “middle” value, which is 12. This method can be applied for large samples also (find the min and max value and take average). But there is an easier way as described below. When ranking the 7 students, the average rank is (1+2+…+7)/7=7*8/2/7=4. If there is no difference between A and B, each student would get one of the ranks at random. So each student is expected to get a rank 4. So the sum of the ranks for the 3 students in group A is expected to be 4+4+4=12. The above argument can be used to find the mean of W in general, with sample sizes n1 and n2, and n=n1+n2. In fact, when ranking n students, the average rank is (n+1)/2. So for the n1 students in group A, the sum of the ranks is expected to be E(W)=n1(n+1)/2. More formally, W=R1+R2+R3 where R1, R2, and R3 are the ranks of the three students in group A. Each R takes one of the values 1,2, …,7 with equal probability: Pr(Ri=j)=1/7, j=1,2,…,7 From which, we have E(Ri)=4, Var(Ri)=4. Hence E(W)=E(R1+R2+R3)=3*4=12. 1/18/2019 SA3202, Solution 7

(d). Note that for different i and j : Pr(Ri=k, Rj=l)= Pr(Rj=l | Ri=k) Pr(Ri=k)=1/6*1/7=1/42. Thus for different i and j: E(RiRj)=46/3, E(Ri)E(Rj)=16, Cov(Ri,Rj)=E(RiRj)-E(Ri)E(Rj)=-2/3. Thus Var(W)=Var(R1+R2+R3)=Var(R1)+Var(R2)+Var(R3)+2Cov(R1,R2)+2Cov(R1,R3)+2Cov(R2,R3) =4*3-2*2/3*3=8. Note that the above argument can be used for the general case with sample sizes n1, n2, and n=n1+n2. Pr(Ri=j)=1/n, j=1,2,…,n Pr(Ri=k,Rj=l)=Pr(Ri=k|Rj=l)Pr(Rj=l)=1/(n-1)*1/n=1/[(n-1)n] E(Ri)=(1+2+….+n)/n=n(n+1)/2/n=(n+1)/2. E(Ri^2)=(1+2^2+…+n^2)/n=n(n+1)(2n+1)/6/n=(n+1)(2n+1)/6. Var(Ri)=(n+1)(2n+1)/6-(n+1)^2/4=(n^2-1)/12. E(RiRj)=[(1+2+…+n)^2-(1+2^2+…+n^2)]/[n(n-1)]=(n+1)(3n+2)/12 Cov(Ri,Rj)=(n+1)(3n+2)/12-(n+1)^2/4=-(n+1)/12. It follows that E(W)=E(R1+…+Rn1)=n1(n+1)/2 Var(W)=n1(n^2-1)/12-n1(n1-1)(n+1)/12=n1n2(n+1)/12=n1n2(n1+n2+1)/12. 1/18/2019 SA3202, Solution 7

(a) X~U[10,30], so F(x)=(x-10)/20, it follows that xp=10+20p. Thus, the median is Q2=10+20*.5=20. The quartiles are Q1=10+20*.25=15, Q2=20,Q3=25 The deciles are 12,14,16,…,28 (b) X~Exp(20). Then F(x)=1-exp(-x/20), x>=0. It follows that xp=-20 log(1-p). Using this formula, we can compute the median, quartiles and deciles respectively. (c) X~N(10,4), Z=(X-10)/2~N(0,1). F(x)=Phi((x-10)/2). Thus xp=10+2 zp where zp is the quantiles of Z. Using the normal cdf table, we can compute any quantile of X. For example, the median of X is 10+2 z(.5)=10. 1 , when x>2 3. X~U[0,2], so f(x)=1/2 , 0<=x<=2, and F(x)= x/2, when 0<=x<=2 0, when x<0. Then, the density function for X(r ) is for r=1,2,3,4,5, and n=5. 1/18/2019 SA3202, Solution 7