Presentation is loading. Please wait.

Presentation is loading. Please wait.

PNC/MLA 2018 Conference/Annual MeetingStatistics 101

Similar presentations


Presentation on theme: "PNC/MLA 2018 Conference/Annual MeetingStatistics 101"— Presentation transcript:

1 PNC/MLA 2018 Conference/Annual MeetingStatistics 101
Nairanjana (Jan) Dasgupta Professor, Dept. of Math and Stats Boeing Distinguished Professor of Math and Science Fellow, American Statistical Association Director, Center of Interdisciplinary Statistics Education and Research (CISER) Washington State University, Pullman, WA

2 Part 1: Data in general Data: Big or Small SOURCE of data
Types of Data Population versus Sample Experiment versus observational studies Exploratory and Confirmatory studies

3 Part 2: making sense of data
Distinction: Uni-variate, Bi-variate, Multi-variate, multiple Graphical Summary of data Numerical summary of data Measures of Center Measures of Spread Measures of the Dimensionality Summarizing multivariate data sets using clusters Population versus sample what do we have data on?

4 Part 3: Making decisions from data:
Going from sample to population Inference and decision making Estimation and Intervals Testing and Confidence Intervals Errors in testing: Type I and Type II Power Statistical significance P-value — good, bad or misused ASA’s statement about p-values

5 Part 1:Data and its collection
Types of Data: The good the bad and the ugly

6 Statistics and Data Statistics has been often defined as the Science (or art) of collecting, compiling, summarizing, analyzing and inferring from data. By this definition: it is the science (or art) that is meant to deal with data

7 Data: GOOD, BAD or the culprit?
I will misquote Samuel Taylor Coleridge here (his quote was about water), when the ancient mariner was stuck in the middle of the ocean: Data data everywhere, and it really makes us blink Data, data everywhere but we’ve got to stop and think… This is the theme of today’s lecture: understanding data, types of data and what we can and cannot do with data. The Mathematics/Statistics that we need to understand to deal with data…

8 How can data be used? Just because we have data does it mean we know something? Can we say anything at an individual level? I want to go to WH Auden and address this question…

9 THE UNKNOWN CITIZEN BY W. H
THE UNKNOWN CITIZEN BY W. H. AUDEN(To JS/07 M 378This Marble Monument Is Erected by the State) He was found by the Bureau of Statistics to be One against whom there was no official complaint, And all the reports on his conduct agree That, in the modern sense of an old-fashioned word, he was a saint, For in everything he did he served the Greater Community. Except for the War till the day he retired He worked in a factory and never got fired, But satisfied his employers, Fudge Motors Inc. Yet he wasn't a scab or odd in his views, For his Union reports that he paid his dues, (Our report on his Union shows it was sound) And our Social Psychology workers found That he was popular with his mates and liked a drink. …

10 Unknown Soldier… Contd
He was married and added five children to the population, Which our Eugenist says was the right number for a parent of his generation. And our teachers report that he never interfered with their education. Was he free? Was he happy? The question is absurd: Had anything been wrong, we should certainly have heard. From Another Time by W. H. Auden.

11 Data collected because or just because
There is a distinction whether we are collecting data with a specific object in mind or we are using existing data that is already available. Even a few years ago data was expensive and valuable and collected mostly with specific objects in mind Now, there is a deluge in terms of data that is available for use as it is being collected ANYWAY.

12 Data Source: Stats is well equipped to deal with

13 Sources of Opportunistic data:

14 Let us start with collecting some data
Let us start with some information about you: Where are you from? How many Statistics classes you have taken? On a scale of 1 to 5 rate your liking for Statistics Your average blood pressure when faced with a Stats problem There is no source that I know of where such data is available so let us physically collect it.

15 Types of data Discrete Numerical Continuous Data Categorical Nominal
Ordinal

16 What are these? Nominal: Name, category Discrete: what you count
Continuous: what you measure Ordinal is in some no-man’s land mostly categorical but with a numerical flavor. What is your unit? On a scale of 1 to 5 rate your liking for Stats How many Statistics classes you have taken? Your blood pressure when faced with a Stats problem

17 How to collect this data?
Questions 1-3 are self reported easily. How about question 4? Let us think about that

18 The big question is: WHY did I collect this information?
The reason we collected this information was to get some idea about all of you so I could come up with a plan of what to talk about, how much detail I go into etc. So, the idea is I take the data I collected and LEARN something from this data? Data by itself is just a bunch of numbers or categories and by itself it doesn’t mean much. What we need to do is figure out how we LEARN from data.

19 Why collect Data? In the past to answer a specific question or questions. However, nowadays data is collected without a specific object in mind, just because everything is data oriented, and collection is easy, internet, cell phone, credit card, Walmart etc. But in general we have a purpose for collecting data, a specific questions or questions. We want to learn something that we do not know from data.

20 Population versus Sample
Our question of interest is almost always about the big picture: something hard to study, but knowing would help us make decisions. Population: Sum total of all individuals and objects in a study Sample: part of the population selected for the study Idea: Get a good sample, study this sample carefully and infer about the population based on the sample. The data I just collected is this a population or a sample? Sample: make sure you stress that if we don’t study it it is not the sample

21 Population, Sample Inference

22 Where does statistical science come in?
If we could always study the population directly: we wouldn’t need Statisticians except for clerical jobs like summarizing… If we are relying on samples: we need to take a GOOD sample: the good is defined by Statistics What type of sample can I take? How exactly can I “infer” attributes about the population (parameter) based on a sample (statistic)? Caveat: the population needs to be a REAL representative population. The information I took on you, observation or experimental?

23 A few sample types

24 Experiments versus Observational Studies
Experiments: You change the environment to see the effect your change had, trying to control all other potential factors that might affect your study. Observational Study: You study the environment as is, and collect data on all possible factors that might be of interest to you. Questions 1-3 definitely observational Question 4 could have been an experiment but possibly observational Talk about designing the study where we are comparing three brands of cake mixes. Discuss Type of mix, temperature, oven effect, the placement effect

25 Experiment versus Observational study

26 Questions to think about:
If you had a choice: would you want to do an experiment or an observational study? Why? Is BIG data always better data?

27 What does it matter the type of study we conduct?
It matters because how we proceed to analyze the data should differ in terms of the type of study we had. Nowadays it is common to have data collected “just because” or “opportunistic data” and these are the extreme types of observational studies. As it wasn’t collected without any aim and it is hard to figure out if it is a population or a sample.

28 Exploratory versus Confirmatory studies
Exploratory studies: We do not have an idea about what we expect to find. So we study a bunch of factors to see how they affect what we are studying. It can be experimental or an observational study (though generally observational) Confirmatory Studies: Generally have an idea what we are expecting to find and do a very focused study to give credence to our beliefs. It can be experimental or an observational study (though generally experimental) Keep in my mind we cannot use data collected in an exploratory study to confirm our belief or hypothesis. Liken this to a fishing expedition and give examples of these kinds of study

29 Exploratory versus confirmatory studies
Exploratory studies generate hypothesis and often we do know/expect certain patterns We confirm these using confirmatory studies Though in general people often skip the confirmatory studies…

30 Part 1: recap It matters what TYPE of data we have.
It matters how the data were collected It matters whether we have a population or a sample It matters if you randomized the process of data collection If the population is studied all you need to do is to summarize, with a sample we need to think of inference.

31 Work sheet for Part 1 What type of data are the following:
Zip code, height, phone number, yearly income, size of family If we randomly choose 50 apple trees in an orchard and measure its height, canopy cover, number of apples: Would this be an observational study or experiment Would the data be univariate or multivariate Would this be an exploratory study or a confirmatory one?

32 Summarization: Making sense of numbers
Part 2 Summarization: Making sense of numbers

33 Summarizing Data Let us go back to the questions we started with:
Location? On a scale of 1 to 5 rate your liking for Stats How many Statistics classes you have taken? Your blood pressure when facing a Stats problem Let us think about answering these questions and address things like: univariate, bivariate and multivariate in this context.

34 Univariate, Bivariate and Multivariate
Univariate: ONLY one variable is measured or disseminated at a particular time Bivariate: Two variables that of equal importance are measured or discussed together Multivariate: Multiple variables are measured and discussed together and each of the variables are of equal importance and collected using the same general method. Most data we collect is multivariate in nature, but we can choose to discuss one variable at a time. Not a great idea, but often done in science. What we collected here is multivariate data

35 Response and Explanatory Variables
For the bivariate and multivariate case we can have two different types of scenarios: ALL variables are equally important We are REALLY interested in one variable but collect the others to understand the variable of interest. The one that we are really interested in is called the RESPONSE variable. The others are called Explanatory variables. It was collected to explain the response. Response variable is taken to be a RANDOM variable (or stochastic). Explanatory variables are assumed non- random.

36 Multiple versus Multivariate
These are associated with whether we have multiple responses or explanatory variables: If we have multiple response variables and we are equally interested in them: multivariate If we have ONE response variable and multiple EXPLANATORY variables: multiple

37 Analyzing UNIVARIATE data
With all the terminology intact, let us consider the most simple case. ONE response variable. We can graph it, and summarize it if it is a population from which we have the data. We can conduct inference if it is a sample. How and what we do, depends upon the data type we have. Hence, knowledge of data types is crucial

38 Summarizing Univariate Categorical Data
What would be some methods used to summarize categorical data? Graphical summary and numerical summary: What graphs are relevant for univariate categorical data? Pie chart Bar chart Line chart etc…

39 Numerical Data: Summary
Graphical summary: Box plots Histograms

40 Snap shot (out of 100 entries)
Location Classes_taken Scale Blood_Pressure F B B C B B B A D

41 Graphs and Charts for our data
Pie Chart for the locations Bar Chart for classes taken

42 Histogram and Box-plots
For Scale Summarizing BP

43 Summarizing Categorical data: Univariate
What numerical summaries would be relevant for categorical data: For example let us take your location How would you summarize all the information in our data by one (or a few numbers)? Idea of Central Tendency: Most naturally arising data has a tendency to clump in the middle of the range of possible values.

44 Measures of Central Tendency
How does one measure what is happening in the CENTER of the data? Thoughts? Mean Median Mode

45 Mean, Median, Mode What are the physical interpretations of these:
Center of Gravity Middle-most point Most frequently appearing number, category or group

46 Pictures of mean, median and mode

47 Measuring Central tendency for Categorical Data
When dealing with your majors: Does Mean make sense? Does Median make sense? How about mode?

48 Numerical Data Let us consider numerical data:
Which measure of central tendency makes sense here? Which would you prefer? Mean Median Mode

49 More Summarization: Measure of Center, provide us with a first step for summarizing data. But often it cannot differentiate between data sets. Consider the following data sets: Set1: Set2: Set3: They all have same mean and median=40. Are they identical? What makes them different?

50 Other summary measures
Measures of Spread Shapes of the distribution of data Where is the peak Measures of symmetry Percentiles

51 Measures of Spread Standard Deviation Variance Range
Inter-quartile range Median Absolute Deviation

52 Summarizing Bivariate and Multivariate data
Categorical: frequency tables Numerical: Scatter plots Many types of visualizations coming up because of “big data”.

53 Table of location and Scale
A B C D E F G H All All

54 Scatter Plot

55 Measures of Association
For bivariate data we can calculate measures of association Kendall’s tau (for ordinal data) Correlation Coefficient (for numerical data) For our data Kendall’s Tau for Major and Scale is .07 Correlation coefficient between BP and Classes .03

56 Summarizing multivariate data
Principal Component Analysis The following are loosely considered summarizing methods: Clustering

57 What is PCA? This is a MATHEMATICAL procedure that transforms a set of correlated responses into a smaller set of uncorrelated variables called PRINCIPAL COMPONENTS. Essentially gives us an idea about the dimensionality of the data Used for summarizing the overlap in the variables Uses: Data screening Clustering Discriminant Analysis

58 Objectives of PCA Reduce dimensionality, rather try to understand the TRUE dimensionality of the data Identify “meaningful” variables If you have a VARIANCE-COVARIANCE MATRIX, S: PCA returns new variables called Principal Components that are: Uncorrelated First component explains MOST of the variability The remaining PC explain decreasing amounts of variability

59 Variance-Covariance as a matrix
If we are interested in looking at several variables, we can construct an array or matrix where we look at the variance of each variable and the covariance about 2 variables. For example if we have 3 variables, the matrix will look like: Var (X1) Cov(X1,X2) Cov(X1,X3) Cov(X2,X1) Var(X2) Cov(X2,X3) Cov(X3,X1) Cov(X3,X2) Var(X3)

60 What is Correlation? Correlation is defined by:
Cov(X,Y)/(Std(X)Std(Y)) Where, Std=square root (Var)

61 Caveats The whole idea of PCA is to transform a set of correlated variables to a set of uncorrelated variables, hence if the data are already uncorrelated, not much additional advantage of doing PCA. One can do PCA on correlation matrix or the Covariance matrix.

62 What is Clustering? Clustering is an EXPLORATORY statistical technique used to break up a data set into smaller groups or “clusters” with the idea that objects within a cluster are similar and objects in different clusters are different. It uses different distance measures between units of a group and across groups to decide which units fall in a group.

63 Clustering in General Idea: to group observations that are “similar” based on predefined criteria. Clustering can be applied to rows and/or columns Clustering allows for reordering of the rows/columns to make it appropriate for visualization. Fundamentally an exploratory tool, clustering is firmly imbedded in many people’s minds as the statistical method for the analysis…

64 Why Cluster? There are very few formal theories about clustering though intuitively the idea is: cluster the internal cohesion and external isolation. Time-course experiments are often clustered to see if there are developmental similarities. Useful for visualization. Generally considered appropriate in typical clinical experiments.

65 Clustering How is “closeness decided”?
For clustering we generally need two ideas: Distance: the original distance used to measure the distance between two points Linkage: condensation of each group of observations into a single representative point

66

67 Types of Clustering Hierarchical and Non-hierarchical methods:
Non-Hierarchical (Partitioning): Have an initial set of cluster seed points and then build clusters around the point, using one of the distance measures. If the cluster is too large, it can split into smaller ones. Hierarchical: Observed data points are grouped into clusters in a nested sequence of groups.

68

69

70 Dendograms The dendogram should be interpreted with care, remember each branch of the dendogram is really like a mobile and can rotate, without altering the mathematical structure of the tree. Neighboring nodes are “close” ONLY if they lie on the same branch. It has been proposed one should slice the tree and look at the clusters produced therein. However, WHERE to cut the tree is subjective and there is no consensus about this. Issue: mistakes made early have no way of being corrected later in this approach.

71 Some remarks on clustering- 1
Simplistically, clustering cannot fail. That is, every clustering method will return clusters, whether the data are organized in clusters or not. Clustering helps to group / order information and is a visualization tool for learning about the data. However, clustering results do not provide any kind of “proof” of anything.

72 Some remarks-II One of the more paradoxical aspects of clustering is that it gets used in practice, even when class labels are available instead of using a discrimination method. The idea is: it is somehow seen as less “biased” to demonstrate the ability of the data to produce the class differences without using class labels. When the inferred clusters largely coincide with the known classes, this is thought to “validate” the class labels. The illogicality and inefficiency of this process does not seem to have become widely appreciated. One sees different “classifiers” (e.g. different gene sets) compared w.r.t their ability to separate known classes, simply by inspecting the clustering they produce, rather than by building classifiers.

73 Comparing clusters:

74 Back to Population versus Sample
The data that we collected, was it a population or was it a sample? IF it is a population ALL we need to do is to summarize it or graph it. We have studied the population hence we do not need to do any more. So, the question now is the data that we have what do we do with it. Stop now (we are done summarizing or should we do something more).

75 Recap of Part 2 Use Pie charts, bar graphs for visualizing univariate categorical data Use box plots, histograms for univariate numerical data For bivariate data we can do scatter plots For multivariate data we can do clusters For numerical data use mean, median as measures of center For categorical use mode Use standard deviation, iqr for spread for numerical data For categorical use the frequency plot or tables to summarize it Summarization allows us to make sense of raw data and is crucial before we analyze it.

76 Worksheet for Part 2 How would you summarize numerically and graphically the measures from the apple tree: Height, canopy cover, number of apples, type of apple There are 3 different types of apple in your data and when you cluster the apples you get the following dendogram. So you think your belief that Variety 2 and 3 were more similar than 1 and 3 are justified.

77 Dendogram

78 Inference and analysis: Making Decisions from data
Part 3 Inference and analysis: Making Decisions from data

79 When and Why we need inference
IF the data we collected was really a population we do not need to do any inference. If however, we assume that this was just a sample from a larger population of ALL medical librarians, then we need to take this sample and make inferences about this sample to relate back to the population.

80 Inference for univariate response
Let us again consider this from an univariate, bivariate and multivariate stand point for both categorical and numerical data. What are the questions we might want answered for the variable LOCATION. What about BP?

81 Inference: Use data and statistical methods to infer (get an idea, resolve a belief, estimate) something about the population based on results of a sample Inference Estimation Point Estimation Interval Estimation Hypothesis Testing

82 Estimation We have no idea at all about the parameter of interest and we use the sample information to get an idea (estimate) the population parameter Point Estimation: Using a single point to estimate the population parameter Interval Estimation: We use an interval of values to estimate the population parameter

83 Hypothesis Testing We have some idea about a population parameters.
We want to test this claim based on data we collect Probably one of the most used and most misunderstood method used in science. This provides us with the “dreaded p-value”.

84 Parameter To understand inference, we really need to get a very clear idea about what is a parameter. By definition: Parameter is a numerical characteristic (or characteristics if multivariate) of the population that is of interest. To make this specific: let us consider the following example Example: We are interested in the average number of statistics classes taken by medical librarians.

85 Population and Parameter:
Here the population is all medical librarians We have to be careful here: is it all CURRENT librarians as well as the past and future. To make matters easy let us say it is CURRENT medical librarians (ML) The parameter is: the average number of statistics classes taken by ML.

86 Sample and Statistic Our choices are to do a census and then compute the average from the entire census – this is the parameter Or to take a sample and calculate the number FROM the sample. If we use the sample and compute the number from the sample we call the sample average our STATISTIC.

87 How do we sample Here we need to think of how we sample this very well defined population. Thoughts? Hallmarks of a good sample: representative, unbiased, reliable

88 Estimation: Complete Ignorance about parameter
If we use the sample statistic to get an idea of the population sample, what we are doing is inference, specifically ESTIMATION What assures us that the sample statistic will be a good estimate of the population parameter? This leads us to Unbiasedness and Precision Do the bulls eye plot here

89 Where does Statistics come in?

90 Point Estimation The idea of point estimation seems intuitive: we use the sample value for the population value. The reason we can do this, is because we make certain assumptions about the probability distribution of the sample statistic Generally we assume that the sampling scheme we pick allows us an unbiased and high precision distribution of the statistic. If our method is indeed unbiased then the sample mean, should on average be a good estimator for the population mean.

91 Interval Estimation Even if we believe in the unbiasedness of our estimator, we still often want an interval rather than just a single value for the estimates. This allows us to have interval estimation. This technique takes into account the spread as well as the distribution in the estimation. It gives us an interval of values, in which we feel that our parameter is contained with high confidence. Talk about the fact that the interval is random rather than the parameter. Liken this to trying to capture a target with a horseshoe

92 Confidence Interval: In general a confidence interval for the population mean is given by: Sample mean ± margin of error Question is: how does one calculate “margin of error” Answer: we need distributions and random variables to do that. This means some mathematics and probability theory.

93 Confidence interval Used quite a bit
Gives similar information as hypothesis tests Often can be inverted to construct tests However, theoretically it is quite different, as here we talk about the SIZE of the effect rather than significance of the effect. With all the bad press that p-values have received this might make a come back.

94 Hypothesis Testing: some idea about the parameter
We have some knowledge about the parameter A claim, a warranty, what we would like it to be We test the claim using our data; First step: formulating the hypothesis Always a pair: Research and Nullification of research (affectionately called Ho and Ha)

95 How to formulate your hypothesis
First state your claim or research. Let us say we believe that the average number of stats classes taken by graduate students coming into WSU is greater than 2. Here our parameter is Population mean of statistics classes taken by graduate students at WSU Claim: Mean > 2 What nullifies this? Mean ≤ 2 (Remember the “=“ always resides with the null)

96 Logic of Testing To actually test the hypothesis, what we try to do is to disprove or reject the null hypothesis. If we can reject the null, by default our Ha (which is our research) is true. Think of how the legal system works: H0: Not Guilty Ha: Guilty

97 How do we do this? We take a sample and look at the sample values. Then we see if the null was true, would this be a likely value of the sample statistic. If our observed value is not a likely value, we reject the null. How likely or unlikely a value is, is determined by the sampling distribution of that statistic.

98 Example In our example we were interested in the hypothesis about the average score in their last Stats class: H0: µ≤2 Ha: µ > 2 If we observed a sample with a mean of 4 and a standard deviation of 2 from a sample of 100 would you consider the null likely?? How about if the mean was 4 and the standard deviation was 20 from a sample of 100?

99 Players in decision making
Your observed statistic Your sample size Your observed standard deviation Your capacity of being able to find probabilistically how likely our observed value is under the null.

100 Errors in testing: Since we take our decisions about the parameter based on sample values we are likely to commit some errors. Type I error: Rejecting Ho when it is true (False Positive) Type II error: Failing to reject H0 when Ha is true (False Negative) In any given situation we want to minimize these errors. P(Type I error) = a, Also called size, level of significance. P(Type II error) = b, Power = 1-b, HERE we reject H0 when the claim is true. We want power to be LARGE. Power is the TRUE Positive we want.

101 Example I am introducing a new drug into the market. The drug may have some serious side effects. Before I do so I will go through tests to see if is effective in curing disease. H0: not effective Ha: drug is effective What is Type I error and Type II error in this case? Which is worse? More importantly think of the consequence of these errors.

102 One more example: Ann Landers in her advice column on the reliability of DNA testing for determining paternity advises, “To get a completely accurate result you would have to be tested, so would the man and your mother. The test is 100% accurate if the man is NOT your father, and 99.9% accurate if he is. Consider the hypothesis: Ho: a particular man is the father Ha: a particular man is not the father. Discuss the chances of probability of Type I and II errors.

103 Decision Making using Hypotheses:
In general, this is the way we make decisions. The idea is we want to minimize both Type I and II errors. However, in practice we cannot minimize both the errors simultaneously. What is done, is we fix our Type I error at some small level, ie 0.1, 0.05 or 0.01 etc. Then we find the test that will minimize our Type II error for this fixed level of Type I error. This gives us the the most powerful test. So in solving a hypothesis problem, we formulate our decision rule using the fixed value of Type I error. The decision rule is also called the CRITICAL VALUE.

104 How does rejection of null work with Critical values?
Here we calculate the value we can find a distribution of from the sample. Then we look at this value and compare it with the distribution of the sample statistic to allow ourselves Type I error of alpha. Based on this, if our observed value is beyond our critical value, we feel justified in rejecting the null. CRITICISM: Choice of alpha is arbitrary. We can make alpha big or small depending on what we want our outcome to be… Draw the picture here.

105 P-values: Elephant in the room
Sometimes hypothesis testing can be thought to be subjective. This is because the choice of a-values may alter a decision. Hence it is thought that one should report p- values and let the readers decide for themselves what the decision should be. p-value or probability value is the probability of getting a value worse than the observed. If this probability is small then our observed is an unlikely value under the null and we should reject the null. Otherwise we cannot reject the null.

106 P-value for our example
For the hypothesis we talked about earlier: H0: µ≤2 Ha: µ > 2 If we observed a sample with a mean of 4 and a standard deviation of 2 from a sample of 100 would you consider the null likely?? How about if the mean was 4 and the standard deviation was 20 from a sample of 100? P-value = P(Z > (4-2)/(2)/sqrt(100) | µ≤2 ) <.001 P-value = P(Z > (4-2)/(20)/sqrt(100) | µ≤2 ) <.16

107 Criticism of p-values As more and more people used p-values and with an effort to guard against the premise that “we can fool some of the people some of the time”, journals started having strict rules about p-value. To publish you needed to show small p-values. No SMALL p-values no publication… This has often led to publication of ONLY significant results. Also, led to let us get the p-value small by hook or crook attitude.

108 ASA Statement about p-value
It really tells us how incompatible the data are with a specified statistical model. P-values do not measure the probability that studied hypothesis is true or that the data were produced by random chance alone. Scientific conclusions and business policy decisions should not be based on only whether a p-value passes a specific threshold. Proper inference requires full reporting and transparency P-value does NOT measure the size of the effect or the importance of the result. By itself the p-value cannot provide a good measure of evidence regarding the model.

109 A simple example: Term Coef SE Coef T-Value P-Value Constant X

110 Power: The other elephant in the room
Power is the TRUE positive In other words what is the probability you would reject the null under a specified value of the alternative. So first we need to figure out what value of the alternative we choose to calculate the power. This choice is up to us and we often call it the effect size.

111 Example of power For the hypothesis we talked about earlier: H0: µ≤2
Ha: µ > 2 If we observed a sample with a mean of 4 and a standard deviation of 2 from a sample of Calculate power when mu=2.5, 3, 3.5, 4 Power = P (Z> (2.5-2)/2/10 ) = P(Z > 2.5) = .0162 Etc.

112 Power in pictures

113 What are the players for power?
Sample size Effect size Standard deviation. So to really calculate power one needs to have data to understand the distribution and have a feel for the standard deviation. Pre-hoc power calculation is often “trying to fool some of the people some of the time”

114 Recap of Part 3 To make inferences our population of interest and parameter needs to be well defined. Errors exist in testing and have to be considered P-values are a measure of incompatibility of the existing data given the null hypothesis and cannot be used to PROVE anything. To calculate power we need to look into values under the alternative and this can be subjective.

115 Worksheet for Part 3: True or False: Type I error is always the worst hence w should focus on controlling it rather than Type II error. You believe that the average time it takes students to walk from one class to another at WSU is more than the 10 minutes you are allotted. Write out your null and alternate hypothesis Write down what the type 1 error would be in this context. You test it and get a p-value of .13, does this indicate that the null is true?

116 What we wanted to learn and what we did learn
Overview What we wanted to learn and what we did learn

117 Take home messages (hopefully)
There are several types of data and each type has its own nuances. The concept of population versus sample Experimental, observational and opportunistic studies Exploratory and confirmatory studies Distinction between univariate, multivariate To summarize type and dimension matters Graphs are worth a thousand words

118 More take home messages
Estimation and Testing Errors in testing Decision making, power and p-values


Download ppt "PNC/MLA 2018 Conference/Annual MeetingStatistics 101"

Similar presentations


Ads by Google