The exam is of 2 hours & Marks :40 The exam is of two parts ( Part I & Part II) Part I is of 20 questions. Answer any 15 questions Each question is.

The exam is of 2 hours & Marks :40 The exam is of two parts ( Part I & Part II) Part I is of 20 questions. Answer any 15 questions Each question is of 2 marks. Total 30 marks. Part II is of 15 questions. Answer any 10 questions Each question is of 1 mark. Total 10 marks. No MCQ’s. You should write the answers. No major calculations. No need to memorize the formulas. Bring your own calculator. Cell phones are not allowed to use as a calculator.

Study Designs Levels of measurements (Type of data) Measures of Location Measures of Dispersion Sampling Distribution of Means and proportions Normal Distribution Hypothesis testing & Z- test Student’s t-test Chi square test Confidence Intervals How to write a research paper ? ----- ( 40 marks)

 QUALITATIVE DATA  DISCRETE QUANTITATIVE  CONTINOUS QUANTITATIVE

 Nominal – qualitative classification of equal value: gender, race, color, city  Ordinal - qualitative classification which can be rank ordered: socioeconomic status of families  Interval - Numerical or quantitative data: can be rank ordered and sizes compared : temperature  Ratio - Quantitative interval data along with ratio: time, age.

CONTINUOUS DATA QUALITATIVE DATA wt. (in Kg.) : under wt, normal & over wt. Ht. (in cm.): short, medium & tall

A science called clinimetrics in which qualities are converted to meaningful quantities by using the scoring system. Examples: (1) Apgar score based on appearance, pulse, grimace, activity and respiration is used for neonatal prognosis. (2) Smoking Index: no. of cigarettes, duration, filter or not, whether pipe, cigar etc., (3) APACHE( Acute Physiology and Chronic Health Evaluation) score: to quantify the severity of condition of a patient

MEASURES OF Central Tendency Mean, Median, Mode Geometric Mean & Harmonic Mean

 A distribution is symmetrical if the frequencies at the right and left tails of the distribution are identical, so that if it is divided into two halves, each will be the mirror image of the other.  In a symmetrical distribution the mean, median, and mode are identical.

Mean= 13.4 Mode= 13.0

PPositively skewed distributions: distributions which have few extremely high values (Mean>Median) NNegatively skewed distributions: distributions which have few extremely low values(Mean<Median)

Mean=1.13 Median=1.0

Mean=3.3 Median=4.0

 IF variable is Nominal..  Mode  IF variable is Ordinal...  Mode or Median(or both)  IF variable is Interval-Ratio and distribution is Symmetrical…  Mode, Median or Mean  IF variable is Interval-Ratio and distribution is Skewed…  Mode or Median

 RANGE  INTERQUARTILE RANGE  VARIANCE  STANDARD DEVIATION

Quartiles, Deciles, Percentiles

50th percentile 80th percentile 50% included here 80% included here 25th percentile 25% included here Locating Percentiles in a Frequency Distribution

 Standard deviation: measures the variation of a variable in the sample.  Technically,

DISTRIBUTION OF DATA IS SYMMETRIC ---- USE MEAN & S.D., DISTRIBUTION OF DATA IS SKEWED ---- USE MEDIAN & QUARTILES

 Standard error of mean is calculated by:

 The standard deviation (s) describes variability between individuals in a sample.  The standard error describes variation of a sample statistic.  The standard deviation describes how individuals differ.  The standard error of the mean describes the precision with which we can make inference about the true mean.

 Standard error of the mean (sem):  Comments:  n = sample size  even for large s, if n is large, we can get good precision for sem  always smaller than standard deviation (s)

For a 0/1 variable, the standard deviation simplifies to a simple function of the proportion ones in the population: The standard deviation of the sampling distribution then simplifies as follows:

Two Steps in Statistical Inferencing Process 1.Calculation of “confidence intervals” from the sample mean and sample standard deviation within which we can place the unknown population mean with some degree of probabilistic confidence 2.Compute “test of statistical significance” (Risk Statements) which is designed to assess the probabilistic chance that the true but unknown population mean lies within the confidence interval that you just computed from the sample mean.

‘The mean sodium concentrations in the two populations are equal.’ Alternative hypothesis Logical alternative to the null hypothesis ‘The mean sodium concentrations in the two populations are different.’Hypothesis simple, specific, in advance

100 110 120 130 140 One-tail test Ho:μ= μo Ha: μ> μo or μ< μo Alternative Hypothesis: Mean systolic BP of Nephrology patients is significantly higher (or lower) than the mean systolic BP of normal patients. 0.05

Two-tail test Ho:μ= μo Ha: μ# μo Alternative Hypothesis : Mean systolic BP of Nephrology patients are significantly different from mean systolic BP of normal patients. 100 110 120 130 140 0.025

Every decisions making process will commit two types of errors. “We may conclude that the difference is significant when in fact there is not real difference in the population, and so reject the null hypothesis when it is true. This is error is known as type-I error, whose magnitude is denoted by the Greek letter ‘α ’. On the other hand, we may conclude that the difference is not significant, when in fact there is real difference between the populations, that is the null hypothesis is not rejected when actually it is false. This error is called type-II error, whose magnitude is denoted by ‘ β ’.

Disease (Gold Standard) Present Correct Negative Total Positive Test False Negative a+b a+b+c+d Total Correct a+cb+d c+d False Positive Result Absent a b cd

 This level of uncertainty is called type 1 error or a false-positive rate (   More commonly called a p-value  In general, p ≤ 0.05 is the agreed upon level  In other words, the probability that the difference that we observed in our sample occurred by chance is less than 5%  Therefore we can reject the H o

 Stating the Conclusions of our Results  When the p-value is small, we reject the null hypothesis or, equivalently, we accept the alternative hypothesis.  “Small” is defined as a p-value  , where  acceptable false (+) rate (usually 0.05).  When the p-value is not small, we conclude that we cannot reject the null hypothesis or, equivalently, there is not enough evidence to reject the null hypothesis.  “Not small” is defined as a p-value > , where  = acceptable false (+) rate (usually 0.05).

P-value A standard device for reporting quantitative results in research where variability plays a large role. Measures the dissimilarity between two or more sets of measures or between one set of measurements and a standard. “ the probability of obtaining the study results by chance if the null hypothesis is true” “The probability of obtaining the observed value (study results) as extreme as possible”

P-value - continued “ The p-value is actually a probability, normally the probability of getting a result as extreme as or more extreme than the one observed if the dissimilarity is entirely due to variability of measurements or patients response, or to sum up, due to chance alone”. Small p value - the rare event has occurred Large p value - likely event

-1.96 0 Area =.025 Area =.005 Z -2.575 Area =.025 Area =.005 1.96 2.575

 Many biologic variables follow this pattern  Hemoglobin, Cholesterol, Serum Electrolytes, Blood pressures, age, weight, height  One can use this information to define what is normal and what is extreme  In clinical medicine 95% or 2 Standard deviations around the mean is normal  Clinically, 5% of “normal” individuals are labeled as extreme/abnormal We just accept this and move on.

 Symmetrical about mean,   Mean, median, and mode are equal  Total area under the curve above the x-axis is one square unit  1 standard deviation on both sides of the mean includes approximately 68% of the total area  2 standard deviations includes approximately 95%  3 standard deviations includes approximately 99%

 Normal distribution is completely determined by the parameters  and   Different values of  shift the distribution along the x-axis  Different values of  determine degree of flatness or peakedness of the graph

Sample z = x - x s Population z = x - µ  Round to 2 decimal places Measures of Position z score

 The Z score makes it possible, under some circumstances, to compare scores that originally had different units of measurement.

- 3- 2- 101 23 Z Unusual Values Unusual Values Ordinary Values Interpreting Z Scores

1.Test for single mean Whether the sample mean is equal to the predefined population mean ?. Test for difference in means 2. Test for difference in means Whether the CD4 level of patients taking treatment A is equal to CD4 level of patients taking treatment B ? Test for paired observation 3. Test for paired observation Whether the treatment conferred any significant benefit ?

t is a measure of: How difficult is it to believe the null hypothesis? High t Difficult to believe the null hypothesis - accept that there is a real difference. Low t Easy to believe the null hypothesis - have not proved any difference.

 Student ‘s t-test will be used: --- When Sample size is small, for mean values and for the following situations: (1) to compare the single sample mean with the population mean (2) to compare the sample means of two indpendent samples (3) to compare the sample means of paired samples

BACKGROUND AND NEED OF THE TEST Data collected in the field of medicine is often qualitative. --- For example, the presence or absence of a symptom, classification of pregnancy as ‘high risk’ or ‘non-high risk’, the degree of severity of a disease (mild, moderate, severe)

The measure computed in each instance is a proportion, corresponding to the mean in the case of quantitative data such as height, weight, BMI, serum cholesterol. Comparison between two or more proportions, and the test of significance employed for such purposes is called the “Chi-square test”

McNemar’s test Situation: Two paired binary variables that form a particular type of 2 x 2 table e.g. matched case-control study or cross-over trial

When both the study variables and outcome variables are categorical (Qualitative): Apply (i) Chi square test (ii) Fisher’s exact test (Small samples) (iii) Mac nemar’s test ( for paired samples)

Z-test: Study variable: Qualitative Outcome variable: Quantitative or Qualitative Comparison: two means or two proportions Sample size: each group is > 50 Student’s t-test: Study variable: Qualitative Outcome variable: Quantitative Comparison: sample mean with population mean; two means (independent samples); paired samples. Sample size: each group <50 ( can be used even for large sample size)

Chi-square test: Study variable: Qualitative Outcome variable: Qualitative Comparison: two or more proportions Sample size: > 20 Expected frequency: > 5 Fisher’s exact test: Study variable: Qualitative Outcome variable: Qualitative Comparison: two proportions Sample size:< 20 Macnemar’s test: (for paired samples) Study variable: Qualitative Outcome variable: Qualitative Comparison: two proportions Sample size: Any

1.Number of Observations that Are Free to Vary After Sample Statistic Has Been Calculated 2.Example Sum of 3 Numbers Is 6 X 1 = 1 (or Any Number) X 2 = 2 (or Any Number) X 3 = 3 (Cannot Vary) Sum = 6 degrees of freedom = n -1 = 3 -1 = 2

P S Investigation S Sampling P value Confidence intervals!!! Inference Results

Two forms of estimation  Point estimation = single value, e.g., x-bar is unbiased estimator of μ  Interval estimation = range of values  confidence interval (CI). A confidence interval consists of:

Mean, , is unknown PopulationRandom Sample I am 95% confident that  is between 40 & 60. Mean X = 50 Estimation Process Sample

 “ We are 95% sure that the TRUE parameter value is in the 95% confidence interval”  “If we repeated the experiment many many times, 95% of the time the TRUE parameter value would be in the interval”  “the probability that the interval would contain the true parameter value was 0.95.”

CI 90% corresponds to p 0.10 CI 95% corresponds to p 0.05 CI 99% corresponds to p 0.01 Note: p value  only for analytical studies CI  for descriptive and analytical studies

 RR = 5.6 (95% CI = 1.2 ; 23.7)  OR = 12.8 (95% CI = 3.6 ; 44,2)  NNT = 12 (95% CI = 9 ; 26)

If p value <0.05, then 95% CI:  exclude 0 (for difference), because if A=B then A-B = 0  p>0.05  exclude 1 (for ratio), because if A=B then A/B = 1,  p>0.05

--The (im) precision of the estimate is indicated by the width of the confidence interval. --The wider the interval the less precision THE WIDTH OF C.I. DEPENDS ON: ---- SAMPLE SIZE ---- VAIRABILITY ---- DEGREE OF CONFIDENCE

 p values (hypothesis testing) gives you the probability that the result is merely caused by chance or not by chance, it does not give the magnitude and direction of the difference  Confidence interval (estimation) indicates estimate of value in the population given one result in the sample, it gives the magnitude and direction of the difference

Wishing all of you Best of Luck !

The exam is of 2 hours & Marks :40 The exam is of two parts ( Part I & Part II) Part I is of 20 questions. Answer any 15 questions Each question is.

Similar presentations

Presentation on theme: "The exam is of 2 hours & Marks :40 The exam is of two parts ( Part I & Part II) Part I is of 20 questions. Answer any 15 questions Each question is."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

The exam is of 2 hours & Marks :40 The exam is of two parts ( Part I & Part II) Part I is of 20 questions. Answer any 15 questions Each question is.

Similar presentations

Presentation on theme: "The exam is of 2 hours & Marks :40 The exam is of two parts ( Part I & Part II) Part I is of 20 questions. Answer any 15 questions Each question is."— Presentation transcript:

Similar presentations

About project

Feedback