Basics of Statistical Analysis

Slides:



Advertisements
Similar presentations
Inference and Confidence Intervals. Outline Inferring a population mean: Constructing confidence intervals Examining the difference between two means.
Advertisements

The Independent- Samples t Test Chapter 11. Independent Samples t-Test >Used to compare two means in a between-groups design (i.e., each participant is.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Standard error of estimate & Confidence interval.
Chapter 6: Sampling Distributions
Confidence Intervals. Estimating the difference due to error that we can expect between sample statistics and the population parameter.
6 - 1 Basic Univariate Statistics Chapter Basic Statistics A statistic is a number, computed from sample data, such as a mean or variance. The.
Basics of Statistical Analysis. Basics of Analysis The process of data analysis Example 1: –Gift Catalog Marketer –Mails 4 times a year to its customers.
Relationships between Variables. Two variables are related if they move together in some way Relationship between two variables can be strong, weak or.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
By C. Kohn Waterford Agricultural Sciences.   A major concern in science is proving that what we have observed would occur again if we repeated the.
Statistics Numerical Representation of Data Part 2 – Measure of Variation.
© 2007 Prentice Hall16-1 Some Preliminaries. © 2007 Prentice Hall16-2 Basics of Analysis The process of data analysis Example 1: Gift Catalog Marketer.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
TEKS (6.10) Probability and statistics. The student uses statistical representations to analyze data. The student is expected to: (B) identify mean (using.
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
Basics of Statistical Analysis. Basics of Analysis The process of data analysis Example 1: –Gift Catalog Marketer –Mails 4 times a year to its customers.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Summary A confidence interval for the population mean, is constructed using the formula: sample mean ± z multiplied by σ/√n where σ is the population.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Measures of Dispersion Section 4.3. The case of Fred and Barney at the bowling alley Fred and Barney are at the bowling alley and they want to know who’s.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Statistics for Business and Economics 7 th Edition Chapter 7 Estimation: Single Population Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
© 2007 Prentice Hall16-1 Some Preliminaries. © 2007 Prentice Hall16-2 Basics of Analysis The process of data analysis Example 1: Gift Catalog Marketer.
Variability GrowingKnowing.com © 2011 GrowingKnowing.com © 2011.
Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis also helps distinguish among multiple.
Chapter 6: Sampling Distributions
And distribution of sample means
Inference about the slope parameter and correlation
Chapter 7 Review.
Bell Work
Some Preliminaries © 2007 Prentice Hall.
Measures of Dispersion
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Objectives The student will be able to:
Chapter 6: Sampling Distributions
PCB 3043L - General Ecology Data Analysis.
Summary descriptive statistics: means and standard deviations:
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
By C. Kohn Waterford Agricultural Sciences
BA 275 Quantitative Business Methods
A 95% confidence interval for the mean, μ, of a population is (13, 20)
Regression Computer Print Out
Inferential Statistics and Probability a Holistic Approach
Descriptive and inferential statistics. Confidence interval
Inference for Proportions
NUMERICAL DESCRIPTIVE MEASURES (Part B)
Arithmetic Mean This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Summary descriptive statistics: means and standard deviations:
Using Statistics in Biology
Statistical Process Control
Using Statistics in Biology
Statistics in Biology.
Random Variables 7.1.
Mean, Median, Mode The Mean is the simple average of the data values. Most appropriate for symmetric data. The Median is the middle value. It’s best.
Confidence intervals for the difference between two means: Independent samples Section 10.1.
Describing Data: Numerical Measures
Describing Data: Numerical Measures
GENERALIZATION OF RESULTS OF A SAMPLE OVER POPULATION
Interval Estimation Download this presentation.
Lecture 4 Psyc 300A.
MGS 3100 Business Analysis Regression Feb 18, 2016
Basics of Statistical Analysis
Business Statistics For Contemporary Decision Making 9th Edition
Presentation transcript:

Basics of Statistical Analysis

Basics of Analysis The process of data analysis Example 1: Gift Catalog Marketer Mails 4 times a year to its customers Company has I million customers on its file Observation Data Information Analysis Encode

Example 1 Cataloger would like to know if new customers buy more than old customers? Classify New Customers as anyone who brought within the last twelve months. Analyst takes a sample of 100,000 customers and notices the following.

Example 1 5000 orders received in the last month 3000 (60%) were from new customers 2000 (40%) were from old customers So it looks like the new customers are doing better

Example 1 Is there any Catch here!!!!! Data at this gross level, has no discrimination between customers within either group. A customer who bought within the last 11 days is treated exactly similar to a customer who bought within the last 11 months.

Example 1 Can we use some other variable to distinguish between old and new Customers? Answer: Actual Dollars spent ! What can we do with this variable? Find its Mean and Variation. We might find that the average purchase amount for old customers is two or three times larger than the average among new customers

Numerical Summaries of data The two basic concepts are the Center and the Spread of the data Center of data - Mean, which is given by - Median - Mode

Numerical Summaries of data Forms of Variation Sum of differences about the mean: Variance: Standard Deviation: Square Root of Variance

Confidence Intervals In catalog eg, analyst wants to know average purchase amount of customers He draws two samples of 75 customers each and finds the means to be $68 and $122 Since difference is large, he draws another 38 samples of 75 each The mean of means of the 40 samples turns out to be $ 94.85 How confident should he be of this mean of means?

Confidence Intervals Analyst calculates the standard deviation of sample means, called Standard Error (SE) Basic Premise for confidence Intervals 95 percent of the time the true mean purchase amount lies between plus or minus 1.96 standard errors from the mean of the sample means. C.I. = Mean (+or-) (1.96) * Standard Error

Confidence Intervals However, if CI is calculated with only one sample then Standard Error of sample mean = Standard deviation of sample Basic Premise for confidence Intervals with one sample 95 percent of the time the true mean lies between plus or minus 1.96 standard errors from the sample means.

Example 2: Confidence Intervals for response rates You are the marketing analyst for Online Apparel Company You want to run a promotion for all customers on your database In the past you have run many such promotions Historically you needed a 4.5% response for the promotions to break-even You want to test the viability of the current full-scale promotion by running a small test promotion

Example 2: Confidence Intervals for response rates Test 1,000 names selected at random from the full list. You construct CI based on required rate of 4.5% and n=1000 Confidence Interval= Expected Response ± 1.96*SE The SE=.00655, and CI is (.0322, .0578) In our case C.I. = 3.22 % to 5.78%. Thus any response between 3.22 and 5.78 % supports hypothesis that true response rate is 4.5% © 2007 Prentice Hall

Example 2: Confidence Intervals for response rates The list is mailed and actually pulls in 3.5% Thus, the true response rate maybe 4.5% What if the actual rate pulled in were 5% ? Regression towards mean: Phenomenon of test result being different from true result Give more thought to lists whose cutoff rates lie within confidence interval