Download presentation
Presentation is loading. Please wait.
1
Analysis and Empirical Results
Frequency Distribution, Basic statistical tools Software’s used for analysis
2
What is Statistics? Statistics is the science of collecting, organizing, analyzing, interpreting, and presenting data. A statistic is a single measure (number) used to summarize a sample data set. For example, the average height of students in this class. 5/13/2018 DR. MADHUKAR DALVI
3
Overview of Statistics
Describing Data Making Inferences from Samples Visual Displays Numerical Summaries Estimating Parameters Testing Hypotheses 5/13/2018 DR. MADHUKAR DALVI
4
Internet Usage Data 5/13/2018 DR. MADHUKAR DALVI
Respondent Sex Familiarity Internet Attitude Toward Usage of Internet Number Usage Internet Technology Shopping Banking 5/13/2018 DR. MADHUKAR DALVI
5
Frequency Distribution
In a frequency distribution, one variable is considered at a time. A frequency distribution for a variable produces a table of frequency counts, percentages, and cumulative percentages for all the values associated with that variable. 5/13/2018 DR. MADHUKAR DALVI
6
Frequency Distribution of Familiarity with the Internet
Table 15.2 5/13/2018 DR. MADHUKAR DALVI
7
Frequency Histogram Frequency Familiarity 8 7 6 5 4 3 2 1 2 3 4 5 6 7
2 3 4 5 6 7 Familiarity 5/13/2018 DR. MADHUKAR DALVI
8
Obtaining a histogram in Excel
5/13/2018 DR. MADHUKAR DALVI
9
Frequency Distributions and Histograms
Excel Histograms Specify a range of cells containing the bin limits or accept Excel’s default. 5/13/2018 DR. MADHUKAR DALVI
10
USING MEGASTAT In MegaStat, you can specify the interval width and lower limit of the first interval or accept the default INTERNETA USAGE DATA.xls 5/13/2018 DR. MADHUKAR DALVI
11
Statistics Associated with Frequency Distribution Measures of Location
The mean, or average value, is the most commonly used measure of central tendency. The mean, ,is given by Where, Xi = Observed values of the variable X n = Number of observations (sample size) The mode is the value that occurs most frequently. It represents the highest peak of the distribution. The mode is a good measure of location when the variable is inherently categorical or has otherwise been grouped into categories. X n S = X / n X i i = 1 5/13/2018 DR. MADHUKAR DALVI
12
Statistics Associated with Frequency Distribution Measures of Location
The median of a sample is the middle value when the data are arranged in ascending or descending order. If the number of data points is even, the median is usually estimated as the midpoint between the two middle values – by adding the two middle values and dividing their sum by 2. The median is the 50th percentile. 5/13/2018 DR. MADHUKAR DALVI
13
Statistics Associated with Frequency Distribution Measures of Variability
The range measures the spread of the data. It is simply the difference between the largest and smallest values in the sample. Range = Xlargest – Xsmallest. The interquartile range is the difference between the 75th and 25th percentile. For a set of data points arranged in order of magnitude, the pth percentile is the value that has p% of the data points below it and (100 - p)% above it. 5/13/2018 DR. MADHUKAR DALVI
14
Statistics Associated with Frequency Distribution Measures of Variability
The variance is the mean squared deviation from the mean. The variance can never be negative. The standard deviation is the square root of the variance. The coefficient of variation is the ratio of the standard deviation to the mean expressed as a percentage, and is a unitless measure of relative variability. n 2 ( X - X ) S s = i x n - 1 i = 1 5/13/2018 DR. MADHUKAR DALVI
15
Descriptive Statistics in Excel
Go to Tools | Data Analysis and select Descriptive Statistics 5/13/2018 DR. MADHUKAR DALVI
16
Highlight the data range, specify a cell for the upper-left corner of the output range, check Summary Statistics and click OK. 5/13/2018 DR. MADHUKAR DALVI
17
Here is the resulting analysis.
5/13/2018 DR. MADHUKAR DALVI
18
Descriptive Statistics in MegaStat
5/13/2018 DR. MADHUKAR DALVI
19
Here is the resulting MegaStat analysis:
5/13/2018 DR. MADHUKAR DALVI
20
Statistics Associated with Frequency Distribution Measures of Shape
Skewness. The tendency of the deviations from the mean to be larger in one direction than in the other. It can be thought of as the tendency for one tail of the distribution to be heavier than the other. Kurtosis is a measure of the relative peakedness or flatness of the curve defined by the frequency distribution. The kurtosis of a normal distribution is zero. If the kurtosis is positive, then the distribution is more peaked than a normal distribution. A negative value means that the distribution is flatter than a normal distribution. 5/13/2018 DR. MADHUKAR DALVI
21
Skewness of a Distribution
Figure 15.2 Symmetric Distribution Skewed Distribution Mean Median Mode (a) Mean Median Mode (b) 5/13/2018 DR. MADHUKAR DALVI
22
5/13/2018 DR. MADHUKAR DALVI
23
Line Charts Simple Line Charts
Two-scale line chart – used to compare variables that differ in magnitude or are measured in different units.CellPhones.xls 5/13/2018 DR. MADHUKAR DALVI
24
Scatter Plots A scatter plot shows n pairs of observations as dots (or some other symbol) on an XY graph. A starting point for bivariate data analysis. Allows observations about the relationship between two variables. Answers the question: Is there an association between the two variables and if so, what kind of association? 5/13/2018 DR. MADHUKAR DALVI
25
Scatter Plots A scatter plot shows n pairs of observations as dots (or some other symbol) on an XY graph. A starting point for bivariate data analysis. Allows observations about the relationship between two variables. Answers the question: Is there an association between the two variables and if so, what kind of association? 5/13/2018 DR. MADHUKAR DALVI
26
Select the XY (Scatter) option.
In Excel, highlight the two data columns, then click on the Chart Wizard icon on the toolbar. Select the XY (Scatter) option. 5/13/2018 DR. MADHUKAR DALVI
27
Scatter Plots Making a Scatter Plot in Excel
Click Next and then click the Series tab.BirthRates1.xls Excel assumes that the first column contain X-axis values and the second column contains Y-axis values. Alternatively, you can specify the data range explicitly for each variable. 5/13/2018 DR. MADHUKAR DALVI
28
Effective Excel Charts
Chart Wizard Click on the Chart Wizard icon on the toolbar to open a sequence of pop-up menus to guide you through the steps of creating a chart. Step 1: Select the Chart type and then click Next. 5/13/2018 DR. MADHUKAR DALVI
29
Regression Terminology
Fitting a Regression on a Scatter Plot in Excel 5/13/2018 DR. MADHUKAR DALVI
30
MEGA STAT 5/13/2018 DR. MADHUKAR DALVI
31
5/13/2018 DR. MADHUKAR DALVI
32
5/13/2018 DR. MADHUKAR DALVI
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.