Two Concepts of Probability

Slides:



Advertisements
Similar presentations
Rubric Unit Plan Univariate Bivariate Examples Resources Curriculum Statistically Thinking A study of univariate and bivariate statistics.
Advertisements

Describing Data: Percentiles
Cumulative Frequency A cumulative frequency polygon shows how the cumulative frequency changes as the data values increase. The data is shown on a continuous.
Lecture 2 Summarizing the Sample. WARNING: Today’s lecture may bore some of you… It’s (sort of) not my fault…I’m required to teach you about what we’re.
Measures of Dispersion
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
1 Empirical and probability distributions 0.4 exploratory data analysis.
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
BOX PLOTS/QUARTILES. QUARTILES: 3 points in a set of data that separate the set into 4 equal parts. Lower Quartile: Q1 (The median for the lower half.
CHAPTER 39 Cumulative Frequency. Cumulative Frequency Tables The cumulative frequency is the running total of the frequency up to the end of each class.
(c) 2007 IUPUI SPEA K300 (4392) Outline: Numerical Methods Measures of Central Tendency Representative value Mean Median, mode, midrange Measures of Dispersion.
Drawing and comparing Box and Whisker diagrams (Box plots)
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Objectives 1.2 Describing distributions with numbers
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Displaying and Exploring Data Unit 1: One Variable Statistics CCSS: N-Q (1-3);
1 Further Maths Chapter 2 Summarising Numerical Data.
Engineering Statistics KANCHALA SUDTACHAT. Statistics  Deals with  Collection  Presentation  Analysis and use of data to make decision  Solve problems.
1 Chapter 2 Bivariate Data A set of data that contains information on two variables. Multivariate A set of data that contains information on more than.
1 WHY WE USE EXPLORATORY DATA ANALYSIS DATA YES NO ESTIMATES BASED ON NORMAL DISTRIB. KURTOSIS, SKEWNESS TRANSFORMATIONS QUANTILE (ROBUST) ESTIMATES OUTLIERS.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
ALL ABOUT THAT DATA UNIT 6 DATA. LAST PAGE OF BOOK: MEAN MEDIAN MODE RANGE FOLDABLE Mean.
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
Graphs with SPSS Aravinda Guntupalli. Bar charts  Bar Charts are used for graphical representation of Nominal and Ordinal data  Height of the bar is.
By Christy Quattrone Click to View Types of Graphs Data Analysis, Grade 5.
StatisticsStatistics Unit 5. Example 2 We reviewed the three Measures of Central Tendency: Mean, Median, and Mode. We also looked at one Measure of Dispersion.
STATISTICS Chapter 2 and and 2.2: Review of Basic Statistics Topics covered today:  Mean, Median, Mode  5 number summary and box plot  Interquartile.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
ALL ABOUT THAT DATA UNIT 6 DATA. LAST PAGE OF BOOK: MEAN MEDIAN MODE RANGE FOLDABLE Mean.
Normalizing Transformations and fitting a marginal distribution
Methods for Describing Sets of Data
Concepts in Probability, Statistics and Stochastic Modeling
Chapter 16: Exploratory data analysis: numerical summaries
BAE 6520 Applied Environmental Statistics
Get out your notes we previously took on Box and Whisker Plots.
CEE Water and Environmental Seminar
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
BAE 5333 Applied Water Resources Statistics
Engineering Probability and Statistics - SE-205 -Chap 6
Add a 3rd column and do a running total of the frequency column
Support for English, maths and ESOL Module 7: Developing the personal maths skills of teachers and assessors Master class 4: Handling data.
Statistics Unit Test Review
Chapter 16: Exploratory data analysis: Numerical summaries
Descriptive Statistics
Chapter 6 ENGR 201: Statistics for Engineers
Statistical Reasoning
Description of Data (Summary and Variability measures)
Warm – Up 1. Find the IQR and Variance of the following data set. Are there any outliers in the data set? How do you know? Grades: 86, 78, 97, 96, 89,
Alafia river: Autocorrelation Autocorrelation of standardized flow.
Measures of Central Tendency (center) Measure of Variability (spread)
Numerical Measures: Skewness and Location
DS4 Interpreting Sets of Data
Friday Lesson Do Over Statistics is all about data. There is a story to be uncovered behind the data--a story with characters, plots and problems. The.
Shape of Distributions
Exploratory data analysis: numerical summaries
Continuous Statistical Distributions: A Practical Guide for Detection, Description and Sense Making Unit 3.
Describing a Skewed Distribution Numerically
Box & Whiskers Plots AQR.
Boxplots.
Descriptive Statistics
Ten things about Descriptive Statistics
Box-And-Whisker Plots
Review on Modelling Process Quality
Box and Whisker Plots.
Statistics Vocab Notes
Cumulative Frequency and Box Plots
Descriptive Statistics Civil and Environmental Engineering Dept.
Presentation transcript:

Two Concepts of Probability Statistical Relative frequency in repeated experiments Inductive Subjective Based on incomplete information, judgment and logical reasoning Bayesian

Line Diagram From Kottegoda and Rosso, 1997 p3

Dot diagram From Kottegoda and Rosso, 1997 p4

Histogram of minimum annual flow in the Po river between 1918 and 1978

Minimum annual flow in the Po river between 1918 and 1978 Alternative histogram axis scaling - Relative Frequency - Density

Po River, Minimum annual flow cumulative relative frequency (number of values ≤ n)/n (KR p 8) qs=sort(q) n=length(q) crf=(0:(n-1))/n plot(qs,crf)

Po River, Minimum annual flow Quantile plot (Q-plot) qs=sort(q) n=length(q) crf=(0:(n-1))/n plot(crf,qs) 75% Quantile or quartile Interquartile range IQR Median 25% Quantile or quartile

Quantile Definition pi p qi x A quantile qi is the random variable value associated with a specific cumulative probability pi

Numerical Quantities

Helsel and Hirsch page 21

Box (Red Lines) enclose 50% of the values Time Series Box Plot Median Box (Red Lines) enclose 50% of the values

Box Plot Outliers: beyond 1.5*IQR Whiskers: 1.5*IQR or largest value Box: 25th %tile to 75th %tile Line: Median (50th %tile) - not the mean Note: The range shown by the box is called the “Inter-Quartile Range” or IQR. This is a robust measure of spread. It is insensitive to outliers since it is based purely on the rank of the values.

Horizontal Line is the mean Seasonality of Flow “Monthly Subseries Plot” - time series for each month Outliers Compare change in mean and median between Aug-Sep. Note Skew in September Flow (cfs) Horizontal Line is the mean Box Plots Flow (cfs)

Scatter Plot - Flow v. Water Level ASK How good is this relationship? Is it linear? What would you do next?

Multiple Scatterplots Flow = f(Pumping) Causality? Co-effect? OR Pumping = f(Flow) Water Level = f(Pumping) Logical relationship

Scatterplot - between raw x and y data Q-Q plot - between sorted x and y data Compares individual X and Y values Compares the distributions of X and Y

Quantiles to compare to theoretical distribution Rank the data Theoretical distribution, e.g. Standard Normal x1 x2 x3 . xn pi qi qi is the distribution specific theoretical quantile associated with ranked data value xi

Quantile-Quantile Plots QQ-plot for Raw Flows QQ-plot for Log-Transformed Flows ln(xi) qi xi qi Used as a basis for finding transformation to make the Raw flows Normally distributed.

Quantile plots and Probability Plots Q-Q Plot Probability Plot Theoretical quantile axis is relabeled with corresponding probability values