Typical logical expressions used in programs with if statements: If(EXPRESSION) { COMMANDS EXECUTED WHEN TRUE } else { COMMANDS EXECUTED WHEN FALSE } x<-3.

Slides:



Advertisements
Similar presentations
Sampling Distributions
Advertisements

Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
THE CENTRAL LIMIT THEOREM
Sampling Distributions (§ )
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Chapter 6 Introduction to Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Statistics Lecture 20. Last Day…completed 5.1 Today Parts of Section 5.3 and 5.4.
BCOR 1020 Business Statistics
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Inference about a Mean Part II
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
The Lognormal Distribution
Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;
Objectives (BPS chapter 13) Binomial distributions  The binomial setting and binomial distributions  Binomial distributions in statistical sampling 
Standard error of estimate & Confidence interval.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
The Data Analysis Plan. The Overall Data Analysis Plan Purpose: To tell a story. To construct a coherent narrative that explains findings, argues against.
Describing distributions with numbers
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
3/2003 Rev 1 I – slide 1 of 33 Session I Part I Review of Fundamentals Module 2Basic Physics and Mathematics Used in Radiation Protection.
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Bayesian inference review Objective –estimate unknown parameter  based on observations y. Result is given by probability distribution. Bayesian inference.
Vectors in the R-Language R has not a function for this !? Exercise: Write your own function ‘magnitude’ x*y is not a dot-product operation in R.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
AP Statistics Section 9.3A Sample Means. In section 9.2, we found that the sampling distribution of is approximately Normal with _____ and ___________.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
10.1: Confidence Intervals – The Basics. Review Question!!! If the mean and the standard deviation of a continuous random variable that is normally distributed.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Chapter 12 Analysis of Variance. An Overview We know how to test a hypothesis about two population means, but what if we have more than two? Example:
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Tests of Hypotheses Involving Two Populations Tests for the Differences of Means Comparison of two means: and The method of comparison depends on.
Inference for 2 Proportions Mean and Standard Deviation.
Tests dealing with the mean of data samples Tests dealing with the variance of the samples Tests dealing with correlation coefficients Tests dealing with.
Section 9.3: Confidence Interval for a Population Mean.
: Chapter 3: Maximum-Likelihood and Baysian Parameter Estimation 1 Montri Karnjanadecha ac.th/~montri.
Winter-Quiz- Intermezzo On each slide you will find a screenshot from R-studio / or some previous theoretical statistical class material. You can find.
Report Writing. A report should be self-explanatory. It should be capable of being read and understood without reference to the original project description.
1 Topic 5 - Joint distributions and the CLT Joint distributions –Calculation of probabilities, mean and variance –Expectations of functions based on joint.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Topic 5 - Joint distributions and the CLT
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Section Parameter v. Statistic 2 Example 3.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Parameter, Statistic and Random Samples
Inference about the slope parameter and correlation
Two-Sample Hypothesis Testing
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Review Quiz.
Data Analysis.
Statistics PSY302 Review Quiz One Fall 2018
Histograms REVIEWED Histograms are more than just an illustrative summary of the data sample. Typical examples are shown below (in R: see help(hist) for.
The normal distribution
Winter-Quiz- Intermezzo
Sampling Distribution of a Sample Proportion
Statistics PSY302 Review Quiz One Spring 2017
Sampling Distributions (§ )
Sampling Distributions
Presentation transcript:

Typical logical expressions used in programs with if statements: If(EXPRESSION) { COMMANDS EXECUTED WHEN TRUE } else { COMMANDS EXECUTED WHEN FALSE } x<-3 y<-4.0 x==4.0 returns TRUE x==y returns TRUE y==y returns TRUE y==4 may return TRUE Better abs(y-4)< When the numbers are sufficently close, then TRUE

Typical logical expressions used in programs with if statements: If(EXPRESSION) { COMMANDS EXECUTED WHEN TRUE } else { COMMANDS EXECUTED WHEN FALSE } x<0 x>0 x<(-1) # negative numbers must be put in () Logical combinations: Logical AND: (x 3) Logical OR: (x 3) Logical NOT: !(x<y) Strings can also be compared for equivalence month<-”Dec” !(month==“Jan”) returns TRUE Note: Practice the logical operations: Be careful with vectors! Inform yourself about more operators and how they behave with vector objects! The proper use requires experience!

 albany_climatology_snow.R Changes: filename and the variable name (better called ‘object name’) Changes: all data and variables are derived from object ‘snow’: vector ‘time’

 albany_climatology_snow.R Changes: plot() function call: update the y-axis label Changes: ‘res’ must be assigned the monthly mean snow data! it is used below for plotting in the function lines()

 albany_climatology_snow.R Changes: use the year information from object snow Changes:mhelp is a new object thatstores the months data, but only for the selected years within our climatological period

 albany_climatology_snow.R Changes: buffer stores the monthly mean snow data of the climatological period Changes: snowclim A new object to calculate the seasonal cycle (climatological mean).

 albany_climatology_snow.R Changes:plot() function call with adjusted y-axis label string Changes: snowclim object to calculate the seasonal cycle (climatological mean). Changes: lines() plots values of snowclim

 albany_climatology_snow.R Outlier, Erroneous data or record snow?

 albany_climatology_snow.R 30-yr climatology No snow between May-September (but in previous years May had snow!)

 Statistical estimates are attempts to quantify the underlying true properties of the sample population  Random samples are incomplete description of the full population  The mean of a sample is an estimate of the true mean  The variance is also only an estimate of the true variance of the population  (Any other estimates, of course too)

 We have seen that the sample mean is the arithmetic mean of the observations Albany Airport Monthly mean temperature anomalies from the 30-yr climatology: Sample size: 360 Mean: 0 C (degree Celsius) Standard Deviation = C

 We have seen that the sample mean is the arithmetic mean of the observations New incoming data: anomaly with respect to (w.r.t.) previous estimated mean With larger sample size each individual sample becomes less influential for updating the mean and it converges to the true mean (for Independent Identically Distributed (IID) samples) ‘online algorithm’

Created with R-program scripts/online_average.R (needs data/USW _tavg_mon_mean_ano.csv

 Histograms give an overview of the sample distribution: mean and standard deviation describe only in parts of the character of sample distributions (we will learn about the skewness and tails of distributions in this course) Albany Airport Monthly mean temperature anomalies from the 30-yr climatology: Sample size: 360 Mean: 0 C (degree Celsius) Standard Deviation = C

 We have seen that the sample mean is the arithmetic mean of the observations Albany Airport Monthly mean temperature anomalies from the 30-yr climatology: Sample size: 360 Mean: 0 C (degree Celsius) Standard Deviation = C

 Estimate an unknown mean of the random process (a “population mean”):  we only have a sample with limited number of observations  Sample is drawn randomly from the population  The larger the sample size the better the estimate  That is if we repeated an experiment several times each time with new samples of size n, then the variance among the estimated means will decrease the larger the sample size n.