Statistical basics Marian Scott Dept of Statistics, University of Glasgow August 2008.

Slides:



Advertisements
Similar presentations
Some statistical ideas Marian Scott Statistics, University of Glasgow June 2012.
Advertisements

Statistical basics Marian Scott Dept of Statistics, University of Glasgow August 2010.
Some statistical ideas Marian Scott Statistics, University of Glasgow January 2014.
Sampling and monitoring the environment Marian Scott Sept 2006.
Some statistical basics Marian Scott. Why bother with Statistics We need statistical skills to: Make sense of numerical information, Summarise data, Present.
Some statistical ideas Marian Scott Statistics, University of Glasgow September 2011.
Beginning the Visualization of Data
1 BA 555 Practical Business Analysis Housekeeping Review of Statistics Exploring Data Sampling Distribution of a Statistic Confidence Interval Estimation.
Intro to Statistics for the Behavioral Sciences PSYC 1900
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Chapter 2 Simple Comparative Experiments
The discipline of statistics: Provides methods for organizing and summarizing data and for drawing conclusions based on information contained in data.
Experimental Statistics I.  We use data to answer research questions  What evidence does data provide?  How do I make sense of these numbers without.
Chapter 1 Descriptive Analysis. Statistics – Making sense out of data. Gives verifiable evidence to support the answer to a question. 4 Major Parts 1.Collecting.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 8: Quantitative.
Quantitative Skills: Data Analysis
PTP 560 Research Methods Week 8 Thomas Ruediger, PT.
Chapter 15 Data Analysis: Testing for Significant Differences.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Role of Statistics in Geography
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
Chapter 1 The Role of Statistics. Three Reasons to Study Statistics 1.Being an informed “Information Consumer” Extract information from charts and graphs.
NA387(3) Lecture 2: Populations, Samples, and Frequency Analysis (Devore, Ch )
Introduction Osborn. Daubert is a benchmark!!!: Daubert (1993)- Judges are the “gatekeepers” of scientific evidence. Must determine if the science is.
Chapter 21 Basic Statistics.
Planning and Data Collection
Statistics PSY302 Quiz One Spring A _____ places an individual into one of several groups or categories. (p. 4) a. normal curve b. spread c.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Applications of Spatial Statistics in Ecology Introduction.
Medical Statistics as a science
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 6 Putting Statistics to Work.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Lecture VI Statistics. Lecture questions Mathematical statistics Sampling Statistical population and sample Descriptive statistics.
Introduction To Statistics
1 PAUF 610 TA 1 st Discussion. 2 3 Population & Sample Population includes all members of a specified group. (total collection of objects/people studied)
LIS 570 Summarising and presenting data - Univariate analysis.
Statistics Module Statistics Statistics are a powerful tool for finding patterns in data and inferring important connections between events in.
Review: Stages in Research Process Formulate Problem Determine Research Design Determine Data Collection Method Design Data Collection Forms Design Sample.
Chapter 0: Why Study Statistics? Chapter 1: An Introduction to Statistics and Statistical Inference 1
1 Collecting and Interpreting Quantitative Data Deborah K. van Alphen and Robert W. Lingard California State University, Northridge.
Exploratory data analysis, descriptive measures and sampling or, “How to explore numbers in tables and charts”
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
WELCOME TO BIOSTATISTICS! WELCOME TO BIOSTATISTICS! Course content.
Unit 1 - Graphs and Distributions. Statistics 4 the science of collecting, analyzing, and drawing conclusions from data.
Yandell - Econ 216 Chap 1-1 Chapter 1 Introduction and Data Collection.
Prof. Eric A. Suess Chapter 3
Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis also helps distinguish among multiple.
Data Analysis.
Lecture 1 Sections 1.1 – 1.2 Objectives:
Chapter 1 & 3.
Description of Data (Summary and Variability measures)
Introductory Statistical Language
Distributions and Graphical Representations
Unit 1 - Graphs and Distributions
Basics of Statistics.
Applied Statistical Analysis
Analyzing Reliability and Validity in Outcomes Assessment Part 1
Basic Statistical Terms
Gathering and Organizing Data
Elementary Statistics (Math 145)
Welcome!.
Statistical Data Analysis
Lecture 1: Descriptive Statistics and Exploratory
Statistics PSY302 Review Quiz One Spring 2017
Gathering and Organizing Data
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
Collecting and Interpreting Quantitative Data
Presentation transcript:

Statistical basics Marian Scott Dept of Statistics, University of Glasgow August 2008

What shall we cover? Why might we need some statistical skills Statistical inference- what is it? how to handle variation exploring data probability models inferential tools- hypothesis tests and confidence intervals

Why bother with Statistics We need statistical skills to: Make sense of numerical information, Summarise data, Present results (graphically), Test hypotheses Construct models

statistical language variable- a single aspect of interest population- a large group of individuals sample- a subset of the population parameter- a single number summarising the variable in the population statistic- a single number summarising the variable in the sample

statistical language- Radiation protection- C-14 in fish variable- radiocarbon level (Bq/KgC) population- all fish caught for human consumption in W Scotland sample- 20 fish bought in local markets parameter- population mean C-14 level statistic- sample mean C-14 level

Variables- number and type Univariate: there is one variable of interest measured on the individuals in the sample. We may ask: What is the distribution of results-this may be further resolved into questions concerning the mean or average value of the variable and the scatter or variability in the results?

Bivariate Bivariate two variables of interest are measured on each member of the sample. We may ask : How are the two variables related? If one variable is time, how does the other variable change? How can we model the dependence of one variable on the other?

Multivariate Multivariate many variables of interest are measured on the individuals in the sample, we might ask: What relationships exist between the variables? Is it possible to reduce the number of variables, but still retain 'all' the information? Can we identify any grouping of the individuals on the basis of the variables?

Data types Numerical: a variable may be either continuous or discrete. For a discrete variable, the values taken are whole numbers (e.g. number of invertebrates, numbers of eggs). For a continuous variable, values taken are real numbers (positive or negative and including fractional parts) (e.g. pH, alkalinity, DOC, temperature).

categorical Categorical: a limited number of categories or classes exist, each member of the sample belongs to one and only one of the classes e.g. compliance status is categorical. Compliance is a nominal categorical variable since the categories are unordered. Level of diluent (eg recorded as low, medium,high) would be an ordinal categorical variable since the different classes are ordered

Inference and Statistical Significance Sample Population inference Is the sample representative? Is the population homogeneous? Since only a sample has been taken from the population we cannot be 100% certain Significance testing

what are your objectives? describing a characteristic of interest (usually the average, but could also be the variability or a high percentile), describing spatial patterns of a characteristic,mapping the spatial distribution, quantifying contamination above a background or specified intervention level detecting temporal or spatial trends, assessing environmental impacts of specific facilities, or of events such as accidental releases,

the statistical process A process that allows inferences about properties of a large collection of things (the population) to be made based on observations on a small number of individuals belonging to the population (the sample). The use of valid statistical sampling techniques increases the chance that a set of specimens (the sample, in the collective sense) is collected in a manner that is representative of the population.

Variation soil or sediment samples taken side-by- side, from different parts of the same plant, or from different animals in the same environment, exhibit different activity densities of a given radionuclide. The distribution of values observed will provide an estimate of the variability inherent in the population of samples that, theoretically, could be taken.

Representativeness An essential concept is that the taking of a sufficient number of individual samples should reflect the population. Representativeness of environmental samples is difficult to demonstrate. Usually, representativeness is considered justified by the procedure used to select the samples

What is the population? The population is the set of all items that could be sampled, such as all fish in a lake, all people living in the UK, all trees in a spatially defined forest, or all 20-g soil samples from a field. Appropriate specification of the population includes a description of its spatial extent and perhaps its temporal stability

What are the sampling units? In some cases, sampling units are discrete entities (i.e., animals, trees), but in others, the sampling unit might be investigator-defined, and arbitrarily sized. Example- technetium in shellfish The objective here is to provide a measure (the average) of technetium in shellfish (eg lobsters for human consumption) for the west coast of Scotland. Population is all lobsters on the west coast Sampling unit is an individual animal.

Summarising data- means, medians and other such statistics

plotting data- histograms, boxplots, stem and leaf plots, scatterplots

median lower quartile upper quartile

Preliminary Analysis There is considerable variation –Across different sites –Within the same site across different years Distribution of data is highly skewed with evidence of outliers and in some cases bimodality