Biostatistics A foundation for analysis in the health science

Slides:



Advertisements
Similar presentations
Lectures of Stat -145 (Biostatistics)
Advertisements

Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Chapter 3 Description of qualitative variable. Quantitative variable Qualitative variable Multiple categorical variable Ordinal variable Binary variable.
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Introduction Biostatistics Analysis: Lecture 1 Definitions and Data Collection.
Areej Jouhar & Hafsa El-Zain Biostatistics BIOS 101 Foundation year.
Medical Statistics Medical Statistics Tao Yuchun Tao Yuchun 1
POPULATION SURVEYS Evaluation the health status of a population (community diagnosis). Evaluation the health status of a population (community diagnosis).
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
1-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Postgraduate books recommended by Degree Management and Postgraduate Education Bureau, Ministry of Education Medical Statistics (the 2nd edition) 孙振球 主.
Biostatistics Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Introduction to Biostatistics Lecture 1. Biostatistics Definition: – The application of statistics to biological sciences Is the science which deals with.
TYPES OF DATA Prof. Dr. Hamit ACEMOĞLU. The aim By the end of this lecture, students will be avare of types of data.
Statistics & Evidence-Based Practice
Elementary Statistics
Learning Objectives : After completing this lesson, you should be able to: Describe key data collection methods Know key definitions: Population vs. Sample.
Statistics Statistics is that field of science concerned with the collection, organization, presentation, and summarization of data, and the drawing of.
Descriptive statistics (2)
What is Statistics? Introduction 1.
INTRODUCTION AND DEFINITIONS
Pharmaceutical Statistics
INTRODUCTION AND DEFINITIONS
Statistics in Management
Week one Introduction to Statistics Chs 221 Dr. wajed Hatamleh
Chapter(1) The Nature of Probability and Statistics
I. Introduction to statistics
Introduction to Statistics
Statistical tests for quantitative variables
CHAPTER 1 INTRODUCTION Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved.
Introduction to Statistics
Medical Statistic.
Math 145 January 23, 2007.
Elementary Statistics MOREHEAD STATE UNIVERSITY
Chapter 5 STATISTICS (PART 1).
Basic Statistics Overview
statistics Specific number
Chapter 2 Describing Data: Graphs and Tables
Understandable Statistics
Statistical Techniques in Business & Economics
What Is Statistics Chapter 1.
Introduction Chapter 1.
Introduction to Statistics
Introduction to Statistics
The Nature of Probability and Statistics
Introduction to Statistics
Gathering and Organizing Data
statistics Specific number
The Nature of Probability and Statistics
Chapter 1 The Where, Why, and How of Data Collection
Biostatistics College of Medicine University of Malawi 2011.
Statistics Workshop Tutorial 1
Elementary Statistics MOREHEAD STATE UNIVERSITY
Elementary Statistics (Math 145)
Chapter 1 The Where, Why, and How of Data Collection
Introduction to Statistics
Statistical Data Analysis
Introduction to Business Statistics
The Nature of Probability and Statistics
15.1 The Role of Statistics in the Research Process
Gathering and Organizing Data
Displaying Data – Charts & Graphs
Math 145 September 5, 2007.
Statistical Techniques in Business & Economics
Statistical Techniques in Business & Economics
Applied Biostatistics
The Where, Why, and How of Data Collection
Introduction to Statistics
Chapter 1 The Where, Why, and How of Data Collection
Presentation transcript:

Biostatistics A foundation for analysis in the health science Yongli YANG Ph.D, Associate Professor Department of Biostatistics & Epidemiology, college of public health TEL: 67781249 E-mail: ylyang377@126.com

Statistics in life GDP in China increased 7.7% in 2013 from the report of State Statistical Bureau. Life expectancy is 74.83 year in 6th population census Weather forecast in Zhengzhou

week Theory course content 8 introduction 9 Description of quantitative variable 10 Description of qualitative variable . Statistical table and graph Exercise: statistical description 11 Normal distribution Sampling error and sampling distribution 12 The principle of hypothesis test t test 13 One-way analysis of variance Nonparametric test 14 Exercise: t test and ANOVA Chi-square test 15 Simple linear correlation analysis Simple linear regression analysis 16 Exercise: Chi-square test. Correlation and regression analysis

Chapter I introduction to biostatistics Some basic concepts Basic step of statistical work Review questions and exercises

Basic step of statistical work Be familiar with The definition: statistics and biostatistics Understand The definition: population, sample, probability, quantitative variable, qualitative variable Master

introduction We are frequently reminded of the fact that we are living in the information age. Appropriately, then, this subject is about information—how it is obtained, how it is analyzed, and how it is interpreted. The information about which we are concerned are called data, and the data are available to us in the form of numbers.

Question 1 We aim to explore whether smoking is harmful to your health. How to explore? Lung cancer, Heart disease, Other diseases?

smoking Lung cancer b c d a/(a+b) c/(c+d) compare conclusion non- Lung cancer b c d a/(a+b) c/(c+d) compare conclusion no lung cancer a

Smoking group Non-smoking group

Question 2 It is obvious that generally men are taller than women, while some other women are taller than men. Therefore, if you wanted to ‘prove’ that men were taller, you should measure many people of each sex. How many people should you measure?

Question 3 A doctor used a new drug to cure 5 AIDS patients. 4 of them are cured. Conclusion: The cured rate of this drug was 80%. ? Is his conclusion right? Why or why not?

A knowledge of statistics is like a knowledge of foreign languages or of algebra; it may prove of use at any time under any circumstances. A.L. Bowley

Some basic concepts Data Statistics and biostatistics Population and sample Variable Parameter and Statistic Probability

Data Definition: The raw material of statistics is data. For our purses we define data as numbers. Sources of data: Routinely kept records Surveys Experiments External sources

Data Routinely kept records. Hospitals keep day-to-day records, which contain immense amounts of information on patients. When the need for data arises, we should look for them first among routinely kept records.

Data Surveys If the data needed to answer a question are not available from routinely kept records, then logical source may be a survey. For example, the administration of the health department want to learn the numbers of hypertension in Zhengzhou, we may conduct a survey.

Data Experiments Frequently the data needed to answer a question are available only as the result of an experiment. For example, a nurse wish to know which of several strategies is best for maximizing patient compliance.

Data External sources The data needed to answer a question may already exist in the form of published reports. For example, statistical yearbook, population census……

statistics A science dealing with the collection, analysis, interpretation and presentation of masses of numerical data ----Webster’s international dictionary

statistics The science and art of dealing with variation in data through collection, classification and analysis in such a way as to obtain reliable results. —— John M. Last —— A Dictionary of Epidemiology

National economic statistics The tools of statistics are employed in many fields—demography, national economic, psychology, medicine…… Demographics National economic statistics Psychological statistics Biostatistics ……

Biostatistics When the data analyzed are derived from the biological sciences and medicine, we use the term “biostatistics” to distinguish this particular application of statistical tools and concepts.

Population and sample We want to learn the average income of Beijing doctors in 2010. Suppose there are 20,000 doctors in Beijing in 2010. To investigate all the doctors one by one (But it is consuming-time ) 500 are drawn from which randomly. Then generalize the population average income from the incomes of 500 doctors.

Population and sample Questions What is study aim? What is study population? What is our observational unit? What is sample? What is sample size?

Population and sample Answers To learn the average income of Beijing doctors in 2010 20,000 doctors’ income Individual 500 doctors’ income 500

Population and sample population Definition:Population is the largest collection of entities for which we have an interest at a particular time. For example, we are interested in the weights of all the children enrolled in a certain country elementary school system, our population consists of all these weights.

Population and sample population Population may be finite or infinite. If a population of values consists of a fixed number of these values, the population is said to be finite. If, on the other hand, a population consists of an endless succession of values, the population in an infinite one.

Population and sample Sample Definition: A sample is a random part of population. Suppose our population consists of the weights of all the elementary school children enrolled in a certain country school system. If we collect for analysis the weights of only a fraction of these children, we have only a part of our population of weights, that is, we have a sample.

Population and sample How to get a random part of population? Simple random sampling Systematic sampling Stratified sampling Cluster sampling

If a sample of size n is drawn from a population of size N in such a way that every possible sample of size n has the same chance of being selected, the sample is called a simple random sample 1 9 2 3 4 5 6 7 8 10 17 16 15 13 14 12 11 Sample

Variable If we observe a characteristic, we find that it takes on different values in different persons, places, or things, we label the characteristic a variable. Examples: heart rate, the heights of adult males, diastolic blood pressure, gender, blood type,treatment effect

Variable Binary Multiple categorical Ordinal Quantitative variable Qualitative Multiple categorical Ordinal Binary

Variable quantitative variable: also known as metric, or numerical is one that can be measured in the usual sense convey information regarding amount example:the weights of preschool children, diastolic blood pressure

Variable qualitative variable also known as categorical or nominal is one that can not be measured in the usual sense,only can be categorized convey information regarding attribute

Variable Binary variable: gender, live or death, yes or no. Multiple categorical variable blood types race A, B, AB, O white, black, yellow, brown Ordinal variable: there is an order in the categories Your opinion on something: unsatisfactory, normal, very satisfactory

Variable ID age gender Educational level occupation height weight 2025655 27 male graduate teacher 165 71.5 2025653 22 undergraduate doctor 160 74 2025830 25 female junior high school worker 158 68 2022543 23 senor high school students 161 69 2022466 159 62 2024535 elementary farmer 157 2025834 20 cadre 66 2019464 24 70.5 2025783 29 154 57

Data transformation Variable Numerical variable weight (kg) Ranked variable binary variable weight (kg) fat or overweight normal thin abnormal

quantitative variable qualitative variable example:WBC(1/m3)count of five persons: 3000 6000 5000 8000 12000 lower normal normal normal higher Binary variable : normal 3 persons; abnormal 2 persons Ordinal variable: lower 1 person normal 3 persons higher 1 person

Parameter and Statistic describe the characteristic of population. usually presented by Greek letter,such as μ. Usually unknown

Parameter and Statistic describe the characteristic of a sample usually presented by Latin letter,such as s and p.

Probability the possibility of occurrence of a random event. designated as P 0≤P≤1 P=0 impossible event P=1 certain event P≤0.05 small probability event Certain Impossible

Probability random event: The event may occur or may not occur in one experiment. Before one experiment, nobody is sure whether the event occurs or not. Throw the dice

Probability Frequency of an event------the number of times the event occurs in a sequence of repetition of the random phenomenon. Probability of an event----if in a long sequence of repetition, the relative frequency of an event approached a fixed number, that number is the probability of the event .

Probability Relative frequency 1.00 0.00 0.25 0.50 0.75 25 50 75 100 25 50 75 100 125

Probability n P=f=m/n ∝ The relationship between relative frequency and probability →Probability is the limit of frequency n P=f=m/n ∝

Examples of small probability event: Probability of traffic accident Serious adverse events happened after injecting hepatitis b vaccine Winning the lottery

Ⅲ Basic step of statistical work 4 Analysis of data 3 Sorting of data 2 Collection of data 1 Design

1 Design Professional design Study aim Study subject measures Statistical design Sampling method Allocation method Calculation of sample size Data processing What are you going to do?

2 Collection of data Source of data Routinely kept records Surveys Experiments External sources Principle:in time, accurate, complete

Checking: outlier, missing value, 3 Sorting of data Checking: outlier, missing value, Coding: Blood type A(1), B(2), AB(3), O(4); gender male(1), female(2) Grouping: DBP SBP Computing: weight height hypotension normal hypertension Body mass index

4 Analysis of data Statistical analysis is divided into two parts: descriptive statistics and inferential statistics

To teach the student to organize and summarize data Statistical description inference indicator Table and chart Parameter estimation Hypothesis testing analysis To teach the student how to reach decisions about a large body of data by examining only a small part of the data

Review questions and exercises Define: Quantitative variable Qualitative variable Population Sample probability

Review questions and exercises Explain the type of the following variables: Admitting diagnosis in a mental health clinic Weights of babies born in hospital during a year Gender of babies born in hospital during a year Under-arm temperature of patients with fever

Thank you