Stat 565- Lecture 0 Introduction and Map of this Class.

Slides:



Advertisements
Similar presentations
A PowerPoint®-based guide to assist in choosing the suitable statistical test. NOTE: This presentation has the main purpose to assist researchers and students.
Advertisements

Non-Parametric Statistics
Kruskal Wallis and the Friedman Test.
Process Improvement in Healthcare: Volunteer Clinic Case Study Nonparametric Statistics ISE 491 Fall 2009 Dr. Joan Burtner Associate Professor, Department.
Introduction to Computational Biology Topics. Molecular Data Definition of data  DNA/RNA  Protein  Expression Basics of programming in Matlab  Vectors.
Lecture 14 – Thurs, Oct 23 Multiple Comparisons (Sections 6.3, 6.4). Next time: Simple linear regression (Sections )
Regression and Correlation
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
1 Test of significance for small samples Javier Cabrera.
Chapter 19 Data Analysis Overview
Lecture 23 Multiple Regression (Sections )
Introduction to research Analyzing and interpreting research data
Lab 2 instruction.  a collection of statistical methods to compare several groups according to their means on a quantitative response variable  One-Way.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Copyright (c) Bani K. mallick1 STAT 651 Lecture #14.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 18-1 Chapter 18 Data Analysis Overview Statistics for Managers using Microsoft Excel.
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
Statistical Analysis KSE966/986 Seminar Uichin Lee Oct. 19, 2012.
Microarray Data Analysis Illumina Gene Expression Data Analysis Yun Lian.
Statistics 501 Methods of Applies Statistics Using MINITAB Fall 2012 TuTh 11:15AM-12:30PM Rm219 Professor H. K. Hsieh (“Shay”)
Statistical Methods II
Practical statistics for Neuroscience miniprojects Steven Kiddle Slides & data :
STATISTICAL TECHNIQUES FOR research ROMMEL S. DE GRACIA ROMMEL S. DE GRACIA SEPS for PLANNING & RESEARCH SEPS for PLANNING & RESEARCH.
Choosing and using statistics to test ecological hypotheses
Essential Statistics in Biology: Getting the Numbers Right
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Statistics Definition Methods of organizing and analyzing quantitative data Types Descriptive statistics –Central tendency, variability, etc. Inferential.
Parametric & Non-parametric Parametric Non-Parametric  A parameter to compare Mean, S.D.  Normal Distribution & Homogeneity  No parameter is compared.
 The idea of ANOVA  Comparing several means  The problem of multiple comparisons  The ANOVA F test 1.
Nonparametric Statistical Methods: Overview and Examples ETM 568 ISE 468 Spring 2015 Dr. Joan Burtner.
Research Methods in Human-Computer Interaction
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
Nonparametric Hypothesis tests The approach to explore the small-sized sample and the unspecified population.
Microarray data analysis David A. McClellan, Ph.D. Introduction to Bioinformatics Brigham Young University Dept. Integrative Biology.
A A R H U S U N I V E R S I T E T Faculty of Agricultural Sciences Introduction to analysis of microarray data David Edwards.
Nonparametrics.Zip (a compressed version of nonparamtrics) Tom Hettmansperger Department of Statistics, Penn State University References: 1.Higgins (2004)
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Academic Research Academic Research Dr Kishor Bhanushali M
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
Analyzing and Interpreting Quantitative Data
Chap 18-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 18-1 Chapter 18 A Roadmap for Analyzing Data Basic Business Statistics.
Data Science and Big Data Analytics Chap 3: Data Analytics Using R
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
Course Outline Presentation Reference Course Outline for MTS-202 (Statistical Inference) Fall-2009 Dated: 27 th August 2009 Course Supervisor(s): Mr. Ahmed.
Soc 3306a Lecture 7: Inference and Hypothesis Testing T-tests and ANOVA.
Value Stream Management for Lean Healthcare ISE 491 Fall 2009 Data Analysis - Lecture 7.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 7 Analyzing and Interpreting Quantitative Data.
Nonparametric Statistics - Dependent Samples How do we test differences from matched pairs of measurement data? If the differences are normally distributed,
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Data Workshop H397. Data Cleaning  Inputting data  Missing Values  Converting String Variables  Creating Scales  Creating Dummy Variables.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
Inferential Statistics Assoc. Prof. Dr. Şehnaz Şahinkarakaş.
Nonparametric Statistics Overview. Objectives Understand Difference between Parametric and Nonparametric Statistical Procedures Nonparametric methods.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
Appendix I A Refresher on some Statistical Terms and Tests.
Lecture 15 Wrap up of class. What we intended to do and what we have done: Topics: What is the Biological Problem at hand? Types of data: micro-array,
MARCH 18, 2014 DATA ANALYSIS. WHAT TO DO WITH DATA Take a look at your data Histogram Descriptive statistics Mean, mode, range, standard deviation/standard.
Chapter 18 Data Analysis Overview Yandell – Econ 216 Chap 18-1.
Nonparametric tests, χ², logistic analysis
STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis
Comparing ≥ 3 Groups Analysis of Biological Data/Biometrics
Y - Tests Type Based on Response and Measure Variable Data
Part Three. Data Analysis
SA3202 Statistical Methods for Social Sciences
HMI 7530– Programming in R STATISTICS MODULE: Basic Data Analysis
Nonparametrics.Zip (a compressed version of nonparamtrics) Tom Hettmansperger Department of Statistics, Penn State University References: Higgins (2004)
Non – Parametric Test Dr. Anshul Singh Thapa.
Introductory Statistics
Presentation transcript:

Stat 565- Lecture 0 Introduction and Map of this Class

What are we trying to do: The main purpose of this class is to get: – Statisticians a flair for the specific issues in genomic data – Biologist an idea about how/why we analyze data the way we do – Computer scientist an idea about the biological (very briefly) and statistical issues with this data. Bottom Line: is to allow three groups of people to talk to each other.

My role versus your role in the class My role: to pose the problems and try to teach the logic behind the statistical procedures.

Your role: – If you are a Statistician: talk to your classmates about the Statistics. Some of the Statistics will be very familiar to you but NOT to your class mates. – If you are a biologist: to explain to me and the class what some of the issues are. Specific problems like hybridization, PCR and ideas that you are familiar with but not all your class-mates – If you are a computer scientist: to explain what some of the challenges are computationally and how we address them.

Statistical terms and topics we will use Descriptive Statistics Hypothesis Testing Multivariate Statistics Non-parametric Statistics Bayesian Statistics Design of Experiments: Optimal Design

Descriptive Statistics Terms we will see and use: – Histograms and Shapes – Boxplots – Scatter Plots – Mean, Median – Standard Deviation, Quartiles, Quantiles – Coefficient of Variation – Distribution plots: normal qq plots etc.

Hypothesis Testing Hypothesis Type I and Type II errors T-test F-test for ANOVA Chi-squares P-values Multiplicity (simultaneous testing/multiple comparison) – Error control, family-wise error rates, FWER, Bonferroni, FDR – Single step vs sequential methods

Multivariate Statistics EDA: Exploratory Data Analysis Cluster Analysis (hierarchical, non-hierarchical, distance metrics, types of clustering) Principal Components (Idea behind this) Discriminant Analysis Supervised vs Unsupervised Learning

Non-parametric Statistics How do non-parametric tests work in general Sign Test Wilcoxon Signed Rank Test Wilcoxon Rank Sum test (Mann Whitney Test) Tukey’s biweight algorithm Kruskal Wallis test

Bayesian Statistics How these work Empirical Bayes Methods Moderated t or F test

Design of Experiments Why design? Block designs Criteria for determining optimality Dye-swaps, block designs, loop designs

Structure of the Class We will use the basic definition of Statistics to define the structure of the class: Statistics comprises of methods for collecting, compiling, describing, analyzing and inferring from data.

Our steps We talk about the experiment that generates this data Specific nuances to the data collection, design issues, systematic effects Leads us to Normalization and issues therein Type of Data – description and compilation Analyze data for overall effects (Clustering etc) Inferring from data (hypothesis testing) Overall process

We will use R R is free-ware and we can access it readily. I will use this mainly in class. The version I have is Choose your computer and operating system and Download and Install R. The Binary Versions are the fastest. Also Install the packages as many as you can. It will ask for a CRAN site or mirror that’s close to you. I always use USA(WA) as my CRAN site. Will give you SAS code as well if you are interested.