Influenza Neuraminidase Inhibitor IC 50 Data: Calculation, Interpretation and Statistical Analyses.

Slides:



Advertisements
Similar presentations
Statistical Reasoning for everyday life
Advertisements

Descriptive Measures MARE 250 Dr. Jason Turner.
Measures of Dispersion
Introduction Data sets can be compared and interpreted in the context of the problem. Data values that are much greater than or much less than the rest.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter In Chapter 3… … we used stemplots to look at shape, central location, and spread of a distribution. In this chapter we use numerical summaries.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
NI Assays: Troubleshooting & Analysis of Curve Fitting Graph 1: Standard clinical isolate Good NA activity S shaped curve (observed points) IC50 in expected.
Understanding and Comparing Distributions
Statistics: Use Graphs to Show Data Box Plots.
Starter 1.Find the median of Find the median of Calculate the range of Calculate the mode.
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
The Five-Number Summary And Boxplots. Chapter 3 – Section 5 ●Learning objectives  Compute the five-number summary  Draw and interpret boxplots 1 2.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
Box and Whisker Plots and Quartiles Sixth Grade. Five Statistical Summary When describing a set of data we have seen that we can use measures such as.
Describing Data: Numerical
Numerical Descriptive Measures
Enter these data into your calculator!!!
Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Objectives 1.2 Describing distributions with numbers
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Copyright © 2009 Pearson Education, Inc. Chapter 5 Understanding and Comparing Distributions.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Variation This presentation should be read by students at home to be able to solve problems.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Interpretation of Raw Data: NA Activity and IC50 Assays Angie Lackenby.
Box and Whisker Plots Measures of Central Tendency.
Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 5 – Slide 1 of 21 Chapter 3 Section 5 The Five-Number Summary And Boxplots.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
Chapter 6: Interpreting the Measures of Variability.
Using Measures of Position (rather than value) to Describe Spread? 1.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Chapter 4 Measures of Central Tendency Measures of Variation Measures of Position Dot Plots Stem-and-Leaf Histograms.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Updated Force Measurement Statistical Tool Jason Beardsley and Bob Fox.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
CHAPTER 4 NUMERICAL METHODS FOR DESCRIBING DATA What trends can be determined from individual data sets?
Chapter 2 Describing Data: Numerical
Exploratory Data Analysis
a graphical presentation of the five-number summary of data
Notes 13.2 Measures of Center & Spread
Introduction To compare data sets, use the same types of statistics that you use to represent or describe data sets. These statistics include measures.
Travelling to School.
1.3 Measuring Center & Spread, The Five Number Summary & Boxplots
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Describing Distributions Numerically
Averages and Variation
Chapter 3 Describing Data Using Numerical Measures
Analyze Data: IQR and Outliers
Unit 4 Statistics Review
Numerical Measures: Skewness and Location
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
Warm-up 8/25/14 Compare Data A to Data B using the five number summary, measure of center and measure of spread. A) 18, 33, 18, 87, 12, 23, 93, 34, 71,
The absolute value of each deviation.
Day 52 – Box-and-Whisker.
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
Describing Data Coordinate Algebra.
Analyze Data: IQR and Outliers
Presentation transcript:

Influenza Neuraminidase Inhibitor IC 50 Data: Calculation, Interpretation and Statistical Analyses

Presentation Outline Determining IC 50 values Curve-fitting methods Sources of variation Identifying IC 50 outliers Determining cut-offs/thresholds Outlier values versus resistant viruses Monitoring trends over time in IC 50 data

Abbreviations NA: Neuraminidase NI: Neuraminidase Inhibitor RFU: Relative Fluorescence Units RLU: Relative Luminescence Units VC: Virus Control

Determining NI IC 50 Values IC 50 : The concentration of NI which reduces NA activity by 50% of the virus-control, or upper asymptote Estimated by: Measuring the NA activity (RFU or RLU) of an isolate against a range of dilutions of the drug, as well as without drug (virus-control)

Determining IC 50 Values: Calculation Options Curve fitting-Statistical software Graph Pad Prism ( $595 Grafit ( $ Jaspr (Developed by CDC) Contact CDC for suitability and further details Point-to-Point calculation Excel templates (Created by Health Protection Agency, UK)

IC 50 Calculation: Curve-Fitting (Graph-pad) Upper Asymptote Non linear regression analysis Sigmoidal dose response curve

IC 50 Calculation: Curve-Fitting (Grafit)

IC 50 Calculation: Point to Point

IC 50 Calculation: Comparison of IC 50 values from different calculation methods

Method of calculation Using point-to-point or curve fitting software Choice of curve fitting software used Intra-assay variation Difference between 2 or more replicates Inter-assay variation Difference in calculated value for a given isolate in multiple assays Determining IC 50 Values: Sources of Variation

IC 50 Calculation: Comparison of Point to Point versus curve-fitting Curve Fitting WILL give an IC 50 value by extrapolating the curve when drug dilutions do not reach a true end point: This does not necessarily give an accurate IC 50 value

IC 50 Calculation: Troubleshooting Low VC: technical errorPoor curve fit Drug titration error Important to examine curves carefully to ensure IC50 is valid

Comments: Choice of IC 50 Calculation Method Choice of IC 50 calculation method will make no more than about 5% difference to IC 50 value, for most samples Must be clear exactly how the curve fitting and calculation of IC 50 is working Is IC 50 based on 50% of RFU/RLU of VC or 50% of fitted upper asymptote. Regardless of method used, careful examination of the curve produced is required to identify technical issues. See presentation on validation and troubleshooting of IC50 testing methods

Analysis of intra-assay variation

Comments: Intra-assay Variation Variation between replicates in the same assay can be 15%-20%. This variation is greater than that seen with changes to curve-fitting method Using replicates and taking the average reduces this effect. A large difference between replicates (e.g. >30%) of a given virus indicates a technical issue In these instances repeat testing should be performed

Analysis of Inter-assay Variation Introduction of new drug batch

Analysis of Inter-assay Variation

Comments: Inter-assay Variation Variation in IC50 values for a virus in multiple assays can be 50%. Control viruses should be included in every assay to identify technical issues. Control viruses should be validated, and have a defined range between which the IC50 is valid. Assays in which the control virus IC50 falls outside the accepted range should be reaped in their entirety.

Conclusions: Determining IC 50 Values Several methods for IC 50 calculation available at a range of price and sophistication Variation due to choice of IC 50 calculation method is minimal (5- 10%) in comparison with intra-assay (20%) and inter-assay (50%) variation. Choice of curve-fitting method should be made based on individual laboratory circumstances All variation can be minimised using appropriate assay controls (reference/control viruses, validation of curves generated) Consistency in methodology used (statistical and laboratory) is important for long term analysis (time trends)

Identifying IC 50 outliers Aim: identify isolates with higher (or lower) than expected IC50 values (outliers) First determine the ‘normal range’ of IC 50 values Each Various statistical methods may be used Critical to ensure that any outliers do not unduly affect the cut-off/threshold Outlier does not equal resistant Identifies isolates that may be worth further investigation (retesting/sequencing)

Identifying IC 50 outliers: Commonly Used Statistical Methods SMAD Robust estimate of the standard deviation based on the median absolute deviation from the median Box and Whisker plots Graphical representation of the 5 number summary of the data (the sample minimum, the lower or first quartile, the median, the upper or third quartile, the sample maximum) Both methods require a minimum dataset to perform robust analyses (>20) Cut offs can be calculated mid-season, once a reasonable number of samples has been tested, to monitor outliers At the end of the season, cut offs can be updated and a retrospective analysis of all season data performed.

Using SMAD Analysis Create a scatter plot of all data Useful to see the spread and trend of the data Log transform the data Calculate a robust estimate of the standard deviation based on the median absolute deviation from the median using log10 data Templates for this analyses are available from HPA, UK Major outliers: all those with values more than 3SD above the median Minor Outliers: all those more than 1.65SD above the median

Using SMAD Analysis: Example Data Early-Mid Season Estimate Median0.97 Robust SD1.27 Minor Outlier (1.65SD)1.45 Outlier (3SD)1.99 Post Season Estimate Median1.1 Robust SD1.25 Minor Outlier (1.65SD)1.56 Outlier (3SD)2.11

Using Box and Whisker Analysis This analysis can be performed in Graphpad Prism, with the box and whisker plots drawn automatically Calculations can be done in excel, but drawing the box and whisker plots is more complicated A template for plotting the graphs is available from Adam Meijer Log transform the data Calculate the median, upper quartile, lower quartile, interquartile range, upper minor and major fences, and lower major and minor fences Mild outliers lie between the minor and major fences Extreme outliers lie outside the major fence

Box and Whisker Plots Principle Mild outlier Extreme outlier Equivalent values in SMAD analysis Excel Formulae Upper quartile (Q3): (QUARTILE(B2:B150,3) Lower quartile )Q1): (QUARTILE(B2:B150,1) IQR: Q3-Q1

Using Box and Whisker Analysis: Example Data Box and Whisker Median1.08 Mild outlier lower fence (1.5*IQR)0.55 Extreme outlier lower fence (3*IQR)0.34 Mild outlier higher fence (1.5*IQR)1.95 Extreme outlier higher fence (3*IQR)3.14 Graphpad Prism Output Excel Output

Do we need to log-transform? Most results from dilution assays produce ‘geometric results’ so likely to be sensible Sometimes data are skewed. (e.g. lower quartile much closer to median than upper quartile) Important to log transform as robust methods assume data are normal once outliers are removed.

Using Log10 versus non logged Data Non Log Data Median0.72 Robust SD 0.50 Minor Outlier (1.65SD) 1.55 Outlier (3SD) 2.23 Log10 Data Median0.72 Robust SD 0.50 Minor Outlier (1.65SD) 1.55 Outlier (3SD) 2.23

Impact of Excess Numbers of Outliers If the data have a large number of outliers, both SMAD and B+W struggle to determine sensible cut offs. As resistant virus is very clearly different, these values can be removed prior to analysis to allow sensible calculations of cut offs for the remaining data. Below, data is shown for H1N1 in 2007/8, when sensitive and resistant virus co-circulated. Cut offs calculated do not accurately apply to the sensitive IC50 data Resistant Isolates removed Sensitive and Resistant Isolates

Conclusions: Identifying Outliers Determining cut offs/thresholds identifies those isolates with IC50 values higher than the normal range Cut offs/thresholds need to be subtype specific Season specific cut offs are useful, if enough data is generated in one season, but data from multiple seasons can be merged to perform a more reliable analyses Box and whisker plots and SMAD analyses generate slightly different cut offs Q3+1.5xIQR (mild outlier cut off) is equivalent to 2.7SD from SMAD Q3+3xICR is equivalent to 4.7SD from SMAD. Cut offs calculated by box and whisker analyses are higher than those from SMAD analyses. Choice depends on individual laboratory preference Box and whisker plots present the data well Both methods minimise the impact of outlier values on the analyses, but both will fail once too many outliers are present Data begins to have two populations

Monitoring Trends Over Time The normal range of IC50 values for a particular subtype can change over time This could be seen by an increase in the number of outliers, or by changes in the median Simple to monitor, using the methods already described for identifying outliers scatter plots/box-whisker Other statistical methods can be used to further analyse data from several seasons

Trends in IC50 Data: Scatter Plot

Trends in IC50 Data: Box and Whisker

Summary Good use of statistical methods can help interpret the IC50 results and ensure assay results are reliable. Analyses of data not only identifies individual outliers, but allows continuous monitoring of trends Retrospective analyses of multiple seasons of data can identify changes in viral characteristics and susceptibilities Do not use statistics without first looking at the data by scatter plot to find obvious deviations which require an adapted statistical approach Challenge is to find explanations for trends