This presentation will be placed on the WWW at the following address:

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Introduction to Flow Cytometry
SUMMARIZING DATA: Measures of variation Measure of Dispersion (variation) is the measure of extent of deviation of individual value from the central value.
ACQUISITION RATES EFFECT ON MULTIPARAMETRIC RARE EVENT ANALYSIS : AN INSTRUMENT COMPARISON John Tigges, Vasilis Toxavidis, Heidi Mariani Beth Israel Deaconess.
World Health Organization
Advancements in FACS analyzers optical design leads to greater functionality and a smaller footprint. INTRODUCTION In the past decade, instrumentation.
COMPENSATION By: Ronald Mathieu. Compensation Why do we need compensation? –1) Because of long emission spectrum of dyes like FITC and PE.
Page 1 © J.Paul Robinson, Purdue University BMS 602 LECTURE 9.PPT BMS LECTURE 10x Flow Cytometry: Theory Bindley Bioscience Center Purdue.
Page 1 © J.Paul Robinson, Purdue University Cytometry Laboratories BMS 602 LECTURE 9.PPT BMS LECTURE 9 Flow Cytometry: Theory Hansen Hall,
CHAPTER 6 Statistical Analysis of Experimental Data
Flow Cytometry and Sorting Part 2 Lecture Notes for “Fluorescence Spectroscopy in Biological Research” Robert F. Murphy, October 1996.
8-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Statistics for Managers using Microsoft.
Copyright ©2011 Pearson Education 8-1 Chapter 8 Confidence Interval Estimation Statistics for Managers using Microsoft Excel 6 th Global Edition.
Workshop Tutorial Polish Cytometry Society 1998 Analysis of flow cytometric data - data collection, principles of gating and histogram analysis This presentation.
Page 1 © J.Paul Robinson, Purdue University BMS 631 – LECTURE007.PPT BMS 602/631 - LECTURE 8 Flow Cytometry: Theory J. Paul Robinson Professor.
Basics of Flow Cytometry Holden Maecker. Outline Definitions, what can be measured by flow cytometry Fluidics: Sheath and sample streams, flow cells,
Introduction to Flow Cytometry
Chemometrics Method comparison
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Confidence Interval Estimation
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
Introduction To Flow Cytometry By Noha Kamel. Flow cytometry is a method of measuring multiple physical and chemical characteristics of particles by optical.
Basic Principles in Flow Cytometry
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Section 10.1 Confidence Intervals
FREQUANCY DISTRIBUTION 8, 24, 18, 5, 6, 12, 4, 3, 3, 2, 3, 23, 9, 18, 16, 1, 2, 3, 5, 11, 13, 15, 9, 11, 11, 7, 10, 6, 5, 16, 20, 4, 3, 3, 3, 10, 3, 2,
Building Simulation Model In this lecture, we are interested in whether a simulation model is accurate representation of the real system. We are interested.
Flow Cytometry Basic Training. What Is Flow Cytometry? Flow ~ cells in motion Cyto ~ cell Metry ~ measure Measuring properties of cells while in a fluid.
Flow Cytometry Becton Dickinson Asia Limited Company.
Chap 8-1 Chapter 8 Confidence Interval Estimation Statistics for Managers Using Microsoft Excel 7 th Edition, Global Edition Copyright ©2014 Pearson Education.
Quality Controls: Get your instruments under control!
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Stats Methods at IC Lecture 3: Regression.
Descriptive and Inferential Statistics
Lecture Slides Elementary Statistics Twelfth Edition
Logic of Hypothesis Testing
Chapter 7 Confidence Interval Estimation
Confidence Interval Estimation
Flowcytometry.
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 8 Confidence Interval Estimation.
Analyzing and Interpreting Quantitative Data
Chapter 8: Inference for Proportions
Elementary Statistics
COMPENSATION + FITC - PE - + PE - FITC - FITC detector FITC PE
Pulse Processing Chapter No. 17
Analyzing Reliability and Validity in Outcomes Assessment Part 1
MEASURES OF CENTRAL TENDENCY
Introduction to Summary Statistics
Introduction to Systems Analysis and Design
Introduction to Summary Statistics
Some Basic Physics & Optics
Flow Cytometry and Sorting Part 3
Inferential Statistics
Statistical Analysis Error Bars
Confidence Interval Estimation
Flow Cytometry Journal of Investigative Dermatology
Chapter 3: Central Tendency
Dr. Clincy Professor of CS
THE STAGES FOR STATISTICAL THINKING ARE:
The Examination of Residuals
A New Tool for Evaluating Candidate PM FEM and PM2.5 ARM Monitors
Displaying data Seminar 2.
Scatter Diagrams Slide 1 of 4
Presentation transcript:

This presentation will be placed on the WWW at the following address: Workshop Tutorial Polish Cytometry Society 1998 Analysis of flow cytometric data - data collection, principles of gating and histogram analysis J.Paul Robinson, Ph.D. Presentation given to the Polish Society for Cytometry meeting, Gdansk, 20 October, 1998. Material on this site is copyrighted by J.Paul Robinson, however, all the material may be freely used for non-commercial educational use. Examples include lectures, tutorials and lecture notes. If slides and material are used, appropriate acknowledgement should be as follows: Taken/modified from material prepared by J.Paul Robinson, Purdue University, West Lafayette, Indiana. For contact purposes: J.Paul Robinson, Ph.D. Professor of Immunopharmacology Purdue University, West Lafayette, In. Ph (765) 494-0757 Fax (765) 494-0517 http://www.cyto.purdue.edu email jpr@flowcyt.cyto.purdue.edu Date Published to the WEB 25 October, 1998 This presentation will be placed on the WWW at the following address: http://www.cyto.purdue.edu/educate

Data Analysis Data acquisition vs. data analysis Data analysis software Data display Establishing regions and gating Analysis methods that can change results Summary This workshop will concentrate on understanding the nature of data analysis for flow cytometry. We will discuss the structure of the files, both listmode and histograms, techniques for gating populations (forward and back gating), the use of time and histogram analysis. Various examples will be provided and routine problems in data analysis will be identified. More detailed notes will be provided for the workshop session.

Data Acquisition Each measurement from each detector is referred to as a “parameter” or “variable” Data are acquired as a “list” of the values for each “parameter” (variable) for each “event” (cell) .Data Collection Most flow cytometers collect between 3 and 10 "parameters" or "variables". Each variable can be used to discriminate some component of the cell populations. The standard for data collection has been known as the "Flow Cytometry Standard" or FCS data file structure. Data can be collected in either histogram mode or listmode. It is now routine to collect flow cytometry data in listmode because there are few constraints in the size of data collection files. Originally, flow cytometers did not have good network capability, small hard disks (5-20 Mb, and floppy drive that could save only about 1 Mb of data). With high capacity hard disks and Ethernet access now routine, listmode is the most useful collection modality. Data can be collected in a gated or ungated mode. If the data are ungated, all events registered by the instruments electronic circuits will be saved in listmode files. If gated collection is chosen, it is possible to collect a listmode file of only those events that satisfy certain criteria established by the operator. [RFM]

Data Analysis Software Instrument Software Elite 4.0 Coulter Bryte HS 2.0 Bio-Rad Lysis II Becton-Dickinson Commercial Sources WinList & Modfit LT Verity Software ListView & Multicycle Phoenix Software Free Flow Software WinMDI Web

Pros & Cons of Free Flow Software Advantages: works with listmode files from many types of aquisition software source code is available many people use these packages FREE!!!!! Disadvantages: little or no technical support little or no documentation often no tutorials BUGS!! Don’t forget to mention that the CDROM has all the free software on it!!! .Analysis Software There are many ways of analyzing flow cytometry data. The first and probably most important packages are those provided on the instruments used in the laboratory to collect the initial data. Often the manufacturer provides a limited analysis package, however, frequently a great deal of the needed analysis can be obtained directly from the instrument package. If more complex gating and comparison are required, it is mostly necessary to move the data to an off-line computer with an alternative data analysis package. Many freeware packages are available today that are excellent. Several examples are WinMDI written by Joe Trotter.

Flow Cytometer Computer Files Listmode files -correlated data file where each event is listed sequentially, parameter by parameter -large file size Histogram files uncorrelated data used for display only Flow cytometry standard (FCS 2.0) format used to save data use other software programs to analyze data

Ungated Listmode Gated Listmode collection Types of Listmode data Ungated Listmode Gated Listmode collection .Data structure As noted above the file structure of the collected data is normally saved in a format known as FSC (Flow Cytometry Standard). This was originally proposed by Murphy et al (ref) to encourage all manufacturers and software engineers to comply with the standard to allow relatively easy data exchange between users of different instruments. Most manufacturers comply with a reasonable extent and a number of conversion utilities exist in software packages to allow cross-analysis of data from different instruments. The current standard is FCS-2, which adds a number of additional components to the file structure. Each file collected has a number of keywords, PMT (Photomultiplier Tube) values and gating regions associated with it. Some listmode file structures also contain the instrument setup used to collect the original data. While each instrument collects files of different internal structure, in essence the information collected retains a relatively common format allowing 3rd party software manufacturers to create value added analysis packages.

Data Acquisition - Listmode .List Mode Data Essentially however, a listmode file is a series of numbers which sequentially list data for each variable collected on each cell. This is shown in the associated table provided. [RFM]

Listmode File FILEVERSION;1.15 BTRIEVE;7058418 IBA13-2 collected 6/27/95;28/6/1995;;2;1 PARAM;LS1;400;LS2;600;FL1;460;FL2;400;FL3;300;WID;TIM GAIN;1;1;2;2;0 THRES;0;10;0;5;0;5;1;11;0;5 FLUIDIC;3;4;1;7.20 FCM;357;2;2;1;1;0;0;1;1;1;1;0;1;1;0;1 SUBTRACT;15;0 AUTOCYCLE;1;0;0;0;0;0;0;-1;1;20000;7;0;1;0;1;0 PCBUFFER;374000;1 SERIAL;2 ADBOARD;1;0;0;30;0 PEAKAREA;0;0;0;0 CALIBRATION;300;300;300;300;0;0;0;0 MAINWINDOW;1 PRINTHEADER; Flow cytometry Report;Win-Bryte software - Bryte-HS flow cytometer PRINTFOOTER; Purdue University Cytometry Laboratories PRINTSTATISTICS;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;1;0;0;1;20;15;1;8;12;5;0;0;5 ROI;1;1;3;3;4;45;65;61;65;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;bird;1;1;1;0;1;1;1;0;1;1;0;;-1;-1;3;0;255;0;0 ROI;2;1;3;3;0;121;41;141;41;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;human;1;1;1;0;1;1;1;0;1;1;0;;-1;-1;3;0;255;0;0 ROI;7;3;5;1;0;0;18;24;63;52;48;16;2;0;2;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;;1;1;1;0;1;1;1;0;1;1;0;;-1;-1;3;5;255;0;0 ROIDATACYTO;7;20022;4845.7;98.7;0.0 ROIDATAHIST;1;2.4;16712;52;53;4044.6;2.4;53;83.5;0.0 ROIDATAHIST;2;2.2;1875;128;128;531.3;1.7;128;7.8;0.0 HISTOGRAM;FALS;24;49;290;320;256;0;Count;1;1;0;0;0;255;0;255;0;0;0;0;0;255;1;1;C;0;1;1;1;S;0;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;1;1;1;255;0;255 HISTOGRAM;SIDE-SCATTER;290;49;556;320;256;1;Count;1;1;0;0;0;255;0;255;0;0;0;0;0;255;1;1;C;0;1;1;1;S;0;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;1;1;1;255;0;255 CYTOGRAM;LS1-LS2;24;320;290;591;64;1;0;1;1;0;0;0;255;0;255;0;0;0;0;0;0;1;1;O;0;1;0;0;S;0;6;-1;-1;-1;-1;-1;-1;-1;-1;-1;6;-1;-1;1;0;1;0;A;B;C;D;1;1;255;0;0;32;32;0;255;0;255 HISTOGRAM;FL2;290;320;556;591;256;3;Count;1;1;0;0;0;255;0;255;0;0;0;255;0;0;1;1;C;10;1;1;1;S;0;0;1;-1;-1;-1;-1;-1;-1;-1;-1;0;7;-1;1;1;1;255;0;255 CYTOGRAM;FL2-TIM;556;320;822;591;64;6;3;1;1;0;0;0;255;0;255;0;0;0;0;0;0;1;1;I;0;1;0;0;S;0;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;-1;1;0;1;0;A;B;C;D;1;1;255;0;0;32;32;0;255;0;255 LISTDATA 226;0;426;426;243;81;0 412;225;8;426;426;239;0 380;262;2;427;427;245;0 578;348;0;412;412;239;0 529;377;0;431;431;240;0 420;203;0;438;438;243;0 444;221;0;427;0;240;0 383;215;0;421;421;238;0 735;716;0;1027;1027;280;0 499;228;0;431;431;239;0 531;328;0;433;433;241;0 520;218;0;423;423;243;0 471;252;0;425;425;244;0 652;298;0;441;441;240;0 561;291;0;421;421;240;0 608;307;0;415;415;241;0 541;231;0;424;424;245;0 Listmode File

Listmode data Analysis PARAM;LS1;400;LS2;600;FL1;460;FL2;400;FL3;300;WID;TIM 226;0;426;426;243;81;0 LS1 LS2 FL1 FL2 FL3 WID TIM 226 0 426 426 243 81 0

One parameter frequency histogram # of events for particular parameter .Histogram Data Histograms are by definition frequency distributions for whatever variable of interest. Frequently we use the term histogram to incorrectly describe various types of data representations. establish regions and calculate coefficient of variation (cv) cv = stdev/mean of half peak

Establishing Regions Establishing regions: -objective or subjective? -training/skill/practice Possible shapes: -rectangles -ellipses -free-hand -quadrants Statistics R1 .Gating: Principles and Terms There are a number of terms associated with flow cytometry analysis. One important concept is the idea of creating an electronic gate. A gate can also be referred to as a bitmap or region. Usually these terms are synonymous in usage, although they can mean slightly different things to different manufacturers.

Gating Real-time gating vs. software gating Establishing regions Gating strategies Quadrant analysis Complex or Boolean gates Back gating

Real-Time vs. Software Gating Real-time or live gating: -restrict the data that will be accepted by a computer (some characteristic must be met before data is stored) (This is not encouraged) Software or analysis gating: -excludes certain stored data from a particular analysis procedure .Forward Gating Forward gating occurs when we select criteria to be applied to a data set such as an initial light scatter bitmap.

Using Gates R1 log PE Region 1 established Gated on Region 1

Light Scatter Gating Side Scatter Projection Neutrophils Scale 1000 200 100 50 40 Monocytes 30 20 This cartoon indicates how data can be presented. This is a contour plot with a PROJECTION histogram of each of the variables shown on the contour plot. For example if contour plots are to be presented, contour levels must be provided (see box at left). Without the contour levels, there is no way to know how the data were actually derived - for example, the lowest level might be 1 cell or even 100 cells - so the data must be provided. The second issue of importance is the labeling of the axes. It is NOT acceptable to label axes as FL! or FL2. This shows an unscientific understanding of the nature of data presentation and should NEVER be used. It is best to identify the name of the parameter/variable represented and if applicable, the wavelength of the signal. If the signal is LOG data, this should be indicated also. 15 Lymphocytes 8 200 400 600 800 1000 90 Degree Scatter

Gating Forward gate Back gate 2P Scatter 1P Fluorescence 90 deg Scatter FALS Scatter Gating Forward gate 2P Scatter 1P Fluorescence 2P Fluorescence Back gate .Back Gating When we backgate - our goal is to identify certain endpoint criteria and apply them to the data set. As example in a whole blood assay, we can use scatter gating to identify which cells might be monocytes. However, there may be some overlap with other cell types making accurate analysis impossible. By adding a fluorescent marker that will identify those monocytes we can place a bitmap on the positive regions of the fluorescence histogram and backgate that to identify the scatter of only those cells that are positive with the fluorescence marker. This is now a backgated population. Backgating can be sequential based on several criteria. log PE

Quadrant Analysis The Square Cell Principle Is It necessary? Why is it used? .Histogram Analysis There are many ways to analyze a histogram. Some very simple methods are often satisfactory to obtain the required information. For example, there might be an interest in identifying positive from negative expressing cells. If there is a significant difference in the fluorescence distributions, a simple positive gate will give you the answer you need. If however, there is overlap between the populations a more sophisticated approach might be needed. This is the area where significant caution must be taken as analysis now becomes more complex. Comparison between histograms requires considerable care. Statistical methods must be used to determine significant difference between populations. The coefficient of variation (CV) is an important measure of the variation that exists between cells within a population. Other statistical components will be introduced.

Quadrant Analysis 2 1 3 4 (- -) (+ -) ( - +) (+ +) log PE Log PE (575 +/- 20 nm) 3 (- -) 4 (+ -) Log FITC (525 nm)

Complex or Boolean Gating With two overlapping regions, several options are available: R1 R2

Boolean Gating Not Region 2:

Boolean Gating Region 1 or Region 2:

Boolean Gating Region 1 and Region 2:

Boolean Gating Not (Region1 and Region 2):

Back Gating Region 4 established Backgating using Region 4 Back gate log PE .Back Gating When we backgate - our goal is to identify certain endpoint criteria and apply them to the data set. As example in a whole blood assay, we can use scatter gating to identify which cells might be monocytes. However, there may be some overlap with other cell types making accurate analysis impossible. By adding a fluorescent marker that will identify those monocytes we can place a bitmap on the positive regions of the fluorescence histogram and backgate that to identify the scatter of only those cells that are positive with the fluorescence marker. This is now a backgated population. Backgating can be sequential based on several criteria. Region 4 established Backgating using Region 4

Drawing Regions: Sample Preparation Sample Quality log log Spores Spores Vegetative Debris Debris log log B.subtilis spores B.subtilis veg. + spores

Multi Parameter Data Display FITC+ APC+ PE + APC+ FITC+ PE+

Methods that can change results: 1. Doublet discrimination 2. Time as a quality control parameter Example: DNA content -need to eliminate debris & clumps -need to gate out doublets -maintain constant flow rate

Doublet Discrimination Peak Fluorescence Integral Fluorescence - selection at the time of collection - 1:1 ratio

Time as a quality Control Parameter “blockage” data can be removed by gating the time histogram. A B Abnormal histogram caused by turbulence (not biological variation) Time was first proposed as a quality control parameter by Watson. IN the above example the DNA histogram (Top left) appears to have an abnormal G0-G1 population. By collecting time as a parameter, an observation can be made as to the rate of cell collection. As can be seen at the right of this histogram, the collection rate appears to have been significantly reduced at region B. This may have been caused by a blockage in the flow cell or some other non-biological problem. If this caused turbulence in the flow cell, a significant impact might be seen on the CV of the fluorescence histogram. By removing the data collected during the time gate B (by using a Boolean operator [a NOT B], only data collected at the appropriate rate is included in the data analysis as seen on the lower left histogram. A normal time histogram is shown on the lower right. Normalized histogram after subtraction of list mdoe data collected during the blockage shown at B on right.

This workshop has discussed the following ideas: Conclusion This workshop has discussed the following ideas: 1. Nature of data 2. Analysis software and techniques 3. Gating and region definition 4. Forward and Back gating 5. Boolean operators on flow data 6. Quality control systems This Workshop was entitled: Analysis of flow cytometric data - data collection, principles of gating and histogram analysis and was presented at the 4th Polish Cytometry Society Congress in Gdansk, Poland, 18 October, 1998. J. Paul Robinson Professor of Immunopharmacology Purdue University West Lafayette, USA Material copyright by J.Paul Robinson. See first slide notes for use information. Date Published to the WEB 25 October, 1998