Abstract Accurate determination of the molecular weight (MW) of a protein is an important step toward its isolation, purification and identification. Sodium.

Slides:



Advertisements
Similar presentations
Electrophoresis Theory
Advertisements

Objectives 10.1 Simple linear regression
Forecasting Using the Simple Linear Regression Model and Correlation
A Model to Determine Molecular Weights of Proteins from Gel Electrophoresis By Jose Ceja Kamyar Ghods CSUN/JPL-PAIR 2001.
Protein Purification Molecular weight Charge Solubility Affinity.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
1 Introduction to Inference Confidence Intervals William P. Wattles, Ph.D. Psychology 302.
Objectives (BPS chapter 24)
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Summarizing Bivariate Data Introduction to Linear Regression.
Regression and Correlation
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
PROTEOMICS: GROUP 7 Aaron Simo, Antolino Venegas, & Daniel Weisman.
Molecular Weight Determination of Unknown Proteins for NASA/JPL PAIR Program August 24, 2001 Barbara Falkowski Falgun Patel Celia Smith.
1 Time Series Analysis Thanks to Kay Smith for making these slides available to me!
Chapter Topics Types of Regression Models
Determining Molecular Weight of Unknown Proteins: NASA - PAIR Final Project August 24, 2001 Gregory Frye Jennifer Wright Jennifer Rosales.
Sea Urchin VE Removal…Prediction of Molecular Weights of unknown By:Michael Dinse Elizabeth Gutierrez Maria Uribe.
Review: The Logic Underlying ANOVA The possible pair-wise comparisons: X 11 X 12. X 1n X 21 X 22. X 2n Sample 1Sample 2 means: X 31 X 32. X 3n Sample 3.
NASA-PAIR/ Proteomics Projects Crystal Austin Gerardo Lopez Elham Sarabi.
Abstract Accurate determination of the molecular weight (MW) of a protein is a necessity toward its isolation, purification and identification. Sodium.
Part 18: Regression Modeling 18-1/44 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
Ch. 14: The Multiple Regression Model building
Project 1 Frog Gels: Analysis of Skin John Donich Claudia Morales Kevin Berry.
Protein Electrophoresis BIT 230. Electrophoresis Separate proteins based on Size (Molecular Weight - MW) SDS PAGE Isoelectric Point Isoelectric focusing.
SDS-Polyacrylamide Gel Electrophoresis
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Linear Regression/Correlation
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Gel filtration chromatography y/college/voet/ /animat ed_figures/ch05/f5-6.html.
12b. Regression Analysis, Part 2 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
Electrophoresis PAGE Dr Gihan Gawish.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 22 Regression Diagnostics.
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
Linear Regression Least Squares Method: the Meaning of r 2.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Confidence Intervals: The Basics BPS chapter 14 © 2006 W.H. Freeman and Company.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Reasoning in Psychology Using Statistics
1.1 example these are prices for Internet service packages find the mean, median and mode determine what type of data this is create a suitable frequency.
© Copyright McGraw-Hill 2000
Statistics Bivariate Analysis By: Student 1, 2, 3 Minutes Exercised Per Day vs. Weighted GPA.
Ch 5-1 © 2004 Pearson Education, Inc. Pearson Prentice Hall, Pearson Education, Upper Saddle River, NJ Ostwald and McLaren / Cost Analysis and Estimating.
 Chapter 3! 1. UNIT 7 VOCABULARY – CHAPTERS 3 & 14 2.
Sodium Dodecyl Sulfate-PolyacrylAmide gel Electrophoresis [SDS-PAGE] Experiment 7 BCH 333[practical]
Copyright © 2011 Pearson Education, Inc. Regression Diagnostics Chapter 22.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Confidence Intervals for a Population Proportion Excel.
Polyacrylamide Gel Electrophoresis. Electrophoresis Horizontal Agarose Gels Agarose forms a gel or molecular sieve that supports the movement of small.
1 Probability and Statistics Confidence Intervals.
Copyright © 2003, N. Ahbel Residuals. Copyright © 2003, N. Ahbel Predicted Actual Actual – Predicted = Error Source:
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
Univariate Point Estimation Confidence Interval Estimation Bivariate: Linear Regression Multivariate: Multiple Regression 1 Chapter 4: Statistical Approaches.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Chapter 4 More on Two-Variable Data. Four Corners Play a game of four corners, selecting the corner each time by rolling a die Collect the data in a table.
Linear model. a type of regression analyses statistical method – both the response variable (Y) and the explanatory variable (X) are continuous variables.
Stats Methods at IC Lecture 3: Regression.
Estimating standard error using bootstrap
SDS-Polyacrylamide Gel Electrophoresis Agarose vs. SDS-PAGE
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Statistical Methods For Engineers
Introduction to Inference
Inferences and Conclusions from Data
Introduction to Inference Confidence Intervals
EQ: How well does the line fit the data?
Reasoning in Psychology Using Statistics
Fig. 2. Sera IgE reactivity patterns and the frequency of reactivity to raw and cooked shrimp extracts by Western blot. (A, C) All sera were diluted at.
Molecular weight determination
Presentation transcript:

Abstract Accurate determination of the molecular weight (MW) of a protein is an important step toward its isolation, purification and identification. Sodium Dodecyl Sulfate Polyacrylamide Gel Electrophoresis (SDS-PAGE) in one dimension with single percentage gels is traditionally used for that process. Gradient gels that incorporate a range of percentages have been considered less accurate, in part due to a lack of reliable mathematical models. The purpose of this project was to develop statistical models to accurately predict protein MW's on gradient gels. Six mathematical models were applied to protein standards of previously identified MW's to determine the best fitting model. Relative mobility (R m ) of the protein standards were calculated and compared to the actual MW's to make this determination. The "Cubic Model" was determined to be the best fitting and will be used to identify unknown proteins that may be involved in amphibian fertilization. Goal To determine which model provides the best fit for determining the known protein standards Conclusions We examined 6 mathematical models to relate relative mobility to the molecular weights of known protein standards. The cubic model was determined best by examining the predicted weights, residuals, and R-squared values for each of the models. Then this model was used to estimate the molecular weights of the unknown proteins. Future The cubic model will be tested on proteins involved in frog fertilization. Other ways to reduce the error and improve the model will be studied. 4 Step Procedure Comparison of Mathematical Models to Determine Molecular Weight of Proteins: A Statistical Analysis 1 Jennifer Wright, 2 Edward J. Carroll, Jr., and 1 Lawrence Clevenson Departments of 1 Mathematics and 2 Biology California State University Northridge NASA/PAIR Program Fig. 1 Electrophoresis Gel of Raw Data Relative Mobility vs. Molecular Weight Plot of Raw Data used in Determining the Models Fig. 2 – Graph of relative mobility of raw data vs. log molecular weights starting with two 7.5% gels, two 10%, two 12% and two gradient gels. Fig. 3 Raw Standards Actual Molecular Weights vs. Predicted Molecular Weights of Standards Residuals for Cubic Model log(MW) = a + b * R m + c * R m 2 + d * R m 3 Table 2: Residuals and R-squared values for the Cubic model. The red numbers are negative and black are positive. Comparison of 3 models with a Standard Fig. 4 One set of raw data (Gel #2 VE) is set against 3 of the models tested (Log Linear, Quad., Cubic). Fig. 5 Raw Data Thanks to: Carol Shubin, Virginia Latham, Larry Clevenson, Edward Carroll, Gregory Frye, John Handy, Jennifer Rosales, Alicia Maravilla and Celia Smith. This work was supported by NASA CSUN/JPL PAIR. Grant #NASA-NCC5-489 Final Predicted Weights of Unknown Proteins Using Cubic Model Table 3: The Cubic model was applied to unknown proteins to predict their molecular weights. 1. Analysis of standards in the gels. 2. Test models on known protein standards. 3. Decide on best fitting model. 4. Apply model to unknown proteins. Determinations 1.) The R-Squared is good for most of the models, except for the SLIC model for which R-squared is a little low. R-squared is the ratio of predicted variation,  (û i - u) 2, to the total variation,  (u i - u) 2 where û i is the predicted value of u i for a particular model and u is the mean. The Cubic model produces the R-squared average with the closest fit of the 6 different models. Ideally, R-squared is equal to 1, meaning that the predicted values and the actual values are equal. 2.) The predictions of the MW are good for most of the models but the Cubic shows a smaller amount of variation. 3.) The residuals of the models show the differences between the actual data points and the predicted points. Examining the residuals (see example above) the Cubic model produces smaller residual values than the other 5 models R-Squared Ave. 9,211 10,4729,37410,63510,267 8,0076,500 11,494 12,82012,78913,43813,618 12,27714,400 17,906 17,96321,19720,51820,514 20,93321,500 36,594 29,50628,09028,73628,822 31,64731,000 57,331 50,02844,67944,24844,271 45,51945,000 81,441 78,95572,87669,99570,111 67,68966,200 97, ,609101,04997,28096,991 94,77597, , ,416120,246117,919118, ,949116, , ,210183,306197,683197, ,028200,000 Predicted MW Molecular Weights SLICLog LinearQuad.Log-Log-LN^2CubicActual Table 1: Comparison of the 6 models and the R-squared values produced by each model. Margin of Error Fig. 5 Graph of Standards and upper/lower predicted confidence interval at 95%. Table 4: Actual and predicted molecular weights of standards with a 95% C.I.. Actual and Predicted Standards with Confidence Interval (C.I.) Table 5: The predicted molecular weights of unknown proteins involved in frog fertilization with a 95% C.I.. Unknown Predicted Weights with C.I.