FM Model Assessment: Old scores and New Combinations ShuoYong Shi Nick Grishin Lab.

Slides:



Advertisements
Similar presentations
CHAPTER 24 MRPP (Multi-response Permutation Procedures) and Related Techniques From: McCune, B. & J. B. Grace Analysis of Ecological Communities.
Advertisements

Descriptive Statistics and the Normal Distribution HPHE 3150 Dr. Ayers.
Measures of Dispersion and Standard Scores
Statistics for the Social Sciences
Z - SCORES standard score: allows comparison of scores from different distributions z-score: standard score measuring in units of standard deviations.
Probability & Using Frequency Distributions Chapters 1 & 6 Homework: Ch 1: 9-12 Ch 6: 1, 2, 3, 8, 9, 14.
Non-parametric equivalents to the t-test Sam Cromie.
Reading Report Ce WANG A segment alignment approach to protein comparison.
Lecture 8: z-Score and the Normal Distribution 2011, 10, 6.
Hybrid Protein Model Quality Assessment Jianlin Cheng Computer Science Department & Informatics Institute University of Missouri, Columbia, MO, USA.
Identification of Domains using Structural Data Niranjan Nagarajan Department of Computer Science Cornell University.
z-Scores What is a z-Score? How Are z-Scores Useful? Distributions of z-Scores Standard Normal Curve.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Statistics and Quantitative Analysis U4320 Segment 4: Statistics and Quantitative Analysis Prof. Sharyn O’Halloran.
Chapter 5 DESCRIBING DATA WITH Z-SCORES AND THE NORMAL CURVE.
Chapter 5: z-scores.
Quiz 5 Normal Probability Distribution.
Chapter 5: z-Scores. 5.1 Purpose of z-Scores Identify and describe location of every score in the distribution Take different distributions and make them.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Chapter 5 Z-Scores. Review ► We have finished the basic elements of descriptive statistics. ► Now we will begin to develop the concepts and skills that.
From Last week.
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Microsoft Research Asia Yunhua Hu, Guomao Xin, Ruihua Song, Guoping.
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Statistics 1 The Basics Sherril M. Stone, Ph.D. Department of Family Medicine OSU-College of Osteopathic Medicine.
Modelling binding site with 3DLigandSite Mark Wass
Descriptive Statistics And related matters. Two families of statistics Descriptive statistics – procedures for summarizing, organizing, graphing, and,
Development of automatic score to simulate manual assessment for CASP FM targets Qian Cong from Grishin lab.
 Statistical Measures allow us to compare individual values to other values in a data set. They are things like:  Per capita  Percent change  Percentile.
Education 793 Class Notes Normal Distribution 24 September 2003.
Interpreting Performance Data
Motif finding with Gibbs sampling CS 466 Saurabh Sinha.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Comp. Genomics Recitation 3 The statistics of database searching.
Distributed Information Retrieval Server Ranking for Distributed Text Retrieval Systems on the Internet B. Yuwono and D. Lee Siemens TREC-4 Report: Further.
Multiple alignment: Feng- Doolittle algorithm. Why multiple alignments? Alignment of more than two sequences Usually gives better information about conserved.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Z-Scores Standardized Scores. Standardizing scores With non-equivalent assessments it is not possible to develop additive summary statistics. –e.g., averaging.
Chapter 5: z-Scores x = 76 (a) X = 76 is slightly below average x = 76 (b) X = 76 is slightly above average 3 70 x = 76 (c) X = 76 is far.
Psychology 290 – Lab 9 January Normal Distribution Standardization Z-scores.
Hand out z tables Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015.
Pairwise Sequence Analysis-III
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
Expected accuracy sequence alignment Usman Roshan.
Pairwise Sequence Alignment Part 2. Outline Summary Local and Global alignments FASTA and BLAST algorithms Evaluating significance of alignments Alignment.
PROBABILITY AND STATISTICS WEEK 1 Onur Doğan. What is Statistics? Onur Doğan.
THE NORMAL DISTRIBUTION AND Z- SCORES Areas Under the Curve.
The Normal distribution and z-scores
Statistics for Political Science Levin and Fox Chapter Seven
EE459 I ntroduction to Artificial I ntelligence Genetic Algorithms Practical Issues: Selection.
Today: Standard Deviations & Z-Scores Any questions from last time?
Chapter 3 Percentiles. Standard Scores A standard score is a score derived from raw data and has a known basis for comparison. A standard score is a score.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 16, 2009.
Expected accuracy sequence alignment Usman Roshan.
PUNCH: An Evolutionary Algorithm for Optimizing Bit Set Selection Adam J. Ruben, Stephen J. Freeland, Laura F. Landweber DNA7, 2001 Summarized by Dongmin.
Motif identification with Gibbs Sampler Xuhua Xia
Z-Scores. Histogram A bar chart of a frequency distribution. 0 — 2 1 — 3 2 — 0 3 — 2 4 — 4 5 — 3 6 — 3 7 — 5 8 — 3 9 — 2 10—2.
 A standardized value  A number of standard deviations a given value, x, is above or below the mean  z = (score (x) – mean)/s (standard deviation)
Z-Scores Quantitative Methods in HPELS HPELS 6210.
Chapter 5 z-Scores PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J. Gravetter and Larry.
Reasoning in Psychology Using Statistics Psychology
Scoring Sequence Alignments Calculating E
Numerical Measures of Position
Assessment of CASP11 Contact-Assisted Predictions
12.4 – Measures of Position In some cases, the analysis of certain individual items in the data set is of more interest rather than the entire set. It.
Increased Reliability of Nuclear Magnetic Resonance Protein Structures by Consensus Structure Bundles  Lena Buchner, Peter Güntert  Structure  Volume.
Volume 20, Issue 6, Pages (June 2012)
Normal Distribution.
High-Resolution Comparative Modeling with RosettaCM
Presentation transcript:

FM Model Assessment: Old scores and New Combinations ShuoYong Shi Nick Grishin Lab

Scores used in CASP5 Assessment*: Ten scores encompass structure and sequence characteristics Generate Scores GDT Distance DALI Qdali CE Qce Mammoth Qmm LGA Qlga SOV Q ratios: Correctly aligned residues/ target length *Kinch LN, et al. Proteins. 2003; 53 Suppl 6:

Structure Score Distributions: Target 531 GDT Distance Dali Mammoth

Structure Score Distributions: Target 531 GDT Distance Dali Mammoth 399_5113_1 360_4 399_5 113_2 119_1 215_1

Sequence Score Distributions: Target 531 Qlga SOV Qdali Qmm 399_5 1_3 55_1208_2

Bad Score Distributions: Target 531 CE Qce

Replace Score Distributions: Target 531 CE Qce Tmalign Qtm 113_4490_2

Combine Ten Scores: TenS Strategy to combine scores: 1)Transform raw scores to Z-scores 2) Sum of Z-scores using equal weight (TenS) Throw out zeros, calculate Z-score Throw out raw score with Z-score<-2, recalculate mean and std Recalculate Z-score on entire population, and assign Z-score <-2 to -2.

Combine Scores (TenS): Target 531 TenS GDT GDT winner: TS399_5 TenS winner: TS208_2

Score Used inComponents Raw Form TenS new 10 (GDT) Z score QCS new 6 (CS) % Max GDT CASP* % Max CS CASP8 % Max Strategy to combine scores: 1) Transform raw scores to Z-scores 2) Sum of Z-scores for each target (ComS) Combine Main Scores: TenS, QCS, GDT, and CS

Motivation: Top performing groups use similar strategies that rank server models with various scoring functions and refine top picks Who did better than servers? Score Strategy: Ratio of best group model scores to top server model for each of the main scores Additional Score: Comparison to top server Model

Server Ratio Score Combinations: 1)Ratio scores below 1 are ignored 2)Average Scores(4) for each target 3)Sum score averages Additional Score: Comparison to top server Model The Sum of average ratios (which are rarely much larger than 1) indicates the number of times each group outperformed servers

Additional Score: Comparison to top server Model (Target 531 Example) Top Servers

Additional Score: Comparison to top server Model (Combined Scores) Servers Outperformed servers 14 times

Report Scores: Many, Many, Many Sortable Tables and a Web Site

Report Scores: Many, Many, Many Sortable Tables and a Web Site

What’s the final ranking?