Towards a Model of Computer Systems Research Tom Anderson University of Washington.

Slides:

Advertisements

Similar presentations

Two Issues Concerning Research Conferences Dave Patterson October 2004.

Advertisements

Imbalanced data David Kauchak CS 451 – Fall 2013.

We are Under Attack. We are Under Attack By the Least Publishable Unit.

The General Linear Model Or, What the Hell’s Going on During Estimation?

Dimension reduction (1)

Case Tools Trisha Cummings. Our Definition of CASE  CASE is the use of computer-based support in the software development process.  A CASE tool is a.

Standards for Qualitative Research in Education

Statistics for the Social Sciences

QUALIFYING EXAM PERFORMANCE EXPECTATIONS AND GRADING 2004 Ian Waitz.

Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.

Statistical Analysis of the Social Network and Discussion Threads in Slashdot Vicenç Gómez, Andreas Kaltenbrunner, Vicente López Defended by: Alok Rakkhit.

Moving from Conference Paper to Journal Article: Strategies for Success as an Author & Developing a Reputation as a Good Reviewer John Humphreys, Eastern.

ALEC 604: Writing for Professional Publication Week 11: Addressing Reviews/Revisions.

Significance Testing Difference between two means.

Meta-analysis & psychotherapy outcome research

Opportunistic Optimization for Market-Based Multirobot Control M. Bernardine Dias and Anthony Stentz Presented by: Wenjin Zhou.

Conceptual change. Conceptual reorganization in psychology students beliefs’ about the discipline. Eric Amsel & Adam Johnston Weber State University 10.

1 Today More on random testing + symbolic constraint solving (“concolic” testing) Using summaries to explore fewer paths (SMART) While preserving level.

Market-Based Management

Today Concepts underlying inferential statistics

Executive Dashboard Systems Secure CITI Adam Zagorecki April 30, 2004.

Current Situation Strong tradition going back to the 1980s (with very little changes even if community has exploded) Highly competitive/selective conferences.

Reliability, Validity, & Scaling

A preliminary analysis of AAAI-99 submissions Devika Subramanian Rice University.

Use a plan review Keep Score Provide incentives for positive behavior Manage knowledge retention & transfer Follow up for Sustainable Results Cox Ch 7.

Evaluating the Running Time of a Communication Round over the Internet Omar Bakr Idit Keidar MIT MIT/Technion PODC 2002.

Social Networking Techniques for Ranking Scientific Publications (i.e. Conferences & journals) and Research Scholars.

ASPLOS 2015 Debate Onur Mutlu

by B. Zadrozny and C. Elkan

Database Publication Practices Surajit Chaudhuri Microsoft Research.

So What? Operations Management EMBA Summer TARGET You are, aspire to be, or need to communicate with an executive that does not have direct responsibility.

TRAINING EVALUATION WHAT? WHAT? WHY? WHY? HOW? HOW? SO WHAT? SO WHAT?

Journal Impact Factors: What Are They & How Can They Be Used? Pamela Sherwill, MLS, AHIP April 27, 2004.

Corinne Introduction/Overview & Examples (behavioral) Giorgia functional Brain Imaging Examples, Fixed Effects Analysis vs. Random Effects Analysis Models.

1 CS 391L: Machine Learning: Experimental Evaluation Raymond J. Mooney University of Texas at Austin.

10/7/14 Do Now: Take one of each of the handouts from the front and read the directions on the top of the page. Homework: - Finish reading chapters 9 &

Networked Information Resources SPARC, E-prints & Open Access initiatives.

6/6/01 1 Copyright 2001 by Ralph R. Young Effective Requirements Practices Designed to improve individual, project, and organizational effectiveness. Based.

Nigel Ward University of Texas at El Paso Fifth International Conference on Intelligent Technologies December 3, 2004 Dealing with Uncertainty in a Model.

SHOW US YOUR RUBRICS A FACULTY DEVELOPMENT WORKSHOP SERIES Material for this workshop comes from the Schreyer Institute for Innovation in Learning.

Faculty Satisfaction Survey Results October 2009.

3D Imaging: Literature Review By Gennifer Majors Conservation in Practice Sarah Foskett.

Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.

1 Choosing a Computer Science Research Problem. 2 Choosing a Computer Science Research Problem One of the hardest problems with doing research in any.

Test Case Designing UNIT - 2. Topics Test Requirement Analysis (example) Test Case Designing (sample discussion) Test Data Preparation (example) Test.

Multiple Regression. From last time There were questions about the bowed shape of the confidence limits around the regression line, both for limits around.

ANOVA, Regression and Multiple Regression March

Subject-specific content: A Generic scoring guide for information-based topics 4 The student has a complete and detailed understanding of the information.

Query Suggestions in the Absence of Query Logs Sumit Bhatia, Debapriyo Majumdar,Prasenjit Mitra SIGIR’11, July 24–28, 2011, Beijing, China.

1 Getting Up to Speed on Value-Added - An Accountability Perspective Presentation by the Ohio Department of Education.

Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,

Scientific Peer Review Yixin Chen, Associate Professor Computer & Information Science University of Mississippi April 9, 2013.

Diversity’s Role in Technology Thought Leadership Diverse teams Patenting Conference papers Open source software NCWIT tracks women’s participation in.

Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:

Some problems with systems research in Europe Miguel Castro.

Multi-Area Load Forecasting for System with Large Geographical Area S. Fan, K. Methaprayoon, W. J. Lee Industrial and Commercial Power Systems Technical.

Linear Regression 1 Sociology 5811 Lecture 19 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.

Computer aided teaching of statistics: advantages and disadvantages

Modify—use bio. IB book  IB Biology Topic 1: Statistical Analysis

Statistics in MSmcDESPOT

Week 3 Class Discussion.

Simon: Modeling and Analysis of Design Space Structures

STAR Element Current State Future State Gap

PERFORMANCE AND TALENT MANAGEMENT

Indented Tree or Graph? A Usability Study of Ontology Visualization Techniques in the Context of Class Mapping Evaluation 本体可视化技术在类型匹配评估中的可用性研究 Qingxia.

The application of the support needs paradigm in implementing Quality of Life: Introducing AAIDD’s 4 Newest White Papers on the Supports Intensity.

Confirmatory Factor Analysis

Scott Irwin University of Illinois at Urbana-Champaign

Amir Ghaisi Grigorios Fountas Panos Anastasopoulos Fred Mannering

Computer Science Publications

Presentation transcript:

Towards a Model of Computer Systems Research Tom Anderson University of Washington

2 P2P vs. Systems Research P2P  No centralized control  Emergent behavior  Heavy tailed distributions  Incentives matter  Randomness helps Systems Research  No centralized control  Emergent behavior  Heavy tailed distributions?  Incentives matter?  Randomness hurts? This talk: Explain systems research using tools from P2P systems research Suggest some mechanisms to better align author and conference incentives

3 Mean Score + StdDev NSDI 08

4 Mean Score + StdDev OSDI 06

5 Mean Score + StdDev SOSP 07

6 Randomness is Fundamental? Little consensus as to what constitutes merit − Importance of problem? − Creativity of solution? − Completeness of evaluation? − Effectiveness of presentation? − All of the above? Large #’s of submissions makes consistency hard to achieve − Small PC, huge workload, burnout, lack of attention to detail − Large PC, lower workload, less consistency

7 SIGCOMM 06 Experiment Manage randomness explicitly − Large PC, split between “light” and “heavy” − Light + heavy PC: bin into accept, marginal, reject With as few reviews as possible Add reviews for papers with high variance Add reviews for papers at the margin Program committee meeting (just heavy PC) − Pre-accept half the papers − Pre-select 2x to discuss − Each paper under discussion read by at least 5 from heavy PC − Result: success disaster Little basis for discriminating between papers at the boundary

8 Two Models of Distribution of Merit

9 Citation Distribution for SOSP

10 Incentives for Marginal Effort With unit merit and no noise: − Impulse function at accept threshold With unit merit and noise, single conference: − Gaussian function at accept threshold With unit merit, high noise, and multiple conferences: − Peak incentive well below accept threshold − Repeated attempts without improving paper We’d like effort to reflect the underlying merit of the idea − Good ideas are pursued, even after publication − Mediocre ideas are published, and the author quickly moves on

11 A Modest Suggestion Reward, like merit, should be a continuous function Publish rank and error bars for every paper accepted at a conference − Computed automatically from individual PC ranking − Post-hoc (benefit from perspectives of all reviewers) After some time has elapsed, re-rank − Encourage continued effort on good ideas − Like test in time, but applied to all published papers

12 Afternoon Discussion Topics  Double-blind vs. single-blind reviews  Should authors disclose previous reviews of the same paper?  Are author-rebuttals useful?  When should ``open reviews'' be used?  Should we review the reviewers?  CS-wide citation reporting and indexing  Travel reduction  Decoupling publication from presentation  How do we quantify the merit of a conference?  Do PCs tend to favor PC-authored papers?  How random are PC decisions?  How big is the rejected-paper tumbleweed?

13 Afternoon Discussion Topics  Is there a correlation between PC size and conference impact?  Does overlapping membership between PCs decrease diversity?  Is there a correlation between number of papers accepted and quality?  Do overall scores predict what gets accepted?  What do authors like and dislike about reviews?  How to handle suspected author misbehavior  How to handle suspected reviewer misbehavior  When, why, and how to shepherd  Reviews of review-management software  Proposals for new or improved review-management features