1 On Good and Fair Paper- Reviewer Assignment Cheng LONG, Raymond Chi-Wing Wong, Yu Peng, Liangliang Ye The Hong Kong University of Science and Technology.

Slides:



Advertisements
Similar presentations
How to write a review. Outline What is a review? Why should you review? How do you review a paper? What not to do? What are the dilemmas? Case study.
Advertisements

Algorithm Design Methods Spring 2007 CSE, POSTECH.
Min-Max Relations, Hall’s Theorem, and Matching-Algorithms Graphs & Algorithms Lecture 5 TexPoint fonts used in EMF. Read the TexPoint manual before you.
Diversity Maximization Under Matroid Constraints Date : 2013/11/06 Source : KDD’13 Authors : Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur Advisor :
Minimizing Seed Set for Viral Marketing Cheng Long & Raymond Chi-Wing Wong Presented by: Cheng Long 20-August-2011.
T h e G a s L a w s. T H E G A S L A W S z B o y l e ‘ s L a w z D a l t o n ‘ s L a w z C h a r l e s ‘ L a w z T h e C o m b i n e d G a s L a w z B.
Progressive Approach to Relational Entity Resolution Yasser Altowim, Dmitri Kalashnikov, Sharad Mehrotra Progressive Approach to Relational Entity Resolution.
EMIS 8373: Integer Programming Valid Inequalities updated 4April 2011.
A Probabilistic Framework for Semi-Supervised Clustering
CIKM’2008 Presentation Oct. 27, 2008 Napa, California
Approximating Sensor Network Queries Using In-Network Summaries Alexandra Meliou Carlos Guestrin Joseph Hellerstein.
The Complexity of the Network Design Problem Networks, 1978 Classic Paper Reading
Research Topics of Potential Interest to Geography COMPUTER SCIENCE Research Away Day, 29 th April 2010 Thomas Erlebach.
On Efficient Spatial Matching Raymond Chi-Wing Wong (the Chinese University of Hong Kong) Yufei Tao (the Chinese University of Hong Kong) Ada Wai-Chee.
Non-myopic Informative Path Planning in Spatio-Temporal Models Alexandra Meliou Andreas Krause Carlos Guestrin Joe Hellerstein.
Network Optimization Models: Maximum Flow Problems In this handout: The problem statement Solving by linear programming Augmenting path algorithm.
[1][1][1][1] Lecture 5-7: Cell Planning of Cellular Networks June 22 + July 6, Introduction to Algorithmic Wireless Communications David Amzallag.
1 Cell Planning of 4G Cellular Networks David Amzallag Computer Science Department, Technion Joint work with Roee Engelberg (Technion), Seffi Naor (Microsoft.
Graph-Cut Algorithm with Application to Computer Vision Presented by Yongsub Lim Applied Algorithm Laboratory.
CSE 550 Computer Network Design Dr. Mohammed H. Sqalli COE, KFUPM Spring 2007 (Term 062)
Approximation Algorithms Motivation and Definitions TSP Vertex Cover Scheduling.
Approximation Algorithms: Bristol Summer School 2008 Seffi Naor Computer Science Dept. Technion Haifa, Israel TexPoint fonts used in EMF. Read the TexPoint.
CBLOCK: An Automatic Blocking Mechanism for Large-Scale Deduplication Tasks Ashwin Machanavajjhala Duke University with Anish Das Sarma, Ankur Jain, Philip.
1 Efficient Algorithms for Optimal Location Queries in Road Networks Zitong Chen (Sun Yat-Sen University) Yubao Liu (Sun Yat-Sen University) Raymond Chi-Wing.
CSC 177 Research Paper Review Chad Crowe. A Microeconomic Data Mining Problem: Customer-Oriented Catalog Segmentation Authors: Martin Ester, Rong Ge,
Semantic Analytics on Social Networks: Experiences in Addressing the Problem of Conflict of Interest Detection Boanerges Aleman-Meza, Meenakshi Nagarajan,
Graph Partitioning Donald Nguyen October 24, 2011.
Customized Tour Recommendations in Urban Areas Date : 2014/11/20 Author: Aristdes Gionis, Konstaninos Pelechrinis, Theodoros Lappas, Evimaria Terzi Source:
C&O 355 Mathematical Programming Fall 2010 Lecture 19 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
Optimizing Plurality for Human Intelligence Tasks Luyi Mo University of Hong Kong Joint work with Reynold Cheng, Ben Kao, Xuan Yang, Chenghui Ren, Siyu.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
Lecture 16 Maximum Matching. Incremental Method Transform from a feasible solution to another feasible solution to increase (or decrease) the value of.
1 Customer-Aware Task Allocation and Scheduling for Multi-Mode MPSoCs Lin Huang, Rong Ye and Qiang Xu CHhk REliable computing laboratory (CURE) The Chinese.
On Energy-Efficient Trap Coverage in Wireless Sensor Networks Junkun Li, Jiming Chen, Shibo He, Tian He, Yu Gu, Youxian Sun Zhejiang University, China.
December 7-10, 2013, Dallas, Texas
1 On Completing Latin Squares Iman Hajirasouliha Joint work with Hossein Jowhari, Ravi Kumar, and Ravi Sundaram.
Boolean Minimizer FC-Min: Coverage Finding Process Petr Fišer, Hana Kubátová Czech Technical University Department of Computer Science and Engineering.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Efficient k-Coverage Algorithms for Wireless Sensor Networks Mohamed Hefeeda.
Guided Learning for Role Discovery (GLRD) Presented by Rui Liu Gilpin, Sean, Tina Eliassi-Rad, and Ian Davidson. "Guided learning for role discovery (glrd):
1 On Optimal Worst-Case Matching Cheng Long (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu Yu Kang, Yangfan Zhou, Zibin Zheng, and Michael R. Lyu {ykang,yfzhou,
On Concise Set of Relative Candidate Keys Shaoxu Song (Tsinghua), Lei Chen (HKUST), Hong Cheng (CUHK)
Tung-Wei Kuo, Kate Ching-Ju Lin, and Ming-Jer Tsai Academia Sinica, Taiwan National Tsing Hua University, Taiwan Maximizing Submodular Set Function with.
CS223 Advanced Data Structures and Algorithms 1 Maximum Flow Neil Tang 3/30/2010.
ANTs PI meeting, Dec. 17, 2001Washington University / DCMP1 Flexible Methods for Multi-agent Distributed Resource Allocation by Exploiting Phase Transitions.
Implicit Hitting Set Problems Richard M. Karp Erick Moreno Centeno DIMACS 20 th Anniversary.
Client Assignment in Content Dissemination Networks for Dynamic Data Shetal Shah Krithi Ramamritham Indian Institute of Technology Bombay Chinya Ravishankar.
1 Finding Competitive Price Yu Peng (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and Technology)
CPS Computational problems, algorithms, runtime, hardness (a ridiculously brief introduction to theoretical computer science) Vincent Conitzer.
Trajectory Simplification: On Minimizing the Direction-based Error
Algorithm Design Methods 황승원 Fall 2011 CSE, POSTECH.
How to survive the review process HSE, Moscow November 2015.
Efficient Semi-supervised Spectral Co-clustering with Constraints
A Maximum Entropy Language Model Integrating N-grams and Topic Dependencies for Conversational Speech Recognition Sanjeev Khudanpur and Jun Wu Johns Hopkins.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
Submodularity Reading Group Matroids, Submodular Functions M. Pawan Kumar
::Network Optimization:: Minimum Spanning Trees and Clustering Taufik Djatna, Dr.Eng. 1.
Submodularity Reading Group Matroids, Submodular Functions M. Pawan Kumar
Finding Dense and Connected Subgraphs in Dual Networks
Lecture 16 Bipartite Matching
Special Graphs: Modeling and Algorithms
Distributed Submodular Maximization in Massive Datasets
Lecture 16 Maximum Matching
Coverage and Distinguishability in Traffic Flow Monitoring
Conflict-Aware Event-Participant Arrangement
Problem Solving 4.
Algorithm Design Methods
Maximum Flow Neil Tang 4/8/2008
Presentation transcript:

1 On Good and Fair Paper- Reviewer Assignment Cheng LONG, Raymond Chi-Wing Wong, Yu Peng, Liangliang Ye The Hong Kong University of Science and Technology 10 Dec., 2013

Paper Reviewer Assignment Peer review Assign papers to reviewers (PCs) 756 submissions and 234 PCs [ICDM’12] Automatic paper-reviewer assignment 2

Existing solution (1) Retrieval-based method Retrieve expertise for each paper Matching-based method Phase 1: Bipartite graph construction Phase 2: Matching computation 3 Un-balanced Sensitive to the processing order Simply heuristic-based Example next …

Existing solution (2) 4 p1p1 t1t1 t2t2 t3t3 t4t4 t5t5 r4r4 r3r3 r2r2 r1r1 r4r4 r3r3 r2r2 r1r1 p1p Phase 1 task: Assign 2 reviewers to p 1 Phase 2 In the returned assignment, only 3 topics of p 1 are covered (t 1, t 2, t 3 ) A better solution: {(p 1, r 1 ), (p 1, r 4 )} 5 topics of p 1 are covered. r4r4 r3r3 r2r2 r1r1 p1p The paper being presented: topic 1: Assignment topic 2: COI topic 3: Submodularity Me as a reviewer: topic 1: social network topic 2: assignment topic 3: algorithmic design

A New Problem: MaxTC-PRA Maximum Topic Coverage Paper Reviewer Assignment (MaxTC-PRA) Three constraints: Paper Demand Constraint Reviewer Workload Constraint Conflict-of-interest Constraint (COI Constraint) One Objective: The total number of distinct topics of papers that are covered is maximized. 5

Solution MaxTC-PRA cannot be fit in the framework of matching-based method Weights are dependent! NP-hard! Greedy algorithm 1/3-factor approximation Sub-modularity P-system 6

COI study (1) Issue 1: What types of author-reviewer relationship should be considered as COI? Co-author relationship Colleague relationship Advisor-advisee relationship Competitor relationship 7 The reviewer would be biased against the paper submitted by the author Issue 1 Issue 2 Issue 3 p1p1 t1t1 t2t2 t3t3 t4t4 t5t5 r4r4 r3r3 r2r2 r1r1 p2p2 a2a2 a1a1 r1r1 The reviewer would be biased for the paper submitted by the author Competitor relationship: r 1 vs. a 2.

Empirical Study (1) Datasets Paper set: 496 papers published in KDD’06-10 Reviewer set: 550 PCs of ICDM’10 and those of KDD’10 Topic set: 49 subject areas of KDD’11 Algorithms Greedy and ILP 8

Empirical Study (2) Greedy improves the topic coverage usually by 10% - 15%. 9

Conclusion A new problem: MaxTC-PRA NP-hardness of MaxTC-PRA and a 1/3-factor approximation COI study: three issues Empirical study 10

Q & A 11

Solution (2): A Greedy Algorithm Greedy Process Start with an assignment with no matches Repeatedly augment the assignment with the match which has the greatest marginal gain in terms of our objective and satisfies each of the three constraints Stop when no matches could be included 12

COI study (2) Issue 2: Is it reliable to use the COI information specified by the authors and/or the reviewers only? Specifying COI manually is tedious (due to large program committees) Some COIs would be left un-specified Collect COIs automatically? 13 Issue 1 Issue 2 Issue 3

COI study (3) Issue 3: Is it always good to use as many COIs as possible? Fairer! But, side-effect on goodness Fairness vs. goodness 14 Issue 1 Issue 2 Issue 3

Empirical Study (2) COIs study: Effects on fairness Competitor > Co-author > Colleague > Advisor-advisee 15

Empirical Study (2) COIs study: Effects on goodness COIs have no significant effects on the goodness of the paper-reviewer assignment. 16