Four-Cut: An Approximate Sampling Procedure for Election Audits

Slides:



Advertisements
Similar presentations
Mark Wang John Sturm Sanjeev Kulkarni Paul Cuff.  Basic Background – What is the problem?  Condorcet = IIA  Survey Data  Pairwise Boundaries = No.
Advertisements

ThreeBallot, VAV, and Twin Ronald L. Rivest – MIT CSAIL Warren D. Smith - CRV Talk at EVT’07 (Boston) August 6, 2007 Ballot Box Ballot Mixer Receipt G.
1 Maximal Independent Set. 2 Independent Set (IS): In a graph G=(V,E), |V|=n, |E|=m, any set of nodes that are not adjacent.
Social Choice Topics to be covered:
Chapter 19 Confidence Intervals for Proportions.
1. Estimation ESTIMATION.
Test 2 Stock Option Pricing
© 2003 All Rights Reserved, Robi Polikar, Rowan University, Dept. of Electrical and Computer Engineering Lecture 8 Engineering Statistics Part II: Estimation.
This material in not in your text (except as exercises) Sequence Comparisons –Problems in molecular biology involve finding the minimum number of edit.
August 6, 2007Electronic Voting Technology 2007 On Estimating the Size and Confidence of a Statistical Audit Javed A. Aslam College of Computer and Information.
Chapter 14 Simulation. Monte Carlo Process Statistical Analysis of Simulation Results Verification of the Simulation Model Computer Simulation with Excel.
Audit Purpose of Audit Quality assurance procedure Check accuracy of machine tally of ballots Ballots for a contest are sampled, manually verified, and.
Calculating Staff Requirements for Hand Counts By Anthony Stevens Assistant Secretary of State December 19, 2007.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Revisiting Sampling Concepts. Population A population is all the possible members of a category Examples: the heights of every male or every female the.
CS270 Project Overview Maximum Planar Subgraph Danyel Fisher Jason Hong Greg Lawrence Jimmy Lin.
Univariate Gaussian Case (Cont.)
Chapter 9 Audit Sampling – Part a.
Copyright © 2009 Pearson Education, Inc. Chapter 11 Understanding Randomness.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Estimation and Confidence Intervals Chapter 9.
Election Day.
US Government and Politics
Unit 5: Ante Up Types of Elections.
Univariate Gaussian Case (Cont.)
Ronald L. Rivest MIT NASEM Future of Voting Meeting June 12, 2017
Chapter 8: Estimating with Confidence
Experiments, Simulations Confidence Intervals
CHAPTER 8 Estimating with Confidence
Chapter 8: Estimating with Confidence
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
ThreeBallot, VAV, and Twin
Sampling Distributions and Estimation
Chapter 10: The Manipulability of Voting Systems Lesson Plan
Elections in Canada are a giant race.
Audit Thoughts Ronald L. Rivest MIT CSAIL Audit Working Meeting
Maximal Independent Set
CHAPTER 12: Introducing Probability
Ronald L. Rivest MIT NASEM Future of Voting December 7, 2017
[ March 9, 2017] [ Bill Bowles, Audit Supervisor]
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
CS 4/527: Artificial Intelligence
Software Reliability Models.
Remember that our objective is for some density f(y|) for observations where y and  are vectors of data and parameters,  being sampled from a prior.
Quick Sort (11.2) CSE 2011 Winter November 2018.
Electoral College Simulation
Instructors: Fei Fang (This Lecture) and Dave Touretzky
Confidence Intervals: The Basics
Bayesian audits (by example)
≠ Particle-based Variational Inference for Continuous Systems
Chapter 8: Estimating with Confidence
Experimental Design: The Basic Building Blocks
TECHNIQUES OF INTEGRATION
LECTURE 09: BAYESIAN LEARNING
Chapter 8: Estimating with Confidence
Utility Billing Balancing the Accounts Receivable
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
How the Electoral College Works STEPS TO BECOMING A PRESIDENT
Chapter 8: Estimating with Confidence
Unsupervised Learning: Clustering
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Risk Limiting Audits Nuts, Bolts, and Paperclips
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Sampling Plans.
Presentation transcript:

Four-Cut: An Approximate Sampling Procedure for Election Audits Mayuri Sridhar Ronald L. Rivest

Overview We present a new way of picking a random sample for election audits This method avoids having to count ballots and, thus, is more efficient However, the sample is now only “approximately uniformly” random. We show how to mitigate for the approximations in RLAs and Bayesian audits

Goals

Ballot-Polling RLA Procedure Sample some ballots uniformly at random from the cast votes Produce a sample tally for the contest: Ex: 70 votes for Alice, 30 votes for Bob If the sample tally satisfies the risk limit, the audit is finished If not, sample more ballots -define sample tally

Goals Can we make the sampling process faster? Yes! However, the samples will only be approximately random

Assumptions We expect to sample 1-2 ballots per batch We expect the ballot manifest to be accurate, in terms of the number of ballots per batch All ballots are in a straight pile

Assumptions What is a “cut”? Remove some ballots from the top of the stack and place them on the bottom The person making a single cut chooses some ballots and places them at the bottom The person making the cut cannot see the vote on the ballot that will end up on top

k-Cut Overview k-Cut Given a pile of ballots from which to select sample Make k cuts Choose the ballot on top and add it to the sample Repeat until sample has desired size -make sure you define k

Typical Sampling Plan Ballot 25 from Batch 1 Ballots 50, 132 from Batch 3 Ballot 92 from Batch 4 … -make sure you define k

Single Ballot Speed Comparison Uniformly random audit plan Choose ballot 50 from batch 3 4-cut audit plan Get the set of ballots in batch 3 Make 4 cuts and choose the ballot on top -make sure you define k

Speed Comparison

Speed Comparison Counting: 3 ballots per second Cutting: 15 seconds per 4 cuts If we have to count more than 45 ballots, then Four-Cut is more efficient!

Properties

Is k-cut good enough? How close is choosing the ballot on top after k cuts to choosing a ballot at random? How much does “approximate sampling” affect the auditing procedure?  Can we compensate?

Distance to “truly” random Infinite-Time Convergence As the number of cuts increases, any card will be equally likely to be on top Finite-Time Convergence: Distance from the uniform distribution decreases exponentially with k, the number of cuts

Variational Distance to Uniform How close to uniform? Number Of Cuts Variational Distance to Uniform 2 0.1111 3 0.0089 4 0.0009 5 0.0008 6 0.0001

Approximate Sampling Effect After 4 cuts, the distance to the uniform distribution is small Implies that the change in margin in the sample is small In particular, we can analyze a 2-candidate race, with 100,000 ballots

Approximate Sampling Effect Sample Size Maximum Change in Margin (with 99% probability) 25 1 30 50 100 300 2 500 3

What can go wrong? The sample tally satisfies the risk limit, but, in reality, the election result is incorrect We stop the audit without realizing the election result is incorrect.

RLA Mitigation Procedure We know that the margin between any pair of candidates changes by at most 1 vote with 99% probability For any sample, we can move 1 ballot from the reported winner to the runner-up

What does this tell us? After the ballot adjustment, with 1% probability, the reported winner only wins because of the approximate sampling A risk limit of 0.05 in the original audit becomes a risk limit of 0.06

What does this tell us? We might have to sample more ballots due to the sample tally adjustments However, the sampling can be done much faster

Bayesian Audits

Bayesian Audit Overview Sample ballots uniformly at random For any given sample tally Run a “restore” simulation to model unsampled ballots Compute winner of sampled + simulated ballots

What happens to the risk? The mitigation procedure is also safe for Bayesian audits However, we can find a more efficient bound for Bayesian audits Most of the time, the sample tallies won’t need to be updated.

Acknowledgements and Contributions

Acknowledgements Thank you to participants in Indiana pilot audit May 30, 2018, which provided the photos and videos.

Use Cases Our approximate sampling procedure is primarily for use with ballot-polling audits, but can be extended for comparison audits The analysis shows how approximate sampling affects the statistics in RLAs and Bayesian audits

Open Problems Understanding the distribution of cuts, in practice Techniques for handling missing or extra ballots Generalizing to handle non-plurality elections

Contributions Designed an approximate sampling procedure to improve the speed of sampling for post-election audits Analyzed how approximate sampling affects risk for RLAs and Bayesian audits Showed how to adjust risk limit and sample tallies to correct for approximate sampling in both audits