Contributions of SCG to SDG Karl Lieberherr Northeastern University College of Computer and Information Science Boston, MA joint work with Ahmed Abdelmeged.

Slides:

Advertisements

Similar presentations

Verification and Validation

Advertisements

Scientific Community Game Karl Lieberherr 4/29/20151SCG.

Ai in game programming it university of copenhagen Reinforcement Learning [Outro] Marco Loog.

Contributions of SCG Karl Lieberherr Northeastern University College of Computer and Information Science Boston, MA joint work with Ahmed Abdelmeged and.

Algorithms and Data Review Fall 2010 Karl Lieberherr 1CS 4800 Fall /7/2010.

Specker Challenge Game (SCG): A Novel Tool for Computer Science Karl Lieberherr.

The Role of Software Engineering Brief overview of relationship of SE to managing DSD risks 1.

CS350/550 Software Engineering Lecture 1. Class Work The main part of the class is a practical software engineering project, in teams of 3-5 people There.

Business research methods: data sources

Lecture 20: April 12 Introduction to Randomized Algorithms and the Probabilistic Method.

What are competencies – some definitions ……… Competencies are the characteristics of an employee that lead to the demonstration of skills & abilities,

Reinforcement Learning (1)

The Scientific Community Game as A Crowdsourcing Platform to Distinguish Good from Bad Presentation to Clients by Software Development Organization 4/24/20111.

ICT TEACHERS` COMPETENCIES FOR THE KNOWLEDGE SOCIETY

Meeting Skills.

SCG Example Labs Ahmed Abdelmeged Karl Lieberherr.

©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 22 Slide 1 Verification and Validation.

CSU 670 Review Fall Software Development Application area: robotic games based on combinatorial maximization problems. Software development is about.

SCG Domain Specification Karl. Overview What needs to be provided – What GameProvider needs to provide to define a competition. – What each Scholar needs.

Poster Design & Printing by Genigraphics ® The Scientific Community Game Education and Innovation Through Survival in a Virtual World of.

“Knowing Revisited” And that’s how we can move toward really knowing something: Richard Feynman on the Scientific Method.

The Scientific Community Game: Education and Innovation Through Survival in a Virtual World of Claims Karl Lieberherr Northeastern University College of.

The Scientific Community Game: Education and Innovation Through Survival in a Virtual World of Claims Karl Lieberherr Northeastern University College of.

Virtual Scientific Communities for Innovation Karl Lieberherr Northeastern University College of Computer and Information Science Boston, MA joint work.

Virtual Scientific Communities for Innovation Karl Lieberherr Northeastern University College of Computer and Information Science Boston, MA joint work.

Software Development using artificial markets of constructively egoistic agents Karl Lieberherr 1SD-F09.

Formal Two Party Debates about Algorithmic Claims or How to Improve and Check your Homework Solutions Karl Lieberherr.

For internal use only Updated H&M Manager/Logistics, role description Responsibilities Selling Ensure maximization of garment care, sales.

The Scientific Community Game for STEM Innovation and Education (STEM: Science, Technology, Engineering and Mathematics) Karl Lieberherr Ahmed Abdelmeged.

Big Idea 1: The Practice of Science Description A: Scientific inquiry is a multifaceted activity; the processes of science include the formulation of scientifically.

SDG Mittagsseminar1 Using Artificial Markets to Teach Computer Science Through Trading Robots How to get students interested in algorithms, combinatorial.

Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.

Patterns and Reuse. Patterns Reuse of Analysis and Design.

Software Development using artificial markets of constructively egoistic agents Karl Lieberherr 1SD-F09.

Game Driven Software Development for NPOs the Scientific Community Game (SCG)

Communications Skills (ELE 205)

BSc Honours Project Introduction CSY4010 Amir Minai Module Leader.

Math 105: Problem Solving in Mathematics

Business Process Change and Discrete-Event Simulation: Bridging the Gap Vlatka Hlupic Brunel University Centre for Re-engineering Business Processes (REBUS)

Strategically Managing the HRM Function McGraw-Hill/Irwin ©2012 The McGraw-Hill Companies, All Rights Reserved.

Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.

1 Introduction to Software Engineering Lecture 1.

Illustrations and Answers for TDT4252 exam, June

UBC March The Evergreen Project: How To Learn From Mistakes Caused by Blurry Vision in MAX-CSP Solving Karl J. Lieberherr Northeastern University.

The Scientific Community Game Education and Innovation Through Survival in a Virtual World of Claims Karl Lieberherr Northeastern University College of.

SCG layers or SCG stages Karl and Yue. Layers of Constraints We can look at the process of game design as a successive layering of constraints on a game.

11/11/2015SDG1 Specker Derivative Game Karl Lieberherr Spring 2009.

The Structure of Information Retrieval Systems LBSC 708A/CMSC 838L Douglas W. Oard and Philip Resnik Session 1: September 4, 2001.

MSD 2011 Midterm Karl Lieberherr 3/28/20111MSD midterm.

A Popperian Platform for Programming and Teaching the Global Brain Karl Lieberherr Ahmed Abdelmeged Northeastern University, CCIS, PRL, Boston 11/20/20151.

Communications Skills (ELE 205) Dr. Ahmad Dagamseh Dr. Ahmad Dagamseh.

NU ACM Talk Virtual Scientific Communities for Driving Innovation and Learning Karl Lieberherr joint work with Ahmed Abdelmeged and Bryan Chadwick 11/28/20151SCG.

BSc Honours Project Introduction CSY4010 Amir Minai Module Leader.

Dynamic Benchmarking Software development though competition Alex Dubreuil Northeastern University

NU ACM Talk Virtual Scientific Communities for Driving Innovation and Learning Karl Lieberherr joint work with Ahmed Abdelmeged and Bryan Chadwick 12/21/20151SCG.

The Algorithms we use to learn about Algorithms Karl Lieberherr Ahmed Abdelmeged 3/16/20111Open House 2011.

A Popperian Platform for Programming and Teaching the Global Brain Karl Lieberherr Ahmed Abdelmeged Northeastern University, CCIS, PRL, Boston 12/23/20151.

Key Points Karl Lieberherr. Challenge: old high-level description Price Set of problems 1/5/20162Summary.

1 CS 501 Spring 2002 CS 501: Software Engineering Lecture 27 Software Engineering as Engineering.

Persistent Playgrounds Fall 2011 Managing Software Development 1/27/20161Persistent Playgrounds.

Software Development using virtual scientific communities of constructively egoistic agents Karl Lieberherr 1SCG-SP20103/19/2016.

BSc Honours Project Introduction CSY4010 Amir Minai Module Leader.

A Popperian Platform for Programming and Teaching the Global Brain Karl Lieberherr Ahmed Abdelmeged Northeastern University, CCIS, PRL, Boston 6/26/20161.

Contributions of SCG to SDG Karl Lieberherr Northeastern University College of Computer and Information Science Boston, MA joint work with Ahmed Abdelmeged.

CSC 480 Software Engineering

Research Methods Dr. X.

The Scientific Community Game for STEM Innovation and Education

SCG Court: A Crowdsourcing Platform for Innovation

Karl Lieberherr Ahmed Abdelmeged

Presentation transcript:

Contributions of SCG to SDG Karl Lieberherr Northeastern University College of Computer and Information Science Boston, MA joint work with Ahmed Abdelmeged and Bryan Chadwick Karl Lieberherr Northeastern University College of Computer and Information Science Boston, MA joint work with Ahmed Abdelmeged and Bryan Chadwick Supported by Novartis SCG = Scientific Community Game = Specker Challenge Game

Game Motto Want reliable software to solve a computational problem? Design a game where the winning team will create the software you want. (Want to teach a STEM domain? Design a game where the winning students demonstrate superior domain knowledge.) 12/17/2015Games for SD2

SCG Make software development more scientific. Software developers – propose claims about their software – oppose claims made by others about their own software refute claims strengthen claims claim defined by refutation protocol 12/17/2015Games for SD3

Claims and Refutation Protocol AliceClaim: I have a program that solves inputs in domain X with quality Q and resources R. – AliceClaim(X,Q,R) Bob is critical. He prepares a tough input in X and gives it to Alice who applies her program. Bob refutes iff Alice achieves R. – Refutation protocol 12/17/2015Games for SD4

Who are Alice and Bob? They are avatars developed by real Alice and real Bob. Alice and Bob compete with 10 other avatars in a full-round robin tournament. Who is the winner: The avatar with the highest reputation, i.e., the strongest, not successfully opposed claims (like in a real scientific community). 12/17/2015Games for SD5

What we want Engage software developers – let them produce software that models an organism that fends for itself in a real virtual world while producing the software we want. Have fun. Focus them. – let them propose claims about the software they produce. Reward them when they defend their claims successfully or oppose the claims of others successfully. 12/17/2015Games for SD6

Life with SCG(X) with SCG structured collaboration between software developers, frequent feedback propose and oppose non- trivial claims to gain reputation. Drive to win knowledge accumulation in claims that have not been opposed successfully management effort goes into X without SCG collaboration is unstructured, less effective reputation gain is delayed knowledge is scattered in s, programs and minds more management effort required 12/17/20157Games for SD

Opening the development approach Problem to be solved: Develop the best practical algorithms for solving computational problems in domain X. Issue: There are probably hundreds of papers on the topic with isolated implementations. What are the best practical algorithms? Our solution: Use the scientific community game SCG(X) with a suitably designed claims language to compare the software. The winning avatar has the best practical algorithms/software. 12/17/20158Games for SD

Example:Independent Set An independent set in a graph is a set of mutually nonadjacent vertices. The problem of finding a maximum independent set in a graph, is one of most fundamental combinatorial NP-hard problems. 12/17/2015Games for SD9

Example: Independent Set claim IndSet(n, 0.9, t(n)): – Alice can construct graphs G with at most n vertices and she can construct a secret independent set I1 for G so that Bob, given G, size(I1) and t(n) minutes only, cannot find an independent set I2 with size(I2) >= size(I1)* /17/2015Games for SD10

Refutation Protocol Alice constructs graph G and deposits her secret independent set I1. Alice gives G as well as the size of I1 to Bob. Bob has 10 minutes to construct his independent set I2 which he gives to Alice. Alice reveals her secret set I1. Bob wins iff size(I2) >= size(I1)*0.9 12/17/2015Games for SD11

Benefits for IBM of using SCG(X) Teams perform know-how retrieval and integration and maybe some research. – Participating teams try to find the best knowledge in the area. – Claims language gives control! The non-opposed claims give hints about new X- specific knowledge. A well-tested solver for X-problems that integrates the current algorithmic knowledge in field X. 12/17/201512Games for SD

Benefits for IBM of using SCG(X) Also great for evaluating potential employees. 12/17/2015Games for SD13

Avatars propose and oppose 12/17/2015Games for SD14 CA1 CA2 CA3 CA4 egoistic Alice egoistic Bob reputation 1000 reputation 10 CB1 CB2 opposes (1) provides problem (2) solves problem not as well as she expected based on CA2 (3) WINS! LOSES proposed claims transfer 200 social welfare Life of an avatar: (propose+ oppose+ provide* solve*)*

What is SCG(X)? Teams Design Problem Solver Develop Software Deliver Avatar Agent AliceAgent Bob Administrator SCG police I am the best No!! Let’s play constructively 12/17/201515Games for SD Team Alice Team Bob

competitive / collaborative 12/17/2015Games for SD16 Avatar Alice: claim H Avatar Bob: opposes H, refutes: provides evidence for !H loses reputation rwins knowledge k wins reputation rmakes public knowledge k

Disadvantages of SCG The game is addictive. After Bob having spent 4 hours to fix his avatar and still losing against Alice, Bob really wants to know why! Overhead to learn to define and participate in competitions. The administrator for SCG(X) must perfectly supervise the game. Includes checking the legality of X-problems. – if admin does not, cheap play – watching over the admin 12/17/201517Games for SD

How to compensate for those disadvantages Warn the scholars. Use a gentleman’s security policy: report administrator problems, don’t exploit them to win. Occasionally have a non-counting “attack the administrator” competitions to find vulnerabilities in administrator. – both generic as well as X-specific vulnerabilities. 12/17/201518Games for SD

Related Work TopCoder 12/17/2015Games for SD19

Conclusions SCG has many applications of potential value to IBM – Training employees in constructive domains – Software development process – Hiring – Driving innovation in constructive domains 12/17/2015Games for SD20

Thank you 12/17/2015Games for SD21

Software Development Governance Software Development Governance (SDG) is defined as: – Establishing chains of responsibility, authority and communication to empower people within a software development organization – Establishing measurement and control mechanisms to enable software developers, project managers and others within a software development organization to carry out their roles and responsibilities 12/17/2015Games for SD22

Applications Develop algorithms/software for new computational domain X – Scientific Community Game Software Development: Describe a problem domain X so that SCG(X) provides the best algorithms and their implementations for problems in X. (best within the participating scientific community) 12/17/2015Games for SD23 SCG = Scientific Community Game = Specker Challenge Game

12/17/2015Games for SD24

Plan Why is it relevant, useful? – Larger context: Open Innovation, Wikinomics – Applications: Netflix in the small, teaching What is it? What is new? – Map problem domain to “second life”, find best solution there and map it back to real life. What do we improve: benefits of SCG How to use SCG Disadvantages Experience with current implementation Related work Detailed example Conclusions 12/17/2015Games for SD25

Introduction (2) Scientific Community Game(X) [SCG(X)] – Goal: Foster innovation and reliable software for solving optimization problems in some domain X A virtual scientific community consists of virtual scholars that propose and oppose claims maximizing their reputations 12/17/2015Games for SD26

Claim Subdomain N – subset of problems Confidence [0,1] Valuation [0,1] 12/17/2015Games for SD27 confidence 0 1 valuation (how well problems in N can be solved)

Claim 12/17/2015Games for SD valuation strengthening correct valuation over strengthening

Hypothesis hypothesis by Alice: for all problems F in niche N there exists a solution J: p(F,J) Bob opposes: F’ to Alice, Alice cannot find J’:p(F’,J’) therefore she loses reputation. 12/17/2015Games for SD29

Full Round Robin Tournaments or Swiss-Style Agents to play the SCG(X). Repeat a few times with feedback used to update agents. Within the group of participating agent, the winning agent has the – best solver for X-problems – best supported knowledge about X 12/17/201530Games for SD

What is the purpose of SCG? The purpose of playing an SCG(X) competition is to assess the "skills" of the agents in: – "approximating" optimization problems in domain X, – "figuring-out" the wall-clock-time-bounded approximability of niches in domain X, – "figuring-out" hardest problems in a specific niche, and – "being-aware" of the niches in which their own solution algorithm works best. This multi-faceted evaluation makes SCG(X) more superior to competitions based on benchmarks that only test the player's skills in approximating optimization problems. During SCG, players cross-test each others' skills. 12/17/2015Games for SD31

How to use SCG Company A provides a problem domain description X and submits it to the SCG server. The game SCG(X) runs on the web (with human algorithm/software developers involved) and company A receives good, tested software and knowledge about problem domain X 12/17/2015Games for SD32

Plan Why is it relevant, useful? – Larger context: Open Innovation, Wikinomics – Applications: Netflix in the small, teaching What is it? What is new? – Map problem domain to “second life”, find best solution there and map it back to real life. What do we improve: benefits of SCG How to use SCG Disadvantages Experience with current implementation Related work Detailed example Conclusions 12/17/2015Games for SD33

Benefit: Improving Benchmark-Driven Algorithm Development A number of autonomous teams Each team develops an agent that embodies their own heuristics Agents participate in a competition (various benchmarks are used) Teams develop their agents for the next competition Examples: SAT, CSP, SMT, ASP (Answer Set Programming) competitions, etc. 12/17/201534Games for SD

From Benchmark-Driven to SCG-Driven Algorithm Development Hard to measure and detect what is fraud. Instead: Design a system that needs a much weaker “gentleman’s agreement” or none at all The Static Benchmark Problem is ONE problem that SCG solves. Dynamic Benchmarks Others: crowd sourcing management, new software development process that engages software developers and that fosters ease of evolution (e.g., good separation of concerns, …) 12/17/2015Games for SD35

Problems with Static Benchmarks competition/index.shtml Policy against special purpose solutions The purpose of the competition is to be as informative as possible about strengths and weaknesses of … Submission of special purpose programs for solving certain benchmark problems falsifies the information that we get from the rankings and goes against the spirit of the competition. … the use of special purpose programs for certain benchmarks can rightfully be considered as scientific fraud. We appeal to participants … 12/17/2015Games for SD36

SCG-Driven Algorithm Development Differences to Benchmark-Driven – You don’t rank chess players by giving them a benchmark; you let them play – We turn the algorithms into egoistic virtual scientists that fend for themselves – social welfare: constructive knowledge based on good algorithms 12/17/2015Games for SD37

What is SCG(X) 12/17/2015Games for SD38 no automation human plays full automation agent plays degree of automation used by scholar our focus some automation human plays 0 1 more applications: test constructive knowledge transfer to reliable, efficient software agent Bob Alice

Scholars and Agents: Same rules Are encouraged to 1.offer results that are not easily improved. 2.offer results that they can successfully support. 3.strengthen results, if possible. 4.stay active and publish new results or oppose current results. 5.become famous! 12/17/201539Games for SD

More Applications Special issue editors for problem domain X. publish top 15 submissions Professor teaching a software development class: students develop fighting agents for full-round robin tournament Teaching constructive topics etc. 12/17/2015Games for SD40

Soundness Theorem SCG is sound: The agent with the best algorithms / knowledge wins (there is no way to cheat) – best: within the group of participating agents – issues: Does an agent win because she is good at solving? Or good at proposing, opposing and providing? Answer: proposing, opposing and providing all reduce to solving. 12/17/2015Games for SD41

Justifying benefits (1) Benefit: competitive – collaborative Game component: hypotheses propose-oppose : problems provide-solve How this game component brings the benefit – hypothesis by Alice: for all problems F in niche N there exists a solution J: p(F,J) – Bob opposes: F’ to Alice, Alice cannot find J’:p(F’,J’) therefore she loses reputation. – Alice lost but she now knows F’ where she cannot achieve what she claimed. F’ was harder than what Alice expected. 12/17/2015Games for SD42

Justifying benefits (2) Benefit: competitive – collaborative Game component: hypotheses propose-oppose : problems provide-solve How this game component brings the benefit – hypothesis HA by Alice: for all problems F in niche N there exists a solution J: p(F,J) – Bob opposes by non-trivially strengthening HA to HB: HB => HA. Alice cannot discount HB. Therefore she loses reputation. – Alice lost but she now knows that her hypothesis HA might not be the strongest. 12/17/2015Games for SD43

Benefits of SCG-driven Focus on understanding problem domain. – What are the niches where specialized algorithms perform well? – What are the hard problems in a niche? Knowledge maintenance system Control of niches to be explored 12/17/2015Games for SD44

Reputation Gain Challenging (C) Gain for A (A supporting), Loss for A (B discounting) 12/17/201545Games for SD

Plan Why is it relevant, useful? – Larger context: Open Innovation, Wikinomics – Applications: Netflix in the small, teaching What is it? What is new? – Map problem domain to “second life”, find best solution there and map it back to real life. What do we improve: benefits of SCG How to use SCG Disadvantages Experience with current implementation Related work Detailed example Conclusions 12/17/2015Games for SD46

How to use SCG(X) ABB needs new ideas about how to solve optimization problems in domain X. Define hypothesis language for X – X-problems – hypotheses, includes protocol Submit hypothesis language definition to SCG server. 12/17/201547Games for SD

How to use SCG(X) Offer prize money for winner with conditions, e.g., performance must be at least 10% higher as performance of agent XY that ABB provides. 10 teams from 6 countries sign up, committing to 6 competitions. Player executables become known to other players after each competition. One team from ABB. The SCG server sends them the basic agent and the administrator for testing. 12/17/201548Games for SD

How to use SCG(X) Game histories known to all. Data mining! First competition is at on day 1. Registration starts at on same day. The competition lasts 2.5 hours. Repeat on days 7, 14, … 42. The final winner is: Team Mumbai, winning Euro. Delivers source code and design document describing winning algorithm to ABB. 12/17/201549Games for SD

Benefits for ABB of using SCG(X) Teams perform know-how retrieval and integration and maybe some research. – Participating teams try to find the best knowledge in the area. – Hypothesis language gives control! The non-discounted hypotheses give hints about new X-specific knowledge. A well-tested solver for X-problems that integrates the current algorithmic knowledge in field X. 12/17/201550Games for SD

Plan Why is it relevant, useful? – Larger context: Open Innovation, Wikinomics – Applications: Netflix in the small, teaching What is it? What is new? – Map problem domain to “second life”, find best solution there and map it back to real life. What do we improve: benefits of SCG How to use SCG Disadvantages Experience with current implementation Related work Detailed example Conclusions 12/17/2015Games for SD51

GIGO: Garbage in / Garbage out If all agents are weak, no useful solver created. WEAK against STRONG: – STRONG refutes a claim that is true but WEAK cannot support it. Correct knowledge might be discounted. – STRONG strengthens a hypothesis too much that it becomes discountable, but WEAK cannot discount it. Incorrect knowledge might be supported – STRONG is discouraged to exploit WEAK by game rules 12/17/2015Games for SD52

Plan Why is it relevant, useful? – Larger context: Open Innovation, Wikinomics – Applications: Netflix in the small, teaching What is it? What is new? – Map problem domain to “second life”, find best solution there and map it back to real life. What do we improve: benefits of SCG How to use SCG Disadvantages Experience with current implementation Related work Detailed example Conclusions 12/17/2015Games for SD53

Experience Used for 3 years in undergraduate Software Development course. Prerequisites: 2 semesters of Introductory Programming, Object-Oriented Design, Discrete Structures, Theory of Computation. – Collect and integrate knowledge from prerequisite courses, lectures, and literature. – Teach it to the agent. 30% of grade is allocated for agent performance in weekly competitions. 12/17/201554Games for SD

Mechanics of using current implementation We define X = MAX-CSP. We produce administrator and baby agent for X at beginning of course. Game flow: – Agents register with administrator – After deadline, administrator tells agents when it is their turn (1 minute) sending them all currently proposed hypotheses – After 1 minute, agent sends back transactions. 12/17/2015Games for SD55

Mechanics of using current implementation 3 competitions per week. Last about 12 hours each. 75% of competitions count towards grade. 1 competition: attack the administrator. 12/17/2015Games for SD56

Experience MAX-CSP MAX-CSP Problem Decompositions T-Ball (one relation), Softball (several relations, one implication tree), Baseball (several relations). ALL, SECRET 12/17/201557Games for SD

Stages for SECRET T-Ball MAXCUT – R(x,y)= x!=y – fair coin ½ – maximally biased coin ½ – semi-definite programming / eigenvalue minimization /17/201558Games for SD

Stages for SECRET T-Ball One-in-three – R(x,y,z) = (x+y+z=1) – fair coin: – optimally biased coin: /17/201559Games for SD

Stages for ALL Baseball Propose/Oppose/Provide/Solve – based on fair coin – optimally biased coin correctly optimize polynomials – correctly eliminate noise relations – correctly implement weights – … 12/17/201560Games for SD

Life with SCG with SCG structured collaboration between scholars, frequent feedback motivation: propose and oppose non-trivial hypotheses to gain reputation. Drive to win knowledge accumulation in undiscounted hypotheses target scholars on a topic without SCG collaboration is unstructured, less effective motivation: reputation gain is delayed knowledge is scattered in s, programs and minds more management effort required 12/17/201561Games for SD

How to model a hypothesis A problem space. A discounting predicate on the problem space. A protocol to set the predicate through alternating “moves” (decisions) by Alice and Bob. If the predicate becomes true, Alice wins. 12/17/201562Games for SD

How to model a hypothesis Proposing and challenging a hypotheses is risky: your opponent has much freedom to choose its decisions within the game rules. Alternating quantifiers. Replace “exists” by agent algorithm kept by administrator. 12/17/201563Games for SD

Hypothesis [Example] 1in3 example. 12/17/201564Games for SD

X = Boolean MAXCSP Given a sequence of Boolean constraints formulated using a set R of Boolean relations, find an assignment that maximizes the fraction of satisfied constraints. Niche defined by R. 12/17/201565Games for SD

1in3 niche Only relation 1in3 is used. 1in3 problem F: v1 v2 v3 v4 v5 1in3( v1 v2 v3) 1in3( v2 v4 v5) 1in3( v1 v3 v4) 1in3( v3 v4 v5) secret Truth Table 1in Secret quality SQ = 3/4 12/17/201566Games for SD

1in3 Hypothesis 1in3 hypothesis H proposed by Alice: exists F in 1in3 niche so that for all S Bob that opponent Bob searches in time t (small constant) seconds: Quality(F,S Bob ) < 0.4 * Quality(F,S Alice ). H = (niche = (1in3), AR =0.4, confidence = 0.8) Bob has clever knowledge that Alice does not have. He opposes the hypothesis H by challenging it using his randomized algorithm. 12/17/201567Games for SD

Bob’s clever knowledge 4/9 for 1in3 4/9 for 1in3: For all F in 1in3 niche, exists S so that Quality(F,S) >= * SQ. Proof: la(p)=3*p*(1-p) 2 has the maximum 4/9. argmax p in [0,1] la(p) = 1/3. Without search, in PTIME. Derandomize Bob successfully discounts Alice gets a hint – Was Bob just lucky? Truth Table 1in /17/201568Games for SD

1in3 Hypothesis Bob does not know whether 4/9 is best possible. Should check Semidefinite Programming. Bob only knows that the set of 1in3 problems having a solution satisfying 4/9 + eps, eps > 0, is NP-complete. 12/17/201569Games for SD

Related Work Renaissance mathematicians Various benchmark based competitions What is new? – Software that has an ego – Holistic software with introspection – Evaluating software through a game – Scientific Community Game Software Development 12/17/2015Games for SD70

Conclusions To address a problem domain X: – “map it to second life”: define a scientific community game for X on the web: SCG(X) – let the game SCG(X) run a few times and choose the winner Benefits – Evaluates fairly, frequently, constructively and dynamically. Encourages retrieval of state-of-the-art know-how, integration and discovery. – Challenges humans, drives innovation, both competitive and collaborative. – Agents point humans to what needs attention in problem solution / software. 12/17/2015Games for SD71

Conclusions SCG(X) provides a structured process for developing software for optimization problems. Benefits – Social Engineering: makes it fun through game. – Fair: Only hard work makes you win. – Engage a large community on one domain X. Tools 12/17/2015Games for SD72