DP for Optimum Strategies in Games

Slides:



Advertisements
Similar presentations
Feature Selection for Pattern Recognition J.-S. Roger Jang ( 張智星 ) CSIE Dept., National Taiwan University ( 台灣大學 資訊工程系 )
Advertisements

Dynamic Time Warping (DTW)
Standard Template Library Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Roll Dice Draw Tens Ones Standard Form Expanded Form
Shallow Copy Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Experiments with MATLAB Experiments with MATLAB Google PageRank Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University, Taiwan
Principal Component Analysis (PCA)
Experiments with MATLAB Mandelbrot Set Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University
Place Value and Multiplication
CSIE Dept., National Taiwan Univ., Taiwan
National Taiwan University
2015/10/221 Progressive Filtering and Its Application for Query-by-Singing/Humming J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval Lab CS Dept.,
Singly Linked Lists Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University 1.
2016/6/41 Recent Improvement Over QBSH and AFP J.-S. Roger Jang (張智星) Multimedia Information Retrieval (MIR) Lab CSIE Dept, National Taiwan Univ.
Sorting Algorithms Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Dynamic Programming 張智星 (Roger Jang) 多媒體資訊檢索實驗室 台灣大學 資訊工程系.
RuSSIR 2013 QBSH and AFP as Two Successful Paradigms of Music Information Retrieval Jyh-Shing Roger Jang ( 張智星 ) MIR Lab, CSIE Dept.
Binary Search Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Quadratic Classifiers (QC) J.-S. Roger Jang ( 張智星 ) CS Dept., National Taiwan Univ Scientific Computing.
STL: Maps Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
DTW for Speech Recognition J.-S. Roger Jang ( 張智星 ) MIR Lab ( 多媒體資訊檢索實驗室 ) CS, Tsing Hua Univ. ( 清華大學.
Today we will be learning: about multiplication and division that halving is the inverse of doubling.
Maximum Likelihood Estimate Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Distance/Similarity Functions for Pattern Recognition J.-S. Roger Jang ( 張智星 ) CS Dept., Tsing Hua Univ., Taiwan
Discussions on Audio Melody Extraction (AME) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.
Simulation of Stock Trading J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.
Linear Classifiers (LC) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.
Final Project: English Preposition Usage Checker J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.
From C to C++ Jyh-Shing Roger Jang (張智星)
Onset Detection, Tempo Estimation, and Beat Tracking
Search in Google's N-grams
CSIE Dept., National Taiwan Univ., Taiwan
Maximum Likelihood Estimate
Quadratic Classifiers (QC)
Query by Singing/Humming via Dynamic Programming
Introduction to Pattern Recognition
Singing Voice Separation via Active Noise Cancellation 使用主動式雜訊消除於歌聲分離
Gradient Descent 梯度下降法
ML for FinTech: Some Examples
DO NOW: Think back to the Elimination Game you played to answer the questions. What was the result when both chips were the same color? What was the result.
Divisibility 6 and 7 are factors of 42.
Search in OOXX Games J.-S. Roger Jang (張智星) MIR Lab, CSIE Dept.
Applications of Stacks and Queues for Constraint Satisfaction Problems
Multi-Step Equation Game! Play!.
Deep Neural Networks (DNN)
Circularly Linked Lists and List Reversal
Multiplying with 2 MAFS.3.OA.3.7.
National Taiwan University
Subtracting 10 Dice Roll Directions:
Computer games rely on game rules being written
Applications of Heaps J.-S. Roger Jang (張智星) MIR Lab, CSIE Dept.
Matrix Chain Product 張智星 (Roger Jang)
Query by Singing/Humming via Dynamic Programming
Dynamic Programming 動態規劃
Examples of Time Complexity
Scientific Computing: Closing 科學計算:結語
Prediction in Stock Trading
Selection Algorithm Jyh-Shing Roger Jang (張智星)
Tens Ones _______ 31 1 Double Digit Addition Regrouping
Gradient Descent 梯度下降法
Naive Bayes Classifiers (NBC)
Game Trees and Minimax Algorithm
Duration & Pitch Modification via WSOLA
Longest Common Subsequence (LCS)
National Taiwan University
How well do you KNOW your 2 times table ?
Sorting Algorithms Jyh-Shing Roger Jang (張智星)
Storing Game Entries in an Array
Edit Distance 張智星 (Roger Jang)
Pre and Post-Processing for Pitch Tracking
Presentation transcript:

DP for Optimum Strategies in Games J.-S. Roger Jang (張智星) jang@mirlab.org http://mirlab.org/jang MIR Lab, CSIE Dept. National Taiwan University

Outline Game of dice sum Game of colored jenga

Game of Dice Sum Description Your goal Toss a dice 8 times and place the value into 4 double-digit number right after each toss. Find the total of these 4 numbers. If the total is bigger than 150, your score is 0. Otherwise your score is the total. Your goal Find the optimum strategy to play the game such that the expected total is optimized. Credit: Peter Norvig at Google CS283: AI Programming Techniques (1989 at UC Berkeley)

Three-step Formula of DP: Step 1 Optimum-value function D(p, q, s)=expected max score when p: No. of ten’s position left q: No. of one’s position left s: current sum of the game Credit: 電機系賀正翔 Game state of (1, 2, 67)

Three-step Formula of DP: Steps 2 and 3 Recurrent formula for the optimum-value function Answer: D(4, 4, 0)

Strategy during the Game Recurrent formula for the optimum-value function

Game of Colored Jenga Description: Techniques http://codeforces.com/problemset/problem/424/E Techniques Dynamic programming Hash table