DP for Optimum Strategies in Games

Slides:

Advertisements

Similar presentations

Feature Selection for Pattern Recognition J.-S. Roger Jang ( 張智星 ) CSIE Dept., National Taiwan University ( 台灣大學資訊工程系 )

Advertisements

Dynamic Time Warping (DTW)

Standard Template Library Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.

Roll Dice Draw Tens Ones Standard Form Expanded Form

Shallow Copy Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.

Experiments with MATLAB Experiments with MATLAB Google PageRank Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University, Taiwan

Principal Component Analysis (PCA)

Experiments with MATLAB Mandelbrot Set Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University

Place Value and Multiplication

CSIE Dept., National Taiwan Univ., Taiwan

National Taiwan University

2015/10/221 Progressive Filtering and Its Application for Query-by-Singing/Humming J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval Lab CS Dept.,

Singly Linked Lists Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University 1.

2016/6/41 Recent Improvement Over QBSH and AFP J.-S. Roger Jang （張智星） Multimedia Information Retrieval (MIR) Lab CSIE Dept, National Taiwan Univ.

Sorting Algorithms Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.

Dynamic Programming 張智星 (Roger Jang) 多媒體資訊檢索實驗室台灣大學資訊工程系.

RuSSIR 2013 QBSH and AFP as Two Successful Paradigms of Music Information Retrieval Jyh-Shing Roger Jang ( 張智星 ) MIR Lab, CSIE Dept.

Binary Search Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.

Quadratic Classifiers (QC) J.-S. Roger Jang ( 張智星 ) CS Dept., National Taiwan Univ Scientific Computing.

STL: Maps Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.

DTW for Speech Recognition J.-S. Roger Jang ( 張智星 ) MIR Lab ( 多媒體資訊檢索實驗室 ) CS, Tsing Hua Univ. ( 清華大學.

Today we will be learning: about multiplication and division that halving is the inverse of doubling.

Maximum Likelihood Estimate Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.

Distance/Similarity Functions for Pattern Recognition J.-S. Roger Jang ( 張智星 ) CS Dept., Tsing Hua Univ., Taiwan

Discussions on Audio Melody Extraction (AME) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.

Simulation of Stock Trading J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.

Linear Classifiers (LC) J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.

Final Project: English Preposition Usage Checker J.-S. Roger Jang ( 張智星 ) MIR Lab, CSIE Dept. National Taiwan University.

From C to C++ Jyh-Shing Roger Jang (張智星)

Onset Detection, Tempo Estimation, and Beat Tracking

Search in Google's N-grams

CSIE Dept., National Taiwan Univ., Taiwan

Maximum Likelihood Estimate

Quadratic Classifiers (QC)

Query by Singing/Humming via Dynamic Programming

Introduction to Pattern Recognition

Singing Voice Separation via Active Noise Cancellation 使用主動式雜訊消除於歌聲分離

Gradient Descent 梯度下降法

ML for FinTech: Some Examples

DO NOW: Think back to the Elimination Game you played to answer the questions. What was the result when both chips were the same color? What was the result.

Divisibility 6 and 7 are factors of 42.

Search in OOXX Games J.-S. Roger Jang (張智星) MIR Lab, CSIE Dept.

Applications of Stacks and Queues for Constraint Satisfaction Problems

Multi-Step Equation Game! Play!.

Deep Neural Networks (DNN)

Circularly Linked Lists and List Reversal

Multiplying with 2 MAFS.3.OA.3.7.

National Taiwan University

Subtracting 10 Dice Roll Directions:

Computer games rely on game rules being written

Applications of Heaps J.-S. Roger Jang (張智星) MIR Lab, CSIE Dept.

Matrix Chain Product 張智星 (Roger Jang)

Query by Singing/Humming via Dynamic Programming

Dynamic Programming 動態規劃

Examples of Time Complexity

Scientific Computing: Closing 科學計算：結語

Prediction in Stock Trading

Selection Algorithm Jyh-Shing Roger Jang (張智星)

Tens Ones _______ 31 1 Double Digit Addition Regrouping

Gradient Descent 梯度下降法

Naive Bayes Classifiers (NBC)

Game Trees and Minimax Algorithm

Duration & Pitch Modification via WSOLA

Longest Common Subsequence (LCS)

National Taiwan University

How well do you KNOW your 2 times table ?

Sorting Algorithms Jyh-Shing Roger Jang (張智星)

Storing Game Entries in an Array

Edit Distance 張智星 (Roger Jang)

Pre and Post-Processing for Pitch Tracking

Presentation transcript:

DP for Optimum Strategies in Games J.-S. Roger Jang (張智星) jang@mirlab.org http://mirlab.org/jang MIR Lab, CSIE Dept. National Taiwan University

Outline Game of dice sum Game of colored jenga

Game of Dice Sum Description Your goal Toss a dice 8 times and place the value into 4 double-digit number right after each toss. Find the total of these 4 numbers. If the total is bigger than 150, your score is 0. Otherwise your score is the total. Your goal Find the optimum strategy to play the game such that the expected total is optimized. Credit: Peter Norvig at Google CS283: AI Programming Techniques (1989 at UC Berkeley)

Three-step Formula of DP: Step 1 Optimum-value function D(p, q, s)=expected max score when p: No. of ten’s position left q: No. of one’s position left s: current sum of the game Credit: 電機系賀正翔 Game state of (1, 2, 67)

Three-step Formula of DP: Steps 2 and 3 Recurrent formula for the optimum-value function Answer: D(4, 4, 0)

Strategy during the Game Recurrent formula for the optimum-value function

Game of Colored Jenga Description: Techniques http://codeforces.com/problemset/problem/424/E Techniques Dynamic programming Hash table