Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 Arrows indicate strength between two problem states Start maze … Reinforcement learning example.

Slides:



Advertisements
Similar presentations
How to register to access the members only pages on our website 1. Scroll down to the bottom of the home page and click on the text (arrowed)
Advertisements

Permutations and Combinations
Chapter 5 Fundamental Algorithm Design Techniques.
You and four other people have been chosen to go to a haunted house. You have no clue who the other four people are. You have been asked to stay the whole.
Probability Distributions Discrete. Discrete data Discrete data can only take exact values Examples: The number of cars passing a checkpoint in 30 minutes.
The Stagecoach Problem
4. Convergence of random variables  Convergence in probability  Convergence in distribution  Convergence in quadratic mean  Properties  The law of.
Chapter 4: Probability (Cont.) In this handout: Total probability rule Bayes’ rule Random sampling from finite population Rule of combinations.
> >
Section 7.3 – Simple Random Samples
Writing Linear Equations Given Two Points On the Line Using the "Slope – Intercept" Form.
The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,
STA Lecture 261 STA 291 Lecture 26 Two types of errors in testing hypothesis. Connection between testing hypothesis and confidence intervals.
Section 15.8 The Binomial Distribution. A binomial distribution is a discrete distribution defined by two parameters: The number of trials, n The probability.
Random Variables Section 3.1 A Random Variable: is a function on the outcomes of an experiment; i.e. a function on outcomes in S. For discrete random variables,
Learning Game Maker Studio:
Copyright © Cengage Learning. All rights reserved. CHAPTER 9 COUNTING AND PROBABILITY.
Section 7.1. Section Summary Finite Probability Probabilities of Complements and Unions of Events Probabilistic Reasoning.
Chapter 6 Lesson 6.2 Probability 6.2: Definition of Probability.
By: Claire Riddell.  A Broken Line Graph is a graph, where the data points are connected with line segments.
ObserveDevelop Theory/ Hypothesis TestCollect & Conclude Lesson 9 – Understand issues with sampling and generalisability. Sampling Techniques.
Chapter 2: Rational Numbers 2.7 Probability of Compound Events.
TEXTURE SYNTHESIS BY NON-PARAMETRIC SAMPLING VIVA-VITAL Nazia Tabassum 27 July 2015.
Discrete Probability Distribution Henry Mesa Use your keyboard’s arrow keys to move the slides forward (▬►) or backward (◄▬) Hit the Esc key to end show.
Other NN Models Reinforcement learning (RL)
Your 3rd quiz will cover sections a){HHH,HTT,THT,TTH,THH,HTH,HHT,TTT} {0,1,2,3} b) {1/8,3/8,3/8,1/8} d) P(x=2 or x=3)= P(x=2)+P(x=3)=3/8+1/8=1/2.
Complemental Probabilities.
Section 7.4 Use of Counting Techniques in Probability.
SPANNING TREES Lecture 20 CS2110 – Fall Spanning Trees  Definitions  Minimum spanning trees  3 greedy algorithms (incl. Kruskal’s & Prim’s)
Systematic Sampling Lecture 9 – Part 1 Sections 2.8 Wed, Jan 30, 2008.
Writing Equations by Completing the Square Or Using the Distance Formula.
1. Tree Diagram 2. Multiplication Principle 3. Generalized Multiplication Principle 1.
Chapter 16 Week 6, Monday. Random Variables “A numeric value that is based on the outcome of a random event” Example 1: Let the random variable X be defined.
1By ILTAF MEHDI, IT Lecturer, MIHE, Kabul CHAPTER_NO : 05 Random Variables.
President UniversityErwin SitompulPBST 4/1 Dr.-Ing. Erwin Sitompul President University Lecture 4 Probability and Statistics
Lesson 4-6 Probability of Compound Events Objectives: To find the probability of independent and dependent events.
Types of Samples. Samples Population – the main group or set from which observations or opinions can be made Sample – a small part of a population from.
SAMPLING. Voluntary Response Individuals choose to join the sample in response to an open invitation.
1 Press Ctrl-A ©G Dear2009 – Not to be sold/Free to use Types of Sample Stage 6 - Year 11 Applied Mathematic (Preliminary General 1)
K means ++ and K means Parallel Jun Wang. Review of K means Simple and fast Choose k centers randomly Class points to its nearest center Update centers.
Section 7.1. Probability of an Event We first define these key terms: An experiment is a procedure that yields one of a given set of possible outcomes.
Probability and Simulation The Study of Randomness.
Spring 2008The Greedy Method1. Spring 2008The Greedy Method2 Outline and Reading The Greedy Method Technique (§5.1) Fractional Knapsack Problem (§5.1.1)
Show Me THE Money!!! Car Budget It Depends… …on Your Semester 1 Team GPA!
Processing a Decision Tree Connecticut Electronics.
Final Exam Review Chapters 3 & 4.
Discrete Probability Distribution
Probability Imagine tossing two coins and observing whether 0, 1, or 2 heads are obtained. It would be natural to guess that each of these events occurs.
نتعارف لنتألف في التعارف تألف (( الأرواح جنود مجندة , ماتعارف منها أئتلف , وماتنافر منها اختلف )) نماذج من العبارات الايجابية.
Estimating
'. \s\s I. '.. '... · \ \ \,, I.
الأستاذ المساعد بقسم المناهج وطرق التدريس
اختر أي شخصية واجعلها تطير!
Decision Maths Dijkstra’s Algorithm.
10:00.
' '· \ ·' ,,,,
What number is the arrow pointing at?
Unit 10 Statistics Part 1.
Elapsed Time.
Selecting a Sample Notes
Simple Random Sampling
MATH 2311 Section 6.3.
Chapter Outline Inferences About the Difference Between Two Population Means: s 1 and s 2 Known.
½ of 6 = 3.
C.2.10 Sample Questions.
Power Problems.
C.2.8 Sample Questions.
Barlogik Mathsbee (final)
C.2.8 Sample Questions.
Presentation transcript:

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 Arrows indicate strength between two problem states Start maze … Reinforcement learning example

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 The first response leads to S2 … The next state is chosen by randomly sampling from the possible next states weighted by their associative strength Associative strength = line width

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 Suppose the randomly sampled response leads to S3 …

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 At S3, choices lead to either S2, S4, or S7. S7 was picked (randomly)

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 By chance, S3 was picked next…

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 Next response is S4

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 And S5 was chosen next (randomly)

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 And the goal is reached …

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 Goal is reached, strengthen the associative connection between goal state and last response Next time S5 is reached, part of the associative strength is passed back to S4...

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 Start maze again…

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 Let’s suppose after a couple of moves, we end up at S5 again

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 S5 is likely to lead to GOAL through strenghtened route In reinforcement learning, strength is also passed back to the last state This paves the way for the next time going through maze

Start S2S2 S3S3 S4S4 S5S5 Goal S7S7 S8S8 The situation after lots of restarts …