Exploring Randomness: Delusions and Opportunities LS 812 Mathematics in Science and Civilization November 3, 2007.

Slides:



Advertisements
Similar presentations
Quality control tools
Advertisements

Costs and Benefits.
General Logic In order to decide what we ought to do to obtain some good or avoid some harm, it is necessary to consider not only the good or harm in itself,
6-1 Stats Unit 6 Sampling Distributions and Statistical Inference - 1 FPP Chapters 16-18, 20-21, 23 The Law of Averages (Ch 16) Box Models (Ch 16) Sampling.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
© 2003 Prentice-Hall, Inc.Chap 5-1 Business Statistics: A First Course (3 rd Edition) Chapter 5 Probability Distributions.
MM207 Statistics Welcome to the Unit 7 Seminar Prof. Charles Whiffen.
Chapter 5 Understanding Randomness
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
4. Project Investment Decision-Making
Linear Transformation and Statistical Estimation and the Law of Large Numbers Target Goal: I can describe the effects of transforming a random variable.
Engineering Economic Analysis Canadian Edition
Everyday Benefits of Understanding Variability Larry Weldon Statistics and Actuarial Science Simon Fraser University Canada.
Introduction to Probability and Risk in Financial Investment
Chapter 4 Discrete Random Variables and Probability Distributions
Uncertainty and Consumer Behavior
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Personal Financial Management Semester – 2009 Gareth Myles Paul Collier
Copyright © 2006 McGraw Hill Ryerson Limited10-1 prepared by: Sujata Madan McGill University Fundamentals of Corporate Finance Third Canadian Edition.
The Surprising Consequences of Randomness LS 829 Mathematics in Science and Civilization Feb 6, /25/20151LS
Probability and Expected Value 6.2 and 6.3. Expressing Probability The probability of an event is always between 0 and 1, inclusive. A fair coin is tossed.
Session 5 Topics to be Covered: –Market Efficiency –Random Walk –Fundamental Analysis –Speculation –Technical Analysis.
Chap 5-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 5-1 Chapter 5 Discrete Probability Distributions Basic Business Statistics.
Basic Tools of Finance Finance is the field that studies how people make decisions regarding the allocation of resources over time and the handling of.
Statistics for Managers Using Microsoft® Excel 5th Edition
Understanding Risk. 1.What is risk? 2.How can we measure risk?
L30: Methods of Handling Project Risk ECON 320 Engineering Economics Mahmut Ali GOKCE Industrial Systems Engineering Computer Sciences.
Chapter The Basic Tools of Finance 14. Present Value: Measuring the Time Value of Money Finance – Studies how people make decisions regarding Allocation.
PowerPoint Slides prepared by: Andreea CHIRITESCU Eastern Illinois University The Basic Tools of Finance 1 © 2011 Cengage Learning. All Rights Reserved.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 5 Section 1 – Slide 1 of 33 Chapter 5 Section 1 Probability Rules.
Exploring Randomness: Delusions and Opportunities 1.
Scientific Inquiry & Skills
Copyright © 2004 South-Western 27 The Basic Tools of Finance.
TOPIC THREE Chapter 4: Understanding Risk and Return By Diana Beal and Michelle Goyen.
What’s It Worth? - The Movies - CSX Business Explorer Post 333 December, 2010.
Engineering Economic Analysis Canadian Edition
Outline Random processes Random variables Probability histograms
Copyright © 2011 Pearson Education, Inc. Probability: Living with the Odds Discussion Paragraph 7B 1 web 59. Lottery Chances 60. HIV Probabilities 1 world.
Chapter 5 Uncertainty and Consumer Behavior. ©2005 Pearson Education, Inc.Chapter 52 Q: Value of Stock Investment in offshore drilling exploration: Two.
Stephen G. CECCHETTI Kermit L. SCHOENHOLTZ Understanding Risk Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
TEACHING STATISTICS CONCEPTS THROUGH STOCK MARKET CONTEXTS Larry Weldon Simon Fraser University.
1 1 Slide © 2016 Cengage Learning. All Rights Reserved. Probability is a numerical measure of the likelihood Probability is a numerical measure of the.
Chapter 5 Choice Under Uncertainty. Chapter 5Slide 2 Topics to be Discussed Describing Risk Preferences Toward Risk Reducing Risk The Demand for Risky.
Some Uses of the Freeware, R, for Studying Financial Phenomena K.L. (Larry) Weldon Simon Fraser University R Development Core Team (2008). R: A language.
Introduction to Probability
QM Spring 2002 Business Statistics Probability Distributions.
Decision theory under uncertainty
Inference: Probabilities and Distributions Feb , 2012.
1-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Organization of statistical investigation. Medical Statistics Commonly the word statistics means the arranging of data into charts, tables, and graphs.
Chapter 11 – Introduction to Risk Analysis u Why do individuals, companies, and stockholders take risks?
On Investor Behavior Objective Define and discuss the concept of rational behavior.
Copyright © 2011 Pearson Education, Inc. Probability: Living with the Odds Discussion Paragraph 7B 1 web 59. Lottery Chances 60. HIV Probabilities 1 world.
Chapter The Basic Tools of Finance 27. Present Value: Measuring the Time Value of Money Finance – Studies how people make decisions regarding Allocation.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
Money and Banking Lecture 11. Review of the Previous Lecture Application of Present Value Concept Internal Rate of Return Bond Pricing Real Vs Nominal.
Copyright © 2009 Pearson Education, Inc. Chapter 11 Understanding Randomness.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
The Law of Averages. What does the law of average say? We know that, from the definition of probability, in the long run the frequency of some event will.
Chapter 5 Understanding Risk
Chapter 4 Probability Concepts
Chapter Five Understanding Risk.
Understanding Randomness
The Basic Tools of Finance
The Basic Tools of Finance
The Basic Tools of Finance
Everyday Benefits of Understanding Variability
Presentation transcript:

Exploring Randomness: Delusions and Opportunities LS 812 Mathematics in Science and Civilization November 3, 2007

Sources and Resources Nassim, Nicholas Taleb Fooled by Randomness, Second Edition, Random House, New York. Weldon,K.L. Everyday Benefits of Understanding Variability. Presented at Applied Statistics Conference, Ribno, Slovenia. September,

Introduction Randomness is about Uncertainty - e.g. Coin Is Mathematics about Certainty? - P(H) = 1/2 Mathematics can help to Describe “Unexplained Variability” Randomness concept is key for “Probability” Probability is key to exploring implications of “unexplained variability”

AbstractReal World MathematicsApplications of Mathematics Randomness Applied Statistics Surprising FindingsUseful Principles Ten Findings and Associated Principles

Example 1 - When is Success just Good Luck? An example from the world of Professional Sport

Sports League - Football Success = Quality or Luck?

Recent News Report “A crowd of 97,302 has witnessed Geelong break its 44-year premiership drought by crushing a hapless Port Adelaide by a record 119 points in Saturday's grand final at the MCG.” (2007 Season)

Sports League - Football Success = Quality or Luck?

Are there better teams? How much variation in the total points table would you expect IF every team had the same chance of winning every game? i.e. every game is Try the experiment with 5 teams. H=Win T=Loss (ignore Ties for now)

5 Team Coin Toss Experiment My experiment … T T H T T H H H H T TeamPoints But all teams Equal Quality (Equal Chance to win) Experiment Result -----> Win=4, Tie=2, Loss=0 but we ignore ties. P(W)=1/2 5 teams (1,2,3,4,5) so 10 games as follows 1-2,1-3,1-4,1-5,2-3,2-4,2-5,3-4,3-5,4-5

Implications? Points spread due to chance? Top team may be no better than the bottom team (in chance to win).

Simulation: 16 teams, equal chance to win, 22 games

Sports League - Football Success = Quality or Luck?

Does it Matter? Avoiding foolish predictions Managing competitors (of any kind) Understanding the business of sport Appreciating the impact of uncontrolled variation in everyday life

Point of this Example? Need to discount “chance” In making inferences from everyday observations.

Example 2 - Order from Apparent Chaos An example from some personal data collection

Gasoline Consumption Each Fill - record kms and litres of fuel used Smooth ---> Seasonal Pattern …. Why?

Pattern Explainable? Air temperature? Rain on roads? Seasonal Traffic Pattern? Tire Pressure? Info Extraction Useful for Exploration of Cause Smoothing was key technology in info extraction

Is Smoothing Objective? Data plotted ->>

How much to smooth?

Optimal Smoothing Parameter? Depends on Purpose of Display Choice Ultimately Subjective Subjectivity is a necessary part of good data analysis

Summary of this Example Surprising? Order from Chaos … Principle - Smoothing and Averaging reveal patterns encouraging investigation of cause

Example 3 - Utility of Averages Understanding them can contribute to your wealth! AVG =.38

Preliminary Proposal I offer you the following “investment opportunity” You give me $100. At the end of one year, I will return an amount determined by tossing a fair coins twice, as follows: $0 ………25% of time (TT) $50.……. 25% of the time (TH) $100.……25% of the time (HT) $400.……25% of the time. (HH) Would you be interested?

Stock Market Investment Risky Company - example in a known context Return in 1 year for 1 share costing $ % of the time % of the time % of the time % of the time i.e. Lose Money 50% of the time Only Profit 25% of the time “Risky” because high chance of loss

Independent Outcomes What if you have the chance to put $1 into each of 100 such companies, where the companies are all in very different markets? What sort of outcomes then? Use coin- tossing (by computer) to explore

Diversification Unrelated Companies Choose 100 unrelated companies, each one risky like this. Outcome is still uncertain but look at typical outcomes …. One-Year Returns to a $100 investment

Looking at Profit only Avg Profit approx 38%

Gamblers like Averages and Sums! The sum of 100 independent investments in risky companies is very predictable! Sums (and averages) are more stable than the things summed (or averaged). Square root law for variability of averages VAR -----> VAR/  n

Example 4 - Industrial Quality Control Filling Cereal Boxes, Oil Containers, Jam Jars Labeled amount should be minimum Save money if also maximum variability reduction contributes to profit Method: Management by exception …>

Management by exception QC = Quality Control <-- Nominal Amount

Japan a QC Innovator from 1950 Consumer Reports –Best Maintenance History Almost all Japanese Makes –Worst Maintenance History American and European Makes Key Technology was Variability Reduction Usually via Control Charts

Summary Example 4 Surprising that Simple Control Chart could have such influence Control Chart is just an implementation of the idea of Management by Exception

Example 5 - A Simple Law of Life Sometimes we see the same pattern in data from many different sources. Recognition of patterns aids description, and also helps to identify anomalies

Example: Zipf’s Law An empirical finding Frequency * rank = constant Example - frequency (i.e. population) of cities Largest city is rank 1 Second largest city is rank 2 ….

Canadian City Populations

Population*Rank = Constant? (Frequency * rank = constant)

Other Applications of Zipf Word Frequency in Natural or Programming Language Volume of messages at Internet Sites Number of Employees of Companies Academic Publishing Productivity Enrolment of Universities …… Google “Zipf’s Law” for more in-depth discussion

Summary for Zipf’s Law Surprising that processes involving many accidents of history and social chaos, should result in a predictable relationship Useful to describe an empirical relationship that has meaning in very different settings - a convenient descriptive tool.

Example 6 - Obtaining Confidential Information How can you ask an individual for data on Incomes Illegal Drug use Sex modes …..Etc in a way that will get an honest response? There is a need to protect confidentiality of answers.

Example: Marijuana Usage Randomized Response Technique Pose two Yes-No questions and have coin toss determine which is answered Head 1. Do you use Marijuana regularly? Tail 2. Is your coin toss outcome a tail?

Randomized Response Technique Suppose 60 of 100 answer Yes. Then about 50 are saying they have a tail. So 10 of the other 50 are users. 20%. It is a way of using randomization to protect Privacy. Public Data banks have used this.

Summary of Example 6 Surprising that people can be induced to provide sensitive information in public The key technique is to make use of the predictability of certain empirical probabilities.

Example 7 - Survival Assessment Personal Data is always hard to get. Need to make careful use of minimal data Here is an example ….

Traffic Accidents Accident-Free Survival Time - can you get it from …. Have you had an accident? How many months have you had your drivers license?

Accident Free Survival Time

Accident Next Month Can show that, for my 2002 class of 100 students, chance of accident next month was about 1%.

Summary of Example 7 Surprising that such minimal information is useful Again, key technique is to use empirical probabilities and smoothing

Example 8 - Lotteries: Expectation and Hope Cash flow –Ticket proceeds in (100%) –Prize money out (50%) –Good causes (35%) –Administration and Sales (15%) 50 % $1.00 ticket worth 50 cents, on average Typical lottery P(jackpot) =

How small is ? Buy 10 tickets every week for 60 years Cost is $31,200. Chance of winning jackpot is = …. 1/5 of 1 percent!

Summary Surprising that lottery tickets provide so little hope! Key technology is simple use of probabilities

Example 9 - Peer Review: Is it fair? Average referees accept 20% of average quality papers Referees vary in accepting 10%-50% of average papers Two referees accepting a paper -> publish. Two referees disagreeing -> third ref Two referees rejecting -> do not publish Analysis via simulation - assumptions are:

Ultimately published: *13 (approx) =9 papers out of others just as good!

Peer Review Fair? Does select good papers but Many equally good papers are rejected Similar property of school admission systems, competition review boards, etc.

Summary of Example 9 Surprising that peer review is so dependent on chance Key procedure is to use simulation to explore effect of randomness

Example 10 - Investment: Back-the-winner fallacy Mutual Funds - a way of diversifying a small investment Which mutual fund? Look at past performance? Experience from symmetric random walk …

Trends that do not persist

Implication from Random Walk …? Stock market trends may not persist Past might not be a good guide to future Some fund managers better than others? A small difference can result in a big difference over a long time …

A simulation experiment to determine the value of past performance data Simulate good and bad managers Pick the best ones based on 5 years data Simulate a future 5-yrs for these select managers

How to describe good and bad fund managers? Use TSX Index over past 50 years as a guide ---> annualized return is 10% Use a random walk with a slight upward trend to model each manager. Daily change positive with probability p Good managerROR = 13%pap=.56 Medium managerROR = 10%pap=.55 Poor managerROR = 8% paP=.54

Simulation to test “Back the Winner” 100 managers assigned various p parameters in.54 to.56 range Simulate for 5 years Pick the top-performing mangers (top 15%) Use the same 100 p-parameters to simulate a new 5 year experience Compare new outcome for “top” and “bottom” managers

START=100

Mutual Fund Advice? Don’t expect past relative performance to be a good indicator of future relative performance. Again - need to give due allowance for randomness (i.e. LUCK)

Summary of Example 10 Surprising that Past Perfomance is such a poor indicator of Future Performance Simulation is the key to exploring this issue

Ten Surprising Findings 1.Sports Leagues - Lack of Quality Differentials 2.Gasoline Mileage - Seasonal Patterns 3.Stock Market - Risky Stocks a Good Investment 4.Industrial QC - Variability Reduction Pays 5.Civilization - City Growth follows Zipf’s Law 6. Marijuana - Show of Hands shows 20% are regular users 7.Traffic Accidents - Simple class survey predicts 1% chance of accident in next month 8.Lotteries offer little hope 9.Peer Review is often unfair in judging submissions 10.Past Performance of Mutual Funds a poor indicator of future performance.

Ten Useful Concepts & Techniques? 1.Sports Leagues - Unexplained variation can cause illusions - simulation can inform 2.Gasoline Mileage - Averaging (and smoothing) amplifies signals 3.Stock Market - Averaging tames unexplained variation - diversification a key to reduce risk 4.Industrial QC - Management by Exception, Continuous Incremental Improvement 5.Population of Cities - Order can emerge from chaos

Ideas Marijuana - Randomness can protect privacy and preserve anonymity 7.Traffic Accidents - simple survey data can predict future risk, using probabilities 8.Lotteries - Not a reasonable “investment” 9.Peer Review - Role of “luck” underestimated 10.Mutual Funds - Role of “luck” underestimated!

Role of Math? Key background for –Graphs –Probabilities –Simulation models –Smoothing Methods Important for constructing theory of inference

The End Questions, Comments, Criticisms…..