1 Multi-criterion Ranking and Poset Prioritization G. P. Patil December 2004 – January 2005.

Slides:



Advertisements
Similar presentations
Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Advertisements

Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Slide 1 Spring, 2005 by Dr. Lianfen Qian Lecture 2 Describing and Visualizing Data 2-1 Overview 2-2 Frequency Distributions 2-3 Visualizing Data.
Chapter 3 Histograms Histogram is a summary graph showing a count of the data falling in various ranges. Purpose: To graphically summarize and display.
Aim: How do we construct a cumulative frequency table/histogram?
Slide 1 Copyright © 2004 Pearson Education, Inc.  Continuous random variable  Normal distribution Overview Figure 5-1 Formula 5-1 LAPTOP3: f(x) = 
Definitions Uniform Distribution is a probability distribution in which the continuous random variable values are spread evenly over the range of possibilities;
3.3 Graphics in the Media LEARNING GOAL Understand how to interpret the many types of more complex graphics that are commonly found in news media.
Slide 1 Detecting Outliers Outliers are cases that have an atypical score either for a single variable (univariate outliers) or for a combination of variables.
Frequency Distributions and Their Graphs
Welcome to Europe I CAN Jeopardy. $100 $200 $300 $400 $100 $200 $300 $400 Latitude & Longitude Using an Atlas Graphs, Tables, & Charts European Tour.
Poverty & Human Capability 101 Introductory Class.
Slide Slide 1 Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution 6-3 Applications of Normal Distributions 6-4.
STATISTICAL GRAPHS.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Innovation for Growth – i4g Universities are portfolios of (largely heterogeneous) disciplines. Further problems in university rankings Warsaw, 16 May.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 141.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Countries of Europe France Spain Italy Germany Which country is this?
Chapter 7 Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements.
Chapter 7 Lesson 7.3 Random Variables and Probability Distributions 7.3 Probability Distributions for Continuous Random Variables.
WESTERN EUROPE Chapters Pages British Isles and Nordic Nations Chapter 15 England Scotland & Wales Nordic Nations Ireland 2.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
Northern Europe Label the following countries on the next page, using the color each countries is labeled in: -United Kingdom (blue) -Ireland (green) -Iceland.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Map - Region 3 Europe.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
Section 6-1 Overview. Chapter focus is on: Continuous random variables Normal distributions Overview Figure 6-1 Formula 6-1 f(x) =  2  x-x-  )2)2.
Data, Type and Methods of representation Dr Hidayathulla Shaikh.
Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms.
Data Presentation Data Presentation Frances A. Pfab.
Problem gambling in Europe: Why a regulatory authority needed Dr Mark Griffiths Professor of Gambling Studies International Gaming Research Unit
Northern Europe Label the following countries on the next page, using the color each countries is labeled in, then add capitals to each country using a.
Copyright © Cengage Learning. All rights reserved. Normal Curves and Sampling Distributions 7.
NUMERACY “Well hey that’s just knowing numbers right.” NOT EXACTLY.
Slide Copyright © 2009 Pearson Education, Inc. Ch. 3.1 Definition A basic frequency table has two columns: One column lists all the categories of.
Displaying Data:  Dot Plots  Stem and Leaf Plots  Grouped Frequency Tables  Histograms and Bar Graphs Slide
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Nationwide Sustainability Indicators and Their Integration, Evaluation, and Visualization Worldwide - UNEP Initiative - Sustainability Indicators Indicators.
Displaying Data:  Dot Plots  Grouped Frequency Tables  Histograms Slide
Computer Graphics CC416 Lecture 04: Bresenham Line Algorithm & Mid-point circle algorithm Dr. Manal Helal – Fall 2014.
1 Forum for Interdisciplinary Mathematics Patna, India G. P. Patil December 2010.
1 Poset Prioritization G. P. Patil October We also present a prioritization innovation. It lies in the ability for prioritization and ranking.
Chapter 7 Random Variables and Continuous Distributions.
Geographic and Network Surveillance for Arbitrarily Shaped Hotspots Overview Geospatial Surveillance Upper Level Set Scan Statistic System Spatial-Temporal.
Comparative Knowledge Discovery with Partial Order and Composite Indicator Partial Order Ranking of Objects with Weights for Indicators and Its Representability.
Nationwide Indicators and Their Integration, Evaluation, and Visualization Worldwide UNEP Initiative Collaboration G. P. Patil Yannis FermantzisSonia Rodriguez.
Continuous Distributions
Countries and Capitals of Western Europe
Lecture Slides Elementary Statistics Twelfth Edition
Section 7.3: Probability Distributions for Continuous Random Variables
CHAPTER 12 Statistics.
EPA Presentation March 13,2003 G. P. Patil
NSF Digital Government surveillance geoinformatics project, federal agency partnership and national applications for digital governance.
Graphing Equations and Inequalities
Graphical Displays of Data
Click on category below to start the game
Basic Practice of Statistics - 5th Edition
Geographic and Network Surveillance for Arbitrarily Shaped Hotspots
Sexual Activity and the Lifespan of Male Fruitflies
Statistical Data Analysis
CHAPTER 12 Statistics.
An Introduction to Correlational Research
Albany New York (1) G. P. Patil
Segregation of Object and Background Motion in Visual Area MT
CHAPTER 12 Statistics.
Frequency Distributions
CHAPTER 12 Statistics.
Frequency Distribution and Graphs
Presentation transcript:

1 Multi-criterion Ranking and Poset Prioritization G. P. Patil December 2004 – January 2005

2 We also present a prioritization innovation. It lies in the ability for prioritization and ranking of hotspots based on multiple indicator and stakeholder criteria without having to integrate indicators into an index, using Hasse diagrams and partial order sets. This leads us to early warning systems, and also to the selection of investigational areas. Prioritization Innovation Partial Order Set Ranking

3

4 First stage screening First stage screening –Significant clusters by SaTScan and/or upper level sets upper level sets Second stage screening Second stage screening –Multicriteria noteworthy clusters by partially ordered sets and Hass diagrams Final stage screening Final stage screening –Follow up clusters for etiology, intervention based on multiple criteria using Hass diagrams based on multiple criteria using Hass diagrams Multiple Criteria Analysis, Multiple Indicators and Choices, Health Statistics, Disease Etiology, Health Policy, Resource Allocation

5 Ranking Possible Disease Clusters in the State of New York Data Matrix

6 Hotspot Prioritization and Poset Ranking Multiple hotspots with intensities significantly elevated relative to the rest of the region Multiple hotspots with intensities significantly elevated relative to the rest of the region Ranking based on likelihood values, and additional attributes: raw intensity values, socio-economic and demographic factors, feasibility scores, excess cases, seasonal residence, atypical demographics, etc. Ranking based on likelihood values, and additional attributes: raw intensity values, socio-economic and demographic factors, feasibility scores, excess cases, seasonal residence, atypical demographics, etc. Multiple attributes, multiple indicators Multiple attributes, multiple indicators Ranking without having to integrate the multiple indicators into a composite index Ranking without having to integrate the multiple indicators into a composite index

7 Regions of comparability and incomparability for the inherent importance ordering of hotspots. Hotspots form a scatterplot in indicator space and each hotspot partitions indicator space into four quadrants

8 Figure 2. Contour of index H passing through object a. A linear index is shown on the left and a non-linear index on the right.

9 Figure 3. The top two diagrams depict valid contours while the bottom two diagrams depict invalid contours.

10 Figure 4. The tradeoff or substitutability between height and weight in assessing the size of a person. The tradeoff is constant with a linear index (left) but varies across indicator space with a nonlinear index (right).

11 HUMAN ENVIRONMENT INTERFACE LAND, AIR, WATER INDICATORS RANK COUNTRY LANDAIRWATER 1 Sweden 2 Finland 3 Norway 5 Iceland 13 Austria 22 Switzerland 39 Spain 45 France 47 Germany 51 Portugal 52 Italy 59 Greece 61 Belgium 64 Netherlands 77 Denmark 78 United Kingdom 81 Ireland for land - % of undomesticated land, i.e., total land area-domesticated (permanent crops and pastures, built up areas, roads, etc.) for air - % of renewable energy resources, i.e., hydro, solar, wind, geothermal for water - % of population with access to safe drinking water

12 Hasse Diagram (all countries)

13 Hasse Diagram (Western Europe)

14

15 Figure 5. Hasse diagrams for four different posets. Poset D has a disconnected Hasse diagram with two connected components {a, c, e} and {b, d}.

16 Figure 6. Bottom-up Hasse diagrams for the posets of Figure 5. Hasse diagrams for Posets A and B are unchanged.

17 Figure 7. Hasse diagram for the four countries of Table 1. Note that it has the same structure as Poset A in Figure 5.

18 Figure 10. Hasse diagram for Latin America. There are four connected components. Three of these components are isolates; the remaining component contains 13 countries.

19 Figure 11. Hasse diagram for the 52 watersheds in the primary component. Labels are (arbitrary) row numbers in the data matrix.

20 Figure 12. Map of the Mid-Atlantic region showing the primary Hasse component (shaded). Geographically, there are three connected components of which two are small and located near the periphery of the region.

21 Figure 13: Hasse diagrams (right) of the two possible rankings for the poset on the left.

22 Figure 14. Rank-intervals for all 106 countries. The intervals (countries) are labeled by their midpoints as shown along the horizontal axis. For each interval, the lower endpoint and the upper endpoint are shown vertically. The length of each interval corresponds to the ambiguity inherent in attempting to rank that country among all 106 countries.

23 Figure 15. Rank-intervals for all 106 countries, plotted against their HEI rank. The HEI rank appears as the 45-degree line. The HEI tends to be optimistic (closer to the lower endpoint) for better-ranked countries and pessimistic (closer to the upper endpoint) for poorer-ranked countries.

24 Figure 16. A ranking of a poset determines a linear Hasse diagram. The numerical rank assigned to each element is that element’s depth in the Hasse diagram.

25 Figure 17. Hasse diagram of Poset B (left) and a decision tree enumerating all possible linear extensions of the poset (right). Every downward path through the decision tree determines a linear extension. Dashed links in the decision tree are not implied by the partial order and are called jumps. If one tried to trace the linear extension in the original Hasse diagram, a “jump” would be required at each dashed link. Note that there is a pure-jump linear extension (path a, b, c, d, e, f) in which every link is a jump.

26 Figure 18. Histograms of the rank-frequency distributions for Poset B.

27 Cumulative Rank Frequency Operator – 5 An Example of the Procedure In the example from the preceding slide, there are a total of 16 linear extensions, giving the following cumulative frequency table. Rank Element a b c d e f Each entry gives the number of linear extensions in which the element (row label) receives a rank equal to or better that the column heading

28 Cumulative Rank Frequency Operator – 6 An Example of the Procedure 16 The curves are stacked one above the other and the result is a linear ordering of the elements: a > b > c > d > e > f

29 Cumulative Rank Frequency Operator – 7 An example where F must be iterated Original Poset (Hasse Diagram) a f eb c g d h a f e b ad c h g a f e b ad c h g F F 2

30 Cumulative Rank Frequency Operator – 8 An example where F results in ties Original Poset (Hasse Diagram) a cb d a b, c (tied) d F Ties reflect symmetries among incomparable elements in the original Hasse diagram Elements that are comparable in the original Hasse diagram will not become tied after applying F operator

31 Incorporating Judgment Poset Cumulative Rank Frequency Approach Certain of the indicators may be deemed more important than the others Certain of the indicators may be deemed more important than the others Such differential importance can be accommodated by the poset cumulative rank frequency approach Such differential importance can be accommodated by the poset cumulative rank frequency approach Instead of the uniform distribution on the set of linear extensions, we may use an appropriately weighted probability distribution , e.g., Instead of the uniform distribution on the set of linear extensions, we may use an appropriately weighted probability distribution , e.g.,

32

33

34 Logo for Statistics, Environment, Health, Ecology, and Society