Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Annual Digital Government Research Conference San Diego, CA Project Highlights G.P. Patil May 2006.

Similar presentations


Presentation on theme: "1 Annual Digital Government Research Conference San Diego, CA Project Highlights G.P. Patil May 2006."— Presentation transcript:

1 1 Annual Digital Government Research Conference San Diego, CA Project Highlights G.P. Patil May 2006

2 2 Agency Databases Thematic Databases Other Databases Homeland Security Disaster Management Public Health Ecosystem Health Other Case Studies Statistical Processing: Hotspot Detection, Prioritization, etc. Data Sharing, Interoperable Middleware Standard or De Facto Data Model, Data Format, Data Access Arbitrary Data Model, Data Format, Data Access Application Specific De Facto Data/Information Standard Agency Databases Thematic Databases Other Databases Homeland Security Disaster Management Public Health Ecosystem Health Other Case Studies Statistical Processing: Hotspot Detection, Prioritization, etc. Data Sharing, Interoperable Middleware Standard or De Facto Data Model, Data Format, Data Access Arbitrary Data Model, Data Format, Data Access Application Specific De Facto Data/Information Standard National Applications Biosurveillance Carbon Management Coastal Management Community Infrastructure Crop Surveillance Disaster Management Disease Surveillance Ecosystem Health Environmental Justice Environmental Management Environmental Policy Homeland Security Invasive Species Poverty Policy Public Health Public Health and Environment Robotic Networks Sensor Networks Social Networks Syndromic Surveillance Tsunami Inundation Urban Crime Water Management SurvellanceGeoinformaticsof Hotspot Detection, Prioritization and Early Warning NSF Digital Government Project #0307010 PI: G. P. Patil Federal Agency Partnership CDCDOD EPANASA NIHNOAA USFSUSGS gpp@stat.psu.edu Websites: http://www.stat.psu.edu/~gpp/ http://www.stat.psu.edu/hotspots/ http://www.digitalgovernment.org/news/stories/2004/1104/1104_hotspots_heyman.jsp NSF Digital Government surveillance geoinformatics project, federal agency partnership and national applications for digital governance.

3 3 Geographic and Network Surveillance for Arbitrarily Shaped Hotspots Overview Geospatial Surveillance Upper Level Set Scan Statistic System Spatial-Temporal Surveillance Typology of Space-Time Hotspots Hotspot Prioritization Ranking Without Having to Integrate Multiple Indicators Surveillance Geoinformatics for Hotspot Detection, Prioritization, Early Warning and Sustainable Management Upper Level Set Scan System Definition: A hotspot is that portion of the study region with an elevated risk of an adverse outcome Federal Agency Partnerships CDC DOD EPA NASA NIH NOAA USFS USGS Features of ULS Scan Statistic: Identifies arbitrarily shaped hotspots Applicable to data on a network Confidence sets and hotspot ratings Computationally efficient Generalizes to space-time scan Poset Prioritization System Objective: Prioritize or rank hotspots based on multiple indicator and stakeholder criteria without having to integrate indicators into an index, using Haase diagrams and partially ordered sets. Example: Prioritization of disease clusters with Multiple Indicators National Applications and Case Studies Biosurveillance Carbon Management Costal Management Community Infrastructure Crop Surveillance Disaster Management Disease Surveillance Ecosystem Health Environmental Justice Sensor Networks Robotic Networks Environmental Management Environmental Policy Homeland Security Invasive Species Poverty Policy Public Health Public Health and Environment Syndromic Surveillance Social Networks Stream Networks g Changing Connectivity of ULS as Level Drops G.P. Patil, R. Acharya, W.L. Myers, P. Patankar, Y. Cai, and S.L. Rathbun The Penn State University, University Park, PA 16802 R. Modarres George Washington University, Washington, D.C. Example: West Nile Virus First isolated in 1937, this mosquito born disease, indigenous to north Africa, the Middle East and west Asia was first introduced into the United States in 1999. Disease Count Quintiles Population Quintiles Disease Rate QuintilesLikelihood Quintiles Comparison of ULS Scan with Circular Scan ULS ScanCircular Scan Confidence set for ULS Hotspot Hotspot Membership Rating Example: Lyme Disease Infections from the bacterium Borelia burgdorfei vectored by ticks from the genus Ixodes. ULS Scan Cylindrical Scan Example: Human-environment indicator values for 16 European countries. There are a total of 3,764,448 admissible linear extensions. The cumulative rank function for Sweden exceeds that of all remaining countries. The crf’s of all countries dominate that of Ireland. The remaining countries cannot be uniquely ordered based on their crf’s. Belgium, Netherlands and United Kingdom have identical crf’s. Admissible linear extensions are comprised of rankings compatible with the rankings of all indicators. Treating each linear extension as a voter, the cumulative rank function is obtained from the frequencies at which each object receives each rank. Disease Rates Comparison of ULS Scan with Cylindrical Scan 1997 1998 1999 Year 2000 2001 2002 2003 Haase Diagram The crf’s also form a partially ordered set. There are only 182 admissible linear extensions for this poset, yielding the cumulative rank function: One more iteration yields the rankings in the data table. Center for Statistical Ecology and Environmental Statistics

4 4

5 5

6 6 Attractive Features Identifies arbitrarily shaped clusters Identifies arbitrarily shaped clusters Data-adaptive zonation of candidate hotspots Data-adaptive zonation of candidate hotspots Applicable to data on a network Applicable to data on a network Provides both a point estimate as well as a confidence set for the hotspot Provides both a point estimate as well as a confidence set for the hotspot Uses hotspot-membership rating to map hotspot boundary uncertainty Uses hotspot-membership rating to map hotspot boundary uncertainty Computationally efficient Computationally efficient Applicable to both discrete and continuous syndromic responses Applicable to both discrete and continuous syndromic responses Identifies arbitrarily shaped clusters in the spatial-temporal domain Identifies arbitrarily shaped clusters in the spatial-temporal domain Provides a typology of space-time hotspots with discriminatory surveillance potential Provides a typology of space-time hotspots with discriminatory surveillance potential Hotspot Detection Innovation Upper Level Set Scan Statistic

7 7 A small sample of the circles used

8 8 Issues

9 9 Spatial Temporal Surveillance

10 10 Network Analysis of Biological Integrity in Freshwater Streams

11 11 Syndromic Crisis-Index Surveillance

12 12 (left) The overall procedure, leading from admissions records to the crisis index for a hospital. The hotspot detection algorithm is then applied to the crisis index values defined over the hospital network. (right) The -machine procedure for converting an event stream into a parse tree and finally into a probabilistic finite state automaton (PFSA). Syndromic Surveillance

13 13 Ocean SAmpling MObile Network OSAMON

14 14 Scalable Wireless Geo-Telemetry with Miniature Smart Sensors Geo-telemetry enabled sensor nodes deployed by a UAV into a wireless ad hoc mesh network: Transmitting data and coordinates to TASS and GIS support systems

15 15 Upper Level Set (ULS) of Intensity Surface Hotspot zones at level g (Connected Components of upper level set)

16 16 Changing Connectivity of ULS as Level Drops g

17 17 ULS Connectivity Tree Schematic intensity “surface” N.B. Intensity surface is cellular (piece-wise constant), with only finitely many levels A, B, C are junction nodes where multiple zones coalesce into a single zone A B C

18 18 A confidence set of hotspots on the ULS tree. The different connected components correspond to different hotspot loci while the nodes within a connected component correspond to different delineations of that hotspot

19 19 Genetic Algorithm Scan

20 20

21 21

22 22

23 23 We also present a prioritization innovation. It lies in the ability for prioritization and ranking of hotspots based on multiple indicator and stakeholder criteria without having to integrate indicators into an index, using Hasse diagrams and partial order sets. This leads us to early warning systems, and also to the selection of investigational areas. Prioritization Innovation Partial Order Set Ranking

24 24 HUMAN ENVIRONMENT INTERFACE LAND, AIR, WATER INDICATORS RANK COUNTRY LANDAIRWATER 1 Sweden 2 Finland 3 Norway 5 Iceland 13 Austria 22 Switzerland 39 Spain 45 France 47 Germany 51 Portugal 52 Italy 59 Greece 61 Belgium 64 Netherlands 77 Denmark 78 United Kingdom 81 Ireland 69.0176.4627.381.7940.5730.1732.6328.3432.5634.6223.3521.5921.8419.439.8312.649.2535.2419.0563.9880.2529.8528.107.746.502.1014.296.893.200.001.075.041.131.99100981001001001001001001008210098100100100100100 for land - % of undomesticated land, i.e., total land area-domesticated (permanent crops and pastures, built up areas, roads, etc.) for air - % of renewable energy resources, i.e., hydro, solar, wind, geothermal for water - % of population with access to safe drinking water

25 25 Hasse Diagram (Western Europe)

26 26 Figure 17. Hasse diagram of Poset B (left) and a decision tree enumerating all possible linear extensions of the poset (right). Every downward path through the decision tree determines a linear extension. Dashed links in the decision tree are not implied by the partial order and are called jumps. If one tried to trace the linear extension in the original Hasse diagram, a “jump” would be required at each dashed link. Note that there is a pure-jump linear extension (path a, b, c, d, e, f) in which every link is a jump.

27 27 Figure 18. Histograms of the rank-frequency distributions for Poset B.

28 28 Cumulative Rank Frequency Operator – 6 An Example of the Procedure 16 The curves are stacked one above the other and the result is a linear ordering of the elements: a > b > c > d > e > f

29 29 Cumulative Rank Frequency Operator – 7 An example where F must be iterated Original Poset (Hasse Diagram) a f eb c g d h a f e b ad c h g a f e b ad c h g F F 2

30 30 Cumulative Rank Frequency Operator – 8 An example where F results in ties Original Poset (Hasse Diagram) a cb d a b, c (tied) d F Ties reflect symmetries among incomparable elements in the original Hasse diagram Elements that are comparable in the original Hasse diagram will not become tied after applying F operator

31 31 Incorporating Judgment Poset Cumulative Rank Frequency Approach Certain of the indicators may be deemed more important than the others Certain of the indicators may be deemed more important than the others Such differential importance can be accommodated by the poset cumulative rank frequency approach Such differential importance can be accommodated by the poset cumulative rank frequency approach Instead of the uniform distribution on the set of linear extensions, we may use an appropriately weighted probability distribution , e.g., Instead of the uniform distribution on the set of linear extensions, we may use an appropriately weighted probability distribution , e.g.,

32 32 15 European Monitoring Stations Evaluated by 5 Criteria Abb.NUMESTPRBM AUS12121 BEL12122 DEN22022 FIN11211 FRA12222 GER22221 GRE10200 IRE00011 ITA20211 LUX00010 NET11021 POR10201 SPA00110 SWE12010 UNK10212

33 33 Hasse Diagram for the EEC 15*5 Dataset Abbr.Rank of EEC5 FRA1 GER2 BEL3 UNK4 DEN5 ITA6 AUS7 FIN8 NET9 POR10.5 SWE10.5 SPA12 GRE13 IRE14 LUX15

34 34 Regions in the Coordinate Space of POSAC Y YpYp 0 P XpXp X (III) Region of points incomparable to P (II) Region of points greater than P (III) Region of points incomparable to P (I) Region of points smaller than P

35 35 Three profiles in a two-dimensional coordinate space Y 3 2 1 0 3210 2,2,0,2,2 2,2,2,2,1 2,0,2,1,1 X

36 36 POSAC Plot of Data Matrix 15*5

37 37 POSAC Output Proportion of profile pairs correctly represented: 0.926 Label DIM 1 DIM 2 Joint Lateral Fit 22222 1.000 1.000 1.000 0.500 0.000 GER 0.935 0.866 0.901 0.535 0.726 FRA 0.901 0.935 0.918 0.483 0.000 BEL 0.707 0.901 0.804 0.403 0.000 DEN 0.612 0.968 0.790 0.322 0.234 AUS 0.661 0.829 0.745 0.416 0.000 FIN 0.829 0.707 0.768 0.561 0.000 UNK 0.866 0.661 0.764 0.602 0.651 ITA 0.968 0.559 0.764 0.705 0.492 NET 0.433 0.791 0.612 0.321 0.213 POR 0.750 0.354 0.552 0.698 0.076 SWE 0.354 0.750 0.552 0.302 0.108 GRE 0.791 0.250 0.520 0.770 0.076 IRE 0.500 0.612 0.556 0.444 0.415 SPA 0.559 0.500 0.530 0.530 0.234 LUX 0.250 0.433 0.342 0.408 0.000 00000 0.000 0.000 0.000 0.500 0.000

38 38 Atlantic Slope Consortium Case Study: Watershed Indicators Evaluation

39 39

40 40 Level3Level2 1 WATERSHEDBIBIFIBINO3 BU FIRBAINVSHA STR MFP SWR_IND EX For estLDIIMP MP AT CORF ORLand-index Back River1.52.21.30.6 0.40.20.50.20.70.50.1 0.00.1 Cattail Creek3.83.34.30.40.80.60.7 0.40.30.60.30.7 0.30.20.4 Gwynn Falls1.92.41.40.30.80.60.7 0.30.20.50.2 0.00.20.10.2 Saint Mary's A2.93.90.20.80.70.60.80.70.60.7 0.80.30.6 Southeast Creek2.73.43.00.8 0.60.50.6 0.7 0.30.50.70.40.30.4 Upper Patuxent3.84.12.80.70.60.70.60.80.70.50.70.40.70.60.4 0.5 Ahoskie0.60.40.50.60.50.40.60.40.70.80.50.7 Buffalo Creek0.40.90.60.90.60.40.20.60.30.6 0.40.70.5 Chickahominy0.60.70.60.80.5 0.60.4 0.10.4 Christian Creek0.10.60.20.90.70.40.5 0.30.60.4 0.30.4 Clearfield Creek0.60.80.21.00.70.40.70.60.70.80.60.70.60.7 Conodoguinet A0.30.70.40.60.70.40.20.60.30.50.2 0.3 Grindle Creek0.50.30.50.60.50.40.60.40.60.70.60.50.70.6 Little Contentnea0.50.70.60.80.70.60.7 0.60.70.5 0.6 Mantua0.50.90.60.80.70.60.30.70.4 0.10.20.10.3 Middle Creek0.40.80.50.80.70.40.30.60.40.70.50.40.60.5 Middle River0.20.50.30.80.60.3 0.50.30.60.3 0.4 Pamunkey0.70.60.70.90.70.50.70.6 0.80.60.70.6 Repaupo0.50.80.60.7 0.80.40.70.40.6 0.40.30.4 White Deer Creek0.9 0.81.00.90.80.70.9 1.00.61.00.9 Wisconisco0.60.80.50.80.70.4 0.70.80.90.50.80.70.8

41 41

42 42 SHA Score Incision Ratio # Stream Stressors Stream-Wetland-Riparian (SWR) Index Buff0-300Basal AreaInvasives# FP-WL Stressors Floodplain-Wetland Condition Conceptual Model of Condition Used for SWR Index

43 43 Conceptual Model of Condition Used for Land Index MFORCORFOR Land Index IMPLDI Urbanization Fragmentation Forest

44 44 Analysis of Indicators for Stream Stability Assessment at Bridge Crossings Case Study

45 45 PROBLEM Bridges fail as a result of the dynamics of the stream channel that they cross.

46 46

47 47 INDICATORS The stream channels and bridges can show signs of warning before catastrophic failure of a bridge occurs.

48 48 Stream Stability Assessment Method Total Score Watershed and Regional Indicators Watershed and Floodplain Flow Habit Channel Pattern Channel Confinement Local Channel Indicators Bed Material Bar Development Obstructions Bank Stability Indicators Bank Soil Texture Bank Slope Angle Bank Protection Bank Cutting Bank Failure Channel Alignment with Bridge Johnson, J. of Hydraulic Eng. 2005

49 49 49 Bridges with 13 Indicators

50 50 POSAC plot for the 13-indicator data matrix

51 51 Concordance Measure

52 52

53 53 Concordance Correlations Correlations with Direct Ranking from Concordance Fractional Weights w/ Original Poset Rank: 0.920 w/ Equal Weights Index: 0.999 w/ Modeled Weight Index: 0.934 Correlations with Concordance Based Weighted Differential Poset Ranking w/ Original Poset Rank: 0.976 w/ Equal Weights Index: 0.935 w/ Modeled Weight Index: 0.928

54 54 METEOR Applied to Bridge Data: Hasse Diagram Progression

55 55

56 56

57 57 Rank range run sequence for the 13-indicator data set

58 58 Maximality level vs. consistency level for the 13-indicator data set

59 59 End-member Elimination results for the 13- indicator data set

60 60 Digital Governance and Hotspot GeoInformatics for Monitoring, Etiology, Early Warning, and Sustainable Managment A collection of white papers prepared for the workshop at dg.o2006 The 7 th Annual International Conference on Digital Government Research San Diego, California May 21, 2006

61 61 Digital Governance and Hotspot GeoInformatics for Monitoring, Etiology, Early Warning, and Management Around the World

62 62 The five year NSF DGP project has been instrumental to conceptualize hotspot geoinformatics partnership among several interested cross-disciplinary scientists in academia, agencies, and private sector around the world. A declared need is around for statistical geoinformatics and software infrastructure for spatial and spatiotemporal hotspot detection and prioritization. Our efforts are driven by a wide variety of case studies of potential interest to government agencies involving critical society issues, such as public health, ecosystem health, sensor networks, robotic networks, social networks, video mining, homeland security, early warning, and disaster management.

63 63 1.Siena, Italy (October, 2005) 2.Parma, Italy (March, 2006) 3.Hyderabad, India (December, 2005) 4.San Diego, USA (May, 2006) 5.Okayama, Japan (November, 2006, January, 2006) 6.Bangkok, Thailand (November, 2005) http://www.j-geoinfo.net/HealthGIS/Symposium.htm 7.Kuala Lumpar, Malaysia (December, 2005) http://iscm.math.um.edu.my/ 8.Jakarta, Indonesia (January, 2006, 2007) 9.Jalgaon, India (December, 2006) 10.Banglor, India (December, 2006) 11.Belo Horizonte, Brazil (April, 2007) 12.Nairobi, Kenya (March, 2007) Course Instructor and Workshop Leader: G.P. Patil Short Course and Case Studies Workshops Around the World

64 64 [1]Patil, G.P., Geoinformatic Hotspot Systems (GHS) for Detection, Prioritization, and Early Warning. Project hightlights, dg.o2005, Atlanta, Georgia. [2]Patil, G.P. and Taillie, C., Upper level set scan statistic for detecting arbitrarily shaped hotspots. Environmental and Ecological Statistics, 11 (2004), 183-197. [3]Patil, G.P. and Taillie, C. Multiple indicators, partially ordered sets, and linear extensions: Multi-criterion ranking and prioritization. Environmental and Ecological Statistics 11 (2004), 199-228. [4]Patil, et al., Upper level set scan statistic system for detecting arbitrarily shaped hotspots for digital governance. Poster presentation, dg.o2005, Atlanta, Georgia. [5]Patil, et al., Geoinformatic surveillance of hotspot detection, prioritization and early warning. Demo, dg.o2005, Atlanta, Georgia [6]Patil, et al., Hotspot geoinformatics for digital governance. Encyclopedia of Digital Government, Ari-Veikko Anttiroiko and Matti Malkia (eds.), (2006, to appear). [7]Patil, et al., Hotspot Detection and Prioritization GeoInformatics for Digital Governance. Digital Government Research Textbook, Larry Brandt, Valerie Gregg, et al. (eds.), (2006, to appear), Springer, New York. [8] Patil, et al., (2006) On PULSE: The Progressive upper level set scan statistic system for geospatial and spatiotemporal hotspot detection. In: The 7 th Annual International Conference on Digital Government Research, San Diego, CA, May 21-24, 2006. [9]Patil, et al., (2006) Multicriterian prioritization and differential weights, stepwise aggregration, Hasse diagrams, and Poset cumulative rank frequency operators. In: The 7 th Annual International Conference on Digital Government Research, San Diego, CA, May 21-24, 2006 References

65 65 http://www.stat.psu.edu/hotspots (A project website providing project activities, products, publications, and events.) http://www.dgrc.org/dgo2006/papers/workshops.jsp#h otspot (Article on the workshop program on hotspot geoinformatics.) http://www.satscan.org (Freeware for circular spatial scan program and information.) http://www.getsynapsed.de/ (Freeware for academia for Hasse program for Windows.) http://www.stat.psu.edu/~gpp (Website for the Penn State Center for Statistical Ecology and Environmental Statistics—Home base of the digital government research project for hotspot geoinformatics.) Online Resources

66 66 Digital Governance and Hotspot GeoInformatics Research Programs 1.Parma, Italy 2.Jalgaon, India 3.Bogar, Indonesia 4.Yokayama, Japan 5.Bello Horizonte, Brazil 6.University Park, PA, U.S.A.


Download ppt "1 Annual Digital Government Research Conference San Diego, CA Project Highlights G.P. Patil May 2006."

Similar presentations


Ads by Google