Heuristic Search and Information Visualization Methods for School Redistricting University of Maryland Baltimore County Marie desJardins, Blazej Bulka,

Slides:



Advertisements
Similar presentations
Local optimization technique G.Anuradha. Introduction The evaluation function defines a quality measure score landscape/response surface/fitness landscape.
Advertisements

Time-Aware Test Suite Prioritization Kristen R. Walcott, Mary Lou Soffa University of Virginia International Symposium on Software Testing and Analysis.
Data Mining Classification: Alternative Techniques
Uses and Abuses of the Efficient Frontier Michael Schilmoeller Thursday May 19, 2011 SAAC.
Multi-objective optimization multi-criteria decision-making.
Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.
Geo479/579: Geostatistics Ch14. Search Strategies.
Multiobjective Optimization Chapter 7 Luke, Essentials of Metaheuristics, 2011 Byung-Hyun Ha R1.
Hidden Markov Models Theory By Johan Walters (SR 2003)
- Sam Ganzfried - Ryan Sukauye - Aniket Ponkshe. Outline Effects of asymmetry and how to handle them Design Space Exploration for Core Architecture Accelerating.
Spring, 2013C.-S. Shieh, EC, KUAS, Taiwan1 Heuristic Optimization Methods Pareto Multiobjective Optimization Patrick N. Ngatchou, Anahita Zarei, Warren.
CPSC 322, Lecture 15Slide 1 Stochastic Local Search Computer Science cpsc322, Lecture 15 (Textbook Chpt 4.8) February, 6, 2009.
Estimation of Distribution Algorithms Ata Kaban School of Computer Science The University of Birmingham.
Iterative Optimization of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial Intelligence.
SocalBSI 2008: Clustering Microarray Datasets Sagar Damle, Ph.D. Candidate, Caltech  Distance Metrics: Measuring similarity using the Euclidean and Correlation.
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
A Heuristic Bidding Strategy for Multiple Heterogeneous Auctions Patricia Anthony & Nicholas R. Jennings Dept. of Electronics and Computer Science University.
A New Evolutionary Algorithm for Multi-objective Optimization Problems Multi-objective Optimization Problems (MOP) –Definition –NP hard By Zhi Wei.
Optimization via Search CPSC 315 – Programming Studio Spring 2009 Project 2, Lecture 4 Adapted from slides of Yoonsuck Choe.
Reporter : Mac Date : Multi-Start Method Rafael Marti.
Iterative Improvement Algorithms
Switch to Top-down Top-down or move-to-nearest Partition documents into ‘k’ clusters Two variants “Hard” (0/1) assignment of documents to clusters “soft”
Intro to AI Genetic Algorithm Ruth Bergman Fall 2002.
Design of Curves and Surfaces by Multi Objective Optimization Rony Goldenthal Michel Bercovier School of Computer Science and Engineering The Hebrew University.
MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition
16.5 Introduction to Cost- based plan selection Amith KC Student Id: 109.
K-means Clustering. What is clustering? Why would we want to cluster? How would you determine clusters? How can you do this efficiently?
Clustering Ram Akella Lecture 6 February 23, & 280I University of California Berkeley Silicon Valley Center/SC.
Intro to AI Genetic Algorithm Ruth Bergman Fall 2004.
Metaheuristics The idea: search the solution space directly. No math models, only a set of algorithmic steps, iterative method. Find a feasible solution.
Quality Indicators (Binary ε-Indicator) Santosh Tiwari.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
Efficient Model Selection for Support Vector Machines
Unsupervised Learning Reading: Chapter 8 from Introduction to Data Mining by Tan, Steinbach, and Kumar, pp , , (
Philosophy of IR Evaluation Ellen Voorhees. NIST Evaluation: How well does system meet information need? System evaluation: how good are document rankings?
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
1 Local search and optimization Local search= use single current state and move to neighboring states. Advantages: –Use very little memory –Find often.
A two-stage approach for multi- objective decision making with applications to system reliability optimization Zhaojun Li, Haitao Liao, David W. Coit Reliability.
A Statistical Approach to Speed Up Ranking/Re-Ranking Hong-Ming Chen Advisor: Professor Shih-Fu Chang.
DAVIS AQUILANO CHASE PowerPoint Presentation by Charlie Cook F O U R T H E D I T I O N Forecasting © The McGraw-Hill Companies, Inc., 2003 chapter 9.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Stratified K-means Clustering Over A Deep Web Data Source Tantan Liu, Gagan Agrawal Dept. of Computer Science & Engineering Ohio State University Aug.
Iterative Improvement Algorithm 2012/03/20. Outline Local Search Algorithms Hill-Climbing Search Simulated Annealing Search Local Beam Search Genetic.
Chapter 14 – Cluster Analysis © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
Kanpur Genetic Algorithms Laboratory IIT Kanpur 25, July 2006 (11:00 AM) Multi-Objective Dynamic Optimization using Evolutionary Algorithms by Udaya Bhaskara.
DIVERSITY PRESERVING EVOLUTIONARY MULTI-OBJECTIVE SEARCH Brian Piper1, Hana Chmielewski2, Ranji Ranjithan1,2 1Operations Research 2Civil Engineering.
Optimizing Pheromone Modification for Dynamic Ant Algorithms Ryan Ward TJHSST Computer Systems Lab 2006/2007 Testing To test the relative effectiveness.
For Wednesday Read chapter 6, sections 1-3 Homework: –Chapter 4, exercise 1.
For Wednesday Read chapter 5, sections 1-4 Homework: –Chapter 3, exercise 23. Then do the exercise again, but use greedy heuristic search instead of A*
Evolutionary Design (2) Boris Burdiliak. Topics Representation Representation Multiple objectives Multiple objectives.
Helsinki University of Technology Systems Analysis Laboratory 1DAS workshop Ahti A. Salo and Raimo P. Hämäläinen Systems Analysis Laboratory Helsinki.
WERST – Methodology Group
Heuristic Methods for the Single- Machine Problem Chapter 4 Elements of Sequencing and Scheduling by Kenneth R. Baker Byung-Hyun Ha R2.
Local Search. Systematic versus local search u Systematic search  Breadth-first, depth-first, IDDFS, A*, IDA*, etc  Keep one or more paths in memory.
Evolving RBF Networks via GP for Estimating Fitness Values using Surrogate Models Ahmed Kattan Edgar Galvan.
3.5 Informed (Heuristic) Searches This section show how an informed search strategy can find solution more efficiently than uninformed strategy. Best-first.
Evolutionary Computing Chapter 12. / 26 Chapter 12: Multiobjective Evolutionary Algorithms Multiobjective optimisation problems (MOP) -Pareto optimality.
Zhengli Huang and Wenliang (Kevin) Du
Selection and Recombination Temi avanzati di Intelligenza Artificiale - Lecture 4 Prof. Vincenzo Cutello Department of Mathematics and Computer Science.
Search Control.. Planning is really really hard –Theoretically, practically But people seem ok at it What to do…. –Abstraction –Find “easy” classes of.
CMPT 463. What will be covered A* search Local search Game tree Constraint satisfaction problems (CSP)
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
CEng 713, Evolutionary Computation, Lecture Notes parallel Evolutionary Computation.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Intelligent Exploration for Genetic Algorithms Using Self-Organizing.
Chapter 15 – Cluster Analysis
Multi-Objective Optimization
SASS: Self-Adaptation using Stochastic Search
MOEA Testing and Analysis
Area Coverage Problem Optimization by (local) Search
Presentation transcript:

Heuristic Search and Information Visualization Methods for School Redistricting University of Maryland Baltimore County Marie desJardins, Blazej Bulka, Ryan Carr, Andrew Hunt, Priyang Rathod, and Penny Rheingans This work was partially supported by NSF # IIS Thanks to David Drown and the Howard County Public School System for data and valuable inputs.

July 18, Overview  The Problem: School Redistricting  Searching for Good Plans  Results  Future Work and Conclusions

July 18, The Problem: School Redistricting

July 18, School Redistricting  Assign neighborhoods (or planning polygons) within a school district to schools while considering multiple factors, such as busing costs, test score distribution, and school utilization  Finding the best assignment (or plan) is a multiattribute optimization problem  Also want to generate qualitatively different plans that represent tradeoffs among the criteria, and help users visualize these tradeoffs  Search space is very large: O(s p ), where  s is the number of schools (12 high schools; 30 elementary)  p is the number of polygons (~263)  Currently in Howard County, Maryland, the process is almost entirely manual

July 18, Evaluation Criteria  Educational benefits for students  Frequency with which students are redistricted  Number and distance of students bused  Total busing cost  Demographics and academic performance of schools  Number of students redistricted  Maintenance of feeder patterns  Changes in school capacity  Impact on specialized programs  Functional and operational capacity of school infrastructure  Building utilization

July 18, Evaluation Criteria  Educational benefits for students  Frequency with which students are redistricted  Number and distance of students bused  Total busing cost  Demographics and academic performance of schools  Number of students redistricted  Maintenance of feeder patterns  Changes in school capacity  Impact on specialized programs  Functional and operational capacity of school infrastructure  Building utilization

July 18, Evaluation Criteria 1.Number of students bused  Students who can walk to a school should be assigned to that school 2.Busing cost  Estimated as a population-weighted sum of polygon-school distances 3.Demographics  FARM (Free and Reduced Meal) ratio at each school should ideally be the same as that of the county as a whole 4.Academic performance  MSA (Maryland State Assessment) scores at each school should ideally be the same as those of the county as a whole 5.Capacity  Each school should be between 90% and 110% of available capacity  Penalty functions are defined for each of the five criteria above  per-polygon, per-school, per-plan cost measure from 0 (good) to 1 (bad)

July 18, Selecting Multiple Plans: Diversity  One plan dominates another if it is better along all dimensions  Two plans are incomparable if each is better than the other along at least one dimension  A good set of plans should:  contain no dominated plans  consist of qualitatively different plans  We measure “qualitatively different” using Euclidean distance in the evaluation space:  Div(P) = 1/|P|  p i, p j  P Dist(p i, p j )

July 18, Closest-School Plan Marriottsville High (new)

July 18, Closest [outer] vs. Recommended [inner]

July 18, Searching for Good Plans

July 18, Multiattribute Optimization  Previous approaches to multiattribute optimization:  Weighted methods: Combine attributes into a single weighted sum  Priority-based methods: Optimize one attribute, then perform constrained optimization on the other attributes  MOA* (and variations): Find all nondominated solutions using heuristic search  Evolutionary methods: Use genetic search to explore the population space using recombination and fitness-based selection  Redistricting domain:  No single set of weights or prioritization scheme  Very large search space  can’t find all (or even most) nondominated solutions  Use local search

July 18, Basic Hill-Climbing  Baseline:  Choose an initial plan as a starting point (seed)  then hill-climb through “[weighted] sum of criteria” space  Seed options:  closest-school plan  current plan  random plan  “breadth-first” assignment  “minimum-spanning-tree” assignment

July 18, Biased Hill-Climbing  General approach:  Choose an initial plan as a seed  Hill-climb through “dominated plan” space  Save “incomparable plans” as they are encountered  Stop at a local maximum  Restart search starting from a plan in the incomparable list  Blind bias:  At restart, choose a plan from the incomparable list at random  Diversity bias:  At restart, choose the plan that is farthest in evaluation space from the solutions found so far

July 18, Results

July 18, Quality of Generated Plans  Quality of generated plans is better than manually generated plans ...with respect to the particular evaluation criteria we’ve defined  Original-plan seed does better than closest-plan seed  leads to the “wrong” local maximum  Compared to recommended plan, generated plans generally perform: ...better with respect to capacity ...comparably with respect to socioeconomic and academic measures ...better with respect to busing costs ...comparably with respect to walk utilization Outer: Recommended Inner: Best generated

July 18, Diversity of Generated Plans  Baselines:  Manual plans: closest, recommended, and alternative (diversity measure =  )  Unweighted hill-climbing: multiple runs of basic hill-climbing with different initial seeds (diversity measure =  )  Weighted hill-climbing: multiple runs of basic hill-climbing with different weight vectors (diversity measure =  )  Biased hill-climbing:  with blind bias: diversity measure =   with diversity bias: diversity measure =  Note: Biased hill-climbing yields a somewhat worse average unweighted sum, so the plans are not quite “as good” in a direct comparison 

July 18, Future Work and Conclusions

July 18, Future Work  Modeling additional evaluation criteria:  Feeder statistics  Redistricting frequency  Incorporating projected future demographic shifts into evaluation, search, and visualization  Extensions to search methods:  Other definitions of diversity (e.g., dispersion, similar to k-means mean-squared error)  Other multiattribute optimization methods (particularly genetic methods)  Visualization extensions:  Visualizing feeder patterns  Computing and visualizing gradients in search space  Deployment and user testing

July 18, Conclusions  School redistricting is an important and challenging problem  The multiattribute optimization framework is a good paradigm for this application  Novel search techniques and evaluation methods are needed  Diversity-biased hill climbing is a promising initial approach