Neuro-evolving Maintain-Station Behavior for Realistically Simulated Boats Nathan A. Penrod David Carr Sushil J. Louis Bobby D. Bryant Evolutionary Computing.

Neuro-evolving Maintain-Station Behavior for Realistically Simulated Boats Nathan A. Penrod David Carr Sushil J. Louis Bobby D. Bryant Evolutionary Computing Systems Lab Neuroevolution and Behavior Laboratory University of Nevada, Reno

Related Work

Maintain Station ● One or more boats follow a leader at specified offsets ● Allows groups of boats to move in formation ● Current work tests only the ability to maintain position ● Ultimate goal is to include following with obstacle avoidance

Maintain Station Continued ● Maintain station has two phases – First finding and moving toward the target point – Second staying on the target point by matching the heading and speed of the lead boat ● Boats have complex physics including friction and inertia ● Maintain behavior is difficult because of limited maneuverability ● Complex physics makes problem interesting

Lagoon ● Real-time 3D naval combat simulation game ● Provides complex physics and flexible control architecture ● Allows for easy integration of experimental control devices like neural networks

Approach ● Simple recurrent neural network (SRN) (add cite) ● Six inputs, two outputs, eight hidden ● ESP evolutionary algorithm (Gomez and Miikkulainen 1998) ● Neural network implicitly learns physics

Maintain Vector ● Method of representing the maintain station behavior ● All sensor and fitness values are based on the maintain vector ● Abstracts away from relative offset ● Following boat does not care about lead boat's position (is unaware if the relative vector position is changed to create new formations) ● Increases the generality of the behavior

Representation ● Three points evenly spaced around target point ● Egocentric vectors cast from boat to sensor points ● x and y components of normalized vectors provide sensor inputs ● Inputs are continuous ● Informs boat about direction to target and rotation relative to lead boat

Fitness ● vL – vector with the same heading as the maintain vector ● vH – vector with the same heading as the boat ● vP – vector cast from boat to target point ● dT – Euclidean distance from boat to target

Fitness continued ● Function 1: – Rewards moving towards target ● Function 2: – Rewards moving towards and away from target ● Function 3: – Rewards moving towards target while matching heading of lead boat ● Fitness values integrated over time

Experiment 1 ● Boats start in world with random positions and headings. ● Lead boat travels in a straight line at exactly half of its maximum speed ● Maintaining boats must acquire and then maintain position on their target points

Experiment 1 Results ● Results surprisingly similar for each fitness function ● Similar growth rates and maximum fitness values ● Results averaged over ten runs

Experiment 2 ● Boats start in world with random positions and headings. ● Lead boat travels in a straight line at constant speed chosen at random from between 25% and 75% ● Maintaining boats must acquire and then maintain position on their target points

Experiment 2 Results ● Again results are similar for each fitness function ● All functions learned the desired behavior ● Results averaged over ten runs

Qualitative Observations ● Boats trained with different fitness functions display subtle differences in behaviors ● Boats trained with Function 1 maintain well but can struggle in the approach ● Boats trained with Function 2 approach well but sometimes struggle to maintain ● Boats trained with Function 3 are better at balancing maintaining and approaching

Conclusions ● Neuroevolution is capable of producing the maintain station behavior ● All three fitness functions produced acceptable results ● A fitness function that considers both orientation relative to target point and orientation relative to target heading produces the most balanced results

Future Work ● Expand network to perform maintain station with obstacle avoidance ● Add new sensors to act as radar for detecting boats and land ● Alter fitness function to encourage avoidance behaviors

Acknowledgments This material is based upon work supported by the Office of Naval Research under contract number N00014-05-0709

Bibliography ● F. Gomez and R. Miikkulainen, “2-D pole-balancing with recurrent evolutionary networks,” ● in Proceedings of the International Conference on Artificial Neural Networks. ● Berlin; New York: Springer-Verlag, 1998, pp. 425–430. ● Available:http://nn.cs.utexas.edu/keyword?gomez:icann98

Neuro-evolving Maintain-Station Behavior for Realistically Simulated Boats Nathan A. Penrod David Carr Sushil J. Louis Bobby D. Bryant Evolutionary Computing.

Similar presentations

Presentation on theme: "Neuro-evolving Maintain-Station Behavior for Realistically Simulated Boats Nathan A. Penrod David Carr Sushil J. Louis Bobby D. Bryant Evolutionary Computing."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Neuro-evolving Maintain-Station Behavior for Realistically Simulated Boats Nathan A. Penrod David Carr Sushil J. Louis Bobby D. Bryant Evolutionary Computing.

Similar presentations

Presentation on theme: "Neuro-evolving Maintain-Station Behavior for Realistically Simulated Boats Nathan A. Penrod David Carr Sushil J. Louis Bobby D. Bryant Evolutionary Computing."— Presentation transcript:

Similar presentations

About project

Feedback