Download presentation
Presentation is loading. Please wait.
Published byFrederick Norris Modified over 9 years ago
1
Eick: LCS-Review Bull-Paper Review1 1.Holland (1986): “Classifier systems … rule-based systems with general mechanisms to process rules in parallel, for the adaptive generation of rules, and for testing the effectiveness of existing rules.” 2.LCS= Reinforcement Learning Evolutionary Computing heuristics to produce adaptive systems 3.Bull on EC: “the population of candidate solutions is seen to adapt to the problem” 4.Reinforcement learning attempts to map state action combinations to their utility, with the aim to maximize future rewards. 5.EC is used to search the space of possible rules, while reinforcement learning techniques are used to assign utilities to existing rules, thereby guiding the search for better rules.
2
Eick: LCS-Review Bull-Paper Review2 6.Holland’s LCS: bids of successfully rules are placed in a bucket and reinforcement learning redistributes these bids between subsequent chosen rules. As an example how this redistribution is done for ZCS read pages 20-21 of the textbook. 7.For understanding ZCS read the textbook and not Bull’s paper who fails to explain its mechanisms clearly. 8.Bull on the difference between ZCS and XCS: The most significant difference between XCS and other systems it its intention to form an accurate mapping of the problem space. 9.ZCS/Holland employ TD(0) whereas XCS employs Q-learning to be explained next week when 10.The second paragraph of the textbook on page 124 assumes a more complex model of XCS in which a state space will be explored to be explained next week; as of now, just assume that the payoff of an action is the reward it receives: P=R
3
Eick: LCS-Review Bull-Paper Review3 11.Bull’s characterization how deletion works in XCS is misleading in that is suggest as is the only thing used—most implementation use as/F, as suggested earlier 12.Different variations of LCS differ in to which sets of rules operations are applied: [N]: all rules (also sometimes called [P]) [M]: rules that match the input message [A]: rule(s) that are selected to process the incoming message 13.Application of LCS-style systems (mostly in optimization, control, and modeling— I am not convinced they are too many applications in data mining except “mining data streams”): Stock price forecasting Fighter aircraft modeling Control of routing nodes in package switch networks Control in electrical power distribution Traffic Signal Control 14.XCS Software (there likely are newer versions!): http://www2.cs.uh.edu/~ceick/6367/Butz-XCS.pdf http://www2.cs.uh.edu/~ceick/6367/Butz-XCS.pdf
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.