1 L. M. Pereira Centro de Inteligência Artificial - CENTRIA Universidade Nova de Lisboa, Portugal P. Dell’Acqua, M. Engberg Dept. of Science and Technology - ITN Linköping University, Sweden December, 2003 EPIA’03, Beja, Portugal An Architecture for a Rational Reactive Agent
2 We have proposed before a type of agents that are able to: - reason and react to the environment (including other agents). - update their own knowledge, reactions and goals. - interact by updating the theory of another agent. - decide whether to accept an update depending on the requesting agent. - prefer among possible choices. - abduce hypotheses to explain observations. Agent capabilities
3 Applications Due to their deliberative, reactive and updating capabilities, our approach to agents is an enabling technology for sophisticated Web applications. Information integration How to integrate data from multiple heterogeneous sources and provide a uniform interface. Web site management: self-reconfigurable and adaptive Web sites. Declarative representation of Web sites allows: –to automatically reconstruct them, e.g. on usage patterns; –to make them adaptable wrt. user profiles; –to enforce integrity constraints on Web sites, e.g. no dangling pointers.
4 Applications For a discussion on the use of preference reasoning in Web applications cf. User preference information in query answering P. Dell'Acqua, L. M. Pereira, and A. Vitória 5th Int. Conf. on Flexible Query Answering Systems 2002, LNCS 2522 For a discussion on the use of Logic Programming for the Semantic Web cf. Semantic web logic programming tools J. J Alferes, C. V. Damásio, L. M. Pereira invited paper, Ws. on Principles and Practice of Semantic Web reasoning (PPSWR’03), at 19 th Int. Conf. On Logic Programming (ICLP’03), Dec. 2003
5 Language Atomic formulae: Aobjective atoms not Adefault atoms :Cprojects ÷Cupdates Formulae: L i is an atom, an update or a negated update active rule generalized rules Z j is a project integrity constraint false L 1 L n Z 1 Z m A L 1 L n not A L 1 L n L 1 L n Z
6 An updating program is a finite set of updates. Let S be a set of natural numbers. We call the elements s S states. An agent at state s, written s, is a pair (A,U) where: - A is a set of abducibles; - U={U 1,…, U s } is a sequence of updating programs. Abductive agents
7 Agent cycle
8 Agent behavioural issues The interaction between the rational/deliberative (D) and reactive (R) behaviour of the agent, and its updating mechanism (U) raises several issues: D-R Should the deliberative behaviour of the agent influence its reactive behaviour? and vice versa? D-U If has a goal to prove, and contemporarily some updates to consider, which of the two tasks shall perform first? What should the behaviour of be when it receives an update while proving a goal G (i.e. while is in deliberative mode). R-U If receives several updates, should consider each update and then trigger its active rules? or should consider all the updates and only then trigger the active rules?
9 Contribution of the paper We propose an agent architecture that allows: to declaratively interrupt/suspend the execution of a query; to declaratively self-modify its control parameters; to tune its behaviour to the environmental needs, and hence to address an agent’s behavioural needs. Aim: to design an architecture that empowers agents with self-monitoring and self-control capabilities.
10 Agent architecture It consists of 6 asynchronous components.
11 Central control controls the behaviour of the entire architecture via control parameters. Reactive and Rational processes characterize the reactive and rational behaviours of the agent. It is implemented by two XSB Prolog processes with identical knowledge bases. The Rational process can abduce. Update handler sorts out the information received from the reactive process and the environment. Action handler handles the tasks (actions) the agent wants to execute. External interface handles the communication between the agent and it’s environment. Agent architecture
12 If an agent is at state s and s ={A,U}, then the knowledge base of the rational process is the generalized logic program P = (s,U) obtained via the syntactic transformation . The ABDUAL procedure is employed wrt. P to compute well-founded abductive answers to goals. The Rational process receives the queries to be proved from central control. If the rational process proves ?-g with substitution and list of abduced hypotheses La, it returns ans(g,g ,La) to central control which in turn updates the KBs of the rational and reactive processes with ÷ ans(g,g ,La). Rational process
13 Similarly to the Rational process, the KB of the reactive process is P = (s,U). The task of the Reactive process is to trigger any active rule in P whose body is satisfied given the hypotheses list La abduced by the Rational process at state s. Triggering an active rule means executing the project occurring in its head. Distinguished projects: urgentRequest(G) : sendIterrupt(G, true)interrupt meeting doAct(displayMessage)action alarmState stateModify(reactiveDelayTime,100)self-tuning Reactive process
14 Handles all the components by means if its control parameters. Has the ability to interrupt/suspend the execution of a query. –It sends an interrupt sendInterrupt(G,F) to the rational process. –The interruption/suspension commands are implemented by XSB Prolog, and can originate externally. Comprises two queues: –a prioritized queue with updates containing interrupts; –a queue with normal updates. Central control
15 For any update of the form ÷ (?-g), central control: 1.launches the query ?-g on the Rational process; 2.collects its answer ans(g,g ,La) and it updates the KBs of the Rational and the Reactive process with it; 3.collects from the Reactive process all the executable projects, and sends them to the update handler. For any update of the form ÷ stateModify(controlPar,value), central control modifies the value of the specified control parameter. Central control
16 Example Theory of Peter: S1a b, not c b friend(mary) A÷ (?-G), friend(A), not ans(G,G1,La) peter : (?-G) A÷ (?-G), ans(G,G1,La) A: ans(G,G1,La)
17 Project flow Mary ?-a mary÷(?-a) peter:(?-a) peter÷ (?-a) ?-a ans(a,a,[ ]) peter÷ans(a,a,[ ]) mary:ans(a,a,[ ])
18 Future work Test the interaction of the components of the architecture: Web site management application. Add other modules to the architecture: learning and planning module. Develop a multi-agent platform: ongoing implementation of a platform supporting the interactions of our logic-based agents as well as some forms of agent structures. Provide a formal description of the behaviour of the architecture: to verify logical properties; to analyze and expand the architecture; to formally compare our architecture with existing hybrid architectures.
19 Future work To make our implementation distributed over the internet: To allow our agents to have parts spread on the Web. Achieved by RMI of Java and by letting the XSB agents maintain a mirror copy of the KB of the agent. Advantage: to offer high level Web services. E.g. if a deliberative agent is being developed at one place, several reactive agents may use its deliberations.