1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University.

1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University

2 Overview l The BN Knowledge Engineering Process »focus on combining expert elicitation and automated methods l Case Study I: Seabreeze prediction l Case Study II: Intelligent Tutoring System for decimal misconceptions l Conclusions

3 Elicitation from experts l Variables »important variables? values/states? l Structure »causal relationships? »dependencies/independencies? l Parameters (probabilities) »quantify relationships and interactions? l Preferences (utilities) (for decision networks)

4 Expert Elicitation Process l These stages are done iteratively l Stops when further expert input is no longer cost effective l Process is difficult and time consuming.

5 Knowledge discovery l There is much interest in automated methods for learning BNs from data »parameters, structure (causal discovery) l Computationally complex problem, so current methods have practical limitations »e.g. limit number of states, require variable ordering constraints, do not specify all arc directions, don’t handle hidden variables l Evaluation methods

6 The knowledge engineering process 1. Building the BN »variables, structure, parameters, preferences »combination of expert elicitation and knowledge discovery 2. Validation/Evaluation »case-based, sensitivity analysis, accuracy testing 3. Field Testing »alpha/beta testing, acceptance testing 4. Industrial Use »collection of statistics 5. Refinement »Updating procedures, regression testing

7 Case Study: Seabreeze prediction l Joint project with Bureau of Meteorology »(Kennet, Korb & Nicholson, PAKDD’2001) l Goal: proof of concept; test ideas about integration of automated learners & elicitation What is a seabreeze? (separate picture)

8 Rule-based predictor and data l Bureau of Meteorology’s (BOM) system achieved about 67% predictive accuracy; currently in use. Ifwind component is offshore andwind component < 23 knots andthe forecast period is in the afternoon thena sea breeze is likely l Seabreeze Data: »30MB from October 1997 to October 1999 from Sydney, Australia. 7% had missing attribute values. »Three types of sensor site data: –Automatic weather stations: ground level data, time –Olympic sites (for sailing, etc): rain, temp, humidity, wind –Balloon data: gradient-level readings »Predicted variables: wind speed and direction

9 Methodology 1. Expert Elicitation. Using variables with available data, forecasters provided causal relations between them. 2. Tetrad II (Spirtes, et al., 1993) uses the Verma-Pearl algorithm (1991) with significance testing to recover causal structure. (NB: usability problems) 3. CaMML (Wallace and Korb, 1999) uses Minimum message Length (MML) to discover causal structure. l BNs for Seabreeze Predictions (see separate slide) »All parameterization was performed by Netica keeping different methods on an equal footing. »Uses simple counting over training data to estimate conditional probabilities (Spiegelhalter & Lauritzen, 1990)

10 Predictive accuracy l Instead of seabreeze existence prediction, we substituted more demanding task: prediction of wind direction at ground level l From this (and gradient-level wind direction), seabreezes can be inferred. l Training/testing regime »randomly select 80% of data for training »use remainder for testing accuracy l Results »See separate slide (comparison of airport site type network versions)

11 Predictive accuracy conclusions l Elicited and discovered nets (MML + Tetrad II) are systematically superior to BOM RB l Discovered networks are superior to elicited nets in first 3 hrs (conf intervals are ~10%) l Strong time component to accuracy

12 Adaptation: Incremental learning l Learn structure from first year’s data (using MML) l Reparameterise nets over second year’s data, while predicting seabreezes »greedy search yielded a time decay factor of e -t0.05 l Results (see separate slides) »comparison of incremental and normal training methods by BN type and by time of year »incremental performed better

13 Case Study II: Intelligent tutoring l Tutoring domain: primary and secondary school students’ misconceptions about decimals l Based on Decimal Comparison Test (DCT) »student asked to choose the larger of pairs of decimals »different types of pairs reveal different misconceptions l ITS System involves computer games involving decimals l This research also looks at a combination of expert elicitation and automated methods

14 Expert classification of Decimal Comparison Test (DCT) results

15 The ITS architecture Adaptive Bayesian Network Decimal comparison test (optional) Inputs Computer Games Generic BN model of student Information about student e.g. age (optional) Hidden number Flying photographer Decimaliens …. Number between Student Item Answer Item Answer Classroom diagnostic test results (optional) Classroom Teaching Activities Report on student Answer Item type New game  Diagnose misconception  Predict outcomes  Identify most useful information Sequencing tactics  Select next item type  Decide to present help  Decide change to new game  Identify when expertise gained Teacher System Controller Module Answers Help Feedback Help

16 Expert Elicitation l Variables »two classification nodes: fine and coarse »item types: (i) H/M/L (ii) 0-N l Structure »arcs from classification to item type »item types independent given classification l Parameters »careless mistake (3 different values) »expert ignorance: - in table (uniform distribution)

17 Expert Elicited BN

18 Evaluation process l Case-based evaluation »experts checked individual cases »sometimes, if prior was low, ‘true’ classification did not have highest posterior (but usually had biggest change in ratio) l Adaptiveness evaluation »priors changes after each set of evidence l Comparison evaluation »Differences in evaluation between BN and expert rule

19 Comparison: expert BN vs rule UndesirableDesirableSame

20 Results Undes. Desir. Same varying prob. of careless mistake varying granularity of item type: 0-N and H/M/L

21 Automated methods: Classification l Applied SNOB classification program, based on MML l Using data from 2437 students, 30 items, SNOB produced 14 classes »10 corresponded to expert classes »2 expert classes LRV and AU were not found »4 clases were mainly combinations of AU and UN »unable to classify 0.5% of students l Using pre-processed data (0-N or H/M/L) on 6 item types, SNOB found only 5 or 6 classes

22 Automated Methods l Parameters »Again, used Netica counting method l Structure »Applied CaMML to pre-processed data (0-N and H/-M/L) 1.constrained so that classification node was parent of item type nodes 2.unconstrained »Many different network structures found, all with arcs between item type nodes, of varying complexity

23 Results from automated methods

24 Conclusions l Automated methods yielded BNs which gave quantitative results comparable to or better than elicited BNs »validation of automated methods (?) l Undertaking both elicitation and automated KE resulted in additional domain analysis (e.g. 0-N vs H/M/L) l Hybrid of expert and automated approaches is feasible »methodology for combining is needed »evaluation measures and methods needed (may be domain specific)

1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University.

Similar presentations

Presentation on theme: "1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University.

Similar presentations

Presentation on theme: "1 Knowledge Engineering for Bayesian Networks Ann Nicholson School of Computer Science and Software Engineering Monash University."— Presentation transcript:

Similar presentations

About project

Feedback