Using Bayesian Networks to Predict Plankton Production from Satellite Data By: Rob Curtis, Richard Fenn, Damon Oberholster Supervisors: Anet Potgieter,

Slides:



Advertisements
Similar presentations
Autonomic Scaling of Cloud Computing Resources
Advertisements

Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
Information Modelling MOLES Metadata Objects for Linking Environmental Sciences S. Ventouras Rutherford Appleton Laboratory.
Bridgette Parsons Megan Tarter Eva Millan, Tomasz Loboda, Jose Luis Perez-de-la-Cruz Bayesian Networks for Student Model Engineering.
Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
INTRODUCTION TO MODELING
1.Data categorization 2.Information 3.Knowledge 4.Wisdom 5.Social understanding Which of the following requires a firm to expend resources to organize.
Management Information Systems, Sixth Edition
INSY 7200 Slip Casting Neural Net / Fuzzy Logic Control System.
Introduction of Probabilistic Reasoning and Bayesian Networks
Modeling Human Reasoning About Meta-Information Presented By: Scott Langevin Jingsong Wang.
Case-based Reasoning System (CBR)
NPS Introduction to GIS: Lecture 1
Presented by Zeehasham Rasheed
1 MARG-DARSHAK: A Scrapbook on Web Search engines allow the users to enter keywords relating to a topic and retrieve information about internet sites (URLs)
Databases and Database Management Systems
Introduction to Machine Learning Approach Lecture 5.
Introduction to the course January 9, Points to Cover  What is GIS?  GIS and Geographic Information Science  Components of GIS Spatial data.
Page 1 ISMT E-120 Introduction to Microsoft Access & Relational Databases The Influence of Software and Hardware Technologies on Business Productivity.
Data Mining Techniques
Bayesian Network Student Model for Adapting Learning Activity Tasks in Adaptive Course Generation System Introduction Adaptive educational hypermedia system.
CHAPTER 12 ADVANCED INTELLIGENT SYSTEMS © 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang.
Chapter 14: Artificial Intelligence Invitation to Computer Science, C++ Version, Third Edition.
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 8: Quantitative.
ITEC224 Database Programming
Managing Organizations Informed decision making as a prerequisite for success Action Vision Mission Organizational Context Policies, Goals, and Objectives.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
A Comparison Between Bayesian Networks and Generalized Linear Models in the Indoor/Outdoor Scene Classification Problem.
Chapter © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or.
UML based expert system generation Using Enterprise Architect to model and generate a web-based expert system.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
C/IL 102 Demographic Survey Fall 2005 Summary. Basic Word Processing (Create, store, spell check) Very Poor 0 Poor0 Average7 Good20 Excellent
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
ES Model development Dr. Ahmed Elfaig The ES attempts to predict results from available information, data and knowledge The model should be able to infer.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
I Robot.
Introduction to Physical Science. A-Science- A-Science- Is a way of learning about the universe and it’s natural laws (Gravity) 1- Skills of scientist.
AN INTELLIGENT AGENT is a software entity that senses its environment and then carries out some operations on behalf of a user, with a certain degree of.
Bayesian Networks Optimization of the Human-Computer Interaction process in a Big Data Scenario Candidate: Emanuele Charalambis University of Modena and.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
CRESST ONR/NETC Meetings, July 2003, v1 17 July, 2003 ONR Advanced Distributed Learning Greg Chung Bill Bewley UCLA/CRESST Ontologies and Bayesian.
Topic Maps introduction Peter-Paul Kruijsen CTO, Morpheus software ISOC seminar, april 5 th 2005.
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Hybrid Intelligent Systems for Network Security Lane Thames Georgia Institute of Technology Savannah, GA
Cognitive Computer Vision Kingsley Sage and Hilary Buxton Prepared under ECVision Specific Action 8-3
Banaras Hindu University. A Course on Software Reuse by Design Patterns and Frameworks.
CSC400W Honors Project Proposal Understanding ocean surface features from satellite images Jared Tilanus Nemanja Spasic.
Welcome: To the fifth learning sequence “ Data Models “ Recap : In the previous learning sequence, we discussed The Database concepts. Present learning:
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
1 Chapter 1 Introduction to Accounting Information Systems Chapter 2 Intelligent Systems and Knowledge Management.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Managing Data Resources File Organization and databases for business information systems.
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
Information and Information Technology 1. Information and employment 2.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
- The most common types of data models.
THE SP SYSTEM AS AN AID TO CRIME PREVENTION AND DETECTION (CPD)
School of Computer Science & Engineering
Introduction to Physical Science
Calibration information in OSCAR/Space and other OSCAR developments
CS & CS Capstone Project & Software Development Project
Qualitative Observation
Causal Models Lecture 12.
Authors: Wai Lam and Kon Fan Low Announcer: Kyu-Baek Hwang
Presentation transcript:

Using Bayesian Networks to Predict Plankton Production from Satellite Data By: Rob Curtis, Richard Fenn, Damon Oberholster Supervisors: Anet Potgieter, John Field, Laurent Drapeau Department of Computer Science

Overview Introduction Introduction Work Detail Work Detail Knowledge Acquisition Knowledge Acquisition Knowledge Representation Knowledge Representation Bayesian Learning and Inference Bayesian Learning and Inference Topic Maps Topic Maps

Introduction Aim to predict plankton primary production using satellite data Aim to predict plankton primary production using satellite data Daily satellite data on surface temperature, chlorophyll, winds, currents Daily satellite data on surface temperature, chlorophyll, winds, currents Archive of ships’ sub-surface details Archive of ships’ sub-surface details Predict likely subsurface plankton profile from surface features Predict likely subsurface plankton profile from surface features

Current System Currently best solution uses Self Organising Maps (SOMs: A type of neural network) to classify data Currently best solution uses Self Organising Maps (SOMs: A type of neural network) to classify data –Resulting solution lacks accuracy –Difficult to interpret

Proposed System Propose a system that uses Bayesian Networks to predict plankton production Propose a system that uses Bayesian Networks to predict plankton production –Use ships’ sub surface profiles + satellite data to draw cause effect relationships –Will use Bayesian Inference and Learning Use Topic Maps to visualize network Use Topic Maps to visualize network

Work Detail Knowledge Acquisition Inference Engine Knowledge Representation Learning Engine Topic Map Requirements Elicitation Rob Curtis Richard Fenn Damon Oberholster

Knowledge Acquisition “The process of analyzing, transforming, classifying, organizing and integrating knowledge and representing that knowledge in a form that can be used in a computer system. Typically the knowledge is based on what a human expert does when solving problems” “The process of analyzing, transforming, classifying, organizing and integrating knowledge and representing that knowledge in a form that can be used in a computer system. Typically the knowledge is based on what a human expert does when solving problems” Relating to this project: Relating to this project: –Huge amounts of data –Data is poorly recorded in Excel spreadsheets –Gaps in current data

Knowledge Acquisition: Amount of Data 2500 ship sub surface readings 2500 ship sub surface readings –Recorded over 10 year period Bayesian Network requires satellite data for the same time period Bayesian Network requires satellite data for the same time period Need to represent data in a form that can be used by the Bayesian Network Need to represent data in a form that can be used by the Bayesian Network

Knowledge Acquisition: Current Data

Knowledge Acquisition: Gaps in Data Ships’ sub-surface readings (discrete) Satellite data (continuous)

Knowledge Acquisition: Gaps in Data

Knowledge Acquisition: Challenges Making sense of all the available data (consultations with Dr John Field and Laurent Drapeau) Making sense of all the available data (consultations with Dr John Field and Laurent Drapeau) Correlating the 2D continuous satellite data to 3D discrete ships’ sub-surface profile Correlating the 2D continuous satellite data to 3D discrete ships’ sub-surface profile Representing all the data in a form easily used by the Bayesian Network Representing all the data in a form easily used by the Bayesian Network Integration of disparate data Integration of disparate data

Knowledge Representation “ A search for formal ways to describe knowledge presented in informal terms (a prerequisite for its handling as computation)” “ A search for formal ways to describe knowledge presented in informal terms (a prerequisite for its handling as computation)”encyclopedia.laborlawtalk.com/Representation Relating to this project: Relating to this project: –Need to find causal relationships between environment variables –Represent those relationships in a Bayesian Network –Store the data in a database so that it will be easy for the Inference and Learning Engines of the Bayesian Network to Manipulate. –Need to consider the temporal aspects of the data

Knowledge Representation: Causal Relationships Primary Plankton Production Many variables that influence plankton production: Chlorophyll Surface Temp Wind Current Chlorophyll Surface Temp Wind

Knowledge Representation: Bayesian Network Directed graphical model Directed graphical model Each node represents influencing variable Each node represents influencing variable An edge from one node to another represents causal relationship between those nodes An edge from one node to another represents causal relationship between those nodes Create Bayesian network structure based on the most relevant relationships found between the variable Create Bayesian network structure based on the most relevant relationships found between the variable

Knowledge Representation: Temporal aspects Need to divide data up into time steps Each time step is dependant on previous step t + 1tt + 2

Learning Engine Each Node of the Bayesian network will have a Conditional Probability Table (CPT) Each Node of the Bayesian network will have a Conditional Probability Table (CPT) Learning engine will implement an algorithm to update the probabilities in each of these tables Learning engine will implement an algorithm to update the probabilities in each of these tables –nine years of satellite and ship data will be used in training the system

Inference Engine The inference engine will be responsible for calculating the probability of a certain sequence of observations given certain input parameters The inference engine will be responsible for calculating the probability of a certain sequence of observations given certain input parameters

Testing Nine years of sub-surface data will be used to train the system. Compare the predicted results for the tenth year against the recorded results for that year. The project will be a success if predictions are very similar to those that were recorded.

Representing Bayesian Networks using Topic Maps

Topic Maps: Overview Brief introduction to topic maps and hypergraphs Brief introduction to topic maps and hypergraphs Applying topic maps to the system Applying topic maps to the system Testing Testing Challenges Challenges

Topic Maps Topic maps provide means for indexing data Topic maps provide means for indexing data ISO standard for describing knowledge structures and associating them with information resources. ISO standard for describing knowledge structures and associating them with information resources.

Topic Map Structure Topic Topic –Anything, subject, entity, concept Occurrence Occurrence –Link to information about topic Association Association –Relationships between topics

Topic Map Structure Occurrence Topic Association

Representing Topic Maps Hypergraphs Hypergraphs  hypergraph is a graph that can have smaller graphs (subgraphs) imbedded within itself

Applying Topic Maps Bayesian Network Bayesian Network –Topics will represent nodes in the network –Associations represent relationships between nodes in the network –Occurrences will link to info about node Future System Future System –Web application linking topic maps for different regions of the ocean

Testing Qualitative approach Qualitative approach Low-Fi prototypes to test intuitiveness of proposed interface to Bayesian Network Low-Fi prototypes to test intuitiveness of proposed interface to Bayesian Network Test with the intended users of the system Test with the intended users of the system

Challenges Representing temporal information using topic maps Representing temporal information using topic maps Representing Bayesian Network relationships using topic maps Representing Bayesian Network relationships using topic maps

SUMMARY Represent data in a formal way using knowledge acquisition and representation Represent data in a formal way using knowledge acquisition and representation Research the viability of using Bayesian Networks as a prediction mechanism Research the viability of using Bayesian Networks as a prediction mechanism Research the viability of using topic maps for intuitively representing Bayesian Networks Research the viability of using topic maps for intuitively representing Bayesian Networks

References Pepper, S. (2002), ”The TAO of Topic Maps, Finding the Way in the Age of Infoglut”, retrieved 01/06/2005, URL: als/tao.html Pepper, S. (2002), ”The TAO of Topic Maps, Finding the Way in the Age of Infoglut”, retrieved 01/06/2005, URL: als/tao.html als/tao.html als/tao.html