VERA AULIA ( 813580 ).  Oil palm is one of the major edible oil traded in the global market.  Oil palm tree will start to produce fruits within three.

Slides:



Advertisements
Similar presentations
Database Planning, Design, and Administration
Advertisements

Semantic Matching of candidates’ profile with job data from Linkedln PRESENTED BY: TING XIAO SARABPREET KAUR DHILLON.
Semantic Web Services Peter Bartalos. 2 Dr. Jorge Cardoso and Dr. Amit Sheth
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Distributed DBMSs A distributed database is a single logical database that is physically distributed to computers on a network. Homogeneous DDBMS has the.
Automatic Evaluation of Migration Quality in Distributed Networks of Converters Miguel Ferreira Supervisors Ana Alice Baptista.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Chapter 1: Data Models and DBMS Architecture Title: What Goes Around Comes Around Authors: M. Stonebraker, J. Hellerstein Pages: 2-40.
1 Lecture 13: Database Heterogeneity Debriefing Project Phase 2.
Article by: Farshad Hakimpour, Andreas Geppert Article Summary by Mark Vickers.
Mapping Techniques and Visualization of Statistical Indicators Haitham Zeidan Palestinian Central Bureau of Statistics IAOS 2014 Conference.
Building Knowledge-Driven DSS and Mining Data
Methodology Conceptual Database Design
Lecture Nine Database Planning, Design, and Administration
The information integration wizard (Iwiz) project Report on work in progress Joachim Hammer Presented by Muhammed Al-Muhammed.
Lecture Two Database Environment Based on Chapter Two of this book:
1 Information Integration and Source Wrapping Jose Luis Ambite, USC/ISI.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Carlos Lamsfus. ISWDS 2005 Galway, November 7th 2005 CENTRO DE TECNOLOGÍAS DE INTERACCIÓN VISUAL Y COMUNICACIONES VISUAL INTERACTION AND COMMUNICATIONS.
Bina Nusantara 2 C H A P T E R INFORMATION SYSTEM BUILDING BLOCKS.
Chapter 9 Database Planning, Design, and Administration Sungchul Hong.
Overview of the Database Development Process
A Unified Framework for the Semantic Integration of XML Databases
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
IST 210 Database Design Process IST 210 Todd S. Bacastow January 2005.
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
“Solving Data Inconsistencies and Data Integration with a Data Quality Manager” Presented by Maria del Pilar Angeles, Lachlan M.MacKinnon School of Mathematical.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Methodology - Conceptual Database Design
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Data Access and Security in Multiple Heterogeneous Databases Afroz Deepti.
Interoperability & Knowledge Sharing Advisor: Dr. Sudha Ram Dr. Jinsoo Park Kangsuk Kim (former MS Student) Yousub Hwang (Ph.D. Student)
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Quality issues in Spatial Databases M. Mostafavi, G. Edwards, R. Jeansoulin CRG & GEOIDE & REVIGIS Victoria, May 2003.
DATABASE MANAGEMENT SYSTEM ARCHITECTURE
ReSeTrus Development of a digital library technology based on redundancy elimination and semantic elevation, with special emphasis on trust management.
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
Chapter 4 Decision Support System & Artificial Intelligence.
SQL Based Knowledge Representation And Knowledge Editor UMAIR ABDULLAH AFTAB AHMED MOHAMMAD JAMIL SAWAR (Presented by Lei Jiang)
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Design Reuse Earlier we have covered the re-usable Architectural Styles as design patterns for High-Level Design. At mid-level and low-level, design patterns.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Mining knowledge from natural language texts using fuzzy associated concept mapping Presenter : Wu,
ATU Decision Support System. Overview Decision Support System – what is it? Definition Main components Illustrative Scenario Ontology / Knowledge Base.
Object storage and object interoperability
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
1 Chapter 2 Database Environment Pearson Education © 2009.
1 Integration of data sources Patrick Lambrix Department of Computer and Information Science Linköpings universitet.
Database Environment Chapter 2. The Three-Level ANSI-SPARC Architecture External Level Conceptual Level Internal Level Physical Data.
ENHANCEMENT OF BIG DATA INTEGRATION METHOD MAISARAH BINTI ZORKEFLEE
Of 24 lecture 11: ontology – mediation, merging & aligning.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 6-1 Chapter 6 Decision Support System Development.
SZRZ6014 Research Methodology Prepared by: Aminat Adebola Adeyemo Study of high-dimensional data for data integration.
Introduction to Machine Learning, its potential usage in network area,
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
The Semantic Web By: Maulik Parikh.
Cloud based linked data platform for Structural Engineering Experiment
Quality Assessment in the framework of Map Generalization
Database management concepts
Metadata Construction in Collaborative Research Networks
Ontology-Based Approaches to Data Integration
Database management concepts
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Presentation transcript:

VERA AULIA ( )

 Oil palm is one of the major edible oil traded in the global market.  Oil palm tree will start to produce fruits within three years after planting and has 25 years life span with fruit production around 13 fruit bunches every year.  The oil palm plantation is rich repository of biodiversity which can be found various flora and fauna in the oil palm environment.

 There are more than hundred common insects and mammalian pests which intrusion and damage oil palms, such as bagworm  Most plantation owners did not realize the disease struck because of slight management can incurred huge loses of production  Oil palm disease increase every year in new type of worm, weeds and pests, it will make the data being larger and complex

What is data integration? Where it has been applied? Why use data integration? Problem in data integration

 Data integration can be defined as combination of data from different sources and be presented to the users in unified form. (Calvanese & Giacomo, 2005)

 Websites, education, social networks, healthcare, location-based services, communication and astronomy. (Dong & Srivastava, 2013; Zhang, 2013)

 Provides convenience to the users that need fast, current and clean data. (Louie et al., 2007)

Inconsistencies data often find in larger datasets that can affect the knowledge content, data items, information, and meta-knowledge (Zhang, 2013).  Temporal inconsistencies ◦ when time interval of two inconsistent event in temporal attributes datasets overlapping (Bleiholder & Naumann, 2011; Zhang, 2013)

 Text inconsistencies ◦ when two text referring to same event or entity be co-associate (Zhang, 2013)  Spatial Inconsistencies ◦ when datasets keep changing spatial distance in time (Cali et. al., 2013; Zhang, 2013)

 To develop a method that could integrate heterogeneous data source ◦ To identify appropriate methods to solve inconsistencies data in data integration ◦ To implement suitable methods that can be proposed in oil palm data integration ◦ To analyze and evaluate the method in development system

Inconsistency Data Method Solution Implementation of The Method Use In Solving Inconsistencies Data Evaluate the method in the development system

 Inconsistency data is one of factors that contribute to data integration problem (Jeffery et al., 2013).  Methods used: Ontology based approach (Louie et al., 2007; Nemirovski et al., 2013). Schema mapping and matching based approach (Do, 2007). Fuzzy multi-attribute decision making based approach (Wang et al., 2011). Information quality criteria approach (Angeles & Mackinno, 2011).  Ontology based approach is appropriate (Wache et al., 2001; Buccella et al., 2005).

 Ontology based approach Petroleum Ontology (Nimmagadda & Dreher, 2013) TBox coding (Nemirovski et al., 2013). Mappings between ontology (Wache et al., 2001) Ontology for the generation of global schemas (Hakimpour & Geppert, 2001).  Advantages of using ontology: Steady idea of the interface (Buccella et al., 2001). Language is easily communicated (Buccella et al., 2005). Increases the computational efficiency (Nemirovski et al., 2013).

 Nemirovski et al. (2013) TBox Intelligibility Mappings compliance Computational efficiency  Buccella et al. (2005) Architecture Semantic heterogeneity Query resolution

 (Luo et al., 2008) Comparing ontology model  modeling goal  model purpose  common ground within same domain  The results of evaluation can presented in appropriate accuracy for decision-making (Buccella et al.,2005)

Research Phase Data Collection System Model Performance Evaluation

 There are three phases of research methodology that need to be completed to achieve the research objectives ◦ Data collection, ◦ Framework model, ◦ Performance evaluation.

 Data of pests, weeds and diseases for oil palm data will collect from reference book Malaysian Palm Oil Board (MPOB)  Federal Land Consolidation and Rehabilitation Authority (FELCRA) Kedah

 ONv = (C, R, t) ◦ ON = ontology name, ◦ v = version number, ◦ C = {c1, …, cm} for concepts, ◦ R = {r1, …,rn} for relationships ◦ t = timestamp  OBSERVER (Ontology Based System Enhanced with Relationship for Vocabulary Heterogeneity Resolution)

 Ontologies and data integration ◦ ontology have three main approach which is single approach, multiple approach and hybrid approach (Wache et al.,2001)

 Semantic support for DSS  ontology can be used to classify different diseases by using the rule base within an expert system  Semantic query enhancement and optimization ◦ Query enhancement allows the system to provide more targeted information ◦ Query optimization is semantically altering the basic query to find a more adequate execution path within the database

 DESMET framework approach (Buccella et al., 2005 and Mealy & Strooper, 2006)

 Architecture: ◦ Information Source: OBSERVER support decision dynamic information sources and support database and HTML pages ◦ Architecture type: A wrapper imitates users’ behavior.

 Semantic Heterogeneity ◦ Ontology use: OBSERVER deals with inclusive relationship such as synonym, hypernym, hyponym, overlap, disjoint and covering ◦ Language : allows user to use any language (Description Logics)

 Query ◦ User participation: Browser performance with users query ◦ Query Plan:  First step: query construction  Access  Underlying Data  Controlled Query Expansion  Second step: Query Processor  Mapping Information  Last step: original query translated into term of user ontology ◦ Optimization: provides basic estimate information of loss measure

Q&A