Computer Representation of Venn and Euler Diagrams Diunuge B. Wijesinghe, Surangika Ranathunga, Gihan Dias Department of Computer Science and Engineering,

Slides:



Advertisements
Similar presentations
STEM 2012 Automatically assessing free-form diagrams in E-assessment systems Pete Thomas Kevin Waugh and Neil Smith Open University, UK.
Advertisements

New Technologies Supporting Technical Intelligence Anthony Trippe, 221 st ACS National Meeting.
Presented by: GROUP 7 Gayathri Gandhamuneni & Yumeng Wang.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Multi-Phase Reasoning of temporal semantic knowledge Sakirulai O. Isiaq and Taha Osman School of Computer and Informatics Nottingham Trent University Nottingham.
Data Visualization STAT 890, STAT 442, CM 462
AN INTRODUCTION TO PRAAT Tina John M.A. Institute of Phonetics and digital Speech Processing - University Kiel Institute of Phonetics and Speech Processing.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Växjö University Joakim Nivre Växjö University. 2 Who? Växjö University (800) School of Mathematics and Systems Engineering (120) Computer Science division.
1 Info 1409 Systems Analysis & Design Module Lecture 8 – Modelling tools and techniques HND Year /9 De Montfort University.
Understanding SVG Diagrams Interim Presentation Vivek Ramakrishna Honours Program, Computer Science
Document Image Analysis CSE 717 An Introduction. Document Image Analysis  DIA is the theory and practice of recovering the symbol structures of digital.
Information Visualisation Praminda Caleb-Solly. Learning Objectives Gain an understanding of the benefits of information visualisation Explore ways of.
Instructor : Dr. K. R. Rao Presented by: Rajesh Radhakrishnan.
The Program Design Phases
Science and Engineering Practices
Software Product Lines Krishna Anusha, Eturi. Introduction: A software product line is a set of software systems developed by a company that share a common.
Programming by Example using Least General Generalizations Mohammad Raza, Sumit Gulwani & Natasa Milic-Frayling Microsoft Research.
4/20/2017.
OOSE 01/17 Institute of Computer Science and Information Engineering, National Cheng Kung University Member:Q 薛弘志 P 蔡文豪 F 周詩御.
Aniko T. Valko, Keymodule Ltd.
Clustering methods Course code: Pasi Fränti Speech & Image Processing Unit School of Computing University of Eastern Finland Joensuu,
Computers in Healthcare Jinbo Bi Department of Computer Science and Engineering Connecticut Institute for Clinical and Translational Research University.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Multimodal Alignment of Scholarly Documents and Their Presentations Bamdad Bahrani JCDL 2013 Submission Feb 2013.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
CSCE 548 Secure Software Development Final Exam – Review.
Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
Abstract We present two Model Driven Engineering (MDE) tools, namely the Eclipse Modeling Framework (EMF) and Umple. We identify the structure and characteristic.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Visualizing Ontology Components through Self-Organizing.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
DataBase and Information System … on Web The term information system refers to a system of persons, data records and activities that process the data.
CS1 Assessment Using Memory Diagrams Mark A. Holliday David R. Luginbuhl Dept of Mathematics and Computer Science Western Carolina University SIGCSE Technical.
Semi-Automatic Quality Assessment of Linked Data without Requiring Ontology Saemi Jang, Megawati, Jiyeon Choi, and Mun Yong Yi KIRD, KAIST NLP&DBPEDIA.
Some questions -What is metadata? -Data about data.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Ontology based Information Extraction
Volgograd State Technical University Applied Computational Linguistic Society Undergraduate and post-graduate scientific researches under the direction.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Yu, et al.’s “A Model-Driven Development Framework for Enterprise Web Services” In proceedings of the 10 th IEEE Intl Enterprise Distributed Object Computing.
Advanced Science and Technology Letters Vol.28 (EEC 2013), pp Fuzzy Technique for Color Quality Transformation.
User Errors in Formulating Queries and IR Techniques to Overcome Them Birger Larsen Information Interaction and Information Architecture Royal School of.
COMPUTER SYSTEM FUNDAMENTAL Genetic Computer School INTRODUCTION TO ARTIFICIAL INTELLIGENCE LESSON 11.
Kadupitiya JCS Dr. Surangika Ranathunga Prof. Gihan Dias Department of Computer Science and Engineering University of Moratuwa Sri Lanka.
Table 1. Τhe grid analysis with conceptual categories and subcategories by NRC (2012) Tsetsos Stavros et al. The Scientific Practices on the Science’s.
Information Retrieval in Practice
Agenda Preliminaries Motivation and Research questions Exploring GLL
What Is Cluster Analysis?
A. M. R. R. Bandara & L. Ranathunga
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
How Students “Measure Up”: Creation of an Assessment Tool for CS1
Advantages of ICT over Manual Methods of Processing Data
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
© The Author(s) Published by Science and Education Publishing.
课程名 编译原理 Compiling Techniques
Lecture 2 Introduction to Programming
Potter’s Wheel: An Interactive Data Cleaning System
Social Knowledge Mining
Finding Clusters within a Class to Improve Classification Accuracy
Query Construct Interfaces of RDF Data an introduction
CSc4730/6730 Scientific Visualization
MATLAB – What Is It ? Name is from matrix laboratory Powerful tool for
MATLAB – What Is It ? Name is from matrix laboratory Powerful tool for
Aniko T. Valko, Keymodule Ltd.
How to publish in a format that enhances literature-based discovery?
Requirement Analysis using
George Bebis and Wenjing Li Computer Vision Laboratory
Definition of a Model.
Presentation transcript:

Computer Representation of Venn and Euler Diagrams Diunuge B. Wijesinghe, Surangika Ranathunga, Gihan Dias Department of Computer Science and Engineering, University of Moratuwa, Sri Lanka {diunuge.10, surangika,

Introduction  Diagrams are a very important communication medium  Mathematical diagram understanding is a complex challenge  Diagram understanding is an important pre-requisite of various fields such as image database systems, and educational diagram grading systems  There is no significant research done to interpret Venn and Euler diagrams  We address the problem of computer understanding of Venn and Euler diagrams that are available in vector format

Motivation  Venn & Euler diagrams are a significant part of Mathematics  Especially in secondary-level exams such as London O/L and SAT Mathematics  For the automatic assessment of diagrams, diagram interpretation is the important first step  For Venn & Euler diagrams, such an interpretation method is not available

Previous Work  Futrelle et al [1] presented a diagram understanding system to interpret diagrams based on constraint grammars. The system is capable of handling x-y graphs and gene diagrams in Biological domain [2].  Huang et al [3, 4] developed a system that can understand chart images.  Burton et al [5] introduced an abstract syntax for Euler diagram representation  Thomas et al [6, 7, and 8] developed a computer aided assessment system that can handle graph based diagrams such as Entity- Relationship diagrams and flow charts

Venn & Euler Diagram Parser  Venn & Euler diagrams with Jordan curves are handled.  Sets are represented with Rectangles, Circles & Ellipses  Each set is represented with only one curve  System accepts SVG vector images as the input diagram  Some diagrammatic systems use their own image structures  SVG diagram is parsed & domain related information is extracted  Interpreted diagram is given as XML output

Venn & Euler Diagram Parser – Text Association  Text association is the most difficult part of the research  Have to deal with text ambiguities  Have to deal with human mistakes  Close distance parameters are tuned with reference to minimal font size of an image  Text-area association depends on the centroid, size of the area and the text location

Diagram Representation: Mathematical Model Abstract Syntax (Extended the syntax introduced by Burton et al [5])  Labels = {A, B, C}  Curves = {c a, c b, c c }  Regions(Called Zones) = {{~c a.~c b.~c c }, {~c a.~c b.c c }, {~c a.c b.~c c }, {~c a.c b.c c }, {c a.~c b.~c c }, {c a.~c b.c c }, {c a.c b.~c c }, {c a.c b.c c }}  No of elements = {{c a =100}, {c b =80}, {c c =30}, {c a.c b =15}, {~c a.~c b.c c =5}, {~c a.c b.~c c =x}, {c a.~c b.c c =8}, {c a.c b.c c =10}}  Shaded Zones ={c a } Venn Diagram in SVG file format

Diagram Representation Contd. Venn Diagram in SVG file format Venn Diagram after parsing

Evaluation  Venn parser is tested and tuned using the Venn diagrams gathered from university undergraduates, which are drawn directly using an SVG editor  For the evaluation, collected hand-written answer scripts (includes GCE O/L questions) that collectively contained 3 Venn and 4 Euler diagrams from university undergraduates and grade 10 school students  Collected 77 diagrams are converted to SVG using a editor & given to the parser  Parser results are validated for each diagram manually

Results Diagram No. Diagram Type Correctly Parsed Incorrectly Parsed Accuracy 1 Venn % 2 Venn120100% 3 Euler110100% 4 Euler110100% 5 Euler3260.0% 6 Euler5362.5% 7 Venn10190%

Results  Diagram 1 and 2 contain two sets and remaining diagrams contains 3 sets.  Drawings for diagram 1 were collected from school student answer scripts and remainder were collected from university undergraduates.  Diagram 5 and 6 had fewer answers because of the higher complexity of the question.  All of the parsing errors are due to the ambiguity of text labels and some text labels being too far away from the arrows that are supposed to be arrow labels.  System showed an accuracy of 89.61%.

Future Work  Currently sets drawn using rectangles, circles, and ellipses are supported. This method can be extended to apply for sets drawn with any type of Jordan curves.  In some Venn and Euler representations, one set label can have more than one Jordan curve. This method can be extended to address those representations.  Machine learning approaches can be used to improve the label classification and clustering.  Need annotated data

Discussion  Diagram understanding is a complex problem in the computer research field.  Structured diagrams can be dealt with using domain ontology.  Unstructured drawing understanding is a very complex problem.  In particular, dealing with human drawn diagrams is difficult due to the ambiguity problems and human errors.

Acknowledgement  This research is funded by the 2015 University of Moratuwa Senate Research Grant (SRC), and DL4D 2016 research grant.

References 1. R. P. Futrelle, I. Kakadiaris, J. Alexander, C. M. Carriero, N. Nikolakis, and J. M. Futrelle, “ Understanding diagrams in technical documents,” in IEEE Computer, 1992, Vol. 25(7), pp R. P. Futrelle and N. Nikolakis, “ Efficient analysis of complex diagrams using constraint-based parsing. In Document Analysis and Recognition,” in Proceedings of the Third International Conference on IEEE, 1995, Vol. 2, pp W. Huang, C. Tan, and W. K. Leow, “ Associating text and graphics for scientific chart understanding. In Document Analysis and Recognition,” in Proceedings of the Eighth International Conference on IEEE, 2005, pp W. Huang, C. Tan, “ A System for Understanding Imaged Infographics and Its Applications,” in Proceedings of the 2007 ACM symposium on Document engineering - DocEng '07, 2007.

References 5. J. Burton, G.Stapleton, J. Howse, and P. Chapman, " Visualizing concepts with Euler diagrams," in International Conference on Theory and Application of Diagrams, Springer Berlin Heidelberg, N. Smith, P. Thomas, and K. Waugh, " Interpreting imprecise diagrams," Diagrammatic Representation and Inference. Springer Berlin Heidelberg, P.Thomas, K. Waugh, and N. Smith, " Experiments in the automatic marking of ER-diagrams," ACM SIGCSE Bulletin, Vol 37(3), pp , P. Thomas, K. Waugh, and N. Smith, " Using patterns in the automatic marking of ER-diagrams," ACM SIGCSE Bulletin. Vol. 38(3), 2006.

Thank You..!