Victor Eijkhout and Erika Fuentes, ICL, University of Tennessee SuperComputing 2003 A Proposed Standard for Numerical Metadata.

Slides:



Advertisements
Similar presentations
Integration of MBSE and Virtual Engineering for Detailed Design
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
PROGRAMMING LANGUAGE (JAVA) UNIT 42 BY ROBERT BUTTERFIELD TELEPHONE Data Structures and Algorithms.
Ch:8 Design Concepts S.W Design should have following quality attribute: Functionality Usability Reliability Performance Supportability (extensibility,
Architecture Representation
Towards a repository – independent implementation of Digital Object Prototypes K. Saidis 1, G. Pyrounakis 2 1 Department of Informatics And Telecommunications.
Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.
Chapter 1: The Database Environment
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Simulation.
Introduction to Databases
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 1: Introduction to Decision Support Systems Decision Support.
Introduction to Databases Transparencies
Chapter 1: The Database Environment
Introduction to Databases Transparencies 1. ©Pearson Education 2009 Objectives Common uses of database systems. Meaning of the term database. Meaning.
Chapter 1 1 © Prentice Hall, 2002 Database Design Dr. Bijoy Bordoloi Introduction to Database Processing.
Chapter 1 1 © Prentice Hall, 2002 Database Design Dr. Bijoy Bordoloi Introduction to Database Processing.
Software Engineering Muhammad Fahad Khan
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
Cracow Grid Workshop 2003 Institute of Computer Science AGH A Concept of a Monitoring Infrastructure for Workflow-Based Grid Applications Bartosz Baliś,
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
T Network Application Frameworks and XML Web Services and WSDL Sasu Tarkoma Based on slides by Pekka Nikander.
Data Structures & AlgorithmsIT 0501 Algorithm Analysis I.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
XML in Development of Distributed Systems Tooling Programming Runtime.
©Ian Sommerville 2000, Mejia-Alvarez 2009 Slide 1 Software Processes l Coherent sets of activities for specifying, designing, implementing and testing.
Introduction: Databases and Database Users
Architecture for a Database System
Assessing Quality for Integration Based Data M. Denk, W. Grossmann Institute for Scientific Computing.
Computational Physics Introduction 3/30/11. Goals  Calculate solutions to physics problems  All physics problems can be formulated mathematically. 
Hyper/J and Concern Manipulation Environment. The need for AOSD tools and development environment AOSD requires a variety of tools Life cycle – support.
SaveUML System design. System overview Possible...
MIS 327 Database Management system 1 MIS 327: DBMS Dr. Monther Tarawneh Dr. Monther Tarawneh Week 2: Basic Concepts.
Andrew S. Budarevsky Adaptive Application Data Management Overview.
UNCERTML - DESCRIBING AND COMMUNICATING UNCERTAINTY WITHIN THE (SEMANTIC) WEB Matthew Williams
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
ModelPedia Model Driven Engineering Graphical User Interfaces for Web 2.0 Sites Centro de Informática – CIn/UFPe ORCAS Group Eclipse GMF Fábio M. Pereira.
1 Introduction to Databases. 2 Examples of Database Applications u Purchases from the supermarket u Purchases using your credit card u Booking a holiday.
1 Chapter 1 Introduction to Databases Transparencies.
1 CSCD 326 Data Structures I Software Design. 2 The Software Life Cycle 1. Specification 2. Design 3. Risk Analysis 4. Verification 5. Coding 6. Testing.
Software Design Process
Volgograd State Technical University Applied Computational Linguistic Society Undergraduate and post-graduate scientific researches under the direction.
WIGOS Data model – standards introduction.
February 8, 2006copyright Thomas Pole , all rights reserved 1 Lecture 3: Reusable Software Packaging: Source Code and Text Chapter 2: Dealing.
WISE Working Group D September 2009, Brussels Jon Maidens.
Requirement engineering & Requirement tasks/Management. 1Prepared By:Jay A.Dave.
Copyright © 2007, Oracle. All rights reserved. Managing Items and Item Catalogs.
Mathematical Service Matching Using Description Logic and OWL Kamelia Asadzadeh Manjili
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Victoria Ibarra Mat:  Generally, Computer hardware is divided into four main functional areas. These are:  Input devices Input devices  Output.
Perspective: Information Processing and Computation Week 11 Lecture notes INF 380E: Perspectives on Information Spring
Laurea Triennale in Informatica – Corso di Ingegneria del Software I – A.A. 2006/2007 Andrea Polini VII. System Specification (I)
Constructing Knowledge Bases for E-Learning Using Protégé 2000 and Web Services Presented by: Fuhua Oscar Lin Authors: Mike Hogeboom, Fuhua Oscar Lin,
Introduction to Databases Transparencies
Physical Data Model – step-by-step instructions and template
Introduction to Databases
Introduction to Databases Connolly and Begg
Introduction to Databases
structures and their relationships." - Linus Torvalds
Chapter 2 Database Environment.
Introduction to Databases
Introduction to Databases
Metadata The metadata contains
Vocabulary Algorithm - A precise sequence of instructions for processes that can be executed by a computer Low level programming language: A programming.
Software Modelling and Design
Software Development Process Using UML Recap
structures and their relationships." - Linus Torvalds
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
Presentation transcript:

Victor Eijkhout and Erika Fuentes, ICL, University of Tennessee SuperComputing 2003 A Proposed Standard for Numerical Metadata

2003/11Eijkhout / Metadata / SC20032 Introduction Many numerical routines have parameters with settings that depend on the application context of the routine. Computing the parameter settings is now part of the numerical software, or is done by human intervention. We argue that this should be done by a separate software analysis component, and automatically. This, however, requires a higher level description of the application data. We formalize this by introducing our Numerical Metadata. Having analysis modules, and a formal almost-semantic description of numerical data, makes Component-based Programming Frameworks possible. We also show the feasibility of the automatic analysis approach.

2003/11Eijkhout / Metadata / SC20033 Traditional flow of control Physics application produces data Numerical app analysis data to find relevant characteristic, uses characteristic to decide on algorithm and set its parameters => only `data’ interface needed

2003/11Eijkhout / Metadata / SC20034 Improved scenario Physics as before Analysis module finds characteristics Numerical algorithm choice and setting of parameters => also interface needed for characteristics => metadata

2003/11Eijkhout / Metadata / SC20035 Usage scenario 1 Example: GMRES restart length as function of indefinitenes s

2003/11Eijkhout / Metadata / SC20036 Usage scenario 2 Example: estimate fill- in, use iterative method if data wouldn’t fit

2003/11Eijkhout / Metadata / SC20037 Usage scenario 3

2003/11Eijkhout / Metadata / SC20038 Numerical experimentation is held back by lack of available characteristics Separately available analysis modules should remedy that Numerical experimentation Many relevant matrix quantities are hard to compute and hard to implement: enclosing ellipse of the spectrum, departure from normality, &c. Availability of independent analysis modules should encourage further experimentation on the part of numerical analysists.

2003/11Eijkhout / Metadata / SC20039 Component-based Programming Frameworks  Applications: large, complex scientific applications (Composite Applications) that couple a variety of single-focus, scientific algorithms (Element Applications) along with other software support (e.g. visualization) Using behavioural metadata to assist in integrating single-focus algorithms into complex applications Metadata as semantic part of interface spec of numerical components (with Thomas Eidson)

2003/11Eijkhout / Metadata / SC200310

2003/11Eijkhout / Metadata / SC Practical access to metadata Store in XML format; use Schema for validation; XSL for display API for conversion XML internal data structure API for retrieval / insertion of metadata We need two-fold access to the metadata: inside a code and in more permanent form. Conversion between the two forms

2003/11Eijkhout / Metadata / SC API: creation routines

2003/11Eijkhout / Metadata / SC API: Access routines

2003/11Eijkhout / Metadata / SC API: Conversion routines

2003/11Eijkhout / Metadata / SC Proposed metadata category 1

2003/11Eijkhout / Metadata / SC Proposed metadata category 2

2003/11Eijkhout / Metadata / SC Proposed metadata category 3

2003/11Eijkhout / Metadata / SC Proposed metadata category 4

2003/11Eijkhout / Metadata / SC Further categories Custom categories Application properties: discretisation, mesh Even though we propose a core set of categories, our storage format, and the libraries implementing it, are general and open-ended. Thus we hope that people will propose categories that are inspired by other views of the same kind of data, or by different problem areas altogether. In particular, categories that describe the application-derived properties of numerical data would be very useful in the analysis modules we proposed.

2003/11Eijkhout / Metadata / SC Matrix metadata, issues Duplication of elements (e.g., simple->nnz == matrix_market->nnz) Relations between elements (e.g., if M-matrix then definite) Inheritance / derivation (e.g., dummy rows from bc, fictitious domain) It is clear that certain pieces of information will appear in more than one category, especially if third-parties will start proposing their own categories. We want to introduce mechanisms for resolving or enforcing such implied relations. Also, if one matrix is derived from another, there should be a linkage mechanism so that categories of metadata can be inherited where this is mathematically justified

2003/11Eijkhout / Metadata / SC Matrix metadata, more issues Extensions beyond matrices and linear systems Language interoperability The current proposal was clearly inspired by linear system solving, and the proposed categories are applicable to matrices, mostly in that context. However, the storage format is general enough to cover other numerical application areas and other types of data. The library we have written uses and targets C. This obviously needs to be extended to Fortran and Java. We will use Babel for this.

2003/11Eijkhout / Metadata / SC Proof of concept Predicting partitioning/distribution of linear solve Analysis modules for structural, scalar, spectral categories of metadata

2003/11Eijkhout / Metadata / SC Proof of concept Heuristic: choice of permutation & partitioning before preconditioning Statistical analysis (parametric model, Bayesian decision rule) Analysis modules for features: bandwidth, sparsity, field-of-values We ran exhaustive tests of a number of iterative methods on a collection of matrices. The results are used in a parametric model to classify the matrices. Dividing the test collection into a training and test set allows us to assess the predictive value of the model. Three different methods can be predicted with accuracies 30,90,30%. Average gain is approx factor of 5 (correct prediction over worst case). Misprediction penalty is only 60%, but still factor of 2 gain over worst method.

2003/11Eijkhout / Metadata / SC Software Metadata library based on libxml Library, XML schema, XSL style sheet Currently only C support See