A.Z. Fazliev, N.A.Lavrentiev, A.I.Privezentsev V.E. Zuev Institute of Atmospheric Optics SB RAS, Academician Zuev Square 1, Tomsk 634021, Russia E-mail:

Slides:



Advertisements
Similar presentations
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Advertisements

ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
Designing Multimedia with Fuzzy Logic Enrique Diaz de Leon * Rene V. Mayorga ** Paul D. Guild *** * ITESM, Guadalajara Campus, Mexico ** Faculty of Engineering,
An Introduction to Programming and Object Oriented Design using Java 2 nd Edition. May 2004 Jaime Niño Frederick Hosch Chapter 0 : Introduction to Object.
OASIS Reference Model for Service Oriented Architecture 1.0
Software Testing and Quality Assurance
7M701 1 Software Engineering Object-oriented Design Sommerville, Ian (2001) Software Engineering, 6 th edition: Chapter 12 )
Theoretical work on the water monomer and dimer Matt Barber Jonathan Tennyson University College London September 2009.
Introduction and Overview “the grid” – a proposed distributed computing infrastructure for advanced science and engineering. Purpose: grid concept is motivated.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
S&MPO linelist of 16 O 3 in the range 6000 – 7000 cm -1. M.-R. De Backer-Barilly #, Semen N. Mikhailenko*, Yurii Babikov*, Alain Campargue §, Samir Kassi.
Some Details of the Upcoming HITRAN Updates for the New Edition of 2008 Laurence S. Rothman, Iouli E. Gordon Harvard-Smithsonian Center for Astrophysics,
A. Barbe, M.R. De Backer-Barilly, Vl.G. Tyuterev, A. Campargue 1, S.Kassi 1 Updated line-list of 16 O 3 in the range 5860 – 7000 cm -1 deduced from CRDS.
 ( ) 0+   ( ) 0–  4 1 Results at 2.5 microns 2 +( ) 1 II (
Information System to Access HITRAN via the Internet Yu. L. Babikov, S. N. Mikhailenko, S. A. Tashkun, V.E. Zuev Institute of Atmospheric Optics, Tomsk,
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 Water spectroscopy with a Distributed Information System A.Z.Fazliev 1, A.G.Császár 2, J.Tennyson 3 1. Institute of Atmospheric Optics SB.
The Software Product Life Cycle. Views of the Software Product Life Cycle  Management  Software engineering  Engineering design  Architectural design.
CSC230 Software Design (Engineering)
SPECTRA, an Internet Accessible Information System for Spectroscopy of Atmospheric Gases Semen MIKHAILENKO, Yurii BABIKOV, Vladimir.
Domain-Specific Software Engineering Alex Adamec.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
What is Software Architecture?
Experimental Energy Levels of HD 18 O and D 2 18 O S.N. MIKHAILENKO, O.V. NAUMENKO, S.A. TASHKUN Laboratory of Theoretical Spectroscopy, V.E. Zuev Institute.
1 A Flexible and Secure Deployment Framework for Distributed Applications Alan Dearle, Graham Kirby, Andrew McCarthy and Juan Carlos Diaz y Carballo School.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Dr. Jūratė Kuprienė Director for innovations and infrastructure development Workshop: Information services for research process , Rīga Research.
Chapter 8 Architecture Analysis. 8 – Architecture Analysis 8.1 Analysis Techniques 8.2 Quantitative Analysis  Performance Views  Performance.
ITEC224 Database Programming
SAMANVITHA RAMAYANAM 18 TH FEBRUARY 2010 CPE 691 LAYERED APPLICATION.
BUSINESS INFORMATICS descriptors presentation Vladimir Radevski, PhD Associated Professor Faculty of Contemporary Sciences and Technologies (CST) Linkoping.
From GEANT to Grid empowered Research Infrastructures ANTONELLA KARLSON DG INFSO Research Infrastructures Grids Information Day 25 March 2003 From GEANT.
What is a Business Analyst? A Business Analyst is someone who works as a liaison among stakeholders in order to elicit, analyze, communicate and validate.
New H 2 16 O measurements of line intensities around 1300 cm -1 and 8800 cm - 1 Oudot Charlotte Groupe de Spectrométrie Moléculaire et Atmosphérique Reims,
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
Fazliev A.Z. Institute of atmospheric optics SB RAS, Tomsk Project of distributed informational system on molecular spectroscopy INTAS Workshop “Complete.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
The roots of innovation Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on:
Hot summer of HITRAN2008 I. E. Gordon L. S. Rothman.
1 Introduction to Software Engineering Lecture 1.
Notes of Rational Related cyt. 2 Outline 3 Capturing business requirements using use cases Practical principles  Find the right boundaries for your.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
The IUPAC Critical Evaluation of the Ro-vibrational Spectra of Water Vapor: Results for H 2 18 O, H 2 17 O, and HD 16 O Jonathan Tennyson University College.
Xinchuan Huang, 1 David W. Schwenke, 2 Timothy J. Lee 2 1 SETI Institute, Mountain View, CA 94043, USA 2 NASA Ames Research Center, Moffett Field, CA 94035,
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Evaluation of the Experimental and Theoretical Intensities of Water- Vapor Lines in the 2 µm Region Using Spectra from the Solar- Pointing FTS Iouli Gordon,
Building Simulation Model In this lecture, we are interested in whether a simulation model is accurate representation of the real system. We are interested.
The Geographic Information System of the European Commission (GISCO) By Albrecht Wirthmann, GISCO, Eurostat ESPON.
MICROWAVE SPECTRUM OF 12 C 16 O S.A. TASHKUN and S.N. MIKHAILENKO, Laboratory of Theoretical Spectroscopy, V.E. Zuev Institute of Atmospheric Optics, Zuev.
Line list of HD 18 O rotation-vibration transitions for atmospheric applications Semen MIKHAILENKO, Olga NAUMENKO, and Sergei TASHKUN Laboratory of Theoretical.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Towards a Reference Quality Model for Digital Libraries Maristella Agosti Nicola Ferro Edward A. Fox Marcos André Gonçalves Bárbara Lagoeiro Moreira.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
Lecture 13.  Failure mode: when team understands requirements but is unable to meet them.  To ensure that you are building the right system Continually.
Preliminary modeling of CH 3 D from 4000 to 4550 cm -1 A.V. Nikitin 1, L. R. Brown 2, K. Sung 2, M. Rey 3, Vl. G. Tyuterev 3, M. A. H. Smith 4, and A.W.
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
Lecture №4 METHODS OF RESEARCH. Method (Greek. methodos) - way of knowledge, the study of natural phenomena and social life. It is also a set of methods.
From Use Cases to Implementation 1. Structural and Behavioral Aspects of Collaborations  Two aspects of Collaborations Structural – specifies the static.
Mathematical Service Matching Using Description Logic and OWL Kamelia Asadzadeh Manjili
From Use Cases to Implementation 1. Mapping Requirements Directly to Design and Code  For many, if not most, of our requirements it is relatively easy.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
CCNT Lab of Zhejiang University
Chapter 0 : Introduction to Object Oriented Design
Presentation transcript:

A.Z. Fazliev, N.A.Lavrentiev, A.I.Privezentsev V.E. Zuev Institute of Atmospheric Optics SB RAS, Academician Zuev Square 1, Tomsk , Russia HITRAN Conference, Cambridge, June 2010

Introduction e-Science. Service oriented architecture e-Science. Three layers information systems The Data, Information and Knowledge Lifecycle Architecture of Model of Quantitative Molecular Spectroscopy Data Validity Formal constraints Selection rules. Primary data sources Publication constraints Non-formal constraints Current State of HITRAN Conference, Cambridge, June 2010

e-Science. Service oriented architecture De Roure D., Jennings N., Shadbolt N. A Future e-Science Infrastructure // Report commissioned for EPSRC/DTI Core e-Science Programme p. HITRAN Conference, Cambridge, June 2010

e-Science. Three layers information systems De Roure D., Jennings N., Shadbolt N. A Future e-Science Infrastructure // Report commissioned for EPSRC/DTI Core e-Science Programme p. The Data-Computation Layer “As soon as computers are interconnected and communicating we have a distributed system, and the issues in designing, building and deploying distributed computer systems have now been explored over many years. First it positions the Grid within the bigger picture of distributed computing, asking whether it is subsumed by current solutions. Then we look in more detail at the requirements and currently deployed technologies in order to identify issues for the next generation of the infrastructure. Since much of the grid computing development has addressed the data-computation layer, this section particularly draws upon the work of that community.” The Information Layer “This layer is focus firstly on the Web. The Web’s information handling capabilities are clearly an important component of the e-Science infrastructure, and the web infrastructure is itself of interest as an example of a distributed system that has achieved global deployment. The second aspect addressed is support for collaboration, something which is key to e- Science. The information layer aspects build on the idea of a ‘collaboratory’, defined as a “centre without walls, in which the nation’s researchers can perform their research without regard to geographical location - interacting with colleagues, accessing instrumentation, sharing data and computational resource, and accessing information in digital libraries.” The Knowledge Layer “The aim of the knowledge layer is to act as an infrastructure to support the management and application of scientific knowledge to achieve particular types of goal and objective. In order to achieve this, it builds upon the services offered by the data-computation and information layers. The first thing to reiterate at this layer is the problem of the sheer scale of content we are dealing with. We recognise that the amount of data that the data grid is managing will be huge. By the time that data is equipped with meaning and turned into information we can expect order of magnitude reductions in the amount. However the amount of information remaining will certainly be enough to present us with a problem – a problem recognised as infosmog – the condition of having too much information to be able to take effective action or apply it in an appropriate fashion to a specific problem. Once information is delivered that is destined for a particular purpose, we are in the realm of the knowledge grid that is fundamentally concerned with abstracted and annotated content, with the management of scientific knowledge.” HITRAN Conference, Cambridge, June 2010

The Data, Information and Knowledge Lifecycle De Roure D., Jennings N., Shadbolt N. A Future e-Science Infrastructure // Report commissioned for EPSRC/DTI Core e-Science Programme p. Acquire Modelling Retrieve Publish Maintain The challenge of knowledge publishing or disseminating can be described as getting the right data, information and knowledge, in the right form, to the right person or system, at the right time …. HITRAN Conference, Cambridge, June 2010

What is the hierarchy of the problems ? 1.Lifecycle. Implement Distributed Information System that allows simplify for the investigator aquire, retrieve and publish data and information 2.Publish. Create Publishing Tools 3.Key question. Guarantee Data Validity 4.Constraints Types. 1. Restrictions on physical entity values (for instance, selection rules). Verification of the restrictions is identical to verification of statement 2. Existence (Publication) Restrictions ( ) Getting the right data, ….. - existential quantifier- universal quantifier S – spectroscopy domain, X- physical entity characterized by quantum numbers, Y – published data set HITRAN Conference, Cambridge, June 2010

Web-service of publications data base synchronization Web-service for the formation of a homogeneous set of inverse and direct tasks solutions properties in a distributed system Web-service for the formation of an ontology of molecular spectroscopy tasks’ solutions properties Interfaces Protégé interface Data - computation layer Information layer Knowledge layer System of direct and inverse spectroscopy problems’ solutions input Spectral functions calculation Formation of composite problems’ solutions Inference engine Logical consistency check Decomposition of problems’ solutions according to publications Description of non-calculable properties of molecular spectroscopy inverse and direct problems’ solutions Computation of calculated properties of direct and inverse spectroscopy problems’ solutions Composite solutions of spectroscopy tasks Primary solutions of inverse spectroscopy tasks Primary solutions of direct spectroscopy tasks Publications DB Ontology of spectroscopy tasks’ solutions properties Molecular spectroscopy tasks’ solutions properties Data Node Applications InterfacesWeb-services Node Architecture of Semantic Web approach HITRAN Conference, Cambridge, June 2010

Model of Quantitative Molecular Spectroscopy Isolated molecules spectral line parameters (T2) Isolated molecule physical characteristics (T1) Spectral line profile parameters (T3) Spectral functions calculation (T4) Direct Problems Inverse Problems Computations Measurements Two chains of problems are selected for domain Isolated molecule energy levels (T7) Einstein coefficients (T6) Interacting molecule spectral line parameters (ET) Spectral functions measurement (E) Quantum numbers assignment to spectral lines (T5)

Elementary solution of spectroscopic problem Elementary source characteristics Elementary source characteristics molecule – H 2 O the list of physical quantities – energy levels E (cm -1 ), Quantum numbers (v 1 v 2 v 3 J K a K c ), ……. publication - Schwenke D.W., New H 2 O Rovibrational Line Assignments. // Journal of Molecular Spectroscopy, 1998, v. 190, no. 2, p data - ……………………………………………………………… HITRAN Conference, Cambridge, June 2010 IUPAC Data group approach (information aspect)

Data Validity Formal constraints Data type – Quantum Numbers – natural numbers, … Intensity, Halfwidth, Frequency, Energy Levels – positive real numbers, …. Variation interval – 0 < wavenumbers < cm -1, cm/mol < intensity < cm/mol Selection rules - normal modes - k a +k c =J or J+1, ….. Publication constraint Whether data are published or not Other constraints (transitivity and antisymmetry axioms) ………………………………………….. Non-formal constraints Experts’ opinion XML OWL DL HITRAN Conference, Cambridge, June 2010

Selection rules. Primary data sources Problem Т1, Problem Т7Problem Т2, Problem Т6Problem Т3, Proble Т5 Direct Inverse H2OH2O 9(2), 30 (24) 5(0), 91 (47)5 (0), 183 (167) H 2 17 O 4(0), 19 (15) 5(1), 40 (31)4 (0), 19 (16) H 2 18 O 4(0), 18 (18) 5(1), 59 (35)4 (0), 29 (17) HDO 1(0), 32 (28) 3(0), 83 (56)2 (0), 8 (3) HD 17 O -, 3 (3) 2(0), 3 (3)2 (0), 6 (6) HD 18 O -, 5 (4) 2(0), 6 (6)2 (0), 7 (7) D2OD2O 1(1), 18 (8) 3(0), 38 (26)3 (0), 10 (7) D 2 17 O 1(0), 3 (3)2 (0), 1 (1) D 2 18 O 2(0), 6 (6)2 (0), 1 (1) 15(3), 125 (100) 28(2), 318 (207) 26(0), 264 (225) 9(2) Total number of data sources(Number of correct data sources) Privesentsev A.I., Ontological knowledge base implementation and software for information resources description in molecular spectroscopy, Tomsk State University, PhD Dissertation, 2009, 238 Pages HITRAN Conference, Cambridge, June 2010

Validity Publication constraints Informatics Restriction. RFC 2396: A (information) resource can be anything that has identity. Decomposition. Mathematical basis. Axiom of reflexivity: For each a, a=a. Physical constraint a st type of measurements --- > A1 a nd type of measurements -- > A2 Experimental accuracy Criteria of identity |A1 – A2| < The complete validated data set are published by IUPAC data group (J. Tennyson, P.F. Bernath, L.R. Brown, et al., IUPAC Critical Evaluation of the Rotational-Vibrational Spectra of Water Vapor. Part I. Energy Levels and Transition Wavenumbers for H 2 17 O and H 2 18 O, Journal of Quantitative Spectroscopy and Radiative Transfer, July 2009, V.110, no.9-10, P ) HITRAN Conference, Cambridge, June 2010

Decomposition. Hitran-2008 (H 2 18 O) H 2 18 O (N = 9753) (cm -1 ) Data source (inverse problem direct problem) Residual N= cm _StBe 1970_PoJo 2006_GoMaGuKn 1972_LuHeCoGo _ZoShOvPo 2007_ZoOvShPo 2007_ScPaTa_a 2007_ScPaTa_b 8 N= cm _GoMaGuKn 1972_LuHeCoGo 1987_BeKoPoTr 1999_MaNaNAOd 1976_FlGi 1981_Partridg _ZoShOvPo 2007_ZoOvShPo 2007_ScPaTa_a 2007_ScPaTa_b 0 N= cm _MaNaNAOd 1985_Johns 1981_Partridg 1976_FlGi _ScPaTa_a 2007_ScPaTa_b 2007_ZoOvShPo 2008_ZoShOvPo 0 N= cm _MaNaNAOd 1985_Johns 1980_KaKy 1978_KaKaKy 1977_Winther _ScPaTa_a 2007_ScPaTa_b 2008_ZoShOvPo 2007_ZoOvShPo 2 N= cm _Johns 1978_KaKaKy 1980_KaKy 1977_Winther 2003_MiTyMe 1998_Toth 1992_Toth 2006_LiDuSoWa 1983_Guelachv 1993_Toth 1971_WiNaJo 1978_JoMc 1975_ToMa 1994_Toth_a 1969_FrNaJo 1983_PiCoCaFl 1973_CaFlGuAm 1983_ToBr 2007_JeDaReTy 2002_MiTyStAl 1985_ChMaFlCa 1977_ToCaFl_a 1986_ChMaFlCa 1989_UlZhSh 2006_LiNaSoVo 1986_ChMaCaFl_b 2004_MaRoMiNa 1994_Toth_b 2007_MiLeKaCa 1977_ToFlCa_b 2001_MoSaGiCi 2001_MoSaGiCi 2005_ToTe 2006_LiHuCaMa 1987_ChMaFlCa 2005_ToNaZoSh _ScPaTa_a 2007_ScPaTa_b 2008_ZoShOvPo 2007_ZoOvShPo 1983_PiCoCaFl 302 N= cm _ChMaFlCa 2005_ToNaZoSh 2007_MaToCa 1995_ByNaPeSc 2002_TaBrTe 2005_TaNaBrTe _ZoOvShPo 2007_ScPaTa_a 2007_ScPaTa_b 2008_ZoShOvPo HITRAN Conference, Cambridge, June 2010

Publication constraints 0. L.S. Rothman, R.R. Gamache, A. Goldman, L.R. Brown, R.A. Toth, H.M. Pickett, R.L. Poynter, J.-M. Flaud, C. Camy-Peyret, A. Barbe, N. Husson, C.P. Rinsland, and M.A.H. Smith, “The HITRAN database: 1986 Edition,” Appl.Opt. 26, (1987) 26. H. Partridge and D.W. Schwenke, “The determination of an accurate isotope dependent potential energy surface for water from extensive ab initio calculations and experimental data,” J.Chem.Phys. 106, (1997). 28. J.P. Chevillard, J.-Y. Mandin, J.-M. Flaud, and C. Camy-Peyret, “H 2 18 O: line positions and intensities between 9500 and cm -1. The (041), (220), (121), (201), (102), and (003) interacting states,” Can.J.Phys. 65, (1987). 30. R.A. Toth, “Linelist of water vapor parameters from 500 to 8000 cm -1,” see Calculation from K.V. Jucks, private communication (2000). Composite data source Unpublished data source Published data and data in HITRAN are not the same HITRAN Conference, Cambridge, June 2010

J. Tennyson, P.F. Bernath, L.R. Brown, et al., IUPAC Critical Evaluation of the Rotational-Vibrational Spectra of Water Vapor. Part I. Energy Levels and Transition Wavenumbers for H 2 17 O and H 2 18 O Journal of Quantitative Spectroscopy and Radiative Transfer, July 2009, V.110, no.9-10, P J. Tennyson, P.F. Bernath, L.R. Brown, et al., IUPAC Critical Evaluation of the Rotational-Vibrational Spectra of Water Vapor. Part II. Energy Levels and Transition Wavenumbers for HDO, HD 17 O and HD 18 O Journal of Quantitative Spectroscopy and Radiative Transfer, Non-formal constraints HITRAN Conference, Cambridge, June 2010

Current State of the H 2 O H 2 S SO 2 CO 2 N 2 O NH 3 CH 4 C 2 H 2 CO O 2 OCS HNCO ~ 2000 articles H 2 O H 2 S CO 2 CO CH 4 ~ 1200 data sets e-Library (Primary Data) Digitized Data H 2 O H 2 S SO 2 O 3 N 2 O OCS NH 3 C 2 H 2 CO HBrO CO 2 CH 4 In the end of August 2010 CO 2 H 2 O H 2 S now + NH 3 CO CH 4 In the end of 2010 Upload Systems Data Base & Knowledge Base of DIS HITRAN Conference, Cambridge, June 2010

Summary ► A node prototype of a Distributed Information System for acquire, retrieve, publish and maintain data, information and knowledge in quantitative molecular spectroscopy is developed and implemented ► A component of the publishing tools provides formal validation of data is implemented ► IS – HITRAN Conference, Cambridge, June 2010

Acknowledgements We thank Prof. J.Tennyson for the assistance providing the creation of all data collections and Dr. S.Tashkun for his contribution in CO 2 data collection. Fazliev A. thanks Prof. Tyuterev Vl.G. for fruitful discussion on the publications constraints. This work has received partial support from RFBR and 7-th Framework Programme