Download presentation
Presentation is loading. Please wait.
Published byJarkko Niemi Modified over 6 years ago
1
Information and software architecture for statistical dissemination
2
Contents Background Information architecture (CoSSI)
System architecture (XML-based publishing system) Future challenges Harri Lehtinen 7/6/2011
3
Background 200 sets of statistics on 26 topics, 700 data releases per year All statistics have their own home page and each statistic must also generate information for its page at least each time of publishing -> Electronical publications in different languages on all statistics The need for an efficient content production system for wide-scale electronic publishing Harri Lehtinen 7/6/2011
4
Information architecture
CoSSI: (Common Structure of Statistical Information) The point of departure in the CoSSI was an (infological) analysis of the information being considered. The conclusion from the analysis was that although in practice the definition of statistical information has varied according to a given situation and application, in reality statistical information has a certain simplifiable and acceptable universal structure. The CoSSI describes the general structure that is not dependent on the situation of the statistical information presented in differing formats. => CoSSI defines the structures of statistical data, metadata and publications. Harri Lehtinen 7/6/2011
5
Information architecture
CoSSI A modular DTD system, which defines the XML-structure for statistical information Publications DATA: Matrices (XDF) Tables (CALS) Sparse matrix (KEYS) Document metadata Statistical metadata Processing metadata Harri Lehtinen 7/6/2011
6
XML-based publishing system
System architecture XML-based publishing system Based on information in XML-format: CoSSI Standard XML-techniques: DTD, XSLT, XSL-FO, XPath, XQuery XML-tools: Arbortext editor, eXist XML-database, Saxon, XEP Development phase Implementation project In use in all monthly, quarterly and yearly statistical publications Harri Lehtinen 7/6/2011
7
GSBPM Harri Lehtinen 7/6/2011
8
System architecture today
Database services Web-publisher Preview and timing Database tables: PC-Axis (.px) Tabulation PX-Edit SAS PX-Web Base-SAS .PX SAS-EG Dissemination database Publication editor Arbortext Publications (text, graphs, tables) Publication tables: CALS (.xml) Graphs; PNG PX-Edit eXist Publications HTML SuperStar PDF Multilingual table? Metadata Printing house RSS, SDMX eXist Variable descriptions PDF Multilingual table Variable editor Variable descriptions Harri Lehtinen 7/6/2011
9
Harri Lehtinen 7/6/2011
10
Future challenges Metadata Multilingual and metadata rich tables
New metadata system based on CoSSI-XML Integrated variable editor Integrated classification editor Quality metadata production and publishing (national needs, ESMS, ESQR) Processing metadata Multilingual and metadata rich tables Interface between metadata and tabulation systems SAS, SAS-EG (EG Add-In), PX-Edit, .Net-applications Under development, but slow progress… Harri Lehtinen 7/6/2011
11
System architecture: 2011->
Tables and publications Tabulation SAS Tables: PC-Axis (.px) Converter PC-Axis -> XDF PC-Axis -> CALS XDF -> PC-Axis XDF -> CALS Metadata enrichment XDF + Data description = Multi-lingual table and metadata for variables Base-SAS Published StatFin (PX-Web) Classifications Variable descriptions eXist XML-database Publications XDF-tables Web-publisher Preview and timing SAS-EG Tables: XDF (.xml) PX-Edit SuperStar Publication tables: CALS (.xml) Arbortext Publications (text, graphs, tables) .Net-applications Metadata Classification database eXist XML-database Variable descriptions Classifications Statistical concepts Variable editor Variable descriptions Classification editor Classification maintenance Statistical concepts database Harri Lehtinen 7/6/2011
12
Questions, comments? Harri Lehtinen 7/6/2011
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.