OPERATIONAL METADATA FOR FEDERATING STATISTICAL REFERENCE SYSTEMS AT EUROSTAT G. Pongas, F. Vernadat EC Eurostat B2
Overview of the talk Introduction CVD (Cycle de Vie des Données) REFIN: Internal Reference Eurostat Dissemination Portal (Site 3) Conclusion
Introduction Metadata in statistical information: define some of the semantics of data needed for proper production and usage of data make data comparable ensure some level of data quality required for efficient search
CVD (Cycle de Vie des Données)
Current Situation at Eurostat
EUROSTAT INTERNAL REFERENCE The problem Two many different systems at EUROSTAT for handling data: –FAME –Oracle Express –Oracle RDBMS –SAM –SAS
REFIN: The problem (Cont’d) Some of them are general purpose (e.g. Oracle RDBMS) whereas others may include special features (for data validation or computation) but they all have their own access methods and user interfaces (Express Analyser, FAME...) Major drawbacks: -High complexity for users -Data comparison between different systems is not easy
What is REFIN ? The REFIN system specifically addresses these issues –Gives access to heterogeneous systems –Provides the users with a common interface Data location and source system is hidden Data not duplicated, access to the original data. –Uses a unique exchange format (PIVOT) –Implements specific security rules
REFIN architecture FAME DATA Bases +METADATA ORACLE DBMS DATA bases +METATDATA MICROSOFT ACCESS SAM+METADATA HLISNAPIOCI REFIN INTERNAL REFERENCE REFIN ADAPTOR SECURITY LAYER METADATA METADATA + LOCALISATION DATA + PROCEDURES RPC/DCE or XML DAO/ODA/ODBC ORACLE EXPRESS DATA Bases +METADATA
REFIN architecture SAM EXPRESS ORACLE FAME Metabase Builder REFIN Particular Metabases REFIN Common Metabase Converter 1) Generation of REFIN metadata 2) Mapping to Common Metadata
REFIN architecture SAM EXPRESS ORACLE FAME FAME Driver SAM Driver EXPRESS Driver ORACLE Driver HLI ODBC SNAPI OCI REFIN API
New possibilities provided by REFIN To build heterogeneous data sets by mixing data from different origins and systems
Eurostat Dissemination Portal (Site 3)
Site 3 Metadata
Conclusion Importance of linking data and metadata Importance of having an integrated metadata environment Clear distinction between –Statistical metadata –IT metadata –Dissemination metadata