XIS XML Input System Statistics Denmark 12 October 2004
What is XIS? A Generic System for Validation Storage and Publication of Input Data
Logic Several sources to the same survey – stored in the same table structure Relationel database tables (Oracle/SQL) as interface to production units XML validation XML transformation Component based
Electronic Input Sources Web Questionnaires EDI (file transfer) by Key Telephone OCR from Paper Scanning FTP Diskettes Tape or CD from Administrative Registers
Questionnaires Approximately 150 different questionnaires Approximately 70 are for private enterprises Annually, Semi Annually, Quarterly or Monthly reporting Large majority is simple questionnaires without routing - A few with complex routing and complex validation
Quantities Total number of reporting is ca per year Approx reporting from private enterprises Intrastat approx each month = approx per year
Architecture Virk.dk XIS Private enterprise server PU Scanner
System Architecture XML System INPUT DB OCR Scanning Virk.dk Diskettes/Tape/CD Blaise V T Control Message Or Log V XML XML/CSV XML ? Adm. Key Telephone XML
Design principles Flexibility – needs are changing Changesibitily – questionnaires changes all the time Clear and simple interfaces – simple integration Components and standards – evolution step by step Stability and correctness – it’s production Confidentiality – statistical office Automation - resources Transperancy – user control
Overview ADM. DB Respondent DB XML SYSTEM INPUT SYSTEM INPUT SYSTEM INPUT SYSTEM INPUT SYSTEM INPUT SYSTEM INHOUSE Data Editing Web Service INPUT DB PRF DB
4 Database Model INPUT DB STAT. REG. SUM DB STAT BANK Input Metadata TIMESMacro Metadata
Input Database Architecture Metadata D Tælling 1 X Tælling 3 X Tælling 2 X Tælling 4 X030322
Tracking Administrative Metadata / Envelope Data Form – eg. Intrastat (130501) Period – eg. 2004M3 Respondent – Legal/Obligated part Reporter – Supplier of information Date – eg :32:10
Prefill Central business register number Unit of reporting Period, deadline, status etc. Fields in form Questions Description of errors Notifications by CVR SE-nr. 1 Afdeling 1Afdeling 2 SE-nr. 2 Afdeling 3Afdeling 4 SE-nr. 3
Communication Publishing Reporting Error reporting Re-reporting Etc.
Technicalities Oracle Database, 9.2i Software AG, XML Mediator Generic database creator upon XSD: – Nesting-> New Table – Repeting field -> New Table – Unique tag names -> Unique table names Generic XML loader Generic XML creator upon SQL views Cryptomathic, SMIME, Digital Signatures, X509 POP3 and SMTP Secure FTP Web Services, SOAP
Status Reception of data since June 2003 Prefilling from April 2004
Plans Forms administration Metadata Statistics Data from public administrative registres
Thanks