19.11.2004Toni Räikkönen Data Collection in Statistics Finland now and in the Future.

Slides:



Advertisements
Similar presentations
Web Hosting. The purpose of this Startup Guide is to familiarize you with Own Web Now's Web Hosting. Own Web Now offers two web hosting platforms, one.
Advertisements

Overview Environment for Internet database connectivity
National Database Templates for the Biosafety Clearing-House Application (NDT-nBCH) Overview of the US nBCH Applications.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Welcome to Middleware Joseph Amrithraj
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Forms Authentication, Users, Roles, Membership Ventsislav Popov Crossroad Ltd.
NGT Information Technology Technical Discussion Bob DeHoff Info Tech, Inc.
Computer Monitoring System for EE Faculty By Yaroslav Ross And Denis Zakrevsky Supervisor: Viktor Kulikov.
Active Directory: Final Solution to Enterprise System Integration
XP Browser and Basics1. XP Browser and Basics2 Learn about Web browser software and Web pages The Web is a collection of files that reside.
Introduction to Web Application Architectures Web Application Architectures 18 th March 2005 Bogdan L. Vrusias
Week 2 IBS 685. Static Page Architecture The user requests the page by typing a URL in a browser The Browser requests the page from the Web Server The.
Interpret Application Specifications
Performed by:Gidi Getter Svetlana Klinovsky Supervised by:Viktor Kulikov 08/03/2009.
Managing Employee Earnings Statements: PAYSTUB 3.0 A centralized, intranet-based application used to view employee earnings statements online Published:
Browser and Basics Tutorial 1. Learn about Web browser software and Web pages The Web is a collection of files that reside on computers, called.
1 Chapter 20 — Creating Web Projects Microsoft Visual Basic.NET, Introduction to Programming.
eFueloil information systems Applications Prof. Dr.Mohamed El-Azab
Lecture slides prepared for “Business Data Communications”, 7/e, by William Stallings and Tom Case, Chapter 8 “TCP/IP”.
Winter Consolidated Server Deployment Guide for Hosted Messaging and Collaboration version 3.5 Philippe Maurent Principal Consultant Microsoft.
JVM Tehnologic Company profile & core business Founded: February 1992; –Core business: design and implementation of large software applications mainly.
Introduction to Networks Networking Concepts IST-200 VWCC 1.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Web Servers Web server software is a product that works with the operating system The server computer can run more than one software product such as .
Architecture Of ASP.NET. What is ASP?  Server-side scripting technology.  Files containing HTML and scripting code.  Access via HTTP requests.  Scripting.
5/5/2005Toni Räikkönen Internet based data collection from enterprises using XML questionnaires and XCola engine CoRD Meeting May 11th 2005.
ASP.NET + Ajax Jesper Tørresø ITNET2 F08. Ajax Ajax (Asynchronous JavaScript and XML) A group of interrelated web development techniques used for creating.
ASP.NET The.NET Framework. The.NET Framework is Microsoft’s distributed run-time environment for creating, deploying, and using applications over the.
Article: Source Code Review Systems Author: Jason Remillard Presenter: Joe Borosky Class: Principles and Applications of Software Design Date: 11/2/2005.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
Introduction to distributed systems Dr. S. Indran 23 January 2004.
Lecture 7 Interaction. Topics Implementing data flows An internet solution Transactions in MySQL 4-tier systems – business rule/presentation separation.
CIS 375—Web App Dev II Microsoft’s.NET. 2 Introduction to.NET Steve Ballmer (January 2000): Steve Ballmer "Delivering an Internet-based platform of Next.
Networking Basics Lesson 1 Introduction to Networks.
ASP.NET Web Application and Development Digital Media Department Unit Credit Value : 4 Essential Learning time : 120 hours Digital.
Web Server Administration Chapter 7 Installing and Testing a Programming Environment.
SUSE Linux Enterprise Desktop Administration Chapter 12 Administer Printing.
Windows 2000 Operating System -- Active Directory Service COSC 516 Yuan YAO 08/29/2000.
Tutorial 7 Creating Forms. Objectives Session 7.1 – Create an HTML form – Insert fields for text – Add labels for form elements – Create radio buttons.
Phone: Mega AS Consulting Ltd © 2007  CAT – the problem & the solution  Using the CAT - Administrator  Mega.
Chapter 14 Part II: Architectural Adaptation BY: AARON MCKAY.
1 Introduction to Microsoft Windows 2000 Windows 2000 Overview Windows 2000 Architecture Overview Windows 2000 Directory Services Overview Logging On to.
Development of Electronic Data Reporting (EDR) in Statistics Finland.
The Experiences of Web Based Data Collection from Enterprises in Finland August 9th 2006, JSM Seattle USA.
Electronic data collection system eSTAT in Statistics Estonia: functionality, authentication and further developments issues 4th June 2007 Maia Ennok,
Hands-On Microsoft Windows Server Implementing Microsoft Internet Information Services Microsoft Internet Information Services (IIS) –Software included.
NETWORK HARDWARE AND SOFTWARE MR ROSS UNIT 3 IT APPLICATIONS.
0 SharePoint Search 2013 Rafael de la Cruz SharePoint Developer Seneca Resources twitter.com/delacruz_rafael
Active Server Pages (ASP), also known as Classic ASP or ASP Classic, was Microsoft's first server-side script engine for dynamically generated web pages.
Electronic data collection System in CSB of Latvia By Karlis Zeila, Vice President, CSB of Latvia IT DG meeting, October , Eurostat.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
MEMBERSHIP AND IDENTITY Active server pages (ASP.NET) 1 Chapter-4.
Insee survey data response websites The « entreprises.insee.fr » portal Insee at businesses’ service.
Kemal Baykal Rasim Ismayilov
NETWORKING FUNDAMENTALS. Network+ Guide to Networks, 4e2.
CITA 310 Section 7 Installing and Testing a Programming Environment (Textbook Chapter 7)
WEB SERVER SOFTWARE FEATURE SETS
MCSE Guide to Microsoft Exchange Server 2003 Administration Chapter Three Managing Recipients.
Configuring and Deploying Web Applications Lesson 7.
MCSE Guide to Microsoft Exchange Server 2003 Administration Chapter One Introduction to Exchange Server 2003.
Expense Tracking System Developed by: Ardhita Maharindra Muskan Regmi Nir Gurung Sudeep Karki Tikaprem Gurung Date: December 05 th, 2008.
Introduction to ASP.NET development. Background ASP released in 1996 ASP supported for a minimum 10 years from Windows 8 release ASP.Net 1.0 released.
1 (c) 2013 FabSoft. MOST Cloud Service What is a Cloud Service? A cloud service is internet-based, meaning that MOST is hosted on a server farm on the.
Submitted by: Moran Mishan. Instructed by: Osnat (Ossi) Mokryn, Dr.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Component D: Activity D.3: Surveys Department EU Twinning Project.
Windows Server Administration Fundamentals
Sending data to EUROSTAT using STATEL and STADIUM web client
Presentation transcript:

Toni Räikkönen Data Collection in Statistics Finland now and in the Future

Topics General background of the data collection in Statistics Finland Internet-based data collection Self-made web data collection applications XCola (XML-based Collection Application)

Primary objectives in data collection reduce data supply burden of respondents speed up data production lower data collection costs improve the quality of data remove overlapping collection and promote joint use of the collected data between different authorities

Background About 96 % of the data is collected from administrative registers About 4 % of the data is collected directly from respondents paper forms, Excel sheets web collection applications interviews by CATI/CAPI systems, mainly using Blaise software Result agreement with the Ministry of Finance All respondents (enterprises, communes, schools) should have the possibility to transmit their data electronically by the end of 2006.

6

7

8 Data collection in Statistics Finland by type and media used

Data flows Different types of data flows data are needed only by Statistics Finland the same data are needed by several administrative organizations interviews made by CATI/CAPI system Different solutions using external teleoperator for distributing data to different data collectors (TYVI model) self-made web-based system Blaise solution for carrying out interviews

The TYVI model Data Flows from Enterprises to Authorities interfaces and transmission data capture data refining management of user accounts Participants The enterprises The TYVI-operators The authorities The authority needs not to be in relationship of many to many with the respondents

The TYVI-model (Vallaskangas 1998)

Toni Räikkönen Internet -based collection of data Case: Building Cost Index

General background Fall 2000 All existing electronic data collections were handled by 3 rd party operators (TYVI model) The production system of Building Cost Index was under re- construction and lacked web-based data collection About Building Cost Index (Business Trends) ~300 respondents (hardware stores, wholesale stores, plumbing stores etc.) Price information of 1-15 products collected from each respondent every month Paper forms are usually sent on the 15th day and expected back around the 25th day

The design goals of the web system Provide means of web based collection of statistical data No extra burden (no installations, no javascript based solutions etc.) “Live” feedback to the respondents (upon validations etc.)

Hardware architecture Running on Windows NT server Web server: Microsoft Internet Information Server 4 (IIS4) Component Server: Microsoft Transaction Server 2.0 Anonymous access (No NT-authentication) Database server Windows 2000 server Running Microsoft SQL Server 2000 Deployed on DMZ, accessible only through firewall

Application architecture Built using Microsoft Windows DNA (Distributed iNternet Application Architecture) Standard 3-tier architecture that consists of Presentation layer: HTML, ASP Business layer: COM components Database layer: Relational database System consists of two separate modules (both self-made) User authentication Data collection

Experiences Beta phase from 5/ /2001, 30 respondents 9/ /2002, 70 users In 3/2002 the systems was opened to all respondents 147 users at the moment (nearly 50%)

Toni Räikkönen Internet -based collection of data CASE: Business Trends’ collection systems technical aspects

Design goals Create framework for similar systems Multi-language support LDAP -based user authentication w/ centralized administration Create generic method for transferring data between collection and production databases Create “mass er” for all kinds of collection systems

Software & hardware architecture Built using Microsoft.NET and ASP.NET Generic 3-tier architecture w/ presentation, business and database logic Collection database separated from the production database 128 bit encryption used for communication between respondents and Statistics Finland

Framework of the collection system The modular structure of the framework allows to Change menus, headers, footers and other styles Add custom functionality (using ASP.NET user controls) on the pages Add and load different languages for the pages The base use cases are more or less same in different collection systems (login, questionnaire, feedback, instructions and contact information)

Multi-language support Most of the textual information on the web pages is stored in the database Texts are loaded on the server’s memory on the system startup Only long descriptions are kept as files Page language can be changed “on the fly” Every element has a tag on the page template and the relevant text is attached to the element upon the page load

User authentication The objective was to use LDAP (lightweight directory access protocol) for the user authentication The development for this didn’t proceed in the schedule, so it was temporarily replaced with database-based user authentication and administration Authentication thru LDAP has been tested and it seems to be an ideal solution At the moment we’re building a simple web administration application to finish the LDAP part

Data transfers Data transfers between collection and production databases are handled with an external win32 -application Built with PowerBuilder using pipeline feature (data flow) Data from collection database is transferred to the temporary tables in the production database and then synchronized with the actual tables Solution is quite customizable, allowing new functionality by adding new pipelines

Mass er An external application was built with Visual Basic 6 to send s to the respondents Modular approach New systems can be added using textual configuration files Reply requests can be added by writing sql statements to the configuration files Supports attachments Replaces traditional letters

Development experiences Microsoft.NET was just released when the development began Development environment wasn’t always stabile and the developers experienced quite a lot of unexpected behavior Despite this, ASP.NET is quite an improvement when comparing to other web application methods (asp, php, perl etc.) Although inter-browser compatibility is still quite poor

Effects of the electronic data supply system on data collection process Printing the questionaries  Transferring data to collection database Mailing  informing (mass er) Receiving the questionaries (mail, fax, , TYVI)  (Electronic data supply) Validating and entering the data  Mass validation Printing and mailing the reminders  reminder (mass er) Phone inquiry  Phone inquiry Non-individual delayed feedback  Individual direct feedback Limited access to previous own data  Previous own data available Manual exclusive treatment  Electronic mass treatment

Results (1): Sale inquiry Electronic data supply system users of all respondents: after 1. month: 48% after 2. month: 59% after 3. month: 61% since 4. month: 70% Today: %

Results (2): Sale inquiry Reminders sent: before electronic data supply system: ~1000 after 1. month: ~800 after 2. month: ~700 after 3. month: ~600 since 4. month: ~500

Experiences (1) Feedback from respondents has been very positive: Response burden has redused remarkably Enthusiasm of persons involved in data collection Manual data treatment has redused (at least by 50%) Quality of data has improved: Validation, additional information if data is not comparable etc.

Experiences (2) Number of enquires made by respondents concerning electronic data supply system: first two months: ~100 / month (mainly questions concernig base settings) since third month: ~30 / month (mainly forgotten passwords)

Development ideas Although the framework is quite good, some ideas have arisen Use of XML to Define the concepts of the questionnaires Define the presentation (XSLT) Define the validations Replace the user authentication with LDAP

Benefits Enables Complex validations of the data Dynamic creation of presentation layer logic Displaying of pre-fetched data to individual respondents Live feedback to the respondents (validation errors etc.)

Drawbacks Requires user/customer administration for Maintaining user profiles Helpdesk/Support services

Toni Räikkönen Internet -based collection of data CASE: Accomodation statistics XML-based form