Download presentation
Presentation is loading. Please wait.
Published byBuck Nash Modified over 9 years ago
1
From Data Access to Data Integration IAOS, Shanghai 14-16 October 2008 Annegrete Wulff, Statistics Denmark awu@dst.dk
2
Accessible products Publications (ca. 1850 - ) Municipality statistics databank 1986-98 StatBank Denmark (www.statbank.dk) 1998 –www.statbank.dk Homepage www.dst.dk 1996 -www.dst.dk
3
Dissemination principles Electronic first StatBank is the place for all official statistics StatBank is the source for all publications StatBank is online available & free-of-charge for everyone Simultaneous releases in all media 9:30:00 am Dissemination should address well-defined –target groups –types of usage …jet still use the same source (data and metadata)
4
The Statistical Information System Person id: Person Number Enterprise id: CBR-No Dwelling id: Address Health Tax Employ- ment Educa- tion Social etc CPR BDRBDR CBRCBR Question- naire Inter- view Cadastre
5
Access Population Labour market Income Finance Agriculture etc. Databank www.statbank.dk
6
The Public - www.dst.dk SumDatabase Cleaned micro data Statistical registers Print Binding dst.d k Charged statistics and analysis Annonymos micro data for Researchers Subject matter division Dissemination, IT-Centre Aggregation to macro data Publication pdf International organisations
7
Users needs More users Variety of users Different – and increasing user needs User satisfaction surveys
9
Shift of focus …..1980’s Electronic on-line access Content: more details than on paper Design of tables on the fly Calculations Download possibilities Output formats
10
Statistics Denmark’s first databank 1986 200 users – and we new them all
11
Shift of focus …..1990’s Internet access - but also off-line products: CD-ROM Functionality (calculations) Interactive aggregations Contact possibilities – who to ask for more
12
Internet databank, ver. 1.0 1998 Access “restricted” by a fee
13
Shift of focus …..2000’s Presentation, layout Documentation Linking, coherence Long time span Search
14
Homepage 2004 Documentation, quality declaration, graphics
15
Shift of focus …..2008’s Response time Definitions Search and browse Visualisation, maps and graphics Self service Integration with own systems
16
StatBank Denmark More than 2,000 tables, several billions of data Links to documentation (declaration of contents) Links to publications Saved queries Data shooting Excel web queries Output formats: Excel, PC-AXIS, xml, SAS, comma separated, time series,… Maps Graphs
17
Satisfied users Satisfied and very satisfiedN=755 The StatBank in general94 % Response time94 % Content93 % Download possibilities, formats92 % Presentation in charts89 %
18
Unsatisfied users Unsatisfied or very unsatisfied N = 755 Finding the right table30 % On-line help29 % Documentation related to a table23 % Degree of detail18 % Presentation on maps15 %
19
2,500 matrices in Danish and English 2 million retrievals HTML table on screen Downloads of a file 77 % only on screen 6 % in maps, 17 % graphs 23% downloads. Of these: 86 % in Excel 9 % in PC-AXIS 5% in other formats
20
Subject matter division Technical sources Statisticians Statistics Denmark publications Metadataproject Quality declarations Statistical Yearbook & Ten year review Statistical Abstracts Annual publications, Nomenclatures External documents Legal documents etc TIMES, microda ta docume ntation Many sources of metadata
21
One source for all metadata Subject matter division Technical sources Statisticians Statistics Denmark publications Metadata project Quality declarations Statistical Yearbook & Ten year review Statistical Abstracts Annual publications, Nomenclatures External documents Legal documents etc TIMES, microda ta docume ntation
22
Metadata International organisations Annonymos micro data for researchers Charged statistics and analysis dst.d k - www.dst.dk code explanations source quality accessability concepts, definitions methodologies contacts release info
23
The process Select the concepts Define the concept Making the definition accessible
24
Concepts in the StatBank 850 dimensions 173,000 dimension members 8,000 will be defined
25
Examples: table titles Unemployed by region, ancestry and sex (monthly) Immigrated by region, country of origin, age and sex (continous years)
26
Example of concepts Concept measured Dimension/ variable Dimension member/ value Frequency UnemployedAncestryDescen- dants Continous years ImmigratedCounrty of origin Western countries
27
Born in Denmark and neither of the parents is born in Denmark and has Danish citizenship as well
28
At least one parent is born in Denmark and has Danish citizenship
29
Glossary
30
Integration and linking Integration across –Media –Level of detail –Topic
31
Thank you
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.