Download presentation
Presentation is loading. Please wait.
Published byStephen Holmes Modified over 9 years ago
1
UK Data Warehouse Work 23 rd May 2012 Paul Tutton, Sarah Ravenhill
2
Outline 1.Background 2.Approach 3.Warehouse Concepts 4.Prototyping & Modelling 5.Data Harmonisation 6.Recommendations and Next Steps
3
1. Background Other Services Data Sources Data Repository Staging Operational Data Store Data Consumers
4
2. Approach What do we want? How do we want to work? Does that work? Build it and see What can we put in there? How would we implement one? What are the costs and benefits?
5
3. What and How Define Interrogate Store Data And Metadata Validate Derive Aggregate Input & Update Extract Find Gaps
6
4. Build It… Integrate data from multiple sources Make extracts to support current and new statistics Define a method for describing extracts Identify gaps in extracts Automate choice between or combination of sources
10
Source Level Indicators
11
Variable Level Indicators Rate my data – what are we consistently suspicious of?
12
4. …and See Warehouses work Statistical processes must change Shared Information Models are important Think about the minimum acceptable amount of data
13
5. Assess Potential Conceptual Overlap Meaning of the Data Conceptual Overlap Meaning of the Data Dataset Shape Shape of the population Dataset Shape Shape of the population Statistical Activity Process surrounding the data Statistical Activity Process surrounding the data Harmonisation Analysis
14
5. Analysis Steps List your sources Describe variables Pool the list Find the concepts Classify variables Assess results
15
5. Overlap findings Exact Replication Conceptually Close Otherwise Derivable General Feasibility Combinations Small numbers found
16
Employee Count Employment Foreign Investment Hours/ Pay Pension Schemes Employee Count Employment Foreign Investment Hours/ Pay Pension Schemes 5. Example Concepts Acquisitions/ expenditure Business Operation Business Structure Comments/ Narrative Disposals/ Income Acquisitions/ expenditure Business Operation Business Structure Comments/ Narrative Disposals/ Income Profit/ Loss Statistical Units Stock Taxes/ National Insurance Turnover
17
5. Interview Findings Pooling data: May assist imputation Enables consist stories across outputs Allows congruence checking at unit level Is more useful if it exposes timelier sources to output managers Is of more benefit for some subjects than others (e.g. employment over finance)
18
6. Recommendations and Next Steps Continue development of CIM Analyse extent of process change due to movement away from survey silos Implement a warehouse in stages: Integrate storage first De-duplicate and harmonise once integration is complete Consider the addition of statistical processing facilities to reap further benefits
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.