Two Paradigms for Official Statistics Production Boris Lorenc, Jakob Engdahl and Klas Blomqvist Statistics Sweden.

Slides:



Advertisements
Similar presentations
An Adaptive System for User Information needs based on the observed meta- Knowledge AKERELE Olubunmi Doctorate student, University of Ibadan, Ibadan, Nigeria;
Advertisements

Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
An Introduction to Artificial Intelligence. Introduction Getting machines to “think”. Imitation game and the Turing test. Chinese room test. Key processes.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
A Probabilistic Framework for Information Integration and Retrieval on the Semantic Web by Livia Predoiu, Heiner Stuckenschmidt Institute of Computer Science,
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Knowledge and Systems Research Group, University of Huddersfield B vs OCL: Comparing Specification Languages for Planning Domains Diane Kitchin, Lee McCluskey,
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
Statistical Relational Learning for Link Prediction Alexandrin Popescul and Lyle H. Unger Presented by Ron Bjarnason 11 November 2003.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
02 -1 Lecture 02 Agent Technology Topics –Introduction –Agent Reasoning –Agent Learning –Ontology Engineering –User Modeling –Mobile Agents –Multi-Agent.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining Chun-Hung Chou
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Changing the perspective: users’involvement in prioritising needs in the Italian National Statistical Programme Monica Attias, ISTAT Q2014 Vienna, June.
Knowledge representation
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
What You Need before You Deploy Master Data Management Presented by Malcolm Chisholm Ph.D. Telephone – Fax
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Assessing Quality for Integration Based Data M. Denk, W. Grossmann Institute for Scientific Computing.
Artificial Intelligence And Machine learning. Drag picture to placeholder or click icon to add What is AI?
Quality Reporting at SORS – Experiences and Future Perspectives Rudi Seljak, Tina Ostrež Statistical Office of the Republic of Slovenia.
The Data Ring: Community Content Sharing Serge Abiteboul (INRIA) Alkis Polyzotis (UC Santa Cruz)
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
Measuring the Quality of Decisionmaking and Planning Framed in the Context of IBC Experimentation February 9, 2007 Evidence Based Research, Inc.
1 A Conceptual Framework of Data Mining Y.Y. Yao Department of Computer Science, University of Regina Regina, Sask., Canada S4S 0A2
Data Mining By Dave Maung.
Metadata driven application for data processing – from local toward global solution Rudi Seljak Statistical Office of the Republic of Slovenia.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Information commitments, evaluative standards and information searching strategies in web-based learning evnironments Ying-Tien Wu & Chin-Chung Tsai Institute.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
 Dr. Syed Noman Hasany 1.  Review of known methodologies  Analysis of software requirements  Real-time software  Software cost, quality, testing.
1 Discovering Robust Knowledge from Databases that Change Chun-Nan HsuCraig A. Knoblock Arizona State UniversityUniversity of Southern California Journal.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
27-18 września Data Mining dr Iwona Schab. 2 Semester timetable ORGANIZATIONAL ISSUES, INDTRODUCTION TO DATA MINING 1 Sources of data in business,
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 13 of 41 Monday, 20 September.
Paper III Qualitative research methodology. Objective 1.3 To what extent can findings be generalized from qualitative studies.
Q Rome Italy, July The Mysterious 36% Difference: Studies on a Measurement Error Anette Björnram, Boris Lorenc, Andreas Persson, Klas Wibell.
ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.
Estimation of preliminary unemployment rates by means of multiple imputation UN/ECE-Work Session on Data Editing Vienna, April 2008 Thomas Burg, Statistics.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
CONCLUSION The conclusion of this work is that it is possible to develop a problem-solving method providing evolutionary computational support to general.
MetaPlus Klas Blomqvist Statistics Sweden Research and Development – Central Methods
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
Unclassified//For Official Use Only 1 RAPID: Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University PI : Prof. Jaime.
Web-based Data Mining for Quenching Data Analysis Aparna S. Varde, Makiko Takahashi, Mohammed Maniruzzaman, Richard D. Sisson Jr. Center for Heat Treating.
Leacock, Warrican & Rose (2009) Reviewing Literature Presentation 4.
The business process models and quality issues at the Hungarian Central Statistical Office (HCSO) Mr. Csaba Ábry, HCSO, Methodological Department Geneva,
Metadata Driven Aspect Specification Ricardo Ferreira, Ricardo Raminhos Uninova, Portugal Ana Moreira Universidade Nova de Lisboa, Portugal 7th International.
CHAPTER 2 LITERATION REVIEW 1-1. LEARNING OUTCOMES 1.The reasons for a literature review being an essential part of every project. 2.The purpose of a.
©Ian Sommerville 2000Software Engineering, 6th edition. Chapter 19Slide 1 Verification and Validation l Assuring that a software system meets a user's.
Copyright © Springer Publishing Company, LLC. All Rights Reserved. EVIDENCE-BASED TEACHING IN NURSING – Chapter 15 –
CSPA & Digital Transformation
School of Computer Science & Engineering
Intelligent Agent Solution
CH. 1: Introduction 1.1 What is Machine Learning Example:
Christian Ansorge Arona, 09/04/2014
ece 720 intelligent web: ontology and beyond
CSc4730/6730 Scientific Visualization
KDD Reviews 周天烁 2018年5月9日.
2. An overview of SDMX (What is SDMX? Part I)
Web Mining Research: A Survey
Managerial Decision Making and Evaluating Research
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Presentation transcript:

Two Paradigms for Official Statistics Production Boris Lorenc, Jakob Engdahl and Klas Blomqvist Statistics Sweden

Preliminaries The talk concerns data and knowledge about external world – not data and knowledge about producing statistics (but might have consequences for the latter) Inspired by the different discussions on ongoing developments and initiatives within (official) statistics May have certain relevance for editing Naturally, the views presented herein are those of the authors, not necessarily reflecting policies of Statistics Sweden

Preliminaries (cont’d) Transition from (many) Stovepipes to (few) Integrated System(s) Among intended goals i.better integration of administrative data and survey data, ii.better/faster response to new or changing user needs How an integrated system should look like so as to satisfy these requirements answer sought in the field of knowledge systems/cognitive systems

Agenda I.Preliminaries II.On some distinctions and results regarding knowledge/cognitive systems III.Consequences for representing data in Integrated systems for statistics production IV.Further considerations for statistics methodology, including some thoughts regarding editing

Knowledge/Cognitive Systems Computational symbolic first-order predicate logic other formal logic etc subsymbolic artificial neural networks (ANNs) etc Other (noncomputational) embodied cognition situated cognition socially distributed cognition etc Good for restricted domains with clear rules (e.g. chess), less good for open-world problems

Database developments Relational Model RDBMS (Relational Database Management System) implements first-order predicate logic database schema: theory in predicate calculus NoSQL schema-less (theory-less) examples Google‘s BigTable solutions underlying some functions on Amazon, Twitter, and Facebook Perhaps related: Semantic Web how to structure documents into a “web of data” “a web of data that can be processed directly and indirectly by machines” uses Resource Description Framework (rather than RDBMS)

Consequences Paradigm I: Stovepipe + RDBMS ‘manual’ management of a fairly restricted domain single-purpose use likely requires expert assistance to users in search and requirements specification Paradigm II: Integrated system + noSQL automatic building of world knowledge pertaining to the domain multi-purpose use likely empowers users to themselves explore available data and consider merits of requiring new data  likely requires expert assistance to users in search and requirements specification  likely empowers users to themselves explore available data and consider merits of requiring new data

Sampling theory considerations In the context of Paradigm II: use of weights what should they then reflect: inclusion probabilities (if known)? nonresponse information (including an assumed model)? auxiliary information pertaining to specific variables to be estimated? use of models memorylessness vs. Bayesian statistics

Editing Editing for a purpose vs. editing “without a purpose” adherence to general specifications (‘concept validity’) self-learning (unsupervised) tools from computer science/ANN model congruence (especially building automatic models using methods from the KDD (Knowledge Discovery and Data Mining) field more?

Conclusions The distinction likely not as clear-cut as presented here, however the trend discernible: transition from “manual” to automatic processing potential increased need to use models In building representations of “world knowledge”, in addition to RDBMS, pay attention to developments in NoSQL, Big Data, and similar Perhaps strengthen work on general-purpose data editing automated data editing model use... (as already advanced in several contributions to the workshop)

Thank you