Re-thinking Modelling: a Call for the Use of Data Mining in Data-driven Social Simulation Samer Hassan Javier Arroyo Celia Guti é rrez Universidad Complutense.

Slides:



Advertisements
Similar presentations
The Robert Gordon University School of Engineering Dr. Mohamed Amish
Advertisements

Agent-Based Social Modelling and Simulation with Fuzzy Sets Samer Hassan Collado Luis Garmendia Salvador Juan Pavón Mestras ESSA 2007 Dep. Ingeniería del.
Understanding the Research Process
Data-Driven Agent-Based Social Simulation of Moral Values Evolution Samer Hassan Universidad Complutense de Madrid University of Surrey.
Huge Raw Data Cleaning Data Condensation Dimensionality Reduction Data Wrapping/ Description Machine Learning Classification Clustering Rule Generation.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
Erasmus University Rotterdam Frederik HogenboomEconometric Institute School of Economics Flavius Frasincar.
Friends Forever: Social Relationships with a Fuzzy Agent-Based Model Samer Hassan Mauricio Salgado Juan Pav ó n Universidad Complutense de Madrid University.
Civil and Environmental Engineering Carnegie Mellon University Sensors & Knowledge Discovery (a.k.a. Data Mining) H. Scott Matthews April 14, 2003.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
An Introduction to METHODOLOGY in Social Sciences.
Video Mining Learning Patterns of Behaviour via an Intelligent Image Analysis System.
Verification and Validation
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Enterprise systems infrastructure and architecture DT211 4
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Overview of Distributed Data Mining Xiaoling Wang March 11, 2003.
The design process z Software engineering and the design process for interactive systems z Standards and guidelines as design rules z Usability engineering.
Data Mining Chun-Hung Chou
Moving from Development to Efficacy & Intervention Fidelity Topics National Center for Special Education Research Grantee Meeting: June 28, 2010.
Soft Computing Lecture 20 Review of HIS Combined Numerical and Linguistic Knowledge Representation and Its Application to Medical Diagnosis.
Mixed Narrative and Dialog Content Planning Based on BDI Agents Carlos León Aznar Samer Hassan Collado Pablo Gervás Juan Pavón Mestras CAEPIA 2007 Universidad.
Load Balancing in Distributed Computing Systems Using Fuzzy Expert Systems Author Dept. Comput. Eng., Alexandria Inst. of Technol. Content Type Conferences.
Copyright © 2007 Pearson Education Canada 3-1 Marketing Research Marketing research serves many roles. It can: 1.Link companies with customers via information.
Exploring Metropolitan Dynamics with an Agent- Based Model Calibrated using Social Network Data Nick Malleson & Mark Birkin School of Geography, University.
© 2005 Pearson Education Canada Inc. Chapter 2 Sociological Investigation.
Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Decision Support Systems Chapter 10.
Role of Statistics in Geography
MODULE 3 INVESTIGATING HUMAN AND SOCIL DEVELOPMENT IN THE CARIBBEAN.
LEVEL 3 I can identify differences and similarities or changes in different scientific ideas. I can suggest solutions to problems and build models to.
1 Introduction to Software Engineering Lecture 1.
Mentat: A Data-Driven Agent-Based Simulation of Social Values Evolution Samer Hassan Luis Antunes Juan Pav ó n Universidad Complutense de Madrid University.
Neural and Evolutionary Computing - Lecture 9 1 Evolutionary Neural Networks Design  Motivation  Evolutionary training  Evolutionary design of the architecture.
Dr. H taking a nap!. All these theories! How do we apply them? And what do we apply them to? It’s exhausting!
Object Oriented Reverse Engineering JATAN PATEL. What is Reverse Engineering? It is the process of analyzing a subject system to identify the system’s.
10-1 Identify the changes taking place in the form and use of decision support in business Identify the role and reporting alternatives of management information.
The Research Process Professor Merrill Warkentin Mississippi State University BIS 9213: Doctoral Seminar.
Fuzzy Systems Michael J. Watts
Chapter 2 Doing Sociological Research Key Terms. scientific method Involves several steps in research process, including observation, hypothesis testing,
The Practical Aspects of Doing Research An Giang University June, 2004 Dennis Berg, Ph.D.
ReSeTrus Development of a digital library technology based on redundancy elimination and semantic elevation, with special emphasis on trust management.
Software Architecture Evaluation Methodologies Presented By: Anthony Register.
Topic (iii): Macro Editing Methods Paula Mason and Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Asking the Oracle: Introducing Forecasting Principles into Agent-Based Modelling GRASIA, Universidad Complutense de Madrid INSISOC, Universidad de Burgos.
Changing the Rules of the Game Dr. Marco A. Janssen Department of Spatial Economics.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Canadian Marketing in Action, 6 th ed. Keith J. Tuckwell ©2004 Pearson Education Canada Inc. 3-1 Marketing Research Marketing research serves many roles.
Introduction To Statistics
Lecture №1 Role of science in modern society. Role of science in modern society.
Deepening the Demographic Mechanisms in a Data-Driven Social Simulation of Moral Values Evolution Samer Hassan Luis Antunes Mill á n Arroyo MABS 2008 Acknowledgments.
Lecture №4 METHODS OF RESEARCH. Method (Greek. methodos) - way of knowledge, the study of natural phenomena and social life. It is also a set of methods.
Using Bayesian Belief Networks in Assessing Software Architectures Jilles van Gurp & Jan Bosch.
Injecting Data into Simulation: Can Agent-Based Modelling Learn from Microsimulation? Samer Hassan Juan Pav ó n Nigel Gilbert Universidad Complutense de.
An Overview of Editing and Imputation Methods for the next Italian Censuses Gianpiero Bianchi, Antonia Manzari, Alessandra Reale UNECE-Eurostat Meeting.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
McGraw-Hill © 2007 The McGraw-Hill Companies, Inc. All rights reserved. Slide 1 Sociological Research SOCIOLOGY Richard T. Schaefer 2.
Managing Qualitative Knowledge in Software Architecture Assesment Jilles van Gurp & Jan Bosch Högskolan Karlskrona/Ronneby in Sweden Department of Software.
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
Profiling: What is it? Notes and reflections on profiling and how it could be used in process mining.
Fuzzy Systems Michael J. Watts
A Methodology for Finding Bad Data
Chapter 2 Sociological Research Methods
Stepping on Earth: A Roadmap for Data-driven Agent-Based Modelling
Data Warehousing Data Mining Privacy
Research Process Prof. Kalinga Tudor Silva Department of Sociology
Toward a Great Class Project: Discussion of Stoianov & Zorzi’s Numerosity Model Psych 209 – 2019 Feb 14, 2019.
Presentation transcript:

Re-thinking Modelling: a Call for the Use of Data Mining in Data-driven Social Simulation Samer Hassan Javier Arroyo Celia Guti é rrez Universidad Complutense de Madrid

Samer Hassan Contents Data-driven ABM DM-assisted Methodology Case Study: Mentat Application Conclusions

Samer Hassan Research Aim

Samer Hassan Research Aim Theoretical KISS Structural Validation Abstract General

Samer Hassan Research Aim Data-driven Non-KISS Empirical Validation Specific (case study) Expressive Theoretical KISS Structural Validation Abstract General

Samer Hassan Classical Logic of Simulation

Samer Hassan Data-Driven Logic

Samer Hassan Data-driven Approach Complexity Large amounts of Data Auxiliary AI: Fuzzy Logic Ontologies Evolutionary Computation Data Mining

Samer Hassan Data Mining Extracting patterns and relevant information from large amounts of data Pre-processing of empirical data Cluster finding Discovery of hidden patterns Locates redundancies Post-processing of simulation output Clustering: Discovery of hidden patterns Validation of clusters Locates inconsistencies Classification Cluster matching

Samer Hassan Contents Data-driven ABM DM-assisted Methodology Case Study: Mentat Application Conclusions

Samer Hassan Methodology for DM-assisted ABM

Samer Hassan Methodology for DM-assisted ABM Data Collection Initial point Validation points Necessarily ≠ initial Type Explicit Externalised Empirical distributions Secondary sources Methods Quantitative E.g. surveys Qualitative E.g. interviews

Samer Hassan Methodology for DM-assisted ABM Analysis Preprocessing of empirical data Roles Domain expert Guide DM exploration Interpretation DM expert Confirm or refine theories

Samer Hassan Methodology for DM-assisted ABM Selection of Relevant Data Filtering Adaptation of data Normalisation Discretisation Domain Expert Theory DM Redundancies Overlooked independent variables

Samer Hassan Methodology for DM-assisted ABM Data Analysis Large data collections Guided by theory Types Cluster analysis Principal Component Analysis Time series methods Association rules

Samer Hassan Methodology for DM-assisted ABM Interpretation of results Theory expert Relate results to theory New findings are added to the findings base

Samer Hassan Methodology for DM-assisted ABM ABM Building Based on Findings Modeller Steps Formalisation Data-driven Design Implementation Initialisation

Samer Hassan Methodology for DM-assisted ABM Simulation Fine tuning the ABM Sensitivity analysis Intensive testing Output Record agent trace

Samer Hassan Methodology for DM-assisted ABM Validation Analysis of the results Empirical validation Theoretical consistency Roles DM expert Analyse the data Domain expert Extract conclusions Iterative cycle

Samer Hassan Contents Data-driven ABM DM-assisted Methodology Case Study: Mentat Application Conclusions

Samer Hassan The Problem Aim: simulate the process of change in social values in a period in a society Plenty of factors involved Inertia of generational change: To which extent the demographic dynamics explain the mental change? Inter-generational: Agent characteristics remain constant Macro aggregation evolves

Samer Hassan Mentat: architecture Agent : Mental State attributes Life cycle patterns Demographic micro-evolution: Couples Reproduction Inheritance

Samer Hassan Mentat: architecture World: 3000 agents Grid 100x100 Demographic model 8 indep. parameters Social Network: Communication with Moore Neighbourhood Friends network Family network

Samer Hassan Contents Data-driven ABM DM-assisted Methodology Case Study: Mentat Application Conclusions

Samer Hassan Data Collection in Mentat Initial data: EVS-1980 Representative sample of Spain Qualitative info Empirically-grounded demographic equations Validation data: EVS-1990 EVS-1999

Samer Hassan Analysis in Mentat Selection of relevant data EVS-1980,1990,1999 Options: 1.Algorithm for the best subset of variables 2.Rely on domain expert Tested domain knowledge (2) chosen Variables adaptation Normalisation NameTypeRange gendercategorical agenumeric≥18 studiesnumeric≥5 civil statecategorical economynumericreal ideologyordinal1-10 conf. churchordinal1-4 church att.Ordinal1-7 relig. personcategorical

Samer Hassan Analysis in Mentat Data Analysis Algorithm selection Wrapped k-means Explore different k (# of clusters) Discarded variables Gender & Age provokes appearance of irrelevant clusters E.g. widowed women Economy is redundant High correlation with Education

Samer Hassan Analysis in Mentat Interpretation Sociological research Religious typology (RLGTYPE) Based on 3 variables Ecclesiastical, low-intensity, alternatives & non-religious Clusters found (1980, 1999) Based on the 9-3=6 variables 5 clusters with sociological meaning Consistent with RLGTYPE Theoretical observations of the pattern evolution: Religiosity strength falls Ideological spectrum twists to the left education & economy Newest type of religiosity, “alternatives” rise youngsters

Samer Hassan Analysis in Mentat

Samer Hassan Validation in Mentat Mentat re-building & simulation explored Mentat output clusterised Same 5 clusters found Similar evolution trends 3 theoretical observations shown Inconsistencies detected Liberal cluster % do not match although aggregated they do Graphics show less youngsters Liberal clusters deeply affected Guide to re-design

Samer Hassan Contents Data-driven ABM DM-assisted Methodology Case Study: Mentat Application Conclusions

Samer Hassan Conclusions DM-assisted ABM methodology Suitable for DDABM Complexity Large amounts of data Limitations KISS Qualitative sources Uses Build new ABM Re-thinking existing DDABM Revealing hidden facts Detect inconsistencies

Samer Hassan Thanks for your attention! Samer Hassan Universidad Complutense de Madrid

Samer Hassan Contents License This presentation is licensed under a Creative Commons Attribution You are free to copy, modify and distribute it as long as the original work and author are cited