NCeSS e-Stat quantitative node Prof. William Browne & Prof. Jon Rasbash University of Bristol.

Slides:



Advertisements
Similar presentations
Handling attrition and non- response in longitudinal data Harvey Goldstein University of Bristol.
Advertisements

Introduction to Monte Carlo Markov chain (MCMC) methods
Other MCMC features in MLwiN and the MLwiN->WinBUGS interface
Non response and missing data in longitudinal surveys.
Contrastive Divergence Learning
On-line Learning Environment for Multilevel Modelling Fiona Steele and Sacha Brostoff Centre for Multilevel Modelling University of Bristol.
For the e-Stat meeting of 6-7 April 2011 Paul Lambert / DAMES Node inputs 1)Updates on DAMES 2)Bringing DAMES inputs to e-Stat 3)Misc. feedback - Stat-JR.
Latent normal models for missing data Harvey Goldstein Centre for Multilevel Modelling University of Bristol.
METHODS FOR HAPLOTYPE RECONSTRUCTION
Bayesian Estimation in MARK
Inspiral Parameter Estimation via Markov Chain Monte Carlo (MCMC) Methods Nelson Christensen Carleton College LIGO-G Z.
By Addison Euhus, Guidance by Edward Phillips An Introduction To Uncertainty Quantification.
Gibbs Sampling Qianji Zheng Oct. 5th, 2010.
Markov-Chain Monte Carlo
The use of electronic books (eBooks) in social science research Richard Parker* Danius Michaelides† Huanji Yang† Alex Frazer† Luc Moreau† Camille Szmaragd*
Reporter: Hsu Hsiang-Jung Modelling stochastic fish stock dynamics using Markov Chain Monte Carlo.
Edge Detection Enhancement Using Gibbs Sampler
GENESIS Web 2.0 Agent City Simulation: Establishing a user community and enabling collaborators to manipulate simulations and develop models Andy Turner.
Two main requirements: 1. Implementation Inspection policies (scheduling algorithms) that will extand the current AutoSched software : Taking to account.
A pilot application 12/9/2008Microsoft eScience Workshop 2008 Robert Bukowski and Jarek Pillardy Computational Biology Service Unit Cornell University.
Computational statistics, course introduction Course contents  Monte Carlo Methods  Random number generation  Simulation methodology  Bootstrap  Markov.
Architecture and Real Time Systems Lab University of Massachusetts, Amherst An Application Driven Reliability Measures and Evaluation Tool for Fault Tolerant.
Robert M. Saltzman © DS 851: 4 Main Components 1.Applications The more you see, the better 2.Probability & Statistics Computer does most of the work.
Robin McDougall, Ed Waller and Scott Nokleby Faculties of Engineering & Applied Science and Energy Systems & Nuclear Science 1.
A simulation study of the effect of sample size and level of interpenetration on inference from cross-classified multilevel logistic regression models.
Supercomputing, Visualization & eScience1 e-Social Science Grid technologies for Social Science: the Seamless Access to Multiple Datasets (SAMD) project.
E-Social Science What is e-Science? E-Science and e-Social Science E-Social Science and Longitudinal Data Examples of the Computational Problems we Currently.
RUP Implementation and Testing
CRESCENDO Full virtuality in design and product development within the extended enterprise Naples, 28 Nov
Bayesian parameter estimation in cosmology with Population Monte Carlo By Darell Moodley (UKZN) Supervisor: Prof. K Moodley (UKZN) SKA Postgraduate conference,
Introduction to MCMC and BUGS. Computational problems More parameters -> even more parameter combinations Exact computation and grid approximation become.
SPSS Presented by Chabalala Chabalala Lebohang Kompi Balone Ndaba.
R2WinBUGS: Using R for Bayesian Analysis Matthew Russell Rongxia Li 2 November Northeastern Mensurationists Meeting.
NREL is a national laboratory of the U.S. Department of Energy Office of Energy Efficiency and Renewable Energy operated by the Alliance for Sustainable.
Examples of Computing Uses for Statisticians Data management : data entry, data extraction, data cleaning, data storage, data manipulation, data distribution.
1 Software Reliability Assurance for Real-time Systems Joel Henry, Ph.D. University of Montana NASA Software Assurance Symposium September 4, 2002.
Nick Draper 05/11/2008 Mantid Manipulation and Analysis Toolkit for ISIS data.
Some comments on using research data in the social sciences Paul Lambert, School of Applied Social Science, University of Stirling, 25 March 2013.
Delivering Integrated, Sustainable, Water Resources Solutions Monte Carlo Simulation Robert C. Patev North Atlantic Division – Regional Technical Specialist.
A Comparison of Two MCMC Algorithms for Hierarchical Mixture Models Russell Almond Florida State University College of Education Educational Psychology.
Multilevel Modeling Software Wayne Osgood Crime, Law & Justice Program Department of Sociology.
Computer Science Project Criteria. Computer Science Project The project is intended to simulate the analysis, design, progamming and documentation stages.
Automatization of air traffic control sector capacity indicators determination process (Автоматизація процесу визначення показників пропускної спроможності.
Monte Carlo Process Risk Analysis for Water Resources Planning and Management Institute for Water Resources 2008.
Project Management Cross lifecycle Activity
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Talk by William Browne Slides by Kelvyn Jones In memory of Jon Rasbash all University of Bristol Monday 5th July 2010, Session 2 WHAT IS: multilevel modelling?
UNCLASSIFIED Approved for Public Release 07-MDA-2965 (26 OCT 07) Load Bearing Walls: Early Sizing Estimation In The NOSTROMO Tool (U) Dan Strickland Dynetics.
 Programming - the process of creating computer programs.
C HU H AI C OLLEGE O F H IGHER E DUCATION D EPARTMENT O F C OMPUTER S CIENCE Preparation of Final Year Project Report Bachelor of Science in Computer Science.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
MAT 4830 Mathematical Modeling 04 Monte Carlo Integrations
1 Getting started with WinBUGS Mei LU Graduate Research Assistant Dept. of Epidemiology, MD Anderson Cancer Center Some material was taken from James and.
HW7: Evolutionarily conserved segments ENCODE region 009 (beta-globin locus) Multiple alignment of human, dog, and mouse 2 states: neutral (fast-evolving),
Density Estimation in R Ha Le and Nikolaos Sarafianos COSC 7362 – Advanced Machine Learning Professor: Dr. Christoph F. Eick 1.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
10 October, 2007 University of Glasgow 1 EM Algorithm with Markov Chain Monte Carlo Method for Bayesian Image Analysis Kazuyuki Tanaka Graduate School.
Generalization Performance of Exchange Monte Carlo Method for Normal Mixture Models Kenji Nagata, Sumio Watanabe Tokyo Institute of Technology.
Markov Chain Monte Carlo in R
Overview Modern chip designs have multiple IP components with different process, voltage, temperature sensitivities Optimizing mix to different customer.
MCMC Output & Metropolis-Hastings Algorithm Part I
System Design.
Centre for Multilevel Modelling, University of Bristol
the goal of Bayesian divergence time estimation
Ch13 Empirical Methods.
An Introduction to StatJR
Slides for Sampling from Posterior of Shape
Wellington Cabrera Advisor: Carlos Ordonez
Scientific Workflows Lecture 15
Presentation transcript:

NCeSS e-Stat quantitative node Prof. William Browne & Prof. Jon Rasbash University of Bristol

e-Stat quantitative node Linking the ESRC Research Methods and e-Social Science programmes. Statistical methodology and software development researchers in Bristol. Computer Science (E-Science) researchers at Southampton. Other social science researchers at Institute of Education, Manchester and Stirling.

e-Stat quantitative node Node started in September 2009 and is funded for 3 years. We aim to produce software tools to cater for three types of user: novice practitioners, advanced practitioners and statistical algorithm developers Capacity building and methodology development require iterative interaction between all three groups We are currently in the planning stage merging the skills of the various contributors. We are however building on many strengths as detailed in the following slides.

MLwiN software package Statistics package for multilevel modelling. Development supported by ESRC for many years. Over 7,000 copies sold worldwide (free to UK academics) Cited in 1,500+ ISI journal articles

MCMC in MLwiN Computationally – intensive Markov Chain Monte Carlo Simulation based methods Statistical model estimates obtained by generated lots of random number draws from the posterior distribution. Can benefit from parallelisation in many ways. MCMC algorithm converges on a distribution. Parameter estimates and intervals are then calculated from the simulation chains.

Sample Size calculations via simulation Original work funded in a recently finished ESRC grant. Is a computationally intensive method as involves running same model on many datasets and assessing how often a confidence interval contains 0 Ideal candidate for parallelisation.

My experiment project (Universities of Southampton and Manchester) Social Networking tool for sharing scientific artifacts

Component wise approach Also interested in interoperability between statistical software packages so components may be existing packages

Prototype for Algebraic Processing Component

Software Component for code generation Algebraic processing component outputs statistical distributions (posteriors) in MATHML MATHML files taken as input into Python. Python code creates C++ code for running the model for 1 iteration. C++ code compiled to a Python function. Function can be called within Python. C++ code can be viewed and edited by expert users.

Specific areas of application in the social sciences Measuring segregation – proposal for complex modelling ESDS feasibility study project: Changing circumstances during childhood Social Networks in Multilevel structures Handling missing data via multiple imputation Sample size calculations

Integration into workbooks Model Specification and Estimation software components – freestanding Python application(s) operated by own GUI or script. These components along with tables, graphs, equations and diagrams can be incorporated in SAGE executable books. myExperiment will then act as a searchable repository for locating and sharing executable books