What are Statistical E-Books? Professor William Browne Centre for Multilevel Modelling, University of Bristol.

Slides:



Advertisements
Similar presentations
EBooks and Audiobooks. This class will give you an overview of eBooks and electronic Audiobooks available from the Library. We will also explain the basic.
Advertisements

Publishers Web Sites Standard Features. Objectives Access publishers websites Identify general features available on most publishers websites Know how.
NCeSS e-Stat quantitative node Prof. William Browne & Prof. Jon Rasbash University of Bristol.
Off Campus Library Services Your virtual library.
Stat-JR: eBooks Richard Parker. Quick overview To recap… Stat-JR uses templates to perform specific functions on datasets, e.g.: – 1LevelMod fits 1-level.
The use of electronic books (eBooks) in social science research Richard Parker* Danius Michaelides† Huanji Yang† Alex Frazer† Luc Moreau† Camille Szmaragd*
Maximizing Teaching & Learning For the Short Course.
E | W | E | W | NHS e-Referral Service Referring Roles Issued: 3 June.
CS&E 1111 PowerPoint Microsoft PowerPoint Learning about Presentation Software and using Microsoft PowerPoint Slide views Preparing text, drawings and.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
PDF Wikispaces Blogging PBWorks You are now ready to cut the red ribbon and unveil your project to your intended audience.
Teaching and Learning with Technology  Allyn and Bacon 2002 Administrative Software Chapter 5 Teaching and Learning with Technology.
MGMS Databases Cool, reliable resources just a few clicks away!
HELP… Login Enter your username and password here. Alternatively, click on the Athens login link below.
Administrative Software Chapter 7 Teaching and Learning with Technology.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
CMPD 434 MULTIMEDIA AUTHORING Chapter 06 Multimedia Authoring Process IV.
Microsoft ® Office OneNote ® 2003 Training Get to know OneNote CGI presents:
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
CiNii Articles is a service that provides information on scholastic articles, with an emphasis on Japanese papers. It allows users to find the articles.
Tutorial of Online Databases Gates Memorial Library.
E | W | E | W | NHS e-Referral Service Referring Roles Issued: 27 th.
GroupMap Starter’s Guide Think Better Together Plan, brainstorm, discuss and prioritise for action. © GroupMap Pty Ltd |
How to Apply PDF in Flipbook on Website. Description If you are finding solution for applying PDF in flipbook mode on website, and adding multimedia items.
Tutorial support.ebsco.com. Welcome to Explora, EBSCO’s engaging interface for schools and public libraries. Designed to meet the unique needs of its.
Searching ProQuest Ebook Resources Susan Watson / Lou Peck June 2007.
Essex Insight Introduction to Essex Insight Training Guide Source: Research and Analysis Unit v4.
Human Computer Interaction Lecture 21 User Support
Dive Into® Visual Basic 2010 Express
Off Campus Library Services
NOODLETOOLS SIGN-IN Student ID #
Creating a Flash Web Site
Discovering Computers 2011: Living in a Digital World Chapter 3
How to get started with RefWorks
How to Develop and Write a Research Paper.
Objectives Overview Identify the four categories of application software Describe characteristics of a user interface Identify the key features of widely.
Creating Oracle Business Intelligence Interactive Dashboards
Microsoft PowerPoint Objectives
Application Software Chapter 6.
Tutorial support.ebsco.com.
HCT: The Library Catalogue
Microsoft® PowerPoint® 2016
How to get started with RefWorks
Tutorial support.ebsco.com.
Presentation Graphics
Exploring Microsoft Office 2013 Word Comprehensive
PowerPoint Xpress Start
USER AND TECHNICAL DOCUMENTATION
Prepared by Kimberly Sayre and Jinbo Bi
Introducing Microsoft Office 2010
Lesson 1: Introduction to Trifacta Wrangler
Centre for Multilevel Modelling, University of Bristol
Quick guide < Keyword search >
Administrative Software
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Professor William Browne,
CHAPTER 4 PROPOSAL.
CHAPTER 4 PROPOSAL.
INSTRUCTIONS Dear Subject Matter Expert,
An Introduction to StatJR
Code Analysis, Repository and Modelling for e-Neuroscience
Planning and Storyboarding a Web Site
USER MANUAL - WORLDSCINET
LO4 – New This Year and Find Out More
Getting started – Example 1
Regression and Clinical prediction models
Scopus - Elsevier (Advanced Course: Module 8)
Professor William Browne,
USER MANUAL - WORLDSCINET
Presentation transcript:

What are Statistical E-Books? Professor William Browne Centre for Multilevel Modelling, University of Bristol

Acknowledgements Jon Rasbash for his vision on which Stat-JR was built. All the team behind programming the Stat-JR software – Chris, Danius, Luc, ZZ, Richard, Camille, Bruce, Huanjia and Alex and collaborators on the e-Stat project. Harvey Goldstein, Fiona Steele, George Leckie, Chris Charlton, Kelvyn Jones and all my other CMM colleagues past and present. ESRC for much funding over the years and the British Academy for funding the current Teaching eBooks grant.

What will we cover ? eBooks and Statistical eBooks STAT-JR eBooks Workflows Example eBooks and workflows Other software – iPython/Jupyter and R-Shiny

eBooks + = An electronic book is a book- publication in digital form. In the US more books are published online than distributed in hard copy in book shops.

Advantages of the eBook format For younger members of the audience it will seem strange to think that going back a few decades academics had to go to the library to access journal articles – often photocopying them or even having to fill in inter-library loan forms! Nowadays one can (subject to subscription) have thousands of articles and books available at one’s finger tips, often as pdf files. Accessing documents within a web-browser allows for enhanced documents with additional features e.g. by embedding sound files, videos within the document. The medium is no longer simply paper!

Statistical (and Mathematical) eBooks The idea is can we incorporate statistical content into an eBook? Of course a statistical textbook is no different on paper to any other document when it comes to creating a pdf file (aside from maybe more equations!) The difference is in what ‘enhancements’ we can add and so the idea here is combining the text book with the statistics package i.e. interactive examples, allowing the user to include their own dataset etc.

Mathematica and SAGEMath Early adopters of eBook technology in their work – much work on mathematical examples but some coverage of statistics. Idea of an interactive log book. Use of sliders as shown below. Influenced our first grant application for eBook work.

Stat-JR A statistical package developed by the team at the centre for multilevel modelling with colleagues at Southampton. Contains it’s own (MCMC-based) estimation engine. System based on the idea of a suite of templates where each template performs a specific operation. Also allows interoperability with other software packages, so for example might have a regression template that fits regressions using various software packages. The initial TREE interface runs in a web browser. There are also newer eBook and workflow interfaces. Several ESRC grants have enabled Stat-JR to be written.

An example of STAT-JR – setting up a model

Example of STAT-JR – setting up a model

Example TREE interface screen shot 2 All objects created available from one pull down and can be popped out to separate tabs in browser.

Output from the E-STAT engine Estimates and the DIC diagnostic can be viewed for the model fitted.

Using the eBook interface in Stat-JR

Different forms of STAT-JR and E-books TREE (Template Reading and Execution Environment) - the format we have demonstrated up to now. Allows user to investigate 1 template and 1 dataset. A dataset can be output from 1 template and then used by the next. DEEP (Documents with Embedded Execution and Provenance) – mixing up templates with textboxes to make executable books. LEAF (Logging and Execution of Analysis Flows) – another workflow based interface to allow the joining together of sets of templates (see later).

Stat-JR writes commands, etc., to perform requested function Template Dataset Stat-JR prompts user for input Function performed (If applicable) external software opened, run, then closed, with results returned to Stat-JR. E.g… Results of function produced (If applicable) results outputted as dataset to be fed back in… myModel<- glm(normexam~ Summary(myModel) plot(myModel,1) Select Open Worksheet Select datafile.dta Select Equations from Fi EquationsMacrosScriptsPoint & click instructions Results Model: DIC: Parameters: Beta1: Charts Results tables

…so we first import one… No eBooks loaded yet…

Navigate through pages of eBook Hierarchical table of contents (can be expanded / collapsed at each node)

Behind the scenes… The eBook author (Richard) has specified which Stat-JR template to associate with this region of the eBook… …and has chosen one which creates plots via R (“PlotsViaR”). Templates require input, from a user, before they can go ahead & perform the function appropriately… …the eBook author can pre-specify inputs (by writing them into the eBook code); any that are not pre-specified are then left to the eBook reader to complete.

Stat-JR writes commands, etc., to perform requested function Template Dataset Stat-JR prompts user for input Function performed (If applicable) external software opened, run, then closed, with results returned to Stat-JR. Results of function produced (If applicable) results outputted as dataset to be fed back in… myModel<- glm(normexam~ Summary(myModel) plot(myModel,1) Select Open Worksheet Select datafile.dta Select Equations from Fi EquationsMacrosScriptsPoint & click instructions Results Model: DIC: Parameters: Beta1: Charts Results tables Stat-JR: to re-cap…

Behind the scenes… …the eBook author has associated relevant model-fitting Stat-JR templates with this region of the eBook… …and has pre-specified all of the inputs, bar the explanatory variables, which are therefore the only ones left to eBook reader to specify. Author has also specified what / where / when the output resulting from a template’s execution will be presented in the eBook…

Stat-JR writes commands, etc., to perform requested function Template Dataset Stat-JR prompts user for input Function performed (If applicable) external software opened, run, then closed, with results returned to Stat-JR. Results of function produced (If applicable) results outputted as dataset to be fed back in… myModel<- glm(normexam~ Summary(myModel) plot(myModel,1) Select Open Worksheet Select datafile.dta Select Equations from Fi EquationsMacrosScriptsPoint & click instructions Results Model: DIC: Parameters: Beta1: Charts Results tables Stat-JR: to re-cap…

Content of text returned is conditional on value of results

Stat-JR’s DEEP system: Summary of features Built on Stat-JR’s powerful & flexible data- analytical engine. Embeds inputs and outputs of Stat-JR’s executable statistical functions within contextual information. Tailoring & specificity: e.g. associating carefully- chosen templates; pre-specifying inputs. Log / recording tool: behind-the-scenes, a comprehensive record is kept of each execution.

Stat-JR’s DEEP system: Teaching of quantitative research methods (including inter-operating software). Communicating principles / theories / inviting exploration of quantitative research topics. Reports: transparency (e.g. access to embedded dataset / analytical methods, etc); facilitates multi-authored preparation. Tailored analytical techniques: pre-specifications allow user to ‘cut to the chase’ and/or circumvent software-specific learning curve. Benefits for the researcher?

Current ESRC grant Grant funds Richard Parker in Bristol and Danius Michaelides in Southampton for 3 years Contains 5 work packages: 1.Capturing discipline-specific research in eBooks 2.Capturing methodological decisions in eBooks 3.The statistical analysis assistant - SAA 4.Reproducible Research and the enhanced journal article 5.The use of eBooks in research training and an online eBook repository

Work Package 3 – The SAA We will adapt our eBook system to allow workflows that will be constructed to describe how the steps in a statistical analysis fit together. There may be many SAAs adapted to different researcher’s approaches - for example one might want to answer a research question/analyse a dataset as a specific expert might do it. Opinion is divided on how far one can take the idea – from nowhere to complete automation i.e. pour in the dataset at the top and let the computer sort it out. Probable end point will be somewhere in between or in fact a series of SAAs that lie on this continuum

A statistical analysis assistant we are all happy with!

One Step further

‘The Warlock of Firetop Mountain’ approach The first of a genre of interactive books published in 1982 and lapped up by 10 year old boys like myself! A combination of book and flowchart Worked something like: ‘The goblin advances towards you, shouting words that you can’t understand, do you try to make conversation (turn to page 231), run past the goblin (turn to page 176) or draw your sword and fight (turn to page 134)’ Basically underpinning the book was effectively a flowchart disguised by random page movements with a variety of endings (99% of them involved you dying), possible loops etc.

The use of Flowcharts in Statistics The equivalent exists in (at least) basic statistical analysis and a variety of books have flowcharts to guide the uninitiated to the appropriate test. The branching rules are usually things like – how many variables do you have?, what type are they?, is a normality assumption appropriate? The example flowcharts usually then say you need a t test / Mann Whitney test / ANOVA etc. One could expand this idea to include branches where we haven’t written material – i.e. the equivalent of ending up dead would be the default ‘go and ask a statistician’ end point – possibly taking your answers to the flow chart with you.

Where might this go? The flow chart idea is appealing as it may to some degree mimic a statistical consultation. If the system is flexible enough then each statistician can tune the SAA to their own approach to analysis and to how much they feel can be comfortably automated. Where there is uncertainty / options in what one should do this could be incorporated E-books can contain hyperlinks so that further background on proposed statistical methods or examples can be easily found

Stat-JR’s LEAF Work flow system Based around a new front end written using the Blockly system. Allows the user to link up templates themselves in a user-friendly visual way. Work flows can be included in eBooks. Will give a few examples of simple workflows.

Blockly

Histogram workflow (hist.xml) Here is a log style workflow.

Histogram workflow (hist.xml) Removing one of the inputs would then make this a question that the user needs to input.

Histogram Output

More complex example

Output

Asking for inputs Here red blocks are variables. Here the blocks have been collapsed.

Conditional operations To the left is part of a workflow showing some conditional operations. Depending on the distribution the prediction we wish to plot will be converted to the probability scale for Binomial or to the counts scale for Poisson or left as is for Normal responses.

Output Here you can see prediction lines for a model on the probability scale. There is 1 line for each district and the probability (of using contraceptive) changes with age.

Other Systems - Shiny by RStudio

Communicating Uncertainty work Project webpage by colleagues using R-Shiny – see

Communicating Uncertainty work Note the use of the sliders embedded in the webpage

Other systems 2 – iPython Jupyter notebooks

New release of Stat-JR The latest release of the Stat-JR software with a first beta release of the LEAF workflow interface is available at ownloads/ Any questions?