11/16/2018 SimpleR: Taking on the “Evil Empire” to Build Simple R Applications for Non-Statistical Users Nicholas Lewin-Koh Bert Gunter Genentech Nonclinical.

Slides:



Advertisements
Similar presentations
Pennsylvania BANNER Users Group 2006 Integrate Your Decision Support with Cognos 8.
Advertisements

Financial Data Calculator© Produced by: Mathematical Investment Decisions, Inc. 95 West Gate Drive – 2 nd Floor Cherry Hill, NJ Web site:
Fundamentals of Information Systems, Second Edition
SDLC. Information Systems Development Terms SDLC - the development method used by most organizations today for large, complex systems Systems Analysts.
JumpStart the Regulatory Review: Applying the Right Tools at the Right Time to the Right Audience Lilliam Rosario, Ph.D. Director Office of Computational.
1 Presenters: Lucretia Parham Janice Zeigler Armstrong Atlantic State University May 14 10:15 a.m. - 11:15 a.m. Presenters: Lucretia Parham Janice Zeigler.
Software Development Stephenson College. Classic Life Cycle.
Objectives Understand what MATLAB is and why it is widely used in engineering and science Start the MATLAB program and solve simple problems in the command.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
Fundamentals of Information Systems, Second Edition 1 Systems Development.
Introduction to the Statistical Analysis of the Clinical Trials
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
PROC-1 1. Software Development Process. PROC-2 A Process Software Development Process User’s Requirements Software System Unified Process: Component Based.
Chapter 6 CASE Tools Software Engineering Chapter 6-- CASE TOOLS
Engineering Self-adaptive Service Mashups Mahdi Bashari LS3 Colloquium May 7 th 2014.
Smart Home Technologies
Photoshop CS6 – Nelson Unit 3: Photoshop CS6. Objectives Define photo editing software Start Photoshop and view the workspace Use the Zoom tool and the.
1 CASE Computer Aided Software Engineering. 2 What is CASE ? A good workshop for any craftsperson has three primary characteristics 1.A collection of.
Introduction to Software Engineering
A quick guide to other statistical software
Introduction to Visual Basic. NET,. NET Framework and Visual Studio
Chapter 25 Process Improvement.
Introduction.
CST 1101 Problem Solving Using Computers
Chapter 2: The Visual Studio .NET Development Environment
Getting Started with Adobe Photoshop CS6
PC-DMIS Introduction to GD&T Selection
Intelligent Systems Development
Classifications of Software Requirements
Release Numbers MATLAB is updated regularly
IBM Rational Rhapsody Advanced Systems Training v7.5
Careers in Statistical Software
Overview Power Quality Dashboard Fred Elmendorf August 2014.
Midwest Biopharmaceutical Statistics Workshop
Chapter 2 Looking at Windows
Section 1 INTRODUCTION.
SECTION 1 INTRODUCTION.
And On To Design: Why in this particular sequence?
How to automatise the grid production - using model builder in ArcGIS
CHAPTER 2 Computer Software.
Software What Is Software?
Building Information Systems
Unit 9 – Spreadsheet Development
IBM Start Now Host Integration Solutions
Lecture 13 RPM and its advantages.
Presentation Graphics
5 SYSTEM SOFTWARE CHAPTER
HCI in the software process
Office 2010 and Windows 7: Essential Concepts and Skills
Information Systems Development
Introduction to MATLAB
VISUAL BASIC.
Essentials of Systems Analysis and Design Fourth Edition
Lesson Objectives Aims
Vitalnet quickly makes publication-ready output
Overview: Software and Software Engineering
Chapter 13: Systems Analysis and Design
Modern Systems Analysis and Design Third Edition
HCI in the software process
An Introduction to Software Architecture
BUSINESS COMMUNICATION SKILLS PRESENTATION SKILLS OF THESIS & PROJECT
CSE 403, Winter 2003 Software Engineering
European Computer Driving Licence
Modern Systems Analysis and Design Third Edition
Share What You Have Learned EP Session
Simulation And Modeling
Human Computer Interaction Lecture 14 HCI in Software Process
Information Systems Development (ISD) Systems Development Life Cycle
Modern Systems Analysis and Design Third Edition
Software Re-engineering and Reverse Engineering
Presentation transcript:

11/16/2018 SimpleR: Taking on the “Evil Empire” to Build Simple R Applications for Non-Statistical Users Nicholas Lewin-Koh Bert Gunter Genentech Nonclinical Statistics

Outline: Background and Context: The working environment and needs 11/16/2018 Outline: Background and Context: The working environment and needs Strategy: The Approach Example: Tumor Xenograft Study Analysis

Context: Pharmaceutical industry, but regulation is not an issue 11/16/2018 Context: Pharmaceutical industry, but regulation is not an issue We collaborate on many projects that investigate drug efficacy, toxicity, biomarkers, dose determination, manufacturing methods, assay methods, etc. Data may be complex, so analyses can be tricky. We need to provide consistent, clear, interpretable analyses to aid scientific assessment Complex statistical analyses are unsuitable

Example: Tumor Xenograft Studies 11/16/2018 Example: Tumor Xenograft Studies Implant special tumor cell lines in mice, then compare tumor growth under different treatment regimens. Measure Tumor Volume

Example: Tumor Xenograft Studies 11/16/2018 Example: Tumor Xenograft Studies Xenograft studies help determine which drugs to work on in which cancers, dosing in human studies, biomarkers that can identify subgroups who may or may not benefit, … Data are challenging, consists of repeated measures of tumor volume over time per animal. Nonlinear growth/stasis/shrinkage dropouts due to toxicity or animal care requirements left censoring when tumors shrink below LOD

Ad hoc analyses and plots using Excel are most widely used approaches 11/16/2018 Ad hoc analyses and plots using Excel are most widely used approaches DRUG 3 DRUG 1 DRUG 2 Poor analyses compromise scientific decision making and our ability to find and develop good drugs. Realities: Scientists/engineers usually have neither the background nor time to learn and use sophisticated statistical methods Wider audience of decision makers cannot consume fancy statistical results anyway Not nearly enough of us (statisticians) to handle all of this for them (scientists and engineers)

11/16/2018 Context for Solutions Rapid change – in technologies, needs, methods, computer hardware and software… Need safe and robust methods: reasonable answers quickly in a variety of real circumstances, alert or failure otherwise. Searching for statistical “optimality” is waste of time. Communicate all results via graphs and tables. Users will treat software as “black box” yielding answers. User interface, not software documentation is key Developers need to meet rapidly evolving user needs Rapid prototyping, development, ease of modification, and feature addition are important factors

R provides a way to meet these challenges 11/16/2018 R provides a way to meet these challenges Many built-in procedures and packages  rapid prototyping Graphics packages (lattice, ggplot, …) ,provide framework for informative, flexible graphical displays Changes the paradigm ! Close collaboration with customers during development: Review/ Test Try Modify

11/16/2018 Strategy Initially, Windows desktop application on only very few (1 or 2) desktops Simple menu interface automatically starts up when user clicks on R icon. e.g. Use startup options to read in .RData file with all functions and execute code that sets up menus, etc. We do it with .Rprofile file, but many alternatives are available Once customers are satisfied and code has stabilized, port to Web-based interface to ease maintenance for larger user base So far, we haven’t found the extra overhead for converting to packages worthwhile, but this may change. Remember, for users it’s a black box that provides solutions, not a tool.

11/16/2018 Demo

11/16/2018 Menu Interface

11/16/2018 Output: Model fit XXXXXXXX

Output: Views derived from the model. 11/16/2018 Output: Views derived from the model. X X

11/16/2018 Web Interface XXXXX

11/16/2018 Summary: Excel is ubiquitous data analysis software, so opportunities for major improvements abound. To replace it, we need: rapid development of flexible, robust solutions “intelligent” graphs and tables to communicate results Workable user interfaces that shield users from technical details A way to scale solutions, that does not require a large ongoing effort to support R and its supporting packages meet these needs.

Thanks: Translational Oncology Bruno Alicke Steven Gould 11/16/2018 Thanks: Translational Oncology Bruno Alicke Steven Gould Bioinformatics Dana Caulder Vivek Ramaswamy Kathryn Woods