Dimensions Characterizing Programming Feature Usage by Information Workers Christopher Scaffidi, Andrew Ko, Brad Myers, Mary Shaw Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
Notes from WEUSE IV Discussions. Notes Carolyn Seaman, UMBC. Bioinformatics, big spectrum, part of it is EUP. Interested in EUSE. Malea Umarji, UMBC,
Advertisements

Database Ed Milne. Theme An introduction to databases Using the Base component of LibreOffice LibreOffice.
Chapter 1 Business Driven Technology
RoMEO, JULIET and OpenDOAR: A Tale with a Happy Ending!
Microsoft Access  Access is a database software program.  Databases are basically large lists of information of a certain sort for example  Telephone.
Office of Technical Services Tracking & Reviewing Research Space Usage Tony Woods Deputy Director, OTS.
PRACTICAL PHP AND MYSQL WALKTHROUGH USING SAMPLE CODES – MAX NG.
Challenges, Motivations, and Success Factors in the Creation of Hurricane Katrina "Person Locator" Web Sites Christopher Scaffidi, Brad Myers, Mary Shaw.
Writing a Research Paper
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
Who Are the “End Users”? Mary Shaw Carnegie Mellon University.
Carving up the Space of End User Programming EUSES, Lincoln, NE, Oct ‘05.
Estimating the Numbers of End Users and End User Programmers Christopher Scaffidi Brad Myers Mary Shaw Carnegie Mellon University EUSES Consortium VL/HCC.
Introduction to the EUSES Web Macro Scenario Corpus Allen Cypher, Sebastian Elbaum, Andhy Koesnandar, Brad Myers, Christopher Scaffidi.
Graphic Designers Who Program as Informal CS Learners Brian Dorn and Mark Guzdial College of Computing, Georgia Tech.
Scenario-Based Requirements for Web Macro Tools Christopher Scaffidi, Allen Cypher, Sebastian Elbaum, Andhy Koesnandar, Brad Myers.
Tool Support for Data Validation by End-User Programmers Christopher Scaffidi Brad Myers, Mary Shaw Carnegie Mellon University.
Usable Privacy and Security Carnegie Mellon University Spring 2008 Lorrie Cranor 1 Designing user studies February.
ForecastingOMS 335 Welcome to Forecasting Summer Semester 2002 Introduction.
Marketing Research and Information Systems
Schedule Viewer A Scheduling Tool for UBC Okanagan Administration Jacob Orr Dr. Ramon Lawrence Bachelor of Science Honours Project.
The Premier Software Usage Analysis and Reporting Toolset Maximizing Value for Software Users.
ICT TEACHERS` COMPETENCIES FOR THE KNOWLEDGE SOCIETY
Chapter 5 Application Software.
Computer Programming My Home Page My Paper Job Description Computer programmers write, test, and maintain the detailed instructions, called programs,
WEBINAR SERIES: ACCESSIBLE INTERACTIVE DOCUMENTS Week 3: Accessible Web Forms Norman Coombs
Chapter 3 Application Software.
A Multi-institutional Study of the Relationship Between High School Mathematics Curricula and College Mathematics Achievement and Course- Taking Patterns.
Copyright © 2005, SAS Institute Inc. All rights reserved. SAS ABM Survey Kit A Sample Usage Scenario: Collecting Group Assignment Values.
MICS Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Interpreting Field Check Tables.
© Paradigm Publishing, Inc. 5-1 Chapter 5 Application Software Chapter 5 Application Software.
Computers & Employment By Andrew Attard and Stephen Calleja.
Word Processors, Databases, Spreadsheets, and Data Problems.
© Thomson/South-WesternSlideCHAPTER 141 CAREER INFORMATION The World of Work Exploring Occupations Chapter 14.
MySQL + PHP.  Introduction Before you actually start building your database scripts, you must have a database to place information into and read it from.
WJEC Applied ICT Spreadsheet Skills 1.Introduction to Financial Modelling Definition A model is a program which has been developed to copy the way.
Managed by UT-Battelle for the Department of Energy 1 Integrated Catalogue (ICAT) Auto Update System Presented by Jessica Feng Research Alliance in Math.
Chapter Thirteen Validation & Editing Coding Machine Cleaning of Data Tabulation & Statistical Analysis Data Entry Overview of the Data Analysis.
Chapter foundations of Chapter M A R K E T I N G Obtaining Data for Marketing Decisions 7.
Oliver, 2010 July.  Define target audience  Decide on stastistics  Make Data Requests  Chase data requests  Collate Data  Check the data  Run analysis.
Chapter Fourteen Data Preparation 14-1 Copyright © 2010 Pearson Education, Inc.
Discovering Computers 2009 Chapter 13 Programming Languages and Program Development.
11 3 / 12 CHAPTER Databases MIS105 Lec15 Irfan Ahmed Ilyas.
THOMAS RANDOLPH KYLE SMITH STUART FELDT NICK PARKER What: Restaurant Management System. Why: Improve customer experience. Makes us better: Ours is personal.
© Paradigm Publishing Inc. 5-1 Chapter 5 Application Software.
Understanding Use: Predicting Action on a message Laura A. Dabbish Jianwei Wang CSCI6800 Spring 2005.
CIT 383: Administrative ScriptingSlide #1 CIT 383: Administrative Scripting Introduction.
Programming Domains 1.Scientific Applications Typically, scientific applications have simple data structures but require large numbers of floating-point.
Online Surveys Jacqui James Malcolm Roberts School of Education.
Technical Writing: An Editor’s Perspective Michael K. Lindell Hazard Reduction & Recovery Center Texas A&M University.
A code-centric cluster-based approach for searching online support forums for programmers Christopher Scaffidi, Christopher Chambers, Sheela Surisetty.
Research Methodology II Term review. Theoretical framework  What is meant by a theory? It is a set of interrelated constructs, definitions and propositions.
WHY IS THIS HAPPENING IN THE PROGRAM? Session 5 Options for Further Investigation & Information Flow.
Communities of Practice & L ESSONS L EARNED Budget, Finance, and Award Management Large Facilities Office May 2016 Large Facilities Workshop 2016 S. Dillon.
Get “Hooked” On A Wiki Becky Klein Matt Smith Dave Sierkowski.
Component D: Activity D.3: Surveys Department EU Twinning Project.
Chapter Fourteen Data Preparation 14-1 Copyright © 2010 Pearson Education, Inc.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
Using Templates and Mail Merge
Section 10.1 Define scripting
Objectives Overview Identify the four categories of application software Describe characteristics of a user interface Identify the key features of widely.
Writing for Academic Journals
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
A Data Model to Help End Users Shape Effective Software
Data Mining: Concepts and Techniques Course Outline
Andrew J. Ko & Brad A. Myers Carnegie Mellon University
Chapter 12: Automated data collection methods
VALIDATION BEST PRACTICES
COMP 208/214/215/216 – Lecture 7 Documenting Design.
Client-Server Model: Requesting a Web Page
Presentation transcript:

Dimensions Characterizing Programming Feature Usage by Information Workers Christopher Scaffidi, Andrew Ko, Brad Myers, Mary Shaw Carnegie Mellon University VL/HCC 2006

2 Motivation Understanding how people use features is essential to improving those features. Researchers have studied various populations’ use of programming features: –Web developers –Users in the home –Children We have surveyed another important population, information workers, who are… –Typically in an office environment –One of the largest end user groups

3 Talk outline Sample Respondent characteristics Factor analysis Interpretation Primary result: We found 3 distinct clusters of programming features based on feature usage.

4 Sample We surveyed over 800 readers of Information Week. Information Week ed 125,000 readers who opted in 831 completed the survey (816 retained for analysis) We asked if respondents or their subordinates had in the past 3 months used 23 programming features in 5 tools: –Web server scripts –Web pages –Databases –Spreadsheets –Slide editors / word processing

5 Respondent characteristics Respondents were predominantly end users. Most were managers: Information Week advertises to “information managers”. 76% of respondents had at least one subordinate. Most were in non-programming industries or jobs: Only 10% came from IT vendors; others were education, business services, government, manufacturing, finance Only 23% were IT or networking staff Most majored in business, science or engineering: Only 22% majored in comp sci or computer-centric fields

6 Respondent characteristics They were end user programmers. Most were familiar with programming constructs: 79% were familiar with all four programming terms (variables, subroutines, conditionals, and loops) 35% actually created all four constructs in the past year Most reported usage of each programming tool: Web server scripts: 52% Web pages: 69% Databases: 79% Spreadsheets: 93% Slide editors / word processing: 96%

7 Factor analysis We prepared the data for factor analysis (FA). We performed several main cleaning steps: –Filtering out people with missing feature variables: 168 left (we checked generalizability of factors, discussed later) –Scaling each feature variable to mean 0.0, std 1.0 (as not every case of feature usage is equally “significant”) –Subtracting a per-user, per-tool mean from each feature variable with that tool (to remove tool “bundling” effects) –Removing 4 features with low communality (< 0.1) We now had 168 people with 19 features showing interrelated usage tendency.

8 Factor analysis Our 3 factors (“clusters”) were Macros, Linked Structure, and Imperative A tendency to use each feature typically co-occurred with a tendency to use other features in the same factor. Macros Factor: –Recording & editing of spreadsheet & document macros Linked Structure Factor: –Creating & linking database tables –Creating web forms; creating page static includes –Creating spreadsheets & linking cells with functions Imperative Factor: –Using “new MyObj()” in Perl, PHP, and JavaScript –Creating JavaScript functions –Creating database triggers

9 Factor analysis We checked our factors several ways. We repeated factor analysis… –with different extraction algorithms –with only data from respondents with 0 subordinates … and consistently found the same qualitative structure. We checked generalizability over all 816 respondents 1.Constructed a scale from each factor 2.Checked Cronbach’s alpha for each (0.82, 0.62, 0.64)

10 Implications What does the existence of factors (feature groups) tell us? If users vary widely, can we create broadly useful features? Fortunately, 3 dimensions seem to characterize usage. What programming features are most widely used? Features in Linked Structure factor; see paper for details. Why do these feature groups exist? Possible hypotheses: Common work patterns require using features in concert. Learning a feature makes it easy to learn certain others. Mastering a feature disinclines people to use others.

11 Thank You To VL/HCC for this opportunity to present. To Rusty Weston and Lisa Smith at Information Week for their partnership. To James Herbsleb, Irina Shklovski, and Sara Kiesler for design and analysis assistance. To our paper reviewers for helpful suggestions. To NSF, Sloan, and NASA for funding.