Three Software Development Gestalts Kirby McMaster Brian Rague Steven Hadfield Nicole Anderson.

Slides:



Advertisements
Similar presentations
Chapter 10: Designing Databases
Advertisements

Advancing Assessment Literacy Data Analysis I: Summarizing, Representing, and Sharing Data.
COMPSCI 105 S Principles of Computer Science 12 Abstract Data Type.
Item Analysis: A Crash Course Lou Ann Cooper, PhD Master Educator Fellowship Program January 10, 2008.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Bivariate Data & Scatter Plots Learn to take bivariate data to create a scatter plot for the purpose of deriving meaning from the data.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Chapter 14 The Second Component: The Database.
1 i247: Information Visualization and Presentation Marti Hearst Graphing and Basic Statistics.
Application of Graph Theory to OO Software Engineering Alexander Chatzigeorgiou, Nikolaos Tsantalis, George Stephanides Department of Applied Informatics.
Let's zoom in on one corner of the coordinate plane
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Sometimes, Tables can be confusing
Measures of Central Tendency
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
Joint Distributions AND CORRELATION Coefficients (Part 3)
Two Frameworks for Discrete Mathematics Kirby McMaster Brian Rague Steven Hadfield.
©2013 Cengage Learning. All Rights Reserved. Business Management, 13e Data Analysis and Decision Making Mathematics and Management Basic.
Chapter 2 Describing Data with Numerical Measurements
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Data Handling Collecting Data Learning Outcomes  Understand terms: sample, population, discrete, continuous and variable  Understand the need for different.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Statistics for Managers.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Research and Statistics AP Psychology. Questions: ► Why do scientists conduct research?  answer answer.
What is Gestalt? Wiki: Gestalt refers to a structure, configuration, or pattern of physical, biological, or psychological phenomena so integrated as to.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
1 DATA DESCRIPTION. 2 Units l Unit: entity we are studying, subject if human being l Each unit/subject has certain parameters, e.g., a student (subject)
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Database System Concepts and Architecture
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Two Approaches to Calculating Correlated Reserve Indications Across Multiple Lines of Business Gerald Kirschner Classic Solutions Casualty Loss Reserve.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 2, Slide 1 Chapter 2 Displaying and Describing Categorical Data.
?v=cqj5Qvxd5MO Linear and Quadratic Functions and Modeling.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
Unit 4 Statistical Analysis Data Representations.
Attainment Peter Gorrie, QIO September 2014.
Random Graph Generator University of CS 8910 – Final Research Project Presentation Professor: Dr. Zhu Presented: December 8, 2010 By: Hanh Tran.
27/3/2008 1/16 A FRAMEWORK FOR REQUIREMENTS ENGINEERING PROCESS DEVELOPMENT (FRERE) Dr. Li Jiang School of Computer Science The.
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
Logical (Information Level) Design of Relational Databases.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Why do we analyze data?  It is important to analyze data because you need to determine the extent to which the hypothesized relationship does or does.
Why do we analyze data?  To determine the extent to which the hypothesized relationship does or does not exist.  You need to find both the central tendency.
Principal Component Analysis
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Statistical Methods © 2004 Prentice-Hall, Inc. Week 3-1 Week 3 Numerical Descriptive Measures Statistical Methods.
SMART NOTE TAKER Presented By M.SIRISHA.  Smart note taker is a very useful product that could satisfy the needs of people in today's technological and.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Module: Software Engineering of Web Applications Dr. Samer Odeh Hanna 1.
Chapter 12 Understanding Research Results: Description and Correlation
Statistical analysis.
Warm Up Scatter Plot Activity.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Module: Software Engineering of Web Applications
Statistical analysis.
Understanding Research Results: Description and Correlation
Module: Software Engineering of Web Applications
Introduction Previous lessons have demonstrated that the normal distribution provides a useful model for many situations in business and industry, as.
Principal Component Analysis
Module: Software Engineering of Web Applications
Suitability Test Wednesday, 22 May 2019.
Presentation transcript:

Three Software Development Gestalts Kirby McMaster Brian Rague Steven Hadfield Nicole Anderson

Three Software Development Gestalts Schemas, Paradigms, and Gestalts We use the term gestalt to refer to a mental framework that allows students to organize course topics into a unified whole (greater than the sum of the parts).  Donald: A schema... is a data structure of generic concepts stored in memory and containing the network of relationships among the constituent parts.... If we are to understand the relationships between concepts, we need to know in what order and how closely concepts are linked and the character of the linkage.  Bain: Students bring paradigms to the class that shape how they construct meaning.... Even if they know nothing about our subjects, they still use an existing mental model of something to build their knowledge of what we tell them. 2

Three Software Development Gestalts Gestalts for Software Development Software development is a broad and diverse field. Our scope in this research is limited to the development of Information Systems, consisting of:  a set of application programs  one or more databases. In CS and IS curricula, software development principles are taught in the following types of courses:  Programming, with an emphasis on object-oriented concepts.  Databases and database management systems.  Software Engineering, including systems analysis and design. 3

Three Software Development Gestalts Purpose of this Study Our primary goal in this research was to construct scales to measure gestalts for Programming, Database, and Software Engineering.  Our methodology involved analyzing words in books.  Our assumption is that words used frequently in a book reflect the gestalt of the author.  Our findings have relevance in designing ways to teach software development courses.  The gestalt scales can also help instructors choose suitable textbooks for those courses. 4

Three Software Development Gestalts Gestalt Scale Development The construction of measurement scales for our three software development gestalts involved the following steps: 1.Sampling: Select a diverse sample of Programming (OOP), Database (DB), and Software Engineering (SE) books having an Amazon concordance. 2. Measurement: For each concordance word, record the book, word, and frequency. 3. Conversion: Change nouns, verbs, adjectives, and adverbs to a consistent form. 4. Transformation: Rescale word frequencies (StdFreq) within a book to give the average concordance word a score of Grouping: Combine relevant synonyms into word/groups, summing the StdFreq scores for each group. 5

Three Software Development Gestalts Gestalt Scale Development 6. Scale Construction: Build the Programming (PGestalt), Database (DGestalt), and Software Engineering (SGestalt) scales using an iterative process:  Look for words that are used frequently within each book and consistently across similar books.  Build a tentative scale, and calculate scores for every book.  Remove books with low scale scores, and repeat the process.  Stop when the scales and list of remaining books stabilize. Starting with 37 OOP books, 37 DB books, and 36 SE books, we obtained (after several iterations) PGestalt, DGestalt, and SGestalt scales constructed from 27 OOP books, 27 DB books, and 26 SE books, respectively. 6

Three Software Development Gestalts Programming Gestalt The PGestalt scale consists of 14 word/groups and weights.  The most frequent word/groups are:  class/subclass  method/algorithm  object.  The weights for each word/group are used to calculate an overall weighted-average PGestalt score for a book.  Sample calculations are illustrated for McMillan’s Object-Oriented Programming with Visual Basic.NET (PGestalt score = 394.1). 7

Three Software Development Gestalts Word/GroupBooks Avg StdFreq Weight class/subclass method/algorithm object code/program function/procedure value/variable integer public/private type/datatype string statement/line data/information new file TOTAL PGestalt Scale

Three Software Development Gestalts Word/GroupWeight StdFreqScale class/subclass20.87* method/algorithm13.78* object10.29* code/program10.14* function/procedure value/variable integer public/private4.81* type/datatype string statement/line data/information3.04* new file TOTAL PGestalt Calculations McMillan (2004), OOP With VB.NET

Three Software Development Gestalts Database Gestalt The DGestalt scale consists of 14 word/groups and weights.  The most frequent word/groups are:  data/information  table/relation  database.  The weights for each word/group are used to calculate an overall weighted-average DGestalt score for a book.  Sample calculations are illustrated for Watson’s Data Management (DGestalt score = 438.7). 10

Three Software Development Gestalts Word/GroupBooks Avg StdFreq Weight data/information table/relation database query/sql entity/relationship attribute/column/field key/primary/foreign system/subsystem object model/modeling user/client/customer record/row/tuple value/variable type/datatype TOTAL DGestalt Scale

Three Software Development Gestalts Word/GroupWeightStdFreqScale data/information19.55* table/relation15.56* database10.83* query/sql entity/relationship attribute/column/field key/primary/foreign system/subsystem object model/modeling user/client/customer record/row/tuple value/variable type/datatype TOTAL DGestalt Calculations Watson (2005), Data Management

Three Software Development Gestalts Software Engineering Gestalt The SGestalt scale consists of 12 word/groups and weights.  The most frequent word/groups are:  software  system/subsystem  process.  The weights for each word/group are used to calculate an overall weighted-average SGestalt score for a book.  Sample calculations are illustrated for Thayer’s Software Engineering, Vol. 1 (SGestalt score = 394.7). 13

Three Software Development Gestalts Word/GroupBooks Avg StdFreq Weight software system/subsystem process data/information code/program requirement/specification test/testing user/client/customer develop/development project design/designer model/modeling TOTAL SGestalt Scale

Three Software Development Gestalts Word/GroupWeightStdFreqScale software18.13* system/subsystem15.85* process data/information code/program6.93* requirement/specification6.81* test/testing user/client/customer develop/development project design/designer3.98* model/modeling TOTAL SGestalt Calculations Thayer (2002), Software Engineering, Vol. 1

Three Software Development Gestalts

Gestalt Scale Distributions Gestalt scale scores varied widely across the books in the sample.  PGestalt scores for the 37 OOP books ranged from to  DGestalt scores for the 37 DB books ranged from to  SGestalt scores for the 36 SE books ranged from to A graph of the three gestalt distributions is shown on the next slide. Each distribution is shown only for books in the relevant category  PGestalt for OOP books  DGestalt for DB books  SGestalt for SE books. Averages for the three scales for all books are presented in a table on the following slide. 17

Three Software Development Gestalts Gestalt Scale Distributions 37 OOP, 37 DB, and 36 SE Books 18

Three Software Development Gestalts Gestalt Scale Distributions Averages by Book Category and Scale Type 19 CategoryPGestaltDGestaltSGestalt OOP (37 books) DB (37 books) SE (36 books) ALL (110 books)

Three Software Development Gestalts Comparing Gestalt Scales The relationship between PGestalt and DGestalt scores for all sample books is displayed on the next screen as a scatter plot.  The correlation between the PGestalt and DGestalt scales is slightly negative (= ), when all 110 books are included.  The correlation is larger (= ) when only OOP and DB books are considered.  Note that 3 DB books and 1 SE book have PGestalt scores above 200 (high on the “wrong” scale).  Similar patterns exist for the PGestalt-Gestalt and DGestalt-SGestalt pairs of scales. 20

Three Software Development Gestalts PGestalt vs. DGestalt Scatter Plot 37 OOP, 37 DB, and 36 SE Books 21

Three Software Development Gestalts Gestalt Mixtures The mixture of gestalts within a book can be obtained by expressing the gestalt scores as percentages.  The next slide is a triangular scatter plot, with PGestalt and DGestalt percentages on the axes.  The SGestalt percentage is implied, since the sum of all 3 is 100%.  A gestalt is dominant if its percentage score exceeds 50%.  Ninety of the 110 books have a dominant gestalt.  Only one (DB) book was dominant on the "wrong" (PGestalt) scale. 22

Three Software Development Gestalts Gestalt Scale Mixture Plot 37 OOP, 37 DB, and 36 SE Books 23

Three Software Development Gestalts Choosing a Textbook Question: Would McConnell’s Code Complete be a suitable textbook for a Software Engineering course?  The SGestalt score for this book is (see next slide).  The most frequent word/groups are:  code/program  data/information  test/testing.  The Gestalt mixture is: PGestalt = 42.7%, SGestalt = 38.1%, and DGestalt = 19.2%.  We would not recommend Code Complete as a primary SE textbook, but it would provide worthwhile supplemental reading. 24

Three Software Development Gestalts Word/GroupWeightStdFreqScale software system/subsystem process data/information8.69* code/program6.93* requirement/specification test/testing user/client/customer develop/development project design/designer model/modeling TOTAL SGestalt Calculations McConnell (2004), Code Complete, 2 nd ed

Three Software Development Gestalts Summary and Conclusions In this paper, we have described three scales for measuring gestalts in software development: 1. Programming – for writing object-oriented programs. 2. Database – for designing and implementing databases. 3. Software Engineering – for building complex software systems. We calculated three gestalt scores for each sample book and analyzed the distributions of these scores.  The scales were able to discriminate between the types of books used to construct the scales.  A few books had high gestalt scores on the "wrong" scale.  When viewed as a mixture of gestalts, 90 of the sample books had a dominant gestalt. 26