Minnesota Contextual Content Analysis

Slides:



Advertisements
Similar presentations
Microsoft® Word 2010 Training
Advertisements

Microsoft ® Word 2010 Training Create your first Word document I.
C6 Databases.
Chapter 3 – Data Exploration and Dimension Reduction © Galit Shmueli and Peter Bruce 2008 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
SW388R7 Data Analysis & Computers II Slide 1 Solving Problems in SPSS The data sets Options for variable lists in statistical procedures Options for variable.
Diction 5.0 Created by Roderick P. Hart Kimberly S. Cooper and Paul M. Palisin.
Classifications and CASCOT Ritva Ellison Institute for Employment Research University of Warwick.
Managing Grades with Excel Viewing Help To view Help 1.Open Excel on your computer. 2.In the top right hand corner of the Excel Screen type in the.
PowerPoint Tutorial 1: Creating a Presentation
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Styles and themes are powerful tools in Word that can help you easily create professional looking documents. A style is a predefined combination of font.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
Customer Relationship Management
Group practice in problem design and problem solving
8/20/2015Slide 1 SOLVING THE PROBLEM The two-sample t-test compare the means for two groups on a single variable. the The paired t-test compares the means.
Word Processing basics
Chapter Sixteen Starting the Data Analysis Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Fact or Fiction: Teaching with Historical Fiction
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Chapter 3 Data Exploration and Dimension Reduction 1.
Testing. Definition From the dictionary- the means by which the presence, quality, or genuineness of anything is determined; a means of trial. For software.
PowerPoint 2007 ©: The Power of Presentations How can Microsoft PowerPoint 2007 help you convey your message?
Microsoft Word Objective: Understand Basic Word/Word Processing Skills Lesson: Create and Save a New Document LOL: Understand/Apply Create your first Word.
Forms and Server Side Includes. What are Forms? Forms are used to get user input We’ve all used them before. For example, ever had to sign up for courses.
Analysis of Algorithms CSCI Previous Evaluations of Programs Correctness – does the algorithm do what it is supposed to do? Generality – does it.
LEARNING HTML PowerPoint #1 Cyrus Saadat, Webmaster.
Chapter 2 Web Page Design Mr. Gironda. Elements of a Web Page These are things that most web pages use.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Within Subjects Analysis of Variance PowerPoint.
CATPAC & MCCALite N.H. & S.S.. CATPAC Can be used on any text, one text at a time Can be used on any text, one text at a time Save as “text only” document.
Microsoft ® Word 2010 Training Create your first Word document I.
Parent Guide to Using Lexile Scores Provided on the Georgia Milestones Individual Score Reports Using the Lexile Score to support the growth of your child’s.
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
+ CATPAC & WordStat Anne D. Sito & Erin Sonenstein COM 633: FA 09.
LIWC2001 Diane Fitzpatrick Jennelle Franz. LIWC20012 LIWC2001 Linguistic Inquiry and Word Count Built-in dictionary (but can input own) Built-in dictionary.
MSOffice PowerPoint 1 Part 1 ® Microsoft® Office 2010: Illustrated Introductory.
SW388R7 Data Analysis & Computers II Slide 1 Solving Homework Problems in SPSS The data sets Options for variable lists in statistical procedures Options.
Table of Contents I: Create an automatic TOC Get started on a table of contents A TOC may be a simple list of chapter titles, or it can include several.
Preparing to collect data. Make sure you have your materials Surveys –All surveys should have a unique numerical identifier on each page –You can write.
Principal Component Analysis
Clustering / Scaling. Cluster Analysis Objective: – Partitions observations into meaningful groups with individuals in a group being more “similar” to.
Academic Computing Services 2007 Microsoft Word 2010 Publishing Long Documents This Guide will teach you how to work with long documents such as dissertations.
Today We Will Review: Operating Systems (Windows) (week 3 & 4) Starting up MS Windows Desktop and its contents Functions of the desktop components Brain.
Copyright © Cengage Learning. All rights reserved. Normal Curves and Sampling Distributions 6.
Creating a Presentation
Data Mining: Concepts and Techniques
Implementation Process
Poster Session Title Author #1, Author #2, and Author #3 Introduction
Parent Guide to Using Lexile Scores Provided on the Georgia Milestones Individual Score Reports Using the Lexile Score to support the growth of your child’s.
Tips for Importing References from
The Shopping Basket Analysis Tool
6 Normal Curves and Sampling Distributions
Microsoft® Word 2010 Training
Tutorial 1 – Creating a Document
Microsoft Word Illustrated
Your Poster Title Here Your Name, Other Names… Your University
Draw a Venn Diagram and assign the details to “Fiction” or “Nonfiction
1 2 3 Welcome! ACT Updates PSAT & SAT Overhaul The Transition Year
Windows xp PART 1 DR.WAFAA SHRIEF.
Lesson – Teacher Notes Standard:
Sr. Quality Engineering Manager,
Signature: Microsoft Word 2003
Tutorial 7 – Integrating Access With the Web and With Other Programs
Conjoint analysis.
The ultimate in data organization
Parent Guide to Using Lexile Scores Provided on the Georgia Milestones Individual Score Reports Using the Lexile Score to support the growth of your child’s.
Lesson – Teacher Notes Standard:
PowerPoint Tutorial 1 Creating a Presentation
Presentation transcript:

Minnesota Contextual Content Analysis MCCAlite Minnesota Contextual Content Analysis Jeff Spicer & Matthew Egizii

MCCAlite Created by Donald McTavish & Kenneth Litkowski Department of Sociology, University of Minnesota Full version only operates on a Control Cyberdata 174 computer at the University of Minnesota

MCCAlite Used to analyze: Scripts Transcripts Screenplays Open-ended items Potentially Likert type scales Conceptual Dictionary 116 categories used to organize word meaning Examine patterns of emphasized ideas in text as well as the social context or underlying perspective reflected in the text.

Preparation/Formatting Choose an Input file As usual, .txt is preferred If you use the full version, it can also take .sav (its own file type) {MCCALite cannot save output!} [Correction: it has been updated to give output, but our lab does not have this version!] Frequent Words File Default is FREQWDS.TXT This is used to exclude certain words from analysis Not recommended by the programmers! Throws off weightings

Preparation/Formatting cont’d

Preparation/Formatting cont’d

Preparation/Formatting cont’d

Preparation/Formatting cont’d Text Separator $ means the end of a body of text. This is really important. MCCALite can’t process E-/C-Scores without it! Text Title Marks the word/text as a unit of analysis. Bracket the word/text with “=“ For example: = Horatio = would track every time Horatio speaks ALSO VERY IMPORTANT Can be time consuming…*sigh*

Preparation/Formatting cont’d

Word Analysis Word Accounting Total # of Words Total Words Categorized % of unique words % of categorized unique words Word Length Mean Standard Deviation Low/High

Words by Category/Frequency Select a text group from the drop down to examine their word use. This shows ONLY that text group. Shown @ right is “SLUG LINE” Frequency can be set to a minimum in the drop down box above. Shown @ right set to {5}

KWIC Key Words In Context Lines up concordances in the text to observe them in context.

C-Scores Measures social context across 4 main categories Traditional Focus on norms and expectations Practical Focus on successful (efficient) goal accomplishment Emotional Focus on personal involvement, comfort, enjoyment or leisure Analytic Focus on objectivity, curiosity, or interest

C-Scores cont’d Scores (Weighted) Scaled for each text with values of –25 to +25

C-Scores cont’d Plots (Weighted) Helps compare the profiles visually

C-Scores cont’d Scores (Raw) Un-normalized scores, sum to 0, but aren’t scaled. Plots (Raw) As with weighted, these help visualize the scores. {Not shown, we figure you get the picture.}

C-Scores cont’d Distance Matrix Euclidian distance between C-scores of each pair of text groups in the input file. Texts that are similar have smaller values. E-scores have this table too!

E-Scores Show emphasis on groups of idea categories formed from the 116 individual categories. Scores are “Normed” against expected usage. This demonstrates over- emphasis and under-emphasis. It is a computed score based on probability.

E-Scores cont’d High-Categories 23 super-categories grouped together from the initial 116

E-Scores cont’d Selected Plots Allows you to choose a specific category’s emphasis to plot, by Text Group. Not the plot of the High Categories, but the individual ones.

E-Scores cont’d Difference Analysis Indicates how different the selected Text Group is from all other Text Groups (or characters, etc.) Listed by category

E-Scores cont’d Diagnostic Groups 43 combinations of categories Set up as scales for further analysis {pretty sure about that} This is the interesting bit

The Searchers We decided to demonstrate this software using this film: Characters: Ethan Edwards (Wayne) & 5 others Also, created “SLUG LINE” Scene heading, actions, transitions, etc. Plot: “As a Civil War veteran spends years searching for a young niece captured by Indians, his motivation becomes increasingly questionable.” Length: Approximately 8 pages of screenplay. Reason for excerpt: Contained dialog and more than one character. This was entirely exploratory {remember, data can’t be saved!}