+ CATPAC & WordStat Anne D. Sito & Erin Sonenstein COM 633: FA 09.

Slides:



Advertisements
Similar presentations
GSSR Research Methodology and Methods of Social Inquiry January 10, 2012 Research Using Available Data.
Advertisements

eClassifier: Tool for Taxonomies
This tutorial is designed to take you through the features and content of Oxford African American Studies Center. Please click "Start the Tour" below for.
A N I NTRODUCTION TO QDA M INER: or IS QDA MINER REALLY A BETTER SOLUTION FOR MIXED METHODS RESEARCH? By Normand Péladeau President Provalis Research Corp.
SONY NW – HD5 MP3 PLAYER MARKET ANALYSIS Mohammad Islam Arafat Salih Aydiner Information Retrieval – Fall /07/2005 New Jersey Institute of Technology.
Preparing Data for Quantitative Analysis
Predictor of Customer Perceived Software Quality By Haroon Malik.
Mixed-methods data analysis Graduate Seminar in English Language Studies Suranaree, March 2011 Richard Watson Todd KMUTT
Jonathan Simon Elizabeth Langdon COM 633, Fall 2010.
How do I summarize and make sense of all these words?
Diction 5.0 Created by Roderick P. Hart Kimberly S. Cooper and Paul M. Palisin.
Chapter 17 Overview of Multivariate Analysis Methods
VBPro & Yoshikoder C.K. & D.L.. VBPro About VBPro Must make own dictionary in this format Can import LIWC and other dictionaries, but wildcards (*) crash.
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 1: Introduction to Decision Support Systems Decision Support.
Table design screen Field name Data type Field size Other properties.
From Scenarios to Paper Prototypes Chapter 6 of About Face Defining requirements Defining the interaction framework.
1 A Student Guide to Object- Orientated Development Chapter 9 Design.
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
Overview of Search Engines
Premier Accessibility Suite Software for Reading and Writing.
Qualitative Data Analysis : An Introduction Carol Grbich Chapter 21: An overview of qualitative computer management programs.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
© Prentice Hall, 2003 Business Communication TodayChapter Finding, Evaluating, and Processing Information.
Golder and Huberman, 2006 Journal of Information Science Usage Patterns of Collaborative Tagging System.
© 2009 Pearson Education, Inc publishing as Prentice Hall 4-1 Chapter Four Exploratory Research Design: Secondary Data.
1 Beyond the Library: i-Skills for University Administration © Netskills, Quality Internet Training, Newcastle University Partly.
Michael Margel Dec CSC 2524 SURFBRD. What is SURFBRD? SURFace-Based Remote Desktop Pronounced “Surfboard” A desktop environment that allows users.
Resources for International Comparative Analysis: The European Social Survey ESRC Research Methods Festival, St Catherine's College, Oxford, 02 July 2008.
Mobile Learning – Part 2 of 3 An Opportunity to Increase Teaching and Learning Mary G. Beckmann July 2008.
Software Engineering Chapter 8 Fall Analysis Extension of use cases, use cases are converted into a more formal description of the system.Extension.
The Galileo Method An Overview of Galileo™ Software and Analysis.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Multivariate Data Analysis CHAPTER seventeen.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
Chapter Four Chapter Four.
WordStat & Yoshikoder T.M. & M.S.. WordStat About WordStat Must be run as part of SimStat Designed to process text such as open ended responses, journal.
TOPIC CENTRIC QUERY ROUTING Research Methods (CS689) 11/21/00 By Anupam Khanal.
Software Development Cycle What is Software? Instructions (computer programs) that when executed provide desired function and performance Data structures.
Yoshikoder Rachel Campbell, Shawna Jackson, and Lisa Tselebidis.
Log files presented to : Sir Adnan presented by: SHAH RUKH.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
Media Arts and Technology Graduate Program UC Santa Barbara MAT 259 Visualizing Information Winter 2006George Legrady1 MAT 259 Visualizing Information.
The Program Evaluation Cycle Module 3. 2 Overview n Overview of the evaluation cycle n Major components of the cycle n Main products of an evaluation.
1 © 2009 University of Wisconsin-Extension, Cooperative Extension, Program Development and Evaluation How do I summarize and make sense of all these words?
PCAD & CATPAC 2011 Brittani Wolanin & Mike Kurtz.
Introduction to Neural Networks and Example Applications in HCI Nick Gentile.
CATPAC & MCCALite N.H. & S.S.. CATPAC Can be used on any text, one text at a time Can be used on any text, one text at a time Save as “text only” document.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
MICS Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Creating Analysis Files: Description of Preparation Steps.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
LIWC2001 Diane Fitzpatrick Jennelle Franz. LIWC20012 LIWC2001 Linguistic Inquiry and Word Count Built-in dictionary (but can input own) Built-in dictionary.
C&I 222: Understanding Goals and Purposes: Assessment OF Learning and Assessment FOR Learning.
Hall, Accounting Information Systems, 8e ©2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly.
Dictionaries and File I/O George Mason University.
Analyzing Data. Learning Objectives You will learn to: – Import from excel – Add, move, recode, label, and compute variables – Perform descriptive analyses.
Undergraduate School of Criminal Justice
INSPIRING CREATIVE AND INNOVATIVE MINDS CONTENT ANALYSIS A careful, detailed, systematic examination and interpretation of a particular body of material.
How to analyze your data Deciding which approach to use Analysing qualitative data Analysing quantitative data Measuring data.
Cost9b 1 Living with Function Points Bernstein and Lubashevsky Text pp
INFORMATION SOURCES Resources in a library are determined by the information requirements of the users of the Library.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Key output and findings D.K. & B.L.
Minnesota Contextual Content Analysis
Applications of Data Mining in Software Engineering
Model Development Weka User Manual.
Tabulations and Statistics
Using GOLD to Tracking L2 Development
UNIT SELF-TEST QUESTIONS
Presented By: Grant Glass
Presentation transcript:

+ CATPAC & WordStat Anne D. Sito & Erin Sonenstein COM 633: FA 09

+ CATPAC

+ Overview of CATPAC Designed to recognize frequently used words in text Identifies and groups patterns of similar words Provides output of clustering algorithms, perceptual maps, and interactive clustering

+ Data Preparation: Text

+ 1. Convert document into.txt file

+ 2. Inputting Data

+ 3. Select Text File You Want to Analyze

+ 4. Select “Make Dendrogram”

+ 5. Initial Output Screen

+ 6. Output Data Screen

+ 7. Output: Dendrogram

+ 8. Data Presented in ThoughtView 2D

+ 9. Data Presented in ThoughtView 3D

+ 10. Thought View 3D (Rotated)

+ Discussion and Limitations +’s Found words like “you”, “you’ll”, & “and” to be the most used in this text. Examines relationships between words based on proximity in the text. -’s Words are measured based on frequency, not importance. Focuses less on what words “mean” or how they fit together based on dictionaries.

+ WordStat

+ Overview of WordStat  Content Analysis Module for SIMSTAT  Specifically designed to process textual information geared for open-ended data which includes: journal articles, speeches, electronic communication, interviews, etc.  Has existing dictionary library and can also run analyses from new dictionaries built by the user  Can perform statistical analyses (i.e., factor analysis, word frequencies, multiple regression, etc.)  KWIC: Key Word In Context tables are available for any included or not included word or word pattern

+ Data: Comparing Reviews of the Book on Amazon.com Between Men and Women

+ 1. Create a Text File

+ 2. Input Text File to WordStat

+ 3. Define Your Variables

+ 4. Running the Analysis

+ 5. Existing Dictionary Was Not Relevant for Our Data

+ 6. New Dictionary Available Online!

+ 7. (Free) New Dictionary Download

+ 8. Import New Dictionary; Maintain Exclusion List

+ 9. Level 1 Analysis

+ 10. Level 2 Analysis

+ 11. Overall Frequencies

+ 12. Gender Differences

+ 13. Dendrogram

+ 14. Clustering

D Figure of Output

+ 16. Concurrence Matrix

+ 17. KWIC by Gender

+ 18. Words by each Text Case

+ 19. Word Count Category Frequency

+ 20. Aggression Example

+ 21. Limitations: Terrific=Anxiety?

+ Discussion & Limitations Allows multiple independent variables Dictionaries may not always be complete Words in.txt file must be be spelled correctly Could not distinguish between quotes from the book and original thoughts May not account for different usage of certain words, (e.g., combating, terrific)

+ Any Questions? Thank You!