How Facebook Talk Informs Us About Current Word Use

Slides:



Advertisements
Similar presentations
Unit 6 Predicates, Referring Expressions, and Universe of Discourse Part 1: Practices 1-7.
Advertisements

The Google Similarity Distance  We’ve been talking about Natural Language parsing  Understanding the meaning in a sentence requires knowing relationships.
Data Analysis in Excel. Importance of Data Analysis Tracking and analyzing data are increasingly important in business, medicine, sports, politics, and.
The Internet Do you really know what is out there?
Jargon Busters Presented by Katie Munton and Natalie Dawson.
Facebook for Beginners One Session Class. What will you learn today? What can you do on Facebook? Creating a profile Privacy Connecting with friends Sending.
Presented By: Nick Koziol ISC110.  Had 1.19 billion members as of October  Largest social networking site in the world  Mark Zuckerberg  Many databases.
VISUAL WORD RECOGNITION. What is Word Recognition? Features, letters & word interactions Interactive Activation Model Lexical and Sublexical Approach.
Taking Notes First thing first- As soon as the teacher says, I need you to take notes, get out your paper. What else could they say?
JavaScript Part 1 Introduction to scripting The ‘alert’ function.
Facebook privacy policy
Genevieve’s Fundraising
Edward De Valle Exciting Internet Marketing Ideas That Anyone Can Use
Tracking Students Throughout the Scholarship Season
FROM MONOMODAL TO MULTIMODAL METAPHORS
E303 Part II The Context of Language Research
Vocabulary Module 2 Activity 5.
1.4 Wired and Wireless Networks
presentational speaking
CORPUS LINGUISTICS Corpus linguistics is the study of language as expressed in samples (corpora) or "real world" text. An approach to derive at a set of.
Please review these important Webinar Etiquette guidelines
THE NEED FOR DNS DOMAIN NAME SYSTEM
Arrays: Checkboxes and Textareas
Capitalizing on Social Media

Research is Dope and other non-drug related things…
Writing Tasks and Prompts
Language is Psychology Too
Open, Manage, and Reconcile
Dr. Michael Zimmer Jenna Willoughby
Ads facebook Wall Photos Flair Boxes Logout Chloroplast
ADO.NET Entity Framework Marcus Tillett
Kronos Manager Tips and Tricks
Job Search: Networking
Search Techniques & Strategies
Are You Ready for the Future?
GCSE Computing Databases.
Can’t Block the Rock n’ Roll: Early Associative Memory Access
Introduction to High School Grammar
Welcome to Kindergarten
Communicative competence
What is Google+? Google+ is a social network and social layer for google services Some of its tools and features come from existing services and platforms,
Chapter 1 Accounting I When we think about accounting, it is the study of how money flows through a business. When you buy something from the store what.
SOCIAL MEDIA IN MANUFACTURING
Written Communication Styles
Social Media-The Game Changer
Statistical n-gram David ling.
Call Session Power Hour
Methods we use in Mathematics
Tonga Institute of Higher Education IT 141: Information Systems
Social Media Best Practices
FCE (FIRST CERTIFICATE IN ENGLISH) General information.
Networking Workshop (2)
Queries.
SCARAB.
How To Fill Out The FAFSA 101
Advertising, Branding, and social media
Word embeddings (continued)
Call Session Power Hour
eHarmony for Recruiters
Humans of smithson valley
FM2 Section A Planning Workshop
Grammar Preposition Simple Sentence Types of sentences: AS, NS and IS
ASL Grammar Basic but Supersized
Tonga Institute of Higher Education IT 141: Information Systems
Wrt 105: practices of academic writing
REAL CLASSROOM ACTIVITIES
Developing SMART Professional Development Plans
Presentation transcript:

How Facebook Talk Informs Us About Current Word Use Updating your Status How Facebook Talk Informs Us About Current Word Use

the Facebook 600 million users Started at Harvard 2004 as a company 2005 bought facebook.com domain

the Facebook 3rd largest web company After google.com and amazon.com In March 2010, had more hits than google.com

the Facebook

the Facebook Word Use Facebooking, Facebook – 2008 Unfriend – 2009 The Social Network - 2010

the Facebook The only “web” program that has more users? Social connectedness

Psycholinguistics Psycholinguistics is a field of psychology that studies all the facets of language and how the interacts with the individual. Dr. B studies word frequency and relationships between words – as well how our ability to remember that difference.

Previous Research Frequency Simple word counts – how often words appear in written text. HAL – Hyperspace Analogue to Language (Burgess & Lund, 1997) Brown Corpus (Kucera & Francis, 1967) Google ngrams (Google.com, 2011) Many more….

Previous Research Semantics versus Association Semantics = word meaning, the dictionary definition of a word Association = word use, how often words are used together in context Often these are the same thing, but they interact with frequency. What do you think of when I say BANK?

Current Study Currently collecting Facebook walls Number of Walls = 771 Donated for PSY 121 credit

Current Study Dates of Status Updates 2004 to present Over 2 Gigs of text information

Current Study Each Facebook wall is reduced to posts and statuses Then each timestamp is tagged with the status. just loves being at work 3 hours after the store closes. Timestamp: Monday, March 29, 2010 at 1:31am VEGAS BABY!!!!! Timestamp: Monday, February 1, 2010 at 12:47am Flu shot: shot. Hannah Montana band-aid: stuck. Bam. Ready to roll. :) Timestamp: Wednesday, October 27, 2010 at 3:09pm Pshhhhh. Timestamp: Saturday, July 11, 2009 at 12:01pm

Current Study Word Use Nearly 50,000-100,000 unique words were collected Frequency was counted over all words Time values to be added later for comparison Only words with information in at least one frequency database are used here. For example, lol omg wtf are all excluded because they were not in the databases for text word frequency.

Research Questions How much of social word use overlaps with word use in texts? Comparing to frequency norms from previous psycholinguistic research. How do word frequencies change over time? Will be able to compare to Google ngrams How often do words appear together over time? Will be able to compare to association norms

Most Common Words http://www.sporcle.com/games/common_english_words.php

Most Common Words Brown Corpus HAL Facebook 1 a the I 2 in to you 3 he of 4 be and 5 6 have 7 one is 8 there that it 9 who 10 will for my

Results - Overlap Facebook HAL 49% Facebook Brown 40%

Results – Overlap There appears to only be a 40-50% overlap between our social word use (hey!) and written word use (Hello.) What is the rest? Most common word uses are still similar Pronouns Verbs Determinants Prepositions

Results – Common Uses There are several social implications from the top 100 words: Happy Birthday = 11 and 15 Love = 26 Miss = 34 Hey = 39 Class = 51 Haha = 53

Results – So what? What does this mean for everyday language? Language is always changing and evolving – words are deleted and added to the dictionary yearly. Over time languages tend to condense – we use less words to emphasize the same meaning.

Results – So what? Social word use is an important phenomenon to understand. Word use is more context (person to person) based and individualized. For instance, you could post on someone’s wall I DID IT!! and you would know what they were talking about with no context as to what you “did”. Written word use is more defined and user-separated. When writing a newspaper article, the journalist has to write so that everyone will understand.

Results – So what? Communication or facts? What exactly is the purpose of all this speak? Are we trying to communicate with people (without having to be in person)? Or share facts and knowledge? Obviously, social connection is key.

Results – What’s Next? Put this information into a large database for other researchers to use (www.wordnorms.com). Look at word frequency over time – are their reliable relationships that match probability relationships of word pairs? For example, do cat and dog occur together as frequently as we associate them? The probability is much lower than free association (50% versus 11%) Other cool stuff?