Statistical analysis of Skype conversations: recognizing individuals by their chatting style Candidato : Cristina Segalin Relatore: Dr. Marco Cristani.

Slides:



Advertisements
Similar presentations
IB Oral Presentation Presentation dates: January-February (tentative)
Advertisements

1 C2 Maturity Model Experimental Validation Statistical Analyses of ELICIT Experimentation Data Dr. David S. Alberts.
Computer Security Lab Concordia Institute for Information Systems Engineering Concordia University Montreal, Canada A Novel Approach of Mining Write-Prints.
WRITING IN CONTEXT Creating and Presenting. What you need to do:  Your task is to develop your writing skills so that you can create a number of short.
Fingerprint Minutiae Matching Algorithm using Distance Histogram of Neighborhood Presented By: Neeraj Sharma M.S. student, Dongseo University, Pusan South.
Writing a Literary Analysis Essay Mrs. Abler. Begin with the basics Read the book or books assigned Read the book or books assigned Ask relevant questions.
Stylometry System CSIS Stylometry System – Use Cases and Feasibility Study Gregory Shalhoub, Robin Simon, Jayendra Tailor, Ramesh Iyer, Dr. Sandra Westcott.
Notes on how to read a paper Reading is more than just passing your eyes over the words in consecutive order. Reading is understanding.
Stylometry Project May 4, 2007 Pace’s Research Day.
Fieldwork, methods and ideas. Your Project  Basic methods of fieldwork  Past projects.
Early Reading First Jeopardy This activity is available for professional, noncommercial use only. You may use this activity provided that: –(a) you do.
Stylometry System CSIS Stylometry Projects, mostly Fall 2009 Project Seidenberg School of Computer Science and Information Systems.
Introduction to Communication Research
PART 1: Writing a comparative essay
A Guide to the Language & Literature External Assessment.
National Curriculum Key Stage 2
Dr. MaLinda Hill Advanced English C1-A Designing Essays, Research Papers, Business Reports and Reflective Statements.
1 © 2012 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Chapter Seven Choosing to Read Actively.
WRITE BITES Early College Campus. Literary Analysis: Using Elements of Literature Students are asked to write literary analysis essays because this type.
Loris Bazzani Marco Cristani
Chris Luszczek Biol2050 week 3 Lecture September 23, 2013.
Developing Student Researchers Part 4 Dr. Gene and Ms. Tarfa Al- Naimi Research Skills Development Unit Education Institute.
WRITING THE RESEARCH REPORT & CITING RESOURCES BUSN 364 – Week 15 Özge Can.
南台科技大學 資訊工程系 Automatic Website Summarization by Image Content: A Case Study with Logo and Trademark Images Evdoxios Baratis, Euripides G.M. Petrakis, Member,
Chapter 10 Handwriting Analysis, Forgery, and Counterfeiting By the end of this chapter you will be able to: describe 12 types of handwriting characteristics.
Generalized Fuzzy Clustering Model with Fuzzy C-Means Hong Jiang Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, US.
Microblogs: Information and Social Network Huang Yuxin.
Authorship Attribution By Allison Pollard. What is Authorship Attribution? The way of determining who wrote a text when it is unclear who wrote it. It.
The Great Gatsby An inside look into a great novel.
You Are What You Tag Yi-Ching Huang and Chia-Chuan Hung and Jane Yung-jen Hsu Department of Computer Science and Information Engineering Graduate Institute.
Investigating Identity Unit. Unit Summary During this unit students will participate in different activities that are all a part of Project-Based Learning.
Schneider: Discourse1 CHAPTER 12: DISCOURSE READ 656 Dr. Schneider.
Using Machine Learning Techniques in Stylometry Ramyaa, Congzhou He, Dr. Khaled Rasheed.
PERSUASION Lesson 10: Discussion of African American Literature.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
10/6/15 Do Now: - Take out your homework. - Take out your Catcher books and Post-It notes. Homework: - Read Chapter 5 in Catcher in the Rye Content Objective:
Content-Based MP3 Information Retrieval Chueh-Chih Liu Department of Accounting Information Systems Chihlee Institute of Technology 2005/06/16.
Literary Criticism Mr. Ritenour English 10 What is Literary Criticism? Literary Criticism is a method of filtering a story’s message or theme via.
Literature as Content For ESL/EFL introduction. objectives  List the benefits of using literature as content.  The importance of literature to extent.
Approaching Literary Criticism. Commentary A literary analysis, which is essentially a close study of the elements that contribute to the success, or.
ID Identification in Online Communities Yufei Pan Rutgers University.
15-1 Bus 421: Marketing Research CSU Monterey Bay School of Business.
Data Analysis. Warm Up: Finish up the notes on Health and Stress. You will have 20 minutes. StandardObjective IA-3.1 Describe the elements of an experiment.
Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon -Smit Shilu.
Writing a literary analysis essay English 11/12. Begin with the basics Read the book or books assigned Read the book or books assigned Ask relevant questions.
By Kyle Bickel. Road Map Biometric Authentication Biometric Factors User Authentication Factors Biometric Techniques Conclusion.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
Optical Character Recognition
What is an Analysis and how does it work? In this essay you will analyze.
GENRES. WHAT IS A GENRE? A literary genre is a category of literary composition. Genres may be determined by literary technique, tone, content, or even.
Advanced Higher Modern Languages
Visual Information Retrieval
Basic Data Analysis: Descriptive Statistics
Parts of an Academic Paper
Materials & Methods Introduction Abstract Results Conclusion
Basic Data Analysis: Descriptive Statistics
Evaluation of a Stylometry System on Various Length Portions of Books
Passage Types Question Types
Do Now Find a partner Work with your partner to write a short story/description about spring You should include At least 2 simple sentences At least 2.
Chapter 10 Handwriting Analysis, Forgery, and Counterfeiting By the end of this chapter you will be able to: describe 12 types of handwriting characteristics.
Stylometry and Authorship
Materials & Methods Introduction Abstract Results Conclusion
Materials & Methods Introduction Abstract Results Conclusion
Chapter 10 Handwriting Analysis, Forgery, and Counterfeiting By the end of this chapter you will be able to: describe 12 types of handwriting characteristics.
Title Introduction: Discussion & Conclusion: Methods & Results:
Agenda *Voice Lesson: Dali
Materials & Methods Introduction Abstract Results Conclusion
Materials & Methods Introduction Abstract Results Conclusion
A Coupled User Clustering Algorithm for Web-based Learning Systems
Presentation transcript:

Statistical analysis of Skype conversations: recognizing individuals by their chatting style Candidato : Cristina Segalin Relatore: Dr. Marco Cristani

Abstract n Goal of the thesis: Recognizing the identity of a person by considering the way she/he chats n Facts: –Each person has a particular style of writing –There are features which capture the style of writing n Assumptions: –A chat is similar to a spoken conversation n Our contribute: –Designing new stylistic features for analyzing chats, which capture the multimedia nature (written text + oral conversation) of a chat

Contents n Introduction n Our approach n Experiments n Conclusions

Intro n SSP: couples social psychology and Pattern Rec. aimed at modeling different behavioral aspects of a person. n Conversational analysis (CA): describes the style of oral conversations. Conversational analysis is a subfield.

Intro n Stylometry is defined as statistical analysis of writing style. It is used to identify author of literary work, apply to music and fine-art paintings. n Authorship attribution is the process of examining the characteristics of a piece of writing, an ancient text, a program code or comments on website to draw conclusions on its authorship. n AA: old, traditionally on books, then on WEB, now on chats n Based in the concept of style, subsumed by a set of stylometric features

Tassonomy

Feature extraction Our ideas: –consider the turn as atomic entity –characterizing the turn taking via novel features AA standard: consider the text as a whole

Feature extraction Conversational features Turn duration Writing speed # Return characters Mimicry

Signature matching Probe feature Gallery MATCH Bhattacharyya Euclidean Dist. NxN distance matrix 1 rank N rank 3 1 N distances nAUC Prob. ID Signature Accuracy CMC curve

Experiments Single features performance

Experiments Feature selection: 12 features, nAUC=90.53 Features # exclamation marks # emoticons # three points # uppercase letters # turn duration # return chars average word length #chars per second # question marks # characters # words per second mimicry degree normalized Area Under Curve (nAUC)

Experiments Final CMC curve

Conclusions n Introduction of new features that account for turn- taking and mirror the features typically applied in automatic understanding of spoken conversations n Use of turns as a basis analysis unit for the analysis of chat data and identification of their participants n New dataset based on Skype conversations

Future Works

Thanks for your attention!