Wikipedia Network Analysis: Commonality detection among Wikipedia authors Deepthi Sajja.

Slides:



Advertisements
Similar presentations
Open repositories: value added services The Socionet example Sergey Parinov, CEMI RAS and euroCRIS.
Advertisements

Microsoft Excel 2003 Illustrated Complete Excel Files and Incorporating Web Information Sharing.
Global database on the Implementation of Nutrition Action 1 Who is doing what, where, when, why and how An interactive platform and mapping tool on nutrition.
Creating Custom Forms. 2 Design and create a custom form You can create a custom form by modifying an existing form or creating a new form. Either way,
Chapter 12: ADO.NET and ASP.NET Programming with Microsoft Visual Basic.NET, Second Edition.
Wikis This work is licensed under a Creative Commons Attribution-Noncommercial- Share Alike 3.0 License. Skills (application development): wiki editing.
Using Wikispaces This work is licensed under a Creative Commons Attribution-Noncommercial- Share Alike 3.0 License. Skills: Wikispaces: editing and management.
Tutorial 8 Sharing, Integrating and Analyzing Data
Tutorial 11: Connecting to External Data
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Access Tutorial 8 Sharing, Integrating, and Analyzing Data
Web 2.0: Concepts and Applications 2 Publishing Online.
1 MySQL and phpMyAdmin. 2 Navigate to and log on (username: pmadmin)
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
Tags Pages 63 to 114 in your workbook. Tag Browser Review of the communication chain Polling Driver concepts Tag Browser in detail – Filtering – The tag.
Prepared by: Steve Teo Contributors: Tong Huu Khiem.
Execute Workflow. Home page To execute a workflow navigate to My Workflows Page.
Reorientation for Moodle 2 Staff Guide. File Repositories With Moodle 2’s file repository system: Duplicate files are only stored once, saving disk space.
--Presented by Tianyi Zhang Building Community Wikipedias: A Machine-Human Partnership Approach.
Tajik Wikipedia Free Encyclopedia Ibrahim Rustamov Note: To view pages on the Internet properly with all Tajik letters, please.
Prepared by: Steve Teo Contributors: Tong Huu Khiem.
Test Automation For Web-Based Applications Portnov Computer School Presenter: Ellie Skobel.
Using a wiki This work is licensed under a Creative Commons Attribution-Noncommercial- Share Alike 3.0 License. Skills (application development): wiki.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 8 1 Microsoft Office Access 2003 Tutorial 8 – Integrating Access with the.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 6 1 Microsoft Office Access 2003 Tutorial 6 – Creating Custom Forms.
LE:NOTRE Thematic Network Project. Overview Glossary Database 2004 current english definition current english word alternative english definition associated.
Open Wikipedia at any page in the language you want to contribute (English or Swedish). Press Create account (Skapa konto). Enter username and password.
INTRODUCTION TO MAPNET WIKI Anar Khan on behalf of AgResearch IS Bioinformatics, Mathematics and Statistics 10/10/2006.
CAA Database Overview Sinéad McCaffrey. Metadata ObservatoryExperiment Instrument Mission Dataset File.
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
C# Programming: From Problem Analysis to Program Design1 Visual Studio Configuration C# Programming: From Problem Analysis to Program Design 4th Edition.
Forms. Forms provide a more convenient user interface for such things as adding new records or editing or deleting existing records in a table. They can.
AdisInsight User Guide July 2015
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Business System Development
Exploring Excel Chapter 5 List and Data Management: Converting Data to
Microsoft Office Illustrated Fundamentals
Lesson 9 Sharing Documents
Wikipedia Network Analysis: Commonality detection among Wikipedia authors Deepthi Sajja.
Microsoft FrontPage 2003 Illustrated Complete
The ways of wikis 4/7/11.
For basic Internet searches for news articles or interviews with the person you are researching, try Bing &/or Google. News search will help you find where.
Lesson 9 Sharing Documents
Tutorial 8 Objectives Continue presenting methods to import data into Access, export data from Access, link applications with data stored in Access, and.
Exam Braindumps
Business Intelligence: A Managerial Approach (2nd Edition)
Lesson 14 Sharing Documents
Search Techniques and Advanced tools for Researchers
Microsoft Office Access 2003
Exploring Microsoft® Access® 2016 Series Editor Mary Anne Poatsy
Microsoft Office Access 2003
NWSI Neuroimaging Web Services Interface
Indistar Plan Management
Access Tutorial 8 Sharing, Integrating, and Analyzing Data
Database Systems Instructor Name: Lecture-3.
Bob Friedman, Xybion; Anthony Fata, SNBL
ICT Word Processing Lesson 5: Revising and Collaborating on Documents
Manipulating and Sharing Data in a Database
Lesson 14 Sharing Documents
Lecture 8 Information Retrieval Introduction
To view, enable editing, select Slide Show, select From Beginning
Tutorial 7 – Integrating Access With the Web and With Other Programs
Grauer and Barber Series Microsoft Access Chapter One
Grauer and Barber Series Microsoft Access Chapter Two
Databases and Information Management
Have an interactive website with your students By Ellen Dill
Tutorial 8 Sharing, Integrating, and Analyzing Data
Wikis Skills (application development): wiki editing and management
JTLS 6.0 View Data Files In Excel
Integrated Statistical Production System WITH GSBPM
Presentation transcript:

Wikipedia Network Analysis: Commonality detection among Wikipedia authors Deepthi Sajja

Objective Number of contributors editing a Wikipedia article. Key authors of the article

Objetive Key authors of the article Compare growth rate with other language Wikipedia.

Data Extraction Wikimedia database dumps:All pages with complete edit history. (several terabytes) for english alone. Another way: Small set of pages using Wiki's Export page.

Computing Edit Network WikiEvent tool WikiEvent is used to extract the revisions in chunks. WikiEvent input: history files of pages in xml format.

Edit types: add,delete,restore,undelete Target:page or user Consider an example of three revisions on one page where (in Revision 1) user Alice adds some new text to the page; subsequently (in Revision 2), user Bob deletes this text; then (in Revision 3), user Charlie reverts Bob's edit - setting back the page text to the one submitted in Revision 1

WikiEvent Output PageTitle;RevisionID;Time(calendar);Time(milliseconds);InteractionType;WordCount;ActiveUser;Target "Social network analysis";1711088;2003-09-23T21:08:52Z;1064344132000;added;196;"142.177.104.40";"Social network analysis" "Social network analysis";2002109;2003-11-11T06:13:44Z;1068527624000;added;10;"63.228.105.175";"Social network analysis" "Social network analysis";2002109;2003-11-11T06:13:44Z;1068527624000;deleted;192;"63.228.105.175";"142.177.104.40" "Social network analysis";2036847;2003-12-19T22:42:43Z;1071870163000;added;54;"Davodd";"Social network analysis" "Social network analysis";2036847;2003-12-19T22:42:43Z;1071870163000;deleted;7;"Davodd";"63.228.105.175" "Social network analysis";2210638;2003-12-24T13:29:11Z;1072268951000;added;1;"210.49.82.219";"Social network analysis" "Social network analysis";2210638;2003-12-24T13:29:11Z;1072268951000;deleted;1;"210.49.82.219";"Davodd

From this output we can calculate number of edits performed by contributors on a single article. We can filter out data by choosing specific type we need in the csv file.

Importing network to Visone Visone: Used for analyzing and visualization of complex networks. CSV file with the computed edit events can be imported in visone.

Data Filtering

Event iterator

Event network Specify the link attributes How events of various types add to these attributes, and how they change over time. Required for the evolution of event network.

Attributes: Added,delted,restored,undeleted Specify halftime - defining how fast attributes decay over time. Useful for network snapshots over the timeslot.

A halftime equal to zero or negative indicates that the respective attribute does not decay over time. Attributes:added, deleted has no decay. They just adds up the weight. Attribute: recently added may have decay.

We need to specify Identity of weights in weight function table. To establish the identity: Attribute: deleted Event type: deleted Here weights of events of type deleted are added. For attribute : interacted Event type: added,deleted...so on

Network Visualization Bi-Partite: Nodes: pages Nodes:Users cotributed The link attributes encode (in our case) the number of words added.

Questions?