Asking Questions on the Internet

Slides:



Advertisements
Similar presentations
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Advertisements

Clustering and Load Balancing Optimization for Redundant Content Removal Shanzhong Zhu (Ask.com) Alexandra Potapova, Maha Alabduljalil (Univ. of California.
Analysis and Modeling of Social Networks Foudalis Ilias.
Comparison of Social Networks by Likhitha Ravi. Outline What is a social network? Elements of social network Previous studies What is missing in previous.
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
Knowledge Sharing Platform Empowering Communities through regional Content and Services C. Kathiresan C-DAC, Hyderabad, India Session V : e-Content & ICT.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Searching the Web II. The Web Why is it important: –“Free” ubiquitous information resource –Broad coverage of topics and perspectives –Becoming dominant.
Measurement and Analysis of Online Social Networks Alan Mislove,Massimiliano Marcon, Krishna P. Gummadi, Peter Druschel, Bobby Bhattacharjee Presented.
GAYATRI SWAMYNATHAN, CHRISTO WILSON, BRYCE BOE, KEVIN ALMEROTH AND BEN Y. ZHAO UC SANTA BARBARA Do Social Networks Improve e-Commerce? A Study on Social.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
What’s New in Search? How destinations can leverage new search trends.
Towards Modeling Legitimate and Unsolicited Traffic Using Social Network Properties 1 Towards Modeling Legitimate and Unsolicited Traffic Using.
DETECTING NEAR-DUPLICATES FOR WEB CRAWLING Authors: Gurmeet Singh Manku, Arvind Jain, and Anish Das Sarma Presentation By: Fernando Arreola.
Abstract Introduction Results and Discussions James Kasson  (Dr. Bruce W.N. Lo)  Information Systems  University of Wisconsin-Eau Claire In a world.
University of California at Santa Barbara Christo Wilson, Bryce Boe, Alessandra Sala, Krishna P. N. Puttaswamy, and Ben Zhao.
Search Engines and Information Retrieval Chapter 1.
What’s New in Search? How destinations can leverage new search trends.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
BLOG. WHAT IS A BLOG ? We have a lot of definition of blog.. A blog is a personal diary. A daily pulpit. A collaborative space. A political soapbox. A.
Addressing Incompleteness and Noise in Evolving Web Snapshots KJDB2007 Masashi Toyoda IIS, University of Tokyo.
Advanced Software Engineering PROJECT. 1. MapReduce Join (2 students)  Focused on performance analysis on different implementation of join processors.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
Abstract Objective Selected Results To determine whether globalization influence of the internet results in similarity in web browsing behavior. To determine.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Crowdsourcing & Social Networks Shrenik Sadalgi Spring 2010 COMS E6125 Web-enHanced Information Management Columbia University.
1 Discovering Authorities in Question Answer Communities by Using Link Analysis Pawel Jurczyk, Eugene Agichtein (CIKM 2007)
Pete Bohman Adam Kunk. Real-Time Search  Definition: A search mechanism capable of finding information in an online fashion as it is produced. Technology.
To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs Imrul Kayes 1, Xiang Zuo 1, Da Wang 2, Jacob Chakareski 3 1 University.
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
Uncovering Social Network Sybils in the Wild Zhi YangChristo WilsonXiao Wang Peking UniversityUC Santa BarbaraPeking University Tingting GaoBen Y. ZhaoYafei.
Autumn Web Information retrieval (Web IR) Handout #0: Introduction Ali Mohammad Zareh Bidoki ECE Department, Yazd University
Understanding Crowds’ Migration on the Web Yong Wang Komal Pal Aleksandar Kuzmanovic Northwestern University
03/19/02Scalab Seminar Series1 Mapping the Gnutella Network Macroscopic Properties of Large Scale P2P Systems Ramaswamy N.Vadivelu Scalab, ASU.
Search Engine Architecture
Human Interaction with Data “Meaningful Interpretations” “The Power of Crowdsourcing” &
Finding high-Quality contents in Social media BY : APARNA TODWAL GUIDED BY : PROF. M. WANJARI.
Meet the web: First impressions How big is the web and how do you measure it? How many people use the web? How many use search engines? What is the shape.
User Interactions in Social Networks and their Implications Christo Wilson, Bryce Boe, Alessandra Sala, Krishna P. N. Puttaswamy, Ben Y. Zhao (UC Santa.
LOGO Identifying Opinion Leaders in the Blogosphere Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng CIKM 2007 Advisor : Dr. Koh Jia-Ling Speaker : Tu.
SocialVoD: a Social Feature-based P2P System Wei Chang, and Jie Wu Presenter: En Wang Temple University, PA, USA IEEE ICPP, September, Beijing, China1.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
RDFPath: Path Query Processing on Large RDF Graph with MapReduce Martin Przyjaciel-Zablocki et al. University of Freiburg ESWC May 2013 SNU IDB.
Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
11 Introduction to Neo4j. 2 We all have our own graphs...
Challenge Problem: Link Mining Lise Getoor University of Maryland, College Park.
Mapping the Gnutella Network: Properties of Large-Scale Peer-to-Peer Systems and Implications for System Design Authors: Matei Ripeanu Ian Foster Adriana.
1 What Makes a Query Difficult? David Carmel, Elad YomTov, Adam Darlow, Dan Pelleg IBM Haifa Research Labs SIGIR 2006.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Social Turing Tests: Crowdsourcing Sybil Detection Gang Wang, Manish Mohanlal, Christo Wilson, Xiao Wang Miriam Metzger, Haitao Zheng and Ben Y. Zhao Computer.
Outline  Introduction  Subgraph Pattern Matching  Types of Subgraph Pattern Matching  Models of Computation  Distributed Algorithms  Performance.
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
CiteData: A New Multi-Faceted Dataset for Evaluating Personalized Search Performance CIKM’10 Advisor : Jia-Ling, Koh Speaker : Po-Hsien, Shih.
Gang Wang, Sarita Y. Schoenebeck †, Haitao Zheng, Ben Y. Zhao UC Santa Barbara, † University of Michigan Understanding Bias and Misbehavior on Location-based.
1 Link Privacy in Social Networks Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar CIKM’08 Advisor: Dr. Koh, JiaLing Speaker: Li, HueiJyun Date: 2009/3/30.
Data mining in web applications
Research on Knowledge Element Relation and Knowledge Service for Agricultural Literature Resource Xie nengfu; Sun wei and Zhang xuefu 3rd April 2017.
WSRec: A Collaborative Filtering Based Web Service Recommender System
Comparison of Social Networks by Likhitha Ravi
Associative Query Answering via Query Feature Similarity
A Comparative Study of Link Analysis Algorithms
Network Science: A Short Introduction i3 Workshop
Introduction of KNS55 Platform
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.
Presentation transcript:

Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa Barbara gangw@cs.ucsb.edu

Asking Questions on the Internet Systems to answer user questions on the Internet Google - general information Wikipedia - factual knowledge But we often have questions that require… Domain-specific knowledge First-hand life experiences Q: What is the population of Rio? Q: What is the most interesting souvenir you can buy in Rio?

Online Q&A Services Today Question and Answer (Q&A) sites Web services where people ask and answer questions A crowd-sourced way to search information Large online knowledge repositories As the Q&A systems grow to massive scales… More difficult for users to locate useful answers or interesting questions Low-value questions (spam) overwhelm the system 300+ Million questions 1+ Billion answers 3.5+ Million questions 6.8+ Million answers

How does Quora’s internal structures contribute to its success? Quora - Social Q&A “Hottest” (most successful) today First social network based Q&A 350% traffic growth in 2012 Many answers are returned as top answers to Google queries Quora’s advantages High-quality questions and answers True domain experts participation politicians, actors, startup founders, etc. How does Quora’s internal structures contribute to its success?

A Measurement Study of Quora Limited understanding of Quora Size of site (questions, users), growth rate Mechanisms for content discovery, quality control Questions we asked in our study How does Quora grow over time? What’s the impact of social graph on Q&A activities? How does Quora direct users to the valuable content? Match experts w/ questions, and seekers w/ answers

Outline Introduction Characterizing Quora Analyzing Graph Structures Implications

A Typical Question Page Topics Related Questions Question Votes Answer

Graphs, Graphs, More Graphs User-topic graph: user following topics Social graph: user following other users Related question graph: connecting related questions Topics Q Q A

Data Collection Crawling Quora Website Data Since Total Questions Topics Users Answers Question Coverage Quora Oct. 2009 437K 56K 264K 979K 58% StackOverflow Jun. 2008 3.45M 22K 1.3M 6.86M 100% Crawling Quora Snowball-crawled related question graph (August 2012) Obtained the largest connected component Slow speed, minor impact to the site Using the dataset of StackOverflow as a comparison

Growth Over Time Number of Questions 761K Total # of questions estimated by Qid Number of Questions 437K (58%) Similar growth trend with StackOverflow 2008/7 2009/5 2010/3 2011/1 2011/11 2012/7

Details on User-Topic Graph in the paper! Outline Introduction Characterizing Quora Analyzing Graph Structures Social Graph Related Question Graph Implications Details on User-Topic Graph in the paper!

How do social ties impact Q&A activities?

Social Graph Structure Users can follow other users to build social connections Asymmetric social graph Users receive items in their newsfeed from people they follow Social degree has power-law distribution

Is the Social Graph Meaningful? More answers or high-quality answers == more followers Social structure could indicate content quality Correlation between user’s # of followers and # of total answers the user wrote # of votes the user ever received

Using Social Ties to Attract Answers Would social ties help to attract answers? Defining “super-users” Top 5% users sorted by # of followers Social ties have no effect on attracting answers

How does Quora direct users to “interesting” questions?

Related Question Graph Related question feature Allows users to browse a series of related questions Related question graph Questions as nodes, edges indicates “related” relationships Graph properties Power-law structure A small set of “core” questions inside each topic

Impact of Question Degree Strong correlation between question degree and user’s attention on the question Question graph drives users to “core” questions

User Attention on Similar Questions Similar questions in Quora Questions around very close (same) subjects Redundant questions asked by different users Do users pay equal attention to similar questions? Locating similar questions by partitioning question graph METIS, produce clusters, each contains similar questions Q

Equal Attention on Similar Questions? Is user attention evenly distributed in each cluster? Gini coefficient (G): evaluate the uniformity of distribution G=0: perfect equality G~1: extremely skewed distribution User attention is highly skewed in each cluster Excellent! users are not distracted by similar questions G=0 G=0.4 G=0.9

Implication and Conclusion Implication for crowdsourcing content sites Q&A sites Users attention is “skewed” to top questions Avoid distraction, encourage contribution Other sites such as Yelp, TripAdvisor Drive enough reviews to key venues Ensure reliable rating The first large-scale measurement study on Quora Graph structures contribute to effective content discovery Social graph indicates content quality Question graph focuses user attention

Thank you! Questions?