1 How Could We All Get Along on the Web 2.0? The Power of Structured Data on the Web Sihem Amer Yahia Yahoo! Research.

Slides:



Advertisements
Similar presentations
Social Media.
Advertisements

What means for you Alisa Leonard Vice President, Marketing Strategy iCrossing.
WEB 2.0 What is a web 2.0? Why is it important? How can it work for teachers, librarians and information professionals?
Search Engines and Information Retrieval
Expertise Networks in Online Communities: Structure and Algorithms Jun Zhang Mark S. Ackerman Lada Adamic University of Michigan WWW 2007, May 8–12, 2007,
Social Media 101. What is Social Media? Social Media is simply this:  A website that doesn’t just provide information but that allows you to interact.
Computing Trust in Social Networks
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
Social Media Motion: How to Get Started & Keep Going With Facebook, Twitter & More Presented by Eli Lilly and Company Hosted by Rob Robinson McNeely Pigott.
Web Logs and Question Answering Richard Sutcliffe 1, Udo Kruschwitz 2, Thomas Mandl University of Limerick, Ireland 2 - University of Essex, UK 3.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Collaborative Filtering Shaun Kaasten CPSC CSCW.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Overview of Search Engines
Sales: (866) PAGE 1 Jim Jabaay - VP of Sales Presenters November 2014 | Sales: (866)
Social Networking – The Ways and Means Rosey Broderick May 2011.
Plugging In Connecting with your Membership On Line T HE A MERICAN L EGION.
Tag-based Social Interest Discovery
Social Media Strategy Non-profit Organization. Platforms to Establish Facebook Blog Twitter YouTube Linked-In.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Search Engines and Information Retrieval Chapter 1.
1 The BT Digital Library A case study in intelligent content management Paul Warren
AVI/Psych 358/IE 340: Human Factors Web 2.0 November
Christine Laham, Fahed Abdu, David Dezano,Shelly Kim.
Personalized Interaction with Web Resources First Sino-German Symposium on KNOWLEDGE HANDLING: REPRESENTATION, MANAGEMENT AND PERSONALIZED APPLICATION.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Raising Awareness in Library 2.0 way: The UJ Sciences Librarian Virtual Experience SANLiC Workshop, 28 May 2009.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Use of Electronic and Internet advertising options Standard 3.4.
To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs Imrul Kayes 1, Xiang Zuo 1, Da Wang 2, Jacob Chakareski 3 1 University.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Personalized Search Xiao Liu
LISTSERV LISTSERV is a registered trademark (™) licensed exclusively to L-Soft international, Inc., as the name of its mailing list processor product.
OT Connections is AOTA’s new online community which allows occupational therapists, occupational therapy assistants and students to connect with each.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
Virtual Platform for Adult Learning Hindi Portal In Brief PRIA, DVV and ASPBAE.
Search Engine Architecture
Information Retrieval Effectiveness of Folksonomies on the World Wide Web P. Jason Morrison.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
This work is by Georgia Koutrika, published on CIDR'09 All the figures & tables in these slides are from that paper.
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
Page  1 Social Media in Business Session 8 Agenda Guest Speaker: Cathy Reilly, Agros.org Facebook and LinkedIn Student Presentations Class Project: Create.
Center for E-Business Technology Seoul National University Seoul, Korea Social Ranking: Uncovering Relevant Content Using Tag-based Recommender Systems.
Research Topics/Areas. Adapting search to Users Advertising and ad targeting Aggregation of Results Community and Context Aware Search Community-based.
FaceBook…. Really?? FaceBook…. Really?? Social networking & Facebook Pam Krambeck, ESU#3
Templated Search over Relational Databases Date: 2015/01/15 Author: Anastasios Zouzias, Michail Vlachos, Vagelis Hristidis Source: ACM CIKM’14 Advisor:
OCLC Online Computer Library Center 1 Social Media and Advocacy.
Community provides a private, manageable, and secure online space for organization's to communicate. It puts the collective knowledge of your entire organization.
NHS Education for Scotland The Knowledge Network: Shared Space April 2010.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
Optimizing today's websites using tomorrow's technologies.
ASSIST: Adaptive Social Support for Information Space Traversal Jill Freyne and Rosta Farzan.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Kendra Hunter & Charde Johnson EDUC Dr. M. Kariuki.
KMS & Collaborative Filtering Why CF in KMS? CF is the first type of application to leverage tacit knowledge People-centric view of data Preferences matter.
Social Information Processing March 26-28, 2008 AAAI Spring Symposium Stanford University
Get Connected through the My Oracle Support Community Lynn Pionkowski Sr Regional Support Advocate
Harmonization and Integration of Semi- Structured Data Through Wikis and Controlled Tagging E. M. Robinson, R. B. Husar Washington University, St. Louis,
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
Groupware What are the goals of a groupware system? - Facilitation - Coordination - Cooperation - Augmented, supported production Is efficiency the goal?
Page  1 Social Media in Business Session 8 Agenda Guest Speaker: Cathy Reilly, Agros.org Facebook and LinkedIn Student Presentations Class Project: Create.
 Smartphones – iPhone, Android, Blackberries, etc  Tablets – iPad, Android, Windows, Google, etc.  Computers Basically anything that can connect to.
How Chapters Can use Social Media Mark Storace Sacramento Chapter March 2013.
NASBLA Social Media: What is it for? NASBLA is involved in numerous Social Media that all serve a distinct purpose. So, what are they all for?
HFMA Chapter Toolkit Aligned with the Early Careerist Task Force Related Toolkits Social Media Overview Social Media LinkedIn Groups Social Media Twitter.
Google Display Network. Targeting options.
Neighborhood - based Tag Prediction
Associative Query Answering via Query Feature Similarity
Information Retrieval and Web Design
Presentation transcript:

1 How Could We All Get Along on the Web 2.0? The Power of Structured Data on the Web Sihem Amer Yahia Yahoo! Research

2 Outline Web search and web 2.0 search Why should we all get along? How could we all get along? Related work Conclusion

Yahoo! Research 3 Web search Access to “heterogeneous”, distributed information –Heterogeneous in creation –Heterogeneous in motives Keyword search very effective in connecting people to information Search web pages

Yahoo! Research 4 Web search vs web 2.0 search? Content creators Content aggregators Feeds Content consumers Anonymous Subscribers Web 2.0 search Web search

Yahoo! Research 5 Web 2.0 a generation of internet-based services that – let people form online communities – in order to collaborate – and share information in previously unavailable ways

Yahoo! Research 6 Online communities Subscribers join communities where they –exchange content: s, comments, tags –rate content from other subscribers –exhibit common behavior About 500M unique Y! visitors per month, about 200M subscribers (login visitors) to more than 130 Y! services

Yahoo! Research 7 Web 2.0 search Web 2.0 Connecting people to people Flickr Y!Answers YouTube Y!Groups

Yahoo! Research 8 Web 2.0 search examples Mary is a professional photographer and is looking for aerial photos of the Hoggar desert She is also an amateur Jazz dancer and wants to ask about dance schools w/flexible schedules in SF She is also looking for the latest video on bird migration in Central Park, NY She has heart problems but loves biking and is interested in finding about discussions on biking trails in northern California

Yahoo! Research 9 Outline Web search and web 2.0 search Why should we all get along? How could we all get along? Related work Conclusion

Yahoo! Research 10 Improving users’ experience Keyword search should be maintained: simple and intuitive Keyword queries usually short –only express a small fraction of the user's true intent Users's interactions within community-based systems can be used to infer a lot more about intent and return better answers

Yahoo! Research 11 Why should we all get along? Contributed content is structured –This is what DB community knows how to do best Relevance to query keywords is key –This is what IR community knows how to do best

Yahoo! Research 12 Searching online communities idauthordate 001s21/1/06 002s41/8/06 003s43/9/06 sub trust s1s3c13 s1s4c14 s2s3c23 s4s6c46 … data table community relationship table idsu b annotation 001s21/1/06 002s41/8/06 003s43/9/06 Tags, ratings, Reviews table

Yahoo! Research 13 Searching online communities Search for most relevant data on some topic –Querying data: selection over data table –Querying annotations: selection over annotation table + join w/data table –Personalizing answers: join w/subscribers table Relevance: use data relevance + annotation table

Yahoo! Research 14 Why should we all get along? Query interpretation depends on subscriber’s interest at the time of querying Data annotations are dynamic –Precompute all (sub,sub,trust) for each topic? Need for dynamic query generation

Yahoo! Research 15 DB and IR Shared interactions help focus search –User-input, community-input, extraction –Personalizing answers with community information Ranking as a combination of –Relevance –Relationship strengths between people in the same community

Yahoo! Research 16 Outline Web search and web 2.0 search Why should we all get along? How could we all get along? – Applications – Technical challenges Related work Conclusion

Yahoo! Research 17 Applications Flickr enables sharing and tagging photos Y! Answers enables asking and answering questions in natural language YouTube enables sharing videos, rating videos, commenting on videos and subscribing to new videos from favorite users Y! Groups enables creating groups, joining existing groups, posting in a group

Yahoo! Research 18 Flickr Acquired by Y! in 2005 Tag search Photos grouped into categories. Set privacy levels on each photo

Yahoo! Research 19

Yahoo! Research 20

Yahoo! Research 21 The new inputs to Flickr search Users tag and rate photos Users tagging same photos with similar tags form a community of interest Combine tag-based search with community knowledge Combine photo rating with relationship strength Search SubscriberQueryCommunities

Yahoo! Research 22 Y! Answers Launched in second half of 2005 Incentive system based on points and voting for best answers Questions grouped by category Some statistics: –over 60 million users –over 120 million answers, available in 18 countries and in 6 languages

Yahoo! Research 23

Yahoo! Research 24 Y! Answers

Yahoo! Research 25 Y! Answers

Yahoo! Research 26 The new inputs to Y!Answers search Users provide Questions/Answers Voting information reflects communities of interest Combine community information with answer rating Search SubscriberQueryCommunities

Yahoo! Research 27 YouTube Founded in February 2005 Tag search Videos grouped by category Some statistics: –100 million views/day –65,000 new videos/day

Yahoo! Research 28

Yahoo! Research 29

Yahoo! Research 30 The new inputs to YouTube search Users provide videos, tags, ratings, comments Similar tags on same videos imply communities of interest Combine community information with video rating Search SubscriberQueryCommunities

Yahoo! Research 31 Yahoo! Groups Yahoo! acquired eGroups in 2000 Group moderators Groups belong to categories Public and private groups Some statistics: –over 7M groups –over 190M subscribers –over 100K new subscribers/day –over 12M s/day

Yahoo! Research 32

Yahoo! Research 33

Yahoo! Research 34

Yahoo! Research 35 Alternative query interpretations Return all group postings relevant to a query. Return only posting by subscribers sharing the same interests: women with heart disease interested in steep slopes

Yahoo! Research 36 The new inputs to Group search Users participate in many groups Group membership and postings imply communities of interest Combine community information with postings relevance Search SubscriberQueryCommunities

Yahoo! Research 37 Outline Web search and web 2.0 search Why should we all get along? How could we all get along? – Applications – Technical challenges Related work Conclusion

Yahoo! Research 38 So, how can we all get along? Augment keyword query with conditions on structure to focus and personalize search (DB) – Flickr: tags – Answers: points – YouTube: reviews and ratings – Groups: s Combine it with relevance (IR)

Yahoo! Research 39 Search architecture Subscriber Query evaluation search terms Query tightening Find relevant community of interest Ranking content relevance + relationship structured query

Yahoo! Research 40 Example “biking trails northern california” Query tightening message contains “…” and from = “s1” or “s2” From: To: Date: Subject: Content: message structure S1 S2 S3 S4 S5 S6 S7 ( si, sj, cij ) Many such relationships depending on subscriber’s interests

Yahoo! Research 41 Can we really all get along? IR may think that user weights are enough to target communities of interest and personalize queries DB thinks expressiveness of query languages cannot all be captured by ranking functions

Yahoo! Research 42 Query rewriting Content-Only Content in context Loose interpretation of context

Yahoo! Research 43 Query relaxation Primitive operations for dropping query predicates Answers to relaxed query contain answers to exact one Scores relaxed answer no higher than score of exact one

Yahoo! Research 44 Query tightening Primitive operations for adding query predicates Tighter answers are found but looser answers should be maintained Scores tighter answers no lower than scores of other answers

Yahoo! Research 45 More technical challenges Query tightening primitives to focus search Subscriber has a different profile/community of interest Topk processing needs to enforce user profiles

Yahoo! Research 46 Outline Web search and web 2.0 search Why should we all get along? How could we all get along? – Applications – Technical challenges Related work Conclusion

Yahoo! Research 47 Related Work Language models: Ask Bruce Croft Web search personalization – Search behavior – HARD track at TREC Building relationship graphs: – Collaborative filtering –Clustering – Unsupervised learning

Yahoo! Research 48 Tempting conclusion Little information could be gathered on users to greatly improve new-generation search IR and DB views both needed

Yahoo! Research 49 More technical challenges Subscriber belongs to different communities of interest Should subscriber turn off personalization? How is efficiency affected? (revisiting topk processing) Back from community search to web search?

Yahoo! Research 50 Beyond search in online communities Are online communities a way to build more accurate user profiles or more? –display relevant groups when user is asking a question on Y! Answers: mashups?

Yahoo! Research 51 Danger of online communities Are we discouraging diversity?

52 Thank you. Questions?