Download presentation
Presentation is loading. Please wait.
Published byBrice Palmer Modified over 9 years ago
1
Pete Bohman Adam Kunk
2
What is real-time search? What do you think as a class?
3
Real-Time Search Definition: A search mechanism capable of finding information in an online fashion as it is produced.
4
Real-Time Search In terms of real-time search, what does “online” mean? Online means that a constant flow of input data is handled, contrary to batch processing Bing Social Search Bing Social Search
5
What kind of content is fed into real-time search? 1. Microblogging data (new type of data) 1. Short temporal life span 2. Little to no context 3. Simple ideas, fast reporting of events 4. Metadata: time, location, relationships 5. Less factual, more opinionated 6. Static posts (vs. dynamic web pages) 7. Furious input rate 8. Often no hyperlink structure, few traditional ranking factors
6
Real-Time Search Input Data Example of what kind of input data is considered for these real-time search systems: twittervision twittervision
7
Web search engines index webpages perdiodically. Expand on this from content on Wiki? Web search is not concerned with social links between data/users What makes real-time search systems different?
8
Web search (Google) – primarily concerned with relevance Real-time search – concerned with relevance, popularity, temporal immediacy
9
What makes real-time search systems different? Web search engines crawl the web periodically and update indexes Need to since data/pages are dynamic Real-time search engines have a constant stream of input data. No need to poll since the posts are static What can we do with real-time search engines?
10
What are the applications of real- time search? How are people using it? Different types of queries Extended Queries
11
What are the applications of real- time search? Real-time news reports Example System: TwitterStand Twitter is one of the first avenues in which news is reported Example: Michael Jackson’s death ordering of media events ○ 9-11 call, twitter update, LA times, Mainstream news Crowdsourcing of first hand reports, CNN does real-time search to see what people are reporting
12
What are the applications of real- time search? Real-time alert systems Leverages tweet metadata (time, location) to raise alerts Earthquake localization based on tweets
13
Twitter Real-Time Alerts USGS Twitter Earthquake Detector
14
Value of real-time search The estimated value of real-time search is around $33,000,000 This is based on queries posted to the collecta system Value derived from types of queries entered in real-time search systems, then valuing these with adwords auctions
15
Related Work (belong here?) Partial Indexes View Materialization Google and Twitter microblog real-time search engines
16
Difficulties of Real-Time Search The problem can be split into two factors: Efficient indexing in order to provide for fast results Effective ranking in order to return relevant results
17
Real-Time Search Indexing How does indexing differ from traditional web indexing? The large volumes of input in microblogging applications such as Twitter ○ Not feasible to index every incoming tweet immediately Need to use selective indexing based on results that are most likely to appear
18
Real-Time Search Indexing RDBMS indexing versus real-time Real-time indexing is not looking at structured data Indexes in RDBMS are built on columns, and they are not necessary to return results ○ They just speed up the search Indexes in real-time applications determine which results are candidate to be returned
19
TI indexing (insert data)
20
Real-Time Search Ranking How does ranking differ from traditional web ranking? There are no social relationships in traditional web pages Typical web search engines rank based on links to a site, and links from a site Website links are not the same as social networking links
21
Real-Time Search Ranking Ranking is not necessary in RDBMS systems RDBMS systems do not favor certain data over others based on select criteria RDBMS systems rank all data contained in the database the same essentially
22
TI Ranking (insert data)
23
What are others doing?
24
Implications/Conclusion (insert data)
25
References (insert data)
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.