Web Exploration and Search Technology Lab Department of Computer and Information Science Polytechnic University Brooklyn, NY 11201 Faculty: Torsten Suel.

Slides:



Advertisements
Similar presentations
For more information please send to or EFFICIENT QUERY SUBSCRIPTION PROCESSING.
Advertisements

Interactive Wrapper Generation with Minimal User Effort Utku Irmak and Torsten Suel CIS Department Polytechnic University Brooklyn, NY 11201
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
Geographic Web Information Retrieval Alexander Markowetz, University of Marburg Thomas Brinkhoff, FH Oldenburg Bernhard Seeger, University of Marburg.
Embedded Web Hyung-min Koo. 2 Table of Contents Introduction of Embedded Web Introduction of Embedded Web Advantages of Embedded Web Advantages of Embedded.
September 21, Broadband Wireless Network Applications and Performance Carey Williamson Professor/iCORE Senior Research Fellow Department of Computer.
1 Oct 30, 2006 LogicSQL-based Enterprise Archive and Search System How to organize the information and make it accessible and useful ? Li-Yan Yuan.
Company Confidential 1 © 2005 Nokia V1-Filename.ppt / yyyy-mm-dd / Initials Towards a mobile content delivery network with a P2P architecture Carlos Quiroz.
Search Engines and Information Retrieval
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Scaling Content Based Image Retrieval Systems Christine Lo, Sushant Shankar, Arun Vijayvergiya CS 267.
1 Internet Protocols and Network Performance Issues Carey Williamson iCORE Professor Department of Computer Science University of Calgary.
Efficient Search in Large Textual Collections with Redundancy Jiangong Zhang and Torsten Suel Review by Newton Alex
ODISSEA: a Peer-to-Peer Architecture for Scalable Web Search and IR Torsten Suel with C. Mathur, J. Wu, J. Zhang, A. Delis, M. Kharrazi, X. Long, K. Shanmugasunderam.
A Mobile World Wide Web Search Engine Wen-Chen Hu Department of Computer Science University of North Dakota Grand Forks, ND
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Navigating and Sharing in a Decentralized World Francisco Matias Cuenca-Acuna
Introduction Web Development II 5 th February. Introduction to Web Development Search engines Discussion boards, bulletin boards, other online collaboration.
Web Search – Summer Term 2006 V. Web Search - Page Repository (c) Wolfgang Hürst, Albert-Ludwigs-University.
Overview of Search Engines
Design and Implementation of a Geographic Search Engine Alexander Markowetz Yen-Yu Chen Torsten Suel Xiaohui Long Bernhard Seeger.
Search Engines and their Public Interfaces: Which APIs are the Most Synchronized? Frank McCown and Michael L. Nelson Department of Computer Science, Old.
Search engines Christian Rennerskog, Jonas Rosling, Mattias Olsson.
Web Search Engines and Information Retrieval on the World-Wide Web Torsten Suel CIS Department Overview: introduction.
Deduplication CSCI 572: Information Retrieval and Search Engines Summer 2010.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
New Protocols for Remote File Synchronization Based on Erasure Codes Utku Irmak Svilen Mihaylov Torsten Suel Polytechnic University.
Search Engines and Information Retrieval Chapter 1.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Parallel Processing CS453 Lecture 2.  The role of parallelism in accelerating computing speeds has been recognized for several decades.  Its role in.
CS523 INFORMATION RETRIEVAL COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
Graph-based Algorithms in Large Scale Information Retrieval Fatemeh Kaveh-Yazdy Computer Engineering Department School of Electrical and Computer Engineering.
Overview What is a Web search engine History Popular Web search engines How Web search engines work Problems.
Internet Information Retrieval Sun Wu. Course Goal To learn the basic concepts and techniques of internet search engines –How to use and evaluate search.
Objective Understand concepts used to web-based digital media. Course Weight : 5%
ACM NOSSDAV 2007, June 5, 2007 IPTV Experiments and Lessons Learned Panelist: Klara Nahrstedt Panel: Large Scale Peer-to-Peer Streaming & IPTV Technologies.
Crawling and Aligning Scholarly Presentations and Documents from the Web By SARAVANAN.S 09/09/2011 Under the guidance of A/P Min-Yen Kan 10/23/
Qingqing Gan Torsten Suel CSE Department Polytechnic Institute of NYU Improved Techniques for Result Caching in Web Search Engines.
McLean HIGHER COMPUTER NETWORKING Lesson 7 Search engines Description of search engine methods.
Internet Real-Time Laboratory Arezu Moghadam and Suman Srinivasan Columbia University in the city of New York 7DS System Design 7DS system is an architecture.
Search Engine Architecture
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
Internet Architecture and Governance
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
Search Engine-Crawler Symbiosis: Adapting to Community Interests
Chittampally Vasanth Raja 10IT05F vasanthexperiments.wordpress.com.
Uncovering the Invisible Web. Back in the day… Students used to research using resources hand-picked by librarians and teachers. These materials were.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
The World Wide Web: Information Resource. How a Search Engine works… How Search Works - YouTube
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Brass: A Queueing Manager for Warrick Frank McCown, Amine Benjelloun, and Michael L. Nelson Old Dominion University Computer Science Department Norfolk,
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
Chapter 8: Web Analytics, Web Mining, and Social Analytics
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
Types Pros & cons.  A program for the retrieval of data, files, or documents from a database or network, esp. the Internet.  Search engines usually.
Search Engine and Optimization 1. Introduction to Web Search Engines 2.
Distributed Systems Architecure. Architectures Architectural Styles Software Architectures Architectures versus Middleware Self-management in distributed.
Design and Implementation of a High- Performance Distributed Web Crawler Vladislav Shkapenyuk, Torsten Suel 실시간 연구실 문인철
Chapter Five Web Search Engines
Search Engine Architecture
中国计算机学会学科前沿讲习班:信息检索 Course Overview
CHAPTER 3 Architectures for Distributed Systems
Course Summary (Lecture for CS410 Intro Text Info Systems)
Prepared by Rao Umar Anwar For Detail information Visit my blog:
The Internet An Overview.
جستجو در وب عميق ارائه‌دهنده: حسين شريفي‌پناه
Panagiotis G. Ipeirotis Luis Gravano
Search Engine Architecture
Presentation transcript:

Web Exploration and Search Technology Lab Department of Computer and Information Science Polytechnic University Brooklyn, NY Faculty: Torsten Suel PhD Students: Qingqing Gan Hao Yan Jiangong Zhang PhD Graduates: Yen-Yu Chen (2006) -> Yahoo Utku Irmak (2006) -> Yahoo Xiaohui Long (2006) –> MSN Search Looking for additional PhD students …

“Brooklyn Poly”, founded in 1854 in downtown Brooklyn Engineering, CS, Management 1500 ugrads, 1400 grad students CS: 16 tenure/t faculty, 40 PhD studs. Algorithms, Networks, Security, Software Eng., Image/Vision/Graphics Polytechnic University: WHERE? WHAT? Databases ? Information Retrieval ? Web Search !! - core web search - related work in algorithms, systems, databases - emerging applications: social networks, blogs, local search, …

WHAT EXACTLY? Systems/Architectures/Scalability: - efficient crawling, data distribution, indexing, query execution, link analysis Emerging Applications: - geographic/mobile search, deep web search, blog/RSS search, P2P search core search image & video blogsmobiledesktop low level stuff: “search engine guts”

Some Research Projects Scalability of Large Search Engines Future Search Architectures - peer-to-peer as Google killer? - desktop/client based search - blogs/social networks/new media Geo / Local Search Engines - can we do with less? - scale to larger data? - storage/indexing/mining of web archives Search Engine Research Cluster at Poly ODISSEA System Architecture Example: Google Local Search Geo Search Research at Poly Web Spam - automatic - interactive

Search Engine Query Processing: Three-Level Caching for Efficient Query Processing in Large Web Search Engines. X. Long, T. Suel. 14th WWW Conf., Optimized Query Execution in Large Search Engines with Global Page Ordering. X. Long, T. Suel. VLDB, Geographic Web Search: Efficient Query Processing in Geographic Web Search Engines. Y. Chen, T. Suel, A. Markowetz. ACM SIGMOD, Design and Implementation of a Geographic Search Engine. A. Markowetz, Y. Chen, et al. WebDB 2005 Miscellaneous: Efficient Query Subscription Processing for Prospective Search Engines. U. Irmak, S. Mihaylov et al. USENIX, Interactive Wrapper Generation with Minimal User Effort. U. Irmak, T. Suel. 15th WWW Conf., Efficient Query Evaluation on Large Textual Collections in a P2P Environment. J. Zhang, T. Suel. IEEE Conf. on P2P, Improved Single-Round Protocols for Remote File Synchron. U. Irmak, S. Mihaylov, T. Suel. IEEE Infocom, Hierarchical Substring Caching for Efficient Content Distr. to Low-Bandwidth Clients. U. Irmak, T. Suel. 14th WWW Conf., Some Recent Group Publications: