Content Analysis Techniques to Ease Browsing with Handhelds Jalal Mahmud Yevgen Borodin I.V. Ramakrishnan Department of Computer Science State University.

Slides:



Advertisements
Similar presentations
1. XP 2 * The Web is a collection of files that reside on computers, called Web servers. * Web servers are connected to each other through the Internet.
Advertisements

XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
CMo: When Less Is More Yevgen Borodin Jalal Mahmud I.V. Ramakrishnan Context-Directed Browsing for Mobiles.
Interception of User’s Interests on the Web Michal Barla Supervisor: prof. Mária Bieliková.
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
Internet Sellouts Final Presentation Enterprise Architecture Group.
1 Slicing*-Tree Based Web Page Transformation for Small Displays Xiangye Xiao, Qiong Luo, Dan Hong, Hongbo Fu Contact: Department of Computer.
XP Browser and Basics1. XP Browser and Basics2 Learn about Web browser software and Web pages The Web is a collection of files that reside.
Building an Intelligent Web: Theory and Practice Pawan Lingras Saint Mary’s University Rajendra Akerkar American University of Armenia and SIBER, India.
Interactive Visual System By Arthur Evans, John Sikorski, and Patricia Thomas.
Web Mining Research: A Survey
By Intellext Presented By: Neha Bhatt. What is Watson? Watson is an information access assistant that automatically retrieves useful information in the.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 Software Testing and Quality Assurance Lecture 32 – SWE 205 Course Objective: Basics of Programming Languages & Software Construction Techniques.
Browser and Basics Tutorial 1. Learn about Web browser software and Web pages The Web is a collection of files that reside on computers, called.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Where Do I Start REFERENCE: LEARNING WEB DESIGN (4 TH EDITION) BY ROBBINS 2012 – CHAPTER 1 (PP. 3 – 14)
Discovering Computers Chapter 1 Discovering Computers & Microsoft Office 2010.
Semantic Browsing Alexander Faaborg Research Assistant MIT Media Lab Carl Lagoze Senior Research Associate Cornell University Information Science ECDL.
By: Bihu Malhotra 10DD.   A global network which is able to connect to the millions of computers around the world.  Their connectivity makes it easier.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
HTML Comprehensive Concepts and Techniques Intro Project Introduction to HTML.
Mining and Summarizing Customer Reviews
Slides by Yevgen Borodin (slides adapted for Psych 384, 3/3/09) Department of Computer Science, Stony Brook University A Vision for a Universally Accessible.
Internet Standard Grade Computing. Internet a wide area network spanning the globe. consists of many smaller networks linked together. Service a way of.
Computer Concepts 2014 Chapter 7 The Web and .
Erasmus University Rotterdam Introduction With the vast amount of information available on the Web, there is an increasing need to structure Web data in.
Annotating Search Results from Web Databases. Abstract An increasing number of databases have become web accessible through HTML form-based search interfaces.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
NAVIL GONZALEZ ANDREA CANTU MAGALY LUNA Heuristic Evaluation.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
Jeremy Seppi ENGL / 11 /  This presentation will teach you how to comparison shop using Google Product Search, a utility provided by the.
The Internet Industry Week Four. RISE OF THE INTERNET THE INTERNET – a global system of interconnected private, public, academic, business, and government.
Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.
Chapter 8 Browsing and Searching the Web. Browsing and Searching the Web FAQs: – What’s a Web page? – What’s a URL? – How does a browser work? – How do.
The Internet and World Wide Web
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
ITEC 1001 Tutorial 1 Browser and Basics. Web browser software & Web pages The Web is a collection of files that reside on computers, called Web.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
McLean HIGHER COMPUTER NETWORKING Lesson 6 Types of Browsers & WAP Explanation of browser functions Wireless access to the Internet Description of.
XP Browser and Basics COM111 Introduction to Computer Applications.
Framework for Virtual Web Laboratory I. Petković M. Rajković.
Computer Science 160 Group 5 Scott Carter, Chuck Moidel, Leila Takayama, Kevin Wang Tuesday, December 4, 2001.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
1 CS 501 Spring 2003 CS 501: Software Engineering Lecture 13 Usability 1.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Living in a Digital World Discovering Computers Fundamentals, 2011 Edition.
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
Internet Searching the World Wide Web. The Internet and the World Wide Web The Internet is a worldwide collection of networks that allows people to communicate.
Premier, multi-disciplinary engineering content that complements course material 750 interactive tables and graphs to.
The Internet Salihu Ibrahim Dasuki (PhD) CSC102 INTRODUCTION TO COMPUTER SCIENCE.
World Wide Web 16 World Wide Web 16. World Wide Web 16 Everyone also talks about the Web But people don’t really understand how it works You need to know.
Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,
Data mining in web applications
Fundamentals of Information Systems, Sixth Edition
What this activity will show you
The Internet Industry Week Two.
And On To Design: Why in this particular sequence?
Over 1,000 books, journals, videos and reference material
Integration of ICT in teaching and learning
Objectives Overview Explain why computer literacy is vital to success in today's world Describe the five components of a computer Discuss the advantages.
Objectives Overview Explain why computer literacy is vital to success in today’s world Define the term, computer, and describe the relationship between.
Efficient and Transparent Dynamic Content Updates for Mobile Clients
An Empirical Study of Web Interface Design on Small Display Devices
Create Pitches Faster by Utilizing a Collection of Information in Office 365 That Caters to Clients Partner Logo “Well-trained professionals know that.
IA for Shopping & Shopping Baskets
Web Mining Department of Computer Science and Engg.
Software Agent.
Y. Borodin, F. Ahmed, M. A. Islam, Y. Puzis, V. Melnyk and I. V
Lesson 2: Gathering and Organizing Information Using ICT KEY QUESTION: HOW DO YOU GATHER AND ORGANIZE INFORMATION USING THE COMPUTER AND INTERNET?
Presentation transcript:

Content Analysis Techniques to Ease Browsing with Handhelds Jalal Mahmud Yevgen Borodin I.V. Ramakrishnan Department of Computer Science State University of New York at Stony Brook Stony Brook, NY 11794

Outline Browsing with Handhelds: Content Analysis Techniques: - Model-directed Web Transaction - Merchant-Side Web Transaction - Context Browsing with Mobile - Context-directed Web Transaction Evaluation: Future Work:

Browsing with Handheld User needs to do a lot of scrolling to get to the relevant content Using PDA Relevant Content

Problems Small Screens Offer Narrow Interaction Bandwidth. Unable to convey the Richness of the Web content. Involves a Lot of Horizontal and Vertical Scrolling. Tedious to Get to the Pertinent Content in a Page. This is worse when one is interested in Web transactions (e.g. buying books, paying utility bills).

Our Approach Relevant content Irrelevant content Filter Away Irrelevant Content and Only Present Relevant Content First Present the Relevant Content.

Model-directed Web Transaction Web Transaction Examples: - Buying a CD Player from Bestbuy - Paying Utility Bills Online Web Transaction Characteristics: - A Sequence of Steps - Each Step is Based on User-Selected Operation Two aspects of a Web transaction: - Semantic Concept - Process Model

Semantic Concepts Search Results Taxonomy Add to Cart Product Details

item_select submit_searchform Process Model TAXONOMY CONCEPT SEARCH FORM CONCEPT 1

select_item_category item_select submit_searchform Process Model 1

2 submit_searchform item_select Process Model SEARCH FORM CONCEPT SEARCH RESULT CONCEPT

item_select select_item_category item_select submit_searchform 2 add_to_cart submit_searchform Process Model 1

show_item_detail add_to_cart check_out continue_shopping item_select select_item_category submit_searchform item_select view_shoppingcart view_shoppingcart, update_shoppingcart submit_searchform 1 - START STATE 6 - FINAL STATE Model-driven transaction item_select Submit_searchform

Process Model show_item_detail add_to_cart check_out continue_shopping item_select select_item_category submit_searchform item_select view_shoppingcart view_shoppingcart, update_shoppingcart submit_searchform 1 - START STATE 6 - FINAL STATE Model-driven transaction item_select Submit_searchform

Evaluation Results Built using Automata Learning Techniques Training Data Over 200 Transaction Sequences Collected from over 30 Sites Recall / Precision 90% / 96% for Books domain 86% / 88% for Consumer Electronics domain 84% / 92% for Office Supplies domain Process Model

Concept Extraction LOGICAL TREE Sort Results By Select Box Image Insignia Image Browse Image Case Logic Best Matches Brand Sony Browse Camera Software Electronics Case Logic Taxonomy Camera Software Electronics Image Insignia Image Browse Image Sony Browse Search Result Electronics Search Phrase Search Form Select Box Go Button Entire Site CONCEPT TREE

Developed a Statistical Model for Each Concept using Machine Learning Techniques Training Data Used Labeled Concepts from Over 100 Pages Collected from Two Dozen Sites Evaluation Results Concept Extraction

Evaluation Results Recall for Concept Extraction

Model-directed Web Transaction on Handheld: Guide-O-Mobile Guide-O Mobile Guide-O-Mobile

Outline Browsing with Handhelds: Content Analysis Techniques: - Model-directed Web transaction - Merchant-Side Process Modeling - Context-Browsing with Mobile - Context-Directed Web Transaction Evaluation: Future Work:

Client-Side Process Modeling: Problems Client-Side Process Modeling in Guide-O-Mobile. Process Model is Stored in Client Side. Separate Process Model Needed for Each Domain. Performance Largely Depends on Concept Extraction.

Merchant-Side Process Modeling Labeled Web Content with Semantic Annotations. Content Providers will Label their Web Content. XHTML will be Used to Label Relevant Content in the Web Sites Describe Process Models Specific to the Sites. Mobile Users will Use the System to Easily Identify Relevant Information. Perform On-Line Transactions.

Prototype Implementation XHTML tags:,,,,,,,,,,, and.

Outline Browsing with Handhelds: Content Analysis Techniques: - Model-directed Web Transaction - Merchant-side Web Transaction - Context-Browsing with Mobile - Context-Directed Web Transaction Evaluation: Future Work:

Context Browsing with Mobile On Following a Link Collect Context of the Link Identify the Relevant Section on the Next Page Using the Context Present the Relevant Section. Context Browsing Reduces Information Overload Makes Mobile Browsing Faster.

Context-directed Browsing

How Do We Find Relevant Content? Finding What is Important on a Web Page: Is Subjective on Any Distinct Page Can be Inferred in a Sequence of Pages

Click on the “MP3 Players" Link Collect Context of the Link

Find Relevant Section Using Context Collect Context of the Link Click the Link – Collect Context

Find Relevant Section Using Context Click the Link – Collect Context

Context Browsing with Mobile: CMo Prototype

Product Search Using CMo

Outline Browsing with Handhelds: Content Analysis Techniques: - Model-directed Web transaction - Merchant-side Web transaction - Context-Browsing with Mobile - Context-directed Web Transaction Evaluation: Future Work:

No Process Model Contextual Browsing with a Domain-Dependent Knowledge-Base Relevant Segment Identification Using Contextual Browsing Concept Segment Identification Using Knowledge- Base and Heuristics Algorithms Context-directed Web Transaction

Context-directed Web Transaction: Prototype System The Online Shopping Knowledge-Base Consists of the Following Few Concepts: SearchForm, AddToCart, Taxonomy, ShoppingCart, Checkout, etc. Implementing the Prototype is a Work in Progress.

Evaluation: Guide-O-Mobile Experimental Set-Up Guide-O-Mobile 1.2 GHz desktop with 256 MB RAM Client-Server Model Client: 400 MHz iPaq with 64 MB RAM Server: Core Guide-O System Evaluation Over two dozen CS graduate students Over 30 web sites spanning Books, Consumer Electronics and Office Supplies domains

Evaluation: Guide-O Mobile Guide-O-Mobile: Overall Time Performance

Evaluation: Guide-O Mobile Guide-O-Mobile Overall Time Performance– with standard deviation Standard Deviation

Evaluation: Guide-O Mobile Guide-O-Mobile: Interaction Time

Evaluation: Guide-O Mobile Guide-O-Mobile Interaction Time Performance– with standard deviation Standard Deviation

Evaluation:CMo Experimental Set-Up Client-Server Model Client: IPAQ Pocket PC equipped with Microsoft Pocket PC operating system with wireless Internet connectivity. Server: Core CMo System Evaluation 8 CS graduate students completing 8 tasks (8 times each) on 8 Web sites from News and Shopping Domain.

Evaluation:CMo Performance of Context Identification

Evaluation: CMo Relevant Information Identification

Browsing Efficiency with CMo

Conclusion and Future Work Port all the Server Steps to the Handheld. Extend the Mozilla's Minimo Mobile Browser with CMo Functionalities. Mining Transactional Models from Contextual Information.

Questions?