Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001.

Slides:



Advertisements
Similar presentations
Composite Device Computing Environment: A Framework for Situated Interaction Using Small Screen Devices Thai-Lai Pham, Georg Schneider, Stuart Goose and.
Advertisements

WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Panasonic Singapore Labs – Network Team QoS and Delivery Context in Rule-Based Edge Services Prepared for IWCW2002 By Ng Chan Wah
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 22 World Wide Web and HTTP.
LYU0101 Wireless Digital Library on PDA Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu First semester FYP Presentation 2001~2002.
Project 1 Introduction to HTML.
Technical Architectures
WMC  “Web standards” can refer to the actual specification of how a language or technology works.  An industry standards body, such as the.
MultiPlatform Applications Module. Agneda for today Areas to be covered Assessment for the module Agenda for the module timetable Procatical sessions.
Web Server Hardware and Software
Xiaobin Zheng April 13 th, Outline Mobile search Mobile Web Types of services Case Study: Google Search for mobile Yahoo! Search for mobile Conclusion.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
J2ME Prepared by: - Gaery- Ronny - Alan- Andy. Why Technology for Mobile Devices? The nature of wireless devices is changing Old devices: All the software.
World Wide Web1 Applications World Wide Web. 2 Introduction What is hypertext model? Use of hypertext in World Wide Web (WWW) – HTML. WWW client-server.
Web Clipping Presentation By: Alex Jacobs, Philip Kim, Nathan Po Web Clipping.
What is adaptive web technology?  There is an increasingly large demand for software systems which are able to operate effectively in dynamic environments.
Chapter 1 Understanding the Web Design Environment
1st Project Introduction to HTML.
Overview of Search Engines
COMPUTER TERMS PART 1. COOKIE A cookie is a small amount of data generated by a website and saved by your web browser. Its purpose is to remember information.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
Chapter 1 Variables in the Web Design Environment.
Chapter 1 Variables in the Web Design Environment
An Introduction to WAP/WML. What is WAP? WAP stands for Wireless Application Protocol. WAP is for handheld devices such as mobile phones. WAP is designed.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Chapter 1 Introduction to HTML, XHTML, and CSS
1 Web Server Concepts Dr. Awad Khalil Computer Science Department AUC.
Introducing Dreamweaver MX 2004
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
Tutorial 1: Getting Started with Adobe Dreamweaver CS4.
Chapter 6 The World Wide Web. Web Pages Each page is an interactive multimedia publication It can include: text, graphics, music and videos Pages are.
5 Chapter Five Web Servers. 5 Chapter Objectives Learn about the Microsoft Personal Web Server Software Learn how to improve Web site performance Learn.
© 2002 by Prentice Hall 1 David M. Kroenke Database Processing Eighth Edition Chapter 14 Networks, Multi-Tier Architectures, and XML.
10 Adding Interactivity to a Web Site Section 10.1 Define scripting Summarize interactivity design guidelines Identify scripting languages Compare common.
Week 1 Understanding the Web Design Environment. 1-2 HTML: Then and Now HTML is an application of the Standard Generalized Markup Language Intended to.
World Wide Web Hypertext model Use of hypertext in World Wide Web (WWW) WWW client-server model Use of TCP/IP protocols in WWW.
Lector: Aliyev H.U. Lecture №15: Telecommun ication network software design multimedia services. TASHKENT UNIVERSITY OF INFORMATION TECHNOLOGIES THE DEPARTMENT.
WWW Programming Model. WWW Model The Internet WWW architecture provides a flexible and powerful programming model. Applications and content are presented.
1 Analysis of Push Initiator Tool used for Wireless Application Protocol Taotao Huang Helsinki University of Technology Department of Electrical and Communication.
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
Chapter 13-Tools for the World Wide Web. Overview Web servers. Web browsers. Web page makers and site builders. Plug-ins and delivery vehicles. Beyond.
National Center for Supercomputing Applications NCSA OPIE Presentation November 2000.
Ericsson Competence Solutions Rev A16/11/011 Mobile Learning Course for R380 and R520 Presented by Michelle Almeida Course Structure Design Guidelines.
CHAPTER TEN AUTHORING.
Web Design and Development for E-Business By Jensen J. Zhao Copyright 2003 Prentice Hall, Inc. Web Design and Development for E-Business Jensen J. Zhao.
Tutorial 7 Working with Multimedia. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition 2 Objectives Explore various multimedia applications.
1 FAQ’S ABOUT WAP Presented By Abhilash Pillai CSCI 5939-Independent Study.
FYP: LYU0001 Wireless-based Mobile E-Commerce on the Web Supervisor: Prof. Michael R. Lyu By: Tony, Wat Hong Fai Harris, Yan Wai Keung.
McLean HIGHER COMPUTER NETWORKING Lesson 6 Types of Browsers & WAP Explanation of browser functions Wireless access to the Internet Description of.
1 MSCS 237 Overview of web technologies (A specific type of distributed systems)
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Web Server.
27.1 Chapter 27 WWW and HTTP Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Internet Applications (Cont’d) Basic Internet Applications – World Wide Web (WWW) Browser Architecture Static Documents Dynamic Documents Active Documents.
WWW Programming Model. WWW Model The Internet WWW architecture provides a flexible and powerful programming model. Applications and content are presented.
Web Services Using Visual.NET By Kevin Tse. Agenda What are Web Services and Why are they Useful ? SOAP vs CORBA Goals of the Web Service Project Proposed.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
Introduction TO Network Administration
Copyright © 2002 Pearson Education, Inc. Slide 3-1 Internet II A consortium of more than 180 universities, government agencies, and private businesses.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
1 The World Wide Web Architectural Overview Static Web Documents Dynamic Web Documents HTTP – The HyperText Transfer Protocol Performance Enhancements.
Tutorial 1 Getting Started with Adobe Dreamweaver CS5.
Your Interactive Guide to the Digital World Discovering Computers 2012 Chapter 13 Computer Programs and Programming Languages.
Chapter 1 Introduction to HTML.
Project 1 Introduction to HTML.
M-Commerce Wireless Markup Language (WML) By Prof T.R. Vaidyanathan.
Mobile Internet and WAP
Presentation transcript:

Subproject 4: HTML-WML Transcoding System Jia-Shung Wang Computer Science Department National Tsing Hua University March 27, 2001

Outline Motivation and Issues Examples of Transcoding System Overview and Translation Flow Some HTML to WML Conversion Strategies

Information Appliances Different design constraints based on intended use, enhances ease of use –Desktop PC –Mobile PC –Desktop “ Smart ” Phone –Mobile Telephone –Personal Digital Assistant –Set-top Box –Digital VCR –… Implications: –Shift from computer design to consumer design –Heterogeneous “ standards, ” hybrid networking –Interactive networking, access on demand, QoS

Motivation  Rapidly growing diversity of wireless communication devices  The incredible growing of the amount of available HTML web pages on the Internet  Solutions for mobile devices with WML browsers to access the existing HTML or WML pages on the Internet.

Issues  Device-enabled service for WML mobile devices with different types of screen  Bandwidth-driven transmission for rapid response and fast delivery speed  The usage of browsing behavior  The resizing of images /icons  The compression of the resulting WML data

Demos of Transcoding Contents from enYES 鉅亨網 USAtoday CS, NTHU NTHU VOD

Discussions  enYES provides two versions: regular HTML and WAP to serve PC users and mobile device users separately.  USAtoday also provides content (simplified version) for users with Palm.  NTHU, CS-NTHU homepages : If we keep the original figure for saving the link information, then the page layout becomes old. (using HTML browser with:Browse-It).  VOD homepage, one-column text: no significant difference after transcoding.

Usage of Browsing Behavior  The automatic translation seems complicated because of the diversity of content posted on an HTML page.  It is unlikely to have a universal conversion strategy to translate every HTML page to sequences of WML decks effectively.  However, it seems a good idea to categorize the browsing behavior to classify the HTML page to be translated first.

Usage of Browsing Behavior (cont’d)  After doing that we may realize what the client requires. Then we can have a corresponding conversion to extract the acquired content step-by-step and translate them into some predictable and small sized WML documents.  We believe that there would be some adequate conversions for some kinds of web pages after classification.

Related Works Transcoding Proxy of IBM alphaWorks It has a goal to manager different version of contents with different fidelities and modalities in order to adapt the delivery to different client device.

Related Works Intel Quick Web Technology New software capability that helps Internet providers and digital distribution companies increase the delivery speed of Web pages containing photos, drawings and other graphics. It uses two key techniques, “Compresses” and “Caches”.

Related Works Spyglass Prism Spyglass Prism dynamically adapts Web content to match various non-PC devices. It functions as a proxy server, caches the converted content, and dynamically converting standard HTML to WML.

Related Works Proxy Architecture for Efficient Web Browsing over Cellular Networks Decreases the access time of browsing WWW in narrow-band wireless environment. It adopts persistent connection and pipelining technique based on proxy architecture to improve the HTTP process between the client and the proxy server.

Comparisons between HTML and WML Both make use of tags and attributes. Similar character set, syntax and data types. Two special elements of WML structure –Deck and Card Different design goal –HTML: To Publish hypertext on the World Wide Web –WML: For narrow network bandwidth devices with small displays, limited memory and fewer computational resources.

Examples of HTML and WML WML This is the first card... This is the second card. HTML Example page. This is a headline. This is a paragraph.

System Overview Web Server Multimedia Content CGI Scripts etc. Translation Server WML Generator Client WML Browser Etc. HTTP HTML Parser WAP HTML-WML Translator HTML, WML Documents HTTP

Features An HTML-WML Translator on the Translation Server Both HTTP and WAP requests are acceptable. Java Servlet API compatible Server- and platform-independent

Translation Server: Components and Flow Network Protocol Proxy HTML Parser Filter Document Analyzer Decks & Cards WML Generator Link Builder Request Response

Components Gateway –Accept requests from clients –Return appropriate responses Proxy Servlet –Get the requested remote documents –Determine to pass or convert –Cache the converted results

Components (cont’d) HTML Parser –Parse the HTML document as a parse tree Document Analyzer –Analyze the parse tree Filter –Filter any objects unnecessary or not supported by the client device –Image/icon resizing

Components (cont’d) Content Divider –Split a document into multiple, small- size documents Link Maker –Insert extra links to make small documents reach one another WML Generator –Produce well-formed WML documents and return them to Proxy Servlet

HTML to WML Conversion Tools Semi-automatic: –Used for rich HTML documents –The conversion form is designated manually with the help of analysis and editing tools. –The resulting forms are distributed to the gateway servers. Automatic: –Used for simple documents, such as News and BBS, …

HTML to WML Conversion Strategies Strategy I: Tables to Lists –Simply removing all layout elements such as table –Let all the contents arrange into only one column with a fixed width Strategy II: One Table One Deck –Extracting each table to form a deck

HTML to WML Conversion Strategies (cont’d) Strategy III: Preview First a. One Table One Deck b. Collect all the first card of every deck as preview cards c. Arrange these preview cards to form an preview deck, which will be transmitted first, every preview card will have a link to its corresponding deck

Original Document

Tables to Lists

One Table One Deck

Preview First

Strategy Evaluation Assuming we have S sections in a document and the document is translated to N WML cards. Every deck contains at most C cards. Assuming that the contents in the same tables are similar.

Evaluation of Searching After Translation Preview First One Table One Deck Tables to Lists GoodBestWorstUser Friendly S/2CS/2N/2 Average Deck Access Time

Performance Evaluation 5.4%57.2%16, %46.7%11, %22.0%7, ,7278,32521, ,7406,13717, ,3619,47124,359Experiment #1 Experiment #2 Experiment #3 HeadersText Source (bytes) Images (bytes) With Images Without Images ReductionHTML Pages WML Decks (bytes) 25.2%40.3%12,06217,96620,3639,568Experiment #4

Performance Evaluation (Experiment #1: What’s WAP ) Preview Deck 1 Deck 3.2 Deck 3.1 What’s WAP Preview Deck 3Deck 2Deck 1 WAP Forum

Performance Evaluation (Experiment #2: NTHU Web Page) Preview NTHU Preview Deck 1 Preview Deck 1Deck 2.1 Deck 2.2 Current Status Preview Deck 1Deck 2.1 Deck 2.2 History Deck 3.1 Deck 3.2 About NTHU

Performance Evaluation (Experiment #3, NTHU CS Web Page) Preview Deck 1 Deck 3.2 Deck 3.1 Faculty Preview Deck 1 NTHU CS Deck 3.4 Deck 3.3 Deck 3.6 Deck 3.5

Performance Evaluation (Experiment #4, IETF Web Page) Preview Deck 1 IETF Preview Deck 1Deck 2.1 Deck 2.2 Internet-Drafts Preview Deck 1 Deck 2.2 Deck 2.1 Internet-Drafts Index Deck 2.4 Deck 2.3Deck 2.5 Preview Deck 1 Deck 2.2 Deck 2.1 DNSOP Deck 2.4 Deck 2.3Deck 2.5

Implementation  Goal: Portability, reusability, and crash protection.  Translation server: under Java environment with Java Servlet, Java HTML Tidy, and XML Parser for Java.  Servlet-enable server: Avenida Web Server and Nokia WAP Server  Microsoft Windows NT Workstation 4.0 with Service Pack 5

Summary Design an HTML to WML transcoding system with 1.Analyzing and filtering HTML contents 2.Image/icon resizing 3.WML browsing mode design and WML conversion tool 4.compression and decompression modules of the WML data. 5.WML transmission control