1 Semantic Annotation for Web Content Adaptation Unit 14 of Spinning the Semantic Web.

Slides:



Advertisements
Similar presentations
Computers: Tools for an Information Age Writing Your Own Web Page: Using HTML and Web Authoring Tools.
Advertisements

WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Web Service Ahmed Gamal Ahmed Nile University Bioinformatics Group
XML Technology in E-Commerce
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
SMIL Content Adaptation for Embedded Devices. Tayeb Lemlouma, February Tayeb LEMLOUMA Nabil LAYÏADA WAM Project INRIA Rhône Alpes, France SMIL Europe.
Project 1 Introduction to HTML.
Technical Architectures
XML Prashant Karmarkar Brendan Nolan Alexander Roda.
Annotation-based Web content Transcoding Masahiro Hori, Goh Kondoh, Kouichi Ono, Shin-ichi Hirose, and Sandeep Singhal IBM Tokyo Research Laboratory.
Introduction to HTML 2006 CIS101. What is the Internet? Global network of computers that are connected and communicate via a series of Protocols Protocols.
What is adaptive web technology?  There is an increasingly large demand for software systems which are able to operate effectively in dynamic environments.
Chapter 1 Understanding the Web Design Environment
1 The World Wide Web. 2  Web Fundamentals  Pages are defined by the Hypertext Markup Language (HTML) and contain text, graphics, audio, video and software.
1 Chapter 20 — Creating Web Projects Microsoft Visual Basic.NET, Introduction to Programming.
1st Project Introduction to HTML.
THE BASICS OF THE WEB Davison Web Design. Introduction to the Web Main Ideas The Internet is a worldwide network of hardware. The World Wide Web is part.
COMPUTER TERMS PART 1. COOKIE A cookie is a small amount of data generated by a website and saved by your web browser. Its purpose is to remember information.
WEB DESIGNING Prof. Jesse A. Role Ph. D TM UEAB 2010.
Chapter ONE Introduction to HTML.
Web Design Basic Concepts.
Section 2.1 Compare the Internet and the Web Identify Web browser components Compare Web sites and Web pages Describe types of Web sites Section 2.2 Identify.
Chapter 16 The World Wide Web. 2 Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Write basic HTML.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Chapter 16 The World Wide Web Chapter Goals ( ) Compare and contrast the Internet and the World Wide Web Describe general Web processing.
Overview of Previous Lesson(s) Over View  ASP.NET Pages  Modular in nature and divided into the core sections  Page directives  Code Section  Page.
Using Styles and Style Sheets for Design
Chapter 16 The World Wide Web. 2 The Web An infrastructure of information combined and the network software used to access it Web page A document that.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
WML Wireless Markup Language Presented by: Richa Saxena Roll no
XP New Perspectives on XML Tutorial 6 1 TUTORIAL 6 XSLT Tutorial – Carey ISBN
WORKING WITH XSLT AND XPATH
1 Web Basics Section 1.1 Compare the Internet and the Web Compare Web sites and Web pages Identify Web browser components Describe types of Web sites Section.
Web Page Design I Retest Terms Review. 1. Web pages are created using a language known as ___________. The coding of this language must follow specific.
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
An Introduction to Software Architecture
WEB DESIGN USING DREAMWEAVER. The World Wide Web –A Web site is a group of related files organized around a common topic –A Web page is a single file.
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
Chapter 1 Understanding the Web Design Environment Principles of Web Design, 4 th Edition.
Week 1 Understanding the Web Design Environment. 1-2 HTML: Then and Now HTML is an application of the Standard Generalized Markup Language Intended to.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
Web Page Design I Basic Computer Terms “How the Internet & the World Wide Web (www) Works”
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
Chapter 13-Tools for the World Wide Web. Overview Web servers. Web browsers. Web page makers and site builders. Plug-ins and delivery vehicles. Beyond.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check for a license violation.
DATABASE MANAGEMENT SYSTEMS CMAM301. Introduction to database management systems  What is Database?  What is Database Systems?  Types of Database.
HTML BTEC National in Computing Section5. Create Information “HTML: defining HTML, discussing HTML uses and demonstrating HTML basics, HTML structure…..
1 Web Servers (Chapter 21 – Pages( ) Outline 21.1 Introduction 21.2 HTTP Request Types 21.3 System Architecture.
Web Server.
WWW Programming Model. WWW Model The Internet WWW architecture provides a flexible and powerful programming model. Applications and content are presented.
Today’s Lesson….. 1.Formative Assessment Given Back – Go through Answers. 2.Webpage Design.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
Writing Your Own Web Page: Using HTML and FrontPage Chapter 10.
1999, COMPUTER SCIENCE, BUU Introduction to HTML Seree Chinodom
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
Understanding Web-Based Digital Media Production Methods, Software, and Hardware Objective
Web Design – Week 2 Introduction to website basics Website basics: How the Web Works Client / server architecture Packet switching URL components.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
Objective % Select and utilize tools to design and develop websites.
Project 1 Introduction to HTML.
Objective % Select and utilize tools to design and develop websites.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
THREE TIER MOBILE COMPUTING ARCHITECTURE
Chapter 16 The World Wide Web.
Presentation transcript:

1 Semantic Annotation for Web Content Adaptation Unit 14 of Spinning the Semantic Web

2 Introduction Necessary for Web contents to be adapted for transparent access from a variety of client agents (cellular phones, PDA) –A large, full-color image may be reduced with regard to size and color depth, removing unimportant portions of the content, when accessed by certain devices Better presentation and faster delivery to client devices Transcoding: transformation of information from one form to another –Web content transcoding –Crucial for universal Web access under varying conditions that may depend on client capabilities, network connectivity, or user preferences

3 Composite Capabilities/ Preferences Profiles (CC/PP)

4 Introduction CC/PP stands for Composite Capabilities/Preferences Profile, and is a system for expressing device capabilities and user preferences. The goal of the CC/PP framework is to specify how client devices express their capabilities and preferences (the user agent profile) to the server that originates content (the origin server). The origin server uses the "user agent profile" to produce and deliver content appropriate to the client device. In addition to computer-based client devices, particular attention is being paid to other kinds of devices such as mobile phones.

5 Devices The web is accessed by various devices: –PC, NoteBook, PDA, Mobile Phone… Each one having different capabilities –Hardware : screen size/color, audio, bandwidth… –Software : mpeg, mp3, 3GPP, AMR…

6 CC/PP & RDF The CC/PP framework starts with RDF and then overlays a CC/PP-defined set of semantics that describe profiles. CC/PP, RDF based profiler, is a collection of information of capabilities of hardware platform and system software, and preferences of the user.

7 Advantages of CC/PP By only sending required content, no time or bandwidth is wasted sending unwanted content. This can also lead to faster page loading times. A server can provide information to a more diverse range on browsers. This can not only be beneficial in economic terms, but also in terms of site accessibility. You give the users what they want, not what you think they want. So many…

8 Deployment(Client & Server Proxies)

9 Deployment (Server Proxy only)

10 Deployment (Client Proxy only)

11 Deployment (Ideal Approach)

12 CC/PP Query

13 Content adaptation

14 Two ways to use CC/PP profiles Selection If the web server has a set of pre-written web pages, suitable for a number of different devices, then the profile can be used to decide which of these pre-written pages is most suitable for the web browser. Transformation Web page content can be kept in a neutral format (e.g. XML). This can then be transformed into an appropriate format, using the profile to decide what that format is.

15 CC/PP Implementations DICE Hewlett Packard DELI Intel Inria Keio University - Portal UMBC JIGSAW X-Smiles Browser So many…

16 Demonstrations An example of RDF file and graphfilegraph A Demo Page presenting the functionality of the CC/PP protocolA Demo Page presenting the functionality of the CC/PP protocol

17 Reference ojects/pda_doc_layout/seminar-html/ ojects/pda_doc_layout/seminar-html/

18 External Annotation Framework

19 Annotation Schemes Inline annotation: embed annotations in a Web document –Created as extra attributes of document elements HTML browsers ignore unknown attributes in a HTML document –Ease of annotation maintenance, eliminating the bookkeeping task annotations with their target documents –Require annotators to have document ownership External annotation: separate annotation from the original document –Raise no issues related to document ownership –Facilitate the sharing and reuse of annotations across documents –Avoid the mixing of contents and metadata

20 Applications of Web Content Annotation Discovery –Accurate searches of Web resources Qualification –Descriptions of users’ preferences regarding privacy Adaptation – the focus of this unit

21 Overview of An Annotation- Based Transcoding

22 External Annotation Files Contain metadata that address a part of a document to be annotated –XPath and XPointer are used to associate annotated portions of a document with annotating descriptions A reference may point to a single element or a range of elements If a target element has an ID attribute, the attribute can be used for direct addressing with the need for a long path expression Use RDF as the fundamental syntax of annotation files –User preferences and device capability: Composite Capability/Preference Profiles (CC/PP) –Document profiles (

23 Framework of External Annotation

24 Association How to select an annotation file for a Web document –Implicitly  by means of a structural analysis of the subject document –Explicitly  by means of tag An annotation file can be associated with a single document file, but the relation is not limited to one-to-one –Many annotation files for one Web document –One annotation file for multiple Web documents Useful when it is necessary to annotate common parts of Web documents, such as page headers, company logo images, and sidebar menus

25 Annotation-Based Transcoding System

26 Overview Content can be adapted on a content server, a proxy, or a client terminal –An adaptation engine should not be forced to reside in any particular location Use a proxy-based approach for content adaptation

27 Transcoding Architecture Intermediary –Computational entities that reside along the Web transaction path –Facilitate an approach to making ordinary information streams into smart streams that enhance the quality of communication An intermediary processor or a transcoding proxy can operate on a document to be delivered and transform the contents with reference to associated annotation files

28 Authoring-Time Transcoding Requirement for authoring-time transcoding –WYSIWYG editor –Let the annotator to navigate from an existing annotation to a portion of an annotated document designated by XPath / Xpointer –Verify the results of content adaptation through a previewer Authoring-time transcoding is crucial when annotations are employed for content adaptation, rather than discovery or qualification of contents –Content adaptation often changes the structure of original documents as the results of transcoding

29 Authoring-Time Transcoding

30 WYSIWYG Annotation Tool

31 HTML Page Splitting for Small- Screen Devices

32 Annotation Vocabulary An annotation vocabulary for HTML page splitting needs to be specified to constrain the possibilities for decomposition, combination, and partial replacement of contents Annotation of Web Content for Transcoding Alternatives –Provide alternative representations of a document or any set of its elements –Color image  grayscale image –A transcoding proxy selects the alternative that best suits the capabilities of the requested client device Elements in the annotated document can then be altered either by replacement or by on-demand conversion

33 Annotation Vocabulary (Cont.) Splitting Hints –An HTML file that can be shown as a single page on a normal desktop PC may be divided into multiple pages on clients with smaller display screens –pcd:Group: specifies a set of elements to be considered as a logical unit and provides hints for determining appropriate page break points Selection Criteria –Help a transcoding module select, from alternatives, the one that best suits the client device –pcd: role  value attribute (proper content, side menu, decoration…) –pcd:importance  priority (low important content may be ignored or displayed in a smaller font)

34 Annotation Descriptions

35 Adaptation Engine Run on an intermediary server called WBIWBI Flow chart –Upon receipt of the request from a client browser, an original page is retrieved for the first time from a content server. –The editor component of the plugin tries to find the locations of annotation files: If it is specified in a link element in an HTML header section, retrieve the designated annotation file. Lookup in a table for the mapping between an URL of the original page and that of an annotation. If it is found, retrieve the designated annotation file. Otherwise, the original page is returned as it is and the session is terminated.

36 Adaptation Engine (Cont.) Flow Chart (Cont.) –The generator component of the plugin generates a current page to be returned. Taking account of client capabilities included in an HTML request header, the generator extracts a portion of a document object tree and returns a sub-tree to the client

37 Adaptation Engine – System Flow

38 Application to Real-Life HTML Pages The Web page used as an example is a news page from a corporate Web site The news page consists of three tables stacked from top to bottom. –The top and middle tables correspond respectively to a header menu and a search form. –The bottom table is used for layouting.

39 Layout of A Real-Life News Page

40 Annotations for Splitting the News Page

41 Annotation for fragmentation of an actual news page

42 Screen copy of a small display preview on an authoring tool

43 Comparison of display contents on a small-screen device

44 Splitting Result The page splitting not only reduces the content to be delivered, but also places the primary content near the top of the fragmented page that is provides with navigational features –Placing navigational features (menu bars etc.) near the top of pages –Placing key information at the top of pages –Reducing the amount of information on the page page fragmentation based on semantic annotation will be more appropriate than page transformation done by solely syntactic information (removing white spaces, shrinking or removing images…) –Semantic rearrangement is one of the critical limitations of the syntactic transformation approach. –The navigational features achieved by this semantic annotation are noteworthy from the perspective of Web content accessibility.

45 Issues Consistency between an Original Document and Its Annotation –Necessary to provide a way of keeping them synchronized Extensibility –Custom-tailored transcoding module that runs without any external meta-information. –Using a general-purpose transformation engine, such as XSLT, which employs externally provided transformation rules –Task-specific  semantics Roles such as header, auxiliary, and layouter supplement semantics that cannot be fully prescribed in the definitions of Web document

46 Comparison of transcoding approaches in terms of extensibility