The Design Of A Web Document Snapshots Delivery System David Chao College of Business San Francisco State University.

Slides:



Advertisements
Similar presentations
Using the Self Service BMC Helpdesk
Advertisements

WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
“The Honeywell Web-based Corrective Action Solution”
Idaho National Engineering and Environmental Laboratory What is a Framework? Web Service? Why do you need them? Wayne Simpson November.
EndNote. What is EndNote:  EndNote is referencing software that enables you to create a database of references from your readings. Your database of references.
Microsoft Excel 2003 Illustrated Complete Excel Files and Incorporating Web Information Sharing.
Data Structures: A Pseudocode Approach with C
Tracking a Web Site’s Historical Links with Temporal URLs David Chao College of Business San Francisco State University.
Units can enter ranks, merit badges, and awards online.
Guide to Oracle10G1 Introduction To Forms Builder Chapter 5.
Maintenance Modifying the data –Add records –Delete records –Update records Modifying the design –Add fields into tables –Remove fields from a table –Change.
1 Chapter 12 Working With Access 2000 on the Internet.
15 Chapter 15 Web Database Development Database Systems: Design, Implementation, and Management, Fifth Edition, Rob and Coronel.
Transaction Management and Concurrency Control
A Guide to Oracle9i1 Introduction To Forms Builder Chapter 5.
XP Tutorial 9 New Perspectives on JavaScript, Comprehensive1 Working with Cookies Managing Data in a Web Site Using JavaScript Cookies.
Introduction To Form Builder
SiS Technical Training Development Track Technical Training(s) Day 1 – Day 2.
The Designing of Web Services to Deliver Web Documents Associated with Historical Links David Chao College of Business San Francisco State University.
Figure 1. Hit analysis in 2002 of database-driven web applications Hits by Category in 2002 N = 73,873 Results Reporting 27% GME 26% Research 20% Bed Availability.
Meta Tags What are Meta Tags And How Are They Best Used?
State of Connecticut Core-CT Project Query 4 hrs Updated 1/21/2011.
Christopher M. Pascucci Basic Structural Concepts of.NET Browser – Server Interaction.
Section 13.1 Add a hit counter to a Web page Identify the limitations of hit counters Describe the information gathered by tracking systems Create a guest.
Refworks Presented by Margaret Clark, Reference Librarian FSU College of Law Library September 20, 2005.
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
Welcome 2013 User Group Meeting Voting on New Features.
JavaScript and The Document Object Model MMIS 656 Web Design Technologies Acknowledgements: 1.Notes from David Shrader, NSU GSCIS 2.Some material adapted.
FireRMS SQL Audit, Archiving & Purging Presented by Laura Small FireRMS Quality Assurance.
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
NAMS Account Activation Training. 2 What is NAMS? The NASA Account Management System is NASA’s centralized process for requesting and maintaining accounts.
The purpose of this Software Requirements Specification document is to clearly define the system under development, that is, the International Etruscan.
Chapter 33 CGI Technology for Dynamic Web Documents There are two alternative forms of retrieving web documents. Instead of retrieving static HTML documents,
Server-side Scripting Powering the webs favourite services.
4-1 INTERNET DATABASE CONNECTOR Colorado Technical University IT420 Tim Peterson.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
9 Chapter Nine Compiled Web Server Programs. 9 Chapter Objectives Learn about Common Gateway Interface (CGI) Create CGI programs that generate dynamic.
© 2007 by Prentice Hall 1 Introduction to databases.
INTRODUCTION TO JAVASCRIPT AND DOM Internet Engineering Spring 2012.
NMED 3850 A Advanced Online Design January 12, 2010 V. Mahadevan.
JavaScript, Fourth Edition Chapter 5 Validating Form Data with JavaScript.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
CSC 2720 Building Web Applications Server-side Scripting with PHP.
NFIRS Data Entry Browser Interface (DEBI)
Dynamic web content HTTP and HTML: Berners-Lee’s Basics.
EndNote. What is EndNote? EndNote is referencing software that enables you to create a database of references from your readings.
ECDL. Word processing Work with documents and save them in different file formats Choose built-in options such as the Help function to enhance productivity.
3 Copyright © 2004, Oracle. All rights reserved. Working in the Forms Developer Environment.
The Management of a Website’s Historical Resources David Chao College of Business San Francisco State University.
Microsoft Access is a database program to manage sort retrieve group filter for certain records.
IS-907 Java EE World Wide Web - Overview. World Wide Web - History Tim Berners-Lee, CERN, 1990 Enable researchers to share information: Remote Access.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
8 Chapter Eight Server-side Scripts. 8 Chapter Objectives Create dynamic Web pages that retrieve and display database data using Active Server Pages Process.
Software Configuration Management SEII-Lecture 21
Since you’ll need a place for the user to enter a search query. Every form must have these basic components: – The submission type defined with the method.
Recent CMA Enhancements Java-based Scroller Component Sample Layout Fixed problem with Component Modifier when previewing Select List components Fixed.
Module 6: Administering Reporting Services. Overview Server Administration Performance and Reliability Monitoring Database Administration Security Administration.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
CHAPTER 7 LESSON C Creating Database Reports. Lesson C Objectives  Display image data in a report  Manually create queries and data links  Create summary.
1 Chapter 4 Unordered List. 2 Learning Objectives ● Describe the properties of an unordered list. ● Study sequential search and analyze its worst- case.
Module 5: Managing Content. Overview Publishing Content Executing Reports Creating Cached Instances Creating Snapshots and Report History Creating Subscriptions.
FHA Training Module 1 This document reflects current policy related to this topic. Its content is approved for use in all external and internal FHA-related.
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
Preface IIntroduction Course Objectives I-2 Course Content I-3 1Introduction to Oracle Reports Developer Objectives 1-2 Business Intelligence 1-3 Enterprise.
Chapter 13 Managing Transactions and Concurrency Database Principles: Fundamentals of Design, Implementation, and Management Tenth Edition.
EndNote by: fatimah alotaibi.
Programming Logic and Design Fourth Edition, Comprehensive
Tutorial 7 – Integrating Access With the Web and With Other Programs
New Technologies for Storage and Display of Meteorological Data
Presentation transcript:

The Design Of A Web Document Snapshots Delivery System David Chao College of Business San Francisco State University

What Is A Web Document Snapshot? A web document snapshot is the state of a web document at a point in time (snaptime).

Applications Of Web Document Snapshots It enables an organization to audit a web document’s contents in the past. Perform business analyses with historical information recorded in it. It is also an archived copy of a web document when it is changed.

The State Of A Web Document At The Snaptime The code that creates the contents The rendering of the web document as displayed with a browser: –Dynamic web documents

Factors Affecting The State Of A Web Document Web document code The state of internal resources it references: –Internal resources are files managed by a web site and are available in creating the web site’s contents. –Images, style sheet, components, script files, databases, etc. The state of external resources it references: –External resources are files not managed by the web site but can be referenced in creating the web site’s contents. Web site host environment variables: –System clock

Four Levels Of Web Document Snapshot Level 1 snapshot: A web document snapshot is the state of web document code at snaptime. –Creating level 1 snapshot enables a web site to trace the changes to the web document code over time. Level 2 snapshot: A level 2 snapshot is a level 1 snapshot with the additional requirement that all the internal resources it references are at least level 1 snapshots at the same snaptime. –Referencing database snapshots

Level 3 snapshot: A level 3 snapshot is a level 2 snapshot with the additional requirement that all the external resources it references are at least level 2 snapshots at the snaptime. Level 4 snapshot: A level 4 snapshot is a level 3 snapshot with the additional requirement that all the web site host’s environment variables are reset to their values at snaptime.

Objective Of This Research The level 3 and level 4 snapshots involve resources that are not managed by the web site and it’s difficult, if not impossible, for the web site to keep track of changes to these resources. This research develops a web document snapshot management system to deliver web documents’ level 1 and level 2 snapshots.

The Design Of The Web Document Snapshot Delivery System The system consists of two components: – Database Snapshot Manager for maintaining database snapshots –Web Document Snapshot Manager for maintaining web document snapshots. The system is designed to deliver snapshots with any snaptime requirement, and snapshots are created only when requested.

Database Snapshot Manager The objective of this module is to provide a database snapshot at any snaptime requested by users. This requires recording all updates in a log. The log uses time stamp to record update time, and use flags to indicate deletions and insertions where a modification is treated as the deletion of the old version followed by an insertion of the new version.

Database Snapshot Management Defining snapshots: CREATE SNAPSHOT snapshotname AS query AS OF snaptime Refreshing snapshots: REFRESH SNAPSHOT snapshotname AS OF new snaptime

Web Document Snapshot Manager The objective of the Web Document Snapshot Manager is to generate level 2 snapshots for all internal non-database files.

The M:M Relationship Between A URL And A Web Documdent

Historical Links The historical links of a web site include the URLs invalidated due to: – web site reorganization – document removal, renaming or relocation and links to document snapshots: –document’s contents as of a specific point in time.

Logging Scheme The log, named TemporalURLLog, is designed to keep the history of changes to web documents. It has four fields: –URL: document’s URL –PublishDate: document publish time –ExpireDate: document URL expire date –NewURL: document’s new URL if any

TemporalURLLog Maintenance Algorithm New document: An entry is entered with its URL and PublishDate; ExpireDate and NewURL are null. Deleted document: The ExpireDate of the document’s entry is changed to its deletion time. Modified document: The ExpireDate of the document’s entry is changed to its modification time and a new entry is entered with its URL and modification time as PublishDate; ExpireDate and NewURL are null. Renamed document: The ExpireDate of the document’s entry is changed to the time it is renamed and the NewURL is changed to its new URL. Then, it adds a new entry with its new URL and the PublishDate is set to the time the document is renamed.

Archiving Scheme Deleted document: The deleted document in the Archive with URL + PublishDate as file name. Modified document: The old version is saved in the Archive with URL + PublishDate as file name.

1.A URL P2 valid between T0 and T1 is deleted 2. A URL P3 has been modified repeatedly and is eventually deleted. 3. An old URL P5 is now renamed to P7. It has been modified at T 3 4. The log is able to determine that a historical link P1 is now renamed to P8. 5. A URL P12 has never existed in the web site. With the scheme we can determine:

Patterns of Log Entries for a URL 1. If a URL has a log entry with a non-null PublishDate and null ExpireDate field then it is a current URL; such as P6 in figure If all entries of a URL have a non-null PublishDate, ExpireDate and null NewURL field, then this URL is deleted from the web site; such as P3 in Figure If a URL has a log entry with a non-null NewURL field, then it has been renamed, and the log entries for the new URL may again have these three patterns of changes.

Backward Search For A Document’s Snapshot At A Specific Time This algorithm processes log entries backward starting from a document’s current entry to trace back its changes in order to locate the snapshot at time T. If the current URL’s PublishDate is less than T, then the current document itself is its snapshot at T. Otherwise the backward search starts.

An entry’s predecessor has one of the following properties: 1. If the entry has a null NewURL then its successor must have the same URL and the successor’s PublishDate must equal the entry’s ExpireDate. If no such successor is found, then this entry must have been generated due to a deletion. 2. If the entry has a non-null NewURL then it must have a successor with a URL equal to the NewURL and the PublishDate equals to the entry’s ExpireDate. Renaming or relocation must have generated the successor.

Retrieving Web Document P7’s Snapshot at time=T 2 Entries processed: –(P5, T 1, T 3, Null) –(P5, T 3, T 4, P7) –(P7, T 4, Null, Null) Document retrieved: –Archive(P5 + T 1 )

Retrieving Web Document P8’s Snapshot at time =T 0 Entries processed: –(P1, T 0, T 1, P4) –(P4, T 1, T 4, P8) –(P8, T 4, Null, Null) –P8 has been renamed at T 1 and T 4. It has not been modified since T 0. Document retrieved: –P8

Requesting Web Document Snapshot with Temporal URL A temporal URL is a URL submitted with temporal requirements of which the documents associated with the URL must meet. Entering temporal requirements with QueryString Example: –URL?SnapshotAsOf=date

Requesting Web Document Snapshot With A Web Service A web service is an application logic accessible via the Internet. The snapshot-retrieving algorithm can be implemented as a web service that takes URL and snaptime as its inputs and returns the document snapshot as output.

Creating A Web Document Snapshots Management Site This is a site designed to manage web document snapshots and handle the requests for snapshot. Its objectives are: –(1) maintaining web document snapshots. –(2) providing interface for users to enter request for snapshots. –(3) educating users about the web document snapshot systems.

Summary This paper has two contributions: –1. It presents an analysis for defining four levels of snapshots for a web document. –2. It presents a design of the web document snapshot management system that is capable of dynamically creating web document snapshots. Future research: –Designing a web document snapshots management site.