The Designing of Web Services to Deliver Web Documents Associated with Historical Links David Chao College of Business San Francisco State University.

Slides:



Advertisements
Similar presentations
Welcome to eDMR This PowerPoint presentation is designed to show eDMR users how to login and begin using the eDMR system.
Advertisements

Editorial roles Members of a Manila site can be assigned an editorial role if you want to grant them access to write stories or modify the appearance of.
MOSS 2007 Document Management Adam McCarthy 1 st April 2009.
PantherSoft Financials Smart Internal Billing. Agenda  Benefits  Security and User Roles  Definitions  Workflow  Defining/Modifying Items  Creating.
1 PeopleSoft Financials v9.0 Upgrade. 2 General Ledger.
Microsoft Excel 2003 Illustrated Complete Excel Files and Incorporating Web Information Sharing.
Tracking a Web Site’s Historical Links with Temporal URLs David Chao College of Business San Francisco State University.
Units can enter ranks, merit badges, and awards online.
Longhorn Council Units can enter ranks, merit badges, and awards online.
Tele’Ware Software Application. Helping you manage your clients….
eduphoria! Forethought 201
How to Logon Oracle Collaboration Suite and change password? STEP 1 Launch
Microsoft Visual Source Safe 6.01 Microsoft Visual Source Safe (MVSS) Presented By: Rachel Espinoza.
The Design Of A Web Document Snapshots Delivery System David Chao College of Business San Francisco State University.
Chapter 16 Maintaining Information Systems
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 18 Maintaining.
New School Websites Teacher Pages. Visit the SCUSD Website for videos tutorials: For more information.
State of Connecticut Core-CT Project Query 4 hrs Updated 1/21/2011.
Deployment Management The following screens demonstrate how to: 1. Access and view deployments 2. Create a new local deployment 3. Create and modify a.
Login Screen This is the Sign In page for the Dashboard Enter Id and Password to sign In New User Registration.
With Internet Explorer 8© 2011 Pearson Education, Inc. Publishing as Prentice Hall1 Go! with Internet Explorer 8 Getting Started.
Building a XanEdu CoursePack Copyright 2004 ProQuest Information and Learning Company. All rights reserved.
Comprehensive Continuous Improvement Plan(CCIP) Training Module 5 Funding Application Workflow, Communication, and Tracking.
High-Speed, High Volume Document Storage, Retrieval, and Manipulation with Documentum and Snowbound March 8, 2007.
FireRMS SQL Audit, Archiving & Purging Presented by Laura Small FireRMS Quality Assurance.
Login Screen This is the Sign In page for the Dashboard New User Registration Enter Id and Password to sign In.
System Analysis and Design
Welcome to Century Equipment’s Shop Online Website! This presentation will highlight some of it’s key features.
Organizing, Sharing and Storing Web Pages: Using Furl.net By Paulette Martinez December, 2005 Prepared for Professor Linda Harris, University of Maryland.
©2006, CSA Creating and Managing Your COS Expertise Profile Managing Your CV and Promoting Your Work ® Resources for Research, Worldwide.
Open Internet Explorer Go to: my.ccsd.net Type YOUR InterAct username and password. Then Submit Query.
1 Online Textbook Adooptions
© 2006 ITT Educational Services Inc. System Analysis for Software Engineers: Unit 3 Slide 1 Chapter 16 Maintaining Information Systems.
NFIRS Data Entry Browser Interface (DEBI)
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 18 Maintaining.
Creating a Syllabus on the ADX System Created by Larry Rowell and Bill Bolton.
Web Center Training ©2003 Optimum Technology, Inc.
The Management of a Website’s Historical Resources David Chao College of Business San Francisco State University.
Exploring Microsoft Microsoft FrontPage Chapter 21 Exploring Microsoft FrontPage 2002 Chapter 1 Creating a Home Page: Introduction to MS FrontPage.
Portaportal Portaportal is a web based bookmarking utility that lets you store links to your favorite websites online. Now your bookmarks are no longer.
HEI/OCAN College Access Program Data Submissions.
Visual Source Safe –A Quick Tour. CONTENTS What is VSS ? How to install VSS ? How does VSS track versions? VSS Concepts & Features Utilities available.
Chapter 16 Maintaining Information Systems. Objectives:  Explain and contrast four types of system maintenance.  Describe factors affecting maintenance.
Fab25 User Training Cerium Labs LabCollector - LIMS Lynette Ballast.
Contact Contact Teacher Training Service Tracking.
How do I access the disclosure form? Go to Office of Research Compliance website Click on the eCOI link under Conflict.
Units can enter ranks, merit badges, and awards online.
Your Name Proposal Creation Module 5 Your Name
Chapter 18 Maintaining Information Systems
NFIRS Data Entry Browser Interface (DEBI) nfirs. fema
Using the AASHTO Materials Library
SQL and SQL*Plus Interaction
Modules State College of Florida
Editing Your Website on SharePoint 2013
Cayuse 424 Proposal Creation.
NFIRS Data Entry Browser Interface (DEBI) nfirs. fema
Manage Funding In this training module, you will be guided through the process of adding funding and funding documents to a requirement package. This feature.
NFIRS Data Entry Browser Interface (DEBI) nfirs. fema
HP ALM Defects Module To protect the confidential and proprietary information included in this material, it may not be disclosed or provided to any third.
Maintaining Information Systems (SAD- 18)
This is the Sign In page for the Dashboard
NFIRS Data Entry Browser Interface (DEBI) nfirs. fema
GT Portal v. 2.0 Data Delivery
Introduction to Internet Explorer
Chapter 16 Maintaining Information Systems
Dreamweaver.
NFIRS Data Entry Browser Interface (DEBI) nfirs. fema
NFIRS Data Entry Browser Interface (DEBI) nfirs. fema
Chapter 18 Maintaining Information Systems
Presentation transcript:

The Designing of Web Services to Deliver Web Documents Associated with Historical Links David Chao College of Business San Francisco State University

Historical Links The historical links of a web site include the URLs invalidated due to: – web site reorganization – document removal, renaming or relocation and links to document snapshots: –document’s contents as of a specific point in time.

Benefits of Maintaining Historical Links Support applications that require historical data. –Trend analysis Audit the web page content at specific point in time. Preserve website content. Retrieve web pages using old links: –Bookmarks, favorites, published URLs in books

Related Researches Archiving and preserving websites: –Internet Archive WayBack Machine –Date/time stamped website snapshots Unable to meet users’ need for different snapshot times. Versioning –Unable to track organizational changes to a page Renaming, relocation Search engines –Unable to find unpublished old pages and web page snapshots

Objectives of this research Tracking changes to a web page using a log: –Insertion, deletion, modifications –organizational changes Renaming, relocation Be able to retrieve web pages using historical links. Deliver the historical documents using web services

Effects of web site reorganization and web document changes Web site actionsEffects on current and historical links Adding a new documentAdding a new URL to current links Modifying a documentNo change to URL; the old document becomes a snapshot and archived Deleting a documentDeleting a current link; adding a historical link and document is archived Renaming a documentAdding a new URL to current links; the old URL becomes historical link Relocating a documentAdding a new URL to current links; the old URL becomes historical link ReorganizationAdding all affected documents’ URLs to current links; adding all affected documents’ old URLs become historical

The M:M Relationship Between A URL And A Web Documdent

Logging Scheme The log, named TemporalURLLog, is designed to keep the history of changes to web documents. It has four fields: –URL: document’s URL –PublishDate: document published time –DocExpireDate: document expired date –URLExpireDate: URL expired date –NewURL: document’s new URL if any Note: A document may continue to exist with a new URL while its old URL is expired.

TemporalURLLog Maintenance Algorithm New document: An entry is entered with its URL and PublishDate; DocExpireDate, URLExpireDate, and NewURL are null. Deleted document: DocExpireDate and URLExpireDate to the time the document is deleted. Modified document: The DocExpireDate is changed to the time the document is modified and leave the URLExpireDate be null. Then, it adds a new entry with the same URL and the PublishDate is set to the time the document is modified; the DocExpireDate, URLExpireDate and NewURL are set to null. Renamed document: The URLExpireDate is changed to the renaming time and its NewURL field to the document’s new URL. Then, it adds a new entry with the new URL and the PublishDate is set to the time the document is relocated or renamed.

Archiving Scheme Deleted document: The deleted document in the Archive with URL + PublishDate as file name. Modified document: The old version is saved in the Archive with URL + PublishDate as file name.

With the scheme we can determine:. A historical link P1 is now renamed to P8, and a current link P1 points to a new document.. A URL P2 valid between T0 and T1 is deleted, and the document it pointed to is in the Archive with the name P2T0. A URL P3 has been modified repeatedly and is eventually deleted. All documents associated with P3 can be found in the Archive.. An old URL P5 is now renamed to P7. It has been modified at T3, and a copy of its snapshot can be found in the Archive with the name P5T1.. A URL P12 has never existed in the web site.

Design of Web Services to Retrieve Web Documents Scenario 1: The user submits a URL and gets the current version of the document, but would like to view its previous versions. Scenario 2: The user submits a URL and gets a file-not- found error. –check to see if the document ever exists in this web site –retrieve the document associated with the invalid URL if it is having a new URL –retrieve deleted documents from the Archive. Scenario 3: The user submits a URL and gets an unrelated document. Under this scenario, the current URL is associated with a different document.

Web Service for Scenario 1 This web service offers two methods: –1. RetrieveSnapshotAsOf method: This method takes a valid URL and the snapshot valid time as inputs and returns the link to snapshot at the specified time. –RetrieveAllSnapshots method: This method retrieves all links to snapshots of a current document.

Web Service for Scenario 2 This web service contains four methods: –1. IsURLEverExist method –2. IsURLValidAsOf method –3. FindCurrentURL method: This method traces the URL’s changes to locate the current URL if the document is still published. –4. FindLinkToOldDocument: This method retrieves the old document associated with a historical link valid as of a user specified time.

Web Service for Scenario 3 This web service contains four methods: –1. IsURLValidBefore method –2. IsURLValidAsOf method –3. FindLinkToOldDocument: This method retrieves the old document associated with a historical link.

Summary This paper has two contributions: –1. It presents a logging and archiving scheme to track a document’s history of changes. –2. It designs web services for users to retrieve documents associated with historical links.