ROT Review and Treatment Susan Fagan, OEI, OIAA. What we will cover today What is ROT EPA.gov ROT objectives Content Type Review Cycle ROT Tools Next.

Slides:



Advertisements
Similar presentations
Planning Your web content
Advertisements

Getting Your Web Site Found. Meta Tags Description Tag This allows you to influence the description of your page with the web crawlers.
USING WORDPRESS. WEEK 1 1.Why WP? 2.Setting Up WP 3.Exploring the Admin screen 4.Page Organization 5.Posting 6.Polls.
Space Missions Can Your Library Automation Software Do This? David Hook MDA
FLEET User Manual July 1, Part One – User Names & Passwords I.User Names & Passwords A. Creating an Account B. Forgot Password C. Updating .
SEO Best Practices with Web Content Management Brent Arrington, Services Developer, Hannon Hill Morgan Griffith, Marketing Director, Hannon Hill 2009 Cascade.
OVERVIEW TEAM5 SOFTWARE The TEAM5 software manages personnel and test data for personal ESD grounding devices. Test and personnel data may be viewed/reported.
Crawling the WEB Representation and Management of Data on the Internet.
Publishing on the WWW Web Site Testing, Promotion and Maintenance.
Crawling The Web. Motivation By crawling the Web, data is retrieved from the Web and stored in local repositories Most common example: search engines,
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice An FAQ on FAQs for Libraries Pamela.
1 Archive-It Training University of Maryland July 12, 2007.
Agenda Overview 2.What is SharePoint? 3.NCDOT Websites 4.Roles 5.Search 6.SharePoint Interface.
Office of Technology Operations & Planning Understanding and Creating Aliases October 27, 2010.
Website Tutorial. Administration  Log on by clicking Login on the footer of almost any page  Your Username is.
An Introduction to Content Management. By the end of the session you will be able to... Explain what a content management system is Apply the principles.
1 EERE Communications EERE Web Coordinators Meeting Conference line: +1 (415) Access Code: Webinar ID: /15/2013.
Web Analytics at EPA Dr Stephen P Gant (CSC)
Unix Command Project Justin Rogers for LS 560 Spring 2015.
Presented by Chad Kafka This Month’s Topic: Wikispaces Advanced Today’s session is an introduction to what a WIKI is and how they can be used in education.
EPA Web Procedures and Standards October 26, 2010.
15 Maintaining a Web Site Section 15.1 Identify Webmastering tasks Identify Web server maintenance techniques Describe the importance of backups Section.
1 How usable is your web site David Strom, MPA Seminar 10/1/98.
© 2011 Delmar, Cengage Learning Chapter 7 Managing a Web Server and Files.
Introduction to SEO and what’s hot in 2012 September 2011.
Drupal Jumpstart Information Systems 337 Prof. Harry Plantinga.
Eurotrace Hands-On The Eurotrace File System. 2 The Eurotrace file system Under MS ACCESS EUROTRACE generates several different files when you create.
Re-Implementing ERM MENA-IUG 5 th Annual Conference 1-2 November 2010.
Best Practices for Coding April 14, Best Practices Keep it simple –Plain Old Semantic HTML (POSH) Don’t recreate styles already in the EPA style.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
WHAT IS A SEARCH ENGINE A search engine is not a physical engine, instead its an electronic code or a software programme that searches and indexes millions.
Google Sitemaps Case Study Eric Papczun SES Chicago Bulk Submit 2.0 December 5 th, 2006.
Search - on the Web and Locally Related directly to Web Search Engines: Part 1 and Part 2. IEEE Computer. June & August 2006.
0 eCPIC User Training: Resource Library These training materials are owned by the Federal Government. They can be used or modified only by FESCOM member.
Continuing Education UCC Fall 2010 Search Engine Optimization.
1 Crawling The Web. 2 Motivation By crawling the Web, data is retrieved from the Web and stored in local repositories Most common example: search engines,
Module 10 Administering and Configuring SharePoint Search.
Publishing Your Web Pages Ann Emmanuel SIUE Web Administrator
SEO Brisbane. Index Topics Page No SEO Brisbane 3 SEO Brisbane Gets You On First Page Of Google! 4 SEO Brisbane: Without Monthly Fees And Charges 5 SEO.
6 th Annual Focus Users’ Conference 6 th Annual Focus Users’ Conference Import Testing Data Presented by: Adrian Ruiz Presented by: Adrian Ruiz.
DataFlow Diagram – Level 0
ACIS Introduction to Data Analytics & Business Intelligence Database s Benefits & Components.
CharMeck.org Contributer Training SharePoint 2013 Orientation and Basic Training.
| imodules.com Top 10 FAQ in Application Support Kelly Schmiedeler & Amber Quayle.
Overview of 3 rd Party Inspection Program for USTs.
1 NATO UNCLASSIFIED Joe Delorie DSPO NATO Standardization Tasking Review and Analysis Process (STRAP) DoD Standardization Conference March 16-18, 2004.
1 Advanced Archive-It Application Training: Reviewing Reports and Crawl Scoping.
Web Server Security: Protecting Your Pages NOAA OAR WebShop 2001 August 2 nd, 2001 Jeremy Warren.
SEO TIPS. Make the website about one thing  Get Your Domain Name  Choose a Web Host and Sign Up for an Account  Designing your Web Pages  Testing.
Introduction. Internet Worldwide collection of computers and computer networks that link people to businesses, governmental agencies, educational institutions,
SEARCH ENGINE OPTIMIZATION, SECURITY, MAINTENANCE.
SEO PROPOSAL FOR BY CHRIS NDUNGU (mkulima). Sample keyword ranking 1.Improve on keyphrases that are not appearing on page 1 2.Add more keyphrases with.
Technical SEO tips for Web Developers Richa Bhatia Singsys Pte. Ltd.
Search Engine Optimization (SEO) Presentation By Celina Jonesi Small Business Seo – KG Tech.
Search can be Your Best Friend You just Need to Know How to Talk to it IW 306 Ágnes Molnár.
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Information Architecture
3.02H Publishing a Website 3.02 Develop webpages..
Project Objectives Publish to a remote server
DLI Website.
Google Search Appliance: improving the search experience
Editing Your Website on SharePoint 2013
SharePoint Essentials Toolkit
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Gotcha! SharePoint Online Migration Mistakes to Avoid
Maximizing Exposure for Your Non-Profit
Training our 5124 Registered WebCMS Users
4.02 Develop web pages using various layouts and technologies.
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Yale Digital Conference 2019
Presentation transcript:

ROT Review and Treatment Susan Fagan, OEI, OIAA

What we will cover today What is ROT EPA.gov ROT objectives Content Type Review Cycle ROT Tools Next Steps

R.O.T. Redundant: characterized by verbosity or unnecessary repetition in expressing ideas Outdated: no longer in use or fashionable; out-of-date; outmoded; antiquated Trivial: of very little importance or value; insignificant

Redundant

Outdated

Trivial Sept 2009 Conference

Problems with ROT Creates confusion and erodes user confidence – Multiple versions, misleading content Causes problems with search, ours and others Costly to maintain; we pay for static file backup, so pay ROT storage twice Increases Metadata tagging project cost

Why ROT? Why Now? ROT removal is the single biggest thing that can be done to improve EPA web site performance and user experience EPA’s key drivers for ROT elimination – Adoption of a new Web strategy – Goal to tag and migrate static content into a WebCMS, which requires significant cost and effort Removing existing ROT avoids wasted tagging and migration costs EPA’s ROT objectives – Adopt a standard process for identifying and treating ROT on an ongoing basis to maintain site usability and validity

Current EPA.gov – 288 registered TSMSS domains – 555, 791HTML files (as of 4/1/10) – 262,838PDF files (as of 4/1/10) – Estimated $10M+ annual cost – Low user satisfaction based on ACSI scores Vision for new and improved EPA.gov – Streamlined entry-point with hooks into the larger collection – Well-tagged content for improved search and indexing – Better context for information access

Records Frequent Questions about Web Sites and Records

ROT Tools Webman – provides high level overview of file age Maxamine (now Accenture) QA Reports – provides duplicate files report Robots.txt ROTtweiler Reports

Webman Mile high view Total number of files Age of the files Also shows: – Number of external links – Pages with external links – Top level external links (We aren’t supposed to link to the top level page. We are supposed to link to related content.) – OMB requirement to review external links every 3 months

Maxamine Redundant File report – Shows duplicate files and their paths. Helps eliminate duplicates Idiosyncrasies – Our aliases confuse it. Appear as dups but they’re not – /test/ and /test/index.html appear as dups but they’re not

Maxamine Broken Link Report Broken links are ROT. They’re outdated. – Link Integrity Report – broken internal links – External Link Integrity – external broken links Maxamine QA report includes a broken link report. – Run monthly – OMB quarterly external link requirement Regular maintenance makes this an easy task.

Robots.txt This list shows the directories on the epa.gov website that are hidden from our search engine – Why are the files hidden? If you don’t want the search engine to find it, why is it posted on the public web site?

Using the ROTtweiler reports index.htm index.htm There are 3 reports in the ROTtweiler – File Inventory – Orphan – Unloved NOTE: Before taking action on your files, capture a copy of your files

File Inventory Report Listing of all files in a tssms area On initial ROT review by content owners, every file should be reviewed Spreadsheet for File Inventory Report: – Checking the remove column doesn’t do anything. Just a signifier. – To track your files, keep the spreadsheet that you start with. They are updated frequently.

Example Items to check – /test/ directories – Password protected directories – Made up extensions – Files labeled old, bak, backup – Duplicates – Old files – Local copies of Agency images (e.g.; exit and other common images) – Non-web formats (e.g.; PPT)

Items to Check continued Old conference material Local copies of Federal Register notices Local copies of EPA press releases Old newsletters Old calendars Non-EPA material (e.g.; NIH press releases).noindex directories. If you’re hiding it, do you need it?

Note: Things with an EPA report number, you do not want to delete. Make sure NEPIS has a copy.

Orphan Reports Appearing on this report doesn’t guarantee a file is an orphan. Verify that orphan files are not used in your web area. 404 and ThankYou files are orphans but used by your web area. /nerl/ vs /nerl/index.html. Program doesn’t know the difference. On the orphan list if: – It’s in a password protected directory – The crawler crawls the space before the file is moved

Unpopular/Unloved Reports Files that have not had any requests since June 8, Info comes from logs used for Maxamine reports. We do not have older data available. It will accumulate from this point on. Once loved, always loved.

Example: Duplicate Images

Example: Old test directory

Example: Duplicate Content/Out of Date Content

Example: Unloved and Old PPT not a Web format They are duplicate files One is named (old) They are 2004 docs no matter what the file server date says.

Contact: Susan Fagan Phone Judy Dew Phone