Heritrix 3: librarian features BnF proposal March 2015.

Slides:



Advertisements
Similar presentations
Setup MOC Auto Reports The MOC Auto Reports provides a method to notify people about the status of MOCs. In some jurisdictions, this step is required.
Advertisements

AIMSweb Benchmark Online Training For AIMSweb Teacher Users
Welcome to WebCRD.
RightNow 8 -- Adding a new report: New > Report: ORAnalytics > Reports > New Report
RightNow February Adding a New Report: RN icon > Report: OR Analytics > Reports > New Report
“The Honeywell Web-based Corrective Action Solution”
Task: Create a Non Pre-work Additional Pay Request To see this in PennWorks...click herehere Task Definition: Enter additional pay data and attach documents.
Enterprise Portal Training Creating Portal Pages Click on arrow to go forward or back.
Lesson 30: Maintaining a Database. Learning Objectives After studying this lesson, you will be able to:  Change the layout of a table by adjusting column.
Status and plans for the H3 release NetarchiveSuite 5.0.
Guide to MCSE , Enhanced 1 Activity 14-1: Browsing Security Templates Objective: To become familiar with built-in security templates Start  Run.
Educational Measurement and School Accountability Directorate Better informed, better positioned, better outcomes.
Managing User Settings with Group Policy
UNESCO ICTLIP Module 4. Lesson 3 Database Design, and Information Storage and Retrieval Lesson 3. Information storage and retrieval using WinISIS.
Proxy Self-editing design review Oct 20, Definition  Proxy self-editing is when a VIVO user has the authority to do "self-editing" on profile.
1 PER for M Prototype December 14, PER for M Disclaimer.
4.3 Searching for Patient Information 4-12 Medisoft offers two options for conducting searches for information: 1.Search for and Field boxes 2.Locate buttons.
Quick-Demo Tour Video. This demonstration will show basic zzusis portal functions and navigation.
South Dakota Library Network MetaLib Management Basics IP Ranges / Proxy Servers South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD.
How To: Create an Additional Pay Request (Individual) To see this in PennWorks...click herehere Task Definition: Enter Additional Pay data and attach documents.
1. 2 LXU800 User’s Manual 1.Installation – Windows XP UI Features Introduction Data Connection & Disconnection.
Deployment Management The following screens demonstrate how to: 1. Access and view deployments 2. Create a new local deployment 3. Create and modify a.
Snippet Management The following screens demonstrate how to: 1. Access and view snippets 2. Create a local standard snippet, or a local class snippet 3.
Login Screen This is the Sign In page for the Dashboard Enter Id and Password to sign In New User Registration.
Presented By: Product Activation Group Syndication.
Practice Insight Instructional Webinar Series Reporting
Chapter 3 Maintaining a Database
Create / Edit Competence Assessment Role: Employee.
Antalis-HQ USER GUIDE. Antalis, Europe’s leading distributor of paper, packaging solutions and visual communication products presents you its user web.
Oracle E-Business Suite Order Management: Presenting the HTML and Mobile User Experience Durgaprasad Bodapati Director, Product Management Bhavana Sharma.
— Customer Success Team July / 2015 Remedyforce Enablement Kit Migration from Remedyforce Self-Service 1.0 to 2.0.
SPSA Tool User Manual. Contents About the SPSA Tool……….…………………………………………………………………………… Login…………………………………………………………………………………………..……….……..……..8 Home.
For Users : Username & Password for logging in to system : CME proposal to be added in system For System Configuration : Initial budget or latest updated.
ERA Manager Training December 19, Propriety and Confidential. Do not distribute. 2 ERA Manager Overview In an effort to reduce the need for Providers,
Tags Pages 63 to 114 in your workbook. Tag Browser Review of the communication chain Polling Driver concepts Tag Browser in detail – Filtering – The tag.
1 EDIT 2013 User Interface Enhancements European Commission – Eurostat.
Division of Alcoholic Beverages and Tobacco Liquor Distiller’s and Rectifier’s Monthly Report.
SELF-SERVICE SELF-SERVICE EMPLOYER REGISTRAIONS AND SERVICES ONE-STOP MANAGEMENT INFORMATION SYSTEM (OSMIS)
Curator wishes for the roadmap november 2011 updates.
Warehouse Report. Log into EDS using your Address/User Id and Password. If you have forgotten your password, click on the Forgot Password? link.
Select Reports Console. Type in Progress, Click Search.
Quotation with Follow-up 1.0 THIS ADD-ON IS VERY USEFUL FOR BUSINESSES WHO ISSUE SALES ORDER TO THEIR CUSTOMERS ALONG WITH QUOTATION. THIS ADD-ON HELPS.
WaveMaker Visual AJAX Studio 4.0 Training Basics: Building Your First Application Binding Basics.
FIX Eye FIX Eye Getting started: The guide EPAM Systems B2BITS.
1 Create a Basic Self Service Layout  Log into the BI Portal with your Berkeley Lab Identity.
DRAFT ROSS Version /18/13 BASIC ROSSD-SL BASIC UNIT 2 ROSS USER BASICS.
Classifications Schemes and Class Scheme Items in the Curation Tool: Interface Design Audrey Lipps, User-Centered Design
WikiPlus Configurations Configure WikiPlus elements to your needs.
Lecture Capture and. Goal Link to D2L D2L Website
Using Project Portfolio Tool ‘PMFolio’ by AlNik Solutions, LLC Copyright 2011 ©
CPSC 203 Introduction to Computers T97 By Jie (Jeff) Gao.
2015 NetarchiveSuite Workshop Eesti Rahvusraamatukogu Tallinn, Estonia January
CLICK2EXPORT EXPORT TOOL FOR DYNAMICS CRM REPORTS.
1 Lesson 14 Sharing Documents Computer Literacy BASICS: A Comprehensive Guide to IC 3, 4 th Edition Morrison / Wells.
THIS IS A DEFAULT/ GENERIC TEMPLATE. CHANGE THE BACKGROUND COLOR AND ADD YOUR OWN PICTURES TO MATCH YOUR PRESENTATION. (Insert Title Here) (Insert your.
Orders and Invoices Supply Chain Platform: Rolls-Royce Training for Indirect Suppliers March 2016.
Invoices and Service Invoices Training Presentation for Raytheon Supply Chain Platform (RSCP) April 2016.
HTBN Batches These slides are intended as a starting point for further discussion of how eTime might be extended to allow easier processing of HTBN data.
Invoices Training Presentation for Supply Chain Platform: BAE Systems May 2015.
Division of Alcoholic Beverages and Tobacco Beer Manufacturer’s Monthly Report.
MANAGING EMPLOYER REGISTRATIONS AND SERVICES IN OSMIS ONE-STOP MANAGEMENT INFORMATION SYSTEM (OSMIS)
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
Boeing 787 SCMP Training June 2016
BnF - DLWEB - Umbra & Heritrix 3
BnF experiences in using NAS 5 And Heritrix 3
Software Testing With Testopia
Navigation Details Boeing 787 SCMP March 2018.
iCIMS 17.1 Release: Highlights
Update Budget Steps Screenshots Purpose:
Task: Create a Non Pre-work Additional Pay Request
Presentation transcript:

Heritrix 3: librarian features BnF proposal March 2015

Context Follow up of our NetarchiveSuite workshop in Tallinn: – Identified work packages: – tests – template migration – implementation of important but missing curator features for common operations in Heritrix 3 BnF will further describe use cases, share them with the community for feedback and implement the following features as a minimal Heritix UI add-on

From H1…

… to H3

Common curator operations Search crawl.log Add filter on current job (job configuration) Change domains/hosts budget (job configuration) View or delete frontier URIs

Search crawl.log (NASC61) Add a page with the same layout but with 2 additional form fields: – Regular expression: – Show matches: 1000 (default # of matching URIs) – Action => Display URIs (reversed order by default) Possibility to refresh display (F5)

Draft UI for « Search crawl log » Display URIs Status + job ID Home Forward Reversed Matching lines: 1000 Lines: displaying out of 12345

Common curator operations Search crawl.log Add filter on current job (job configuration) Change domains/hosts budget (job configuration) View or delete frontier URIs

Add filter on current job (DecideRule) (NASC60) Not necessary to view active filters that were included from job start (NASC59) Add a page containing a rejectTemporarily area working with the following parameters: – Decision: REJECT – List-logic: OR – Regexp-list : empty at job start, free textarea which can be manually edited and sorted (440 px wide, 20 lines) – Action => Save: save current filters and activate them for current job

Draft UI for « Add filter on current job » Status + job ID Home All URIs matching any of the following regular expressions will be rejected from the current job. Regular expressions: Save

Common curator operations Search crawl.log Add filter on current job (job configuration) Change domains/hosts budget (job configuration) View or delete frontier URIs

Change domains/hosts budget Works with queue-total-budget and quota- enforcer systems Add a page containing: – a list of domains/hosts (in domain alphabetical order) – their associated budget value (which can be edited) – only those which budget is not set by default – and a form field to add a new domain/host

Draft UI for « Change domains/hosts budget » Status + job ID Home Save Budget defined in job configuration: queue-total-budget of URIs. bnf.fr ina.fr cnc.fr Budgets of following domains/hosts have been changed in the current job: New domain/host: toto.fr – Save

Common curator operations Search crawl.log Add filter on current job (job configuration) Change domains/hosts budget (job configuration) View or delete frontier URIs

View or delete frontier URIs (NASC56 + NASC57 + NASC58) Add a page containing 2 form fields: – Regular expression: – Show matches: 1000 (default # of matching URIs) – Action A => Display URIs: displays the matching URIs, the # of matching URIs and gives the possibility to view the next bloc of matching URIs – Action B => Delete URIs: delete matching URIs and indicates the # of matching URIs

Draft UI for « View or delete frontier URIs » Status + job ID Home URIs: displaying out of Matching lines: 1000 URIs: displaying out of Pause the job first to view frontier

search Job configuration add filter – change budget

Comparaison with BAnQ