Reaching out… through IT R Document Store - Pilot 001 Presented to.

Slides:



Advertisements
Similar presentations
EBSCO Discovery Service
Advertisements

Module 13: Performance Tuning. Overview Performance tuning methodologies Instance level Database level Application level Overview of tools and techniques.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1.
Building a Distributed Full-Text Index for the Web S. Melnik, S. Raghavan, B.Yang, H. Garcia-Molina.
HEP Data Sharing … … and Web Storage services Alberto Pace Information Technology Division.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
Cacti Workshop Tony Roman Agenda What is Cacti? The Origins of Cacti Large Installation Considerations Automation The Current.
Scriblio Installation and Configuration Terence Wong Systems Senior Technician, HKUST Library Workshop on Implementing Scriblio The Next-Generation.
New continent or Bermuda? The consolidation of e-resources in Hong Kong Shue Yan University Library. Mr. Joe Chow & Mr. Cyrus Fong 8th Annual HKIUG Meeting.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Virtual techdays INDIA │ august 2010 Building ASP.NET applications using SQL Server Compact Chaitanya Solapurkar │ Partner Technical Consultant,
Capacity Planning in SharePoint Capacity Planning Process of evaluating a technology … Deciding … Hardware … Variety of Ways Different Services.
Avaya Contact Center Control Manager. © 2010 Avaya Inc. All rights reserved. What if you could… 1 Requires purchase of additional connectors  Enable.
Web Development Challenges and How They are Solved in ps:eScript Matt Verrinder Progress Software UK Internet & Integration Technologies.
ArcGIS Workflow Manager An Introduction
New Partnerships for Smarter Data Discovery, eBooks and Digital Asset Management Thailand IUG 2012 – Mahidol University.
TPB Models Development Status Report Presentation to the Travel Forecasting Subcommittee Ron Milone National Capital Region Transportation Planning Board.
ILC EDMS project suite Status Maura Barone GDE/Fermilab ILC Valencia - November 7, 2006.
TERRA KRIDLER SENIOR LIBRARIAN & ASSISTANT UNIVERSITY ARCHIVIST AMERICAN UNIVERSITY IN CAIRO MIDDLE EAST AND NORTH AFRICA INNOVATIVE USERS GROUP CONFERENCE.
MANAGING THE ACCESS TO THE SCIENTIFIC GREY LITERATURE THROUGH INTERNET AND FREE SOFTWARE OUTLINE Hardware & Software Workflow of grey literature documents.
EDU MANAGER Presented By : us at :
1 InStar Studio Product Release December The AMS InStar Studio release results in a move to a more powerful and scalable platform for huge future.
Module 7: Fundamentals of Administering Windows Server 2008.
Plenary meeting 2015 – Chania - Crete CASCADE Data Services Yusuf Yigini, Panos Panagos, Martha B. Dunbar Joint Research Centre - European Commission.
The National Park Service and National Park System August 31 st, 2004.
AUTHORS: STIJN POLFLIET ET. AL. BY: ALI NIKRAVESH Studying Hardware and Software Trade-Offs for a Real-Life Web 2.0 Workload.
Development of the Graphical User Interface and Improvement and Streamlining of NYMTC's Best Practice Model Jim Lam, Andres Rabinowicz, Srini Sundaram,
ALMA Integrated Computing Team Coordination & Planning Meeting #1 Santiago, April 2013 Evaluation of mongoDB for Persistent Storage of Monitoring.
1 © 2008 Avaya Inc. All rights reserved. IPOffice Configuration Service Emil Ratnam.
KM Technology Assessment “Knowledge and team collaboration servers” DSC8030/CIS8260 Dr. Samaddar Summer 2004 Jon A. Preston.
1 TenStep Project Management Process ™ PM00.8 PM00.8 Project Management Preparation for Success * Manage Documents *
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Aules d’Empresa 2011 Aules d’empresa 2011 DEX. Nom e la presenatació o altra info (opcional) Aules d’Empresa 2011 Contents Graph database Motivation DEX.
TPM Software within Good Spirit School Division. TPM Software is an integrated Student Services Software Solution Forms / Printouts / Reports Integrated.
Banner Document Management Suite David Cheney |
All Search Platforms are Created Equal … Myth or Reality Presented by Matt Dunie President, CSA
ISpheresImage iSpheresImage Feature Overview and Progress Summary.
NetTech Solutions Microsoft Office Word 2003 Level 3 Instructor: Richard Fredrickson.
© 2008 Quest Software, Inc. ALL RIGHTS RESERVED. Perfmon and Profiler 101.
Connexion Comparison Client or Browser? Fran Juergensmeyer Waukegan Public Library 2 nd Annual WILIUG Conference June 16, 2006 Cataloging from A (Authority)
| Banner XtenderSolutions David Cheney SunGard Higher Education.
The physical parts of a computer are called hardware.
Technical Overview. Project Overview Document Library Document List Index TransmittalsPlanning.
Create Content Capture Content Review Content Edit Content Version Content Version Content Translate Content Translate Content Format Content Transform.
Portal Update Plan Ashok Adiga (512)
The Million Point PI System – PI Server 3.4 The Million Point PI System PI Server 3.4 Jon Peterson Rulik Perla Denis Vacher.
How "Next Generation" Are We? A Snapshot of the Current State of OPACs in U.S. and Canadian Academic Libraries Melissa A. Hofmann and Sharon Yang, Moore.
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
UPDATE ON TRB INFORMATION SERVICES GTRIC June 8, 2003 Barbara Post Manager, Information Services
A Technical Overview Bill Branan DuraCloud Technical Lead.
BMTS 242: Computer and Systems Lecture 2: Memory, and Software Yousef Alharbi Website
CIAF Summary Report 2012/13 TPM Software within Good Spirit School Division.
SQL Query Analyzer. Graphical tool that allows you to:  Create queries and other SQL scripts and execute them against SQL Server databases. (Query window)
WEB Access of Library Content YooLib WEB Access of Library Content YooLib ….and what is Hyperbook? Michael Maxwell Director, Worldwide Sales Kirtas Technologies,
How "Next Generation" Are We? A Snapshot of the Current State of OPACs in U.S. and Canadian Academic Libraries Melissa A. Hofmann and Sharon Yang, Moore.
Matt Goldner Product & Technology Advocate Mela Kircher Product Manager WorldCat Local Metasearch 13 November 2009.
The ___ is a global network of computer networks Internet.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
uses of DB systems DB environment DB structure Codd’s rules current common RDBMs implementations.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Open source IP Address Management Software Review
HedEx Lite Obtaining and Using Huawei Documentation Easily
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Understanding and Improving Server Performance
Genomic Data Clustering on FPGAs for Compression
Provisioning Performance of name server Software
CompTIA Server+ Certification (Exam SK0-004)
Good Morning/Afternoon/Evening
ISI Web of Knowledge update: April 2009
Presentation transcript:

Reaching out… through IT R Document Store - Pilot 001 Presented to

© by HTC Global Services, Inc. Do not copy or distribute 2 Objectives  Index 5M+ MARC XML records  Demonstrate following features  Full-text search  Advanced search (fielded search)  Search results pagination  Sub second query time on commercial hardware  Setup Jackrabbit repository (MySQL persistent store)  Load up to 5000 documents  Analyze and optimize loading & storage  Generate UUID  Check-in, Check-out and versioning  Establish links between documents

© by HTC Global Services, Inc. Do not copy or distribute 3 Environment  Hardware  CPU – Quad 2.93 GHz  Memory – 16 GB  Storage – 500GB  Software  64 Bit Windows 7 OS

© by HTC Global Services, Inc. Do not copy or distribute 4 Content Set Data Type# Records Bibliographic – Marc~5.5M Authority – EAC~100

© by HTC Global Services, Inc. Do not copy or distribute 5 Sample Document

© by HTC Global Services, Inc. Do not copy or distribute 6 Sample Document

© by HTC Global Services, Inc. Do not copy or distribute 7 Performance Metrics  Indexing time for (~5.5M) records is 1 Hour and 42 Minutes  Index size for records is 14GB  Extrapolated indexing time for 10M records is ~3 hours  Loading time for 3569 records 112 seconds  Extrapolated loading time for 6M records is 55 hours (~2.31 days)  Average response time for full-text search 69 milliseconds  Average response time for advanced search 3+ fields 200 milliseconds Note: Basic setup with minimal or no tuning

© by HTC Global Services, Inc. Do not copy or distribute 8 Work in Progress  Faceted navigation and search suggest  Simultaneously index and search multiple document types  Index and search new document types by configuration  Batch and online management (add, update, delete indexes)  Repository document load, 5M documents  Discovery and Repository integration  Bulk and online operations load, update

© by HTC Global Services, Inc. Do not copy or distribute 9 World Headquarters 3270 West Big Beaver Road Troy, MI 48084, U.S.A Phone: Fax: Web: Thank You