Reaching out… through IT R Document Store - Pilot 001 Presented to
© by HTC Global Services, Inc. Do not copy or distribute 2 Objectives Index 5M+ MARC XML records Demonstrate following features Full-text search Advanced search (fielded search) Search results pagination Sub second query time on commercial hardware Setup Jackrabbit repository (MySQL persistent store) Load up to 5000 documents Analyze and optimize loading & storage Generate UUID Check-in, Check-out and versioning Establish links between documents
© by HTC Global Services, Inc. Do not copy or distribute 3 Environment Hardware CPU – Quad 2.93 GHz Memory – 16 GB Storage – 500GB Software 64 Bit Windows 7 OS
© by HTC Global Services, Inc. Do not copy or distribute 4 Content Set Data Type# Records Bibliographic – Marc~5.5M Authority – EAC~100
© by HTC Global Services, Inc. Do not copy or distribute 5 Sample Document
© by HTC Global Services, Inc. Do not copy or distribute 6 Sample Document
© by HTC Global Services, Inc. Do not copy or distribute 7 Performance Metrics Indexing time for (~5.5M) records is 1 Hour and 42 Minutes Index size for records is 14GB Extrapolated indexing time for 10M records is ~3 hours Loading time for 3569 records 112 seconds Extrapolated loading time for 6M records is 55 hours (~2.31 days) Average response time for full-text search 69 milliseconds Average response time for advanced search 3+ fields 200 milliseconds Note: Basic setup with minimal or no tuning
© by HTC Global Services, Inc. Do not copy or distribute 8 Work in Progress Faceted navigation and search suggest Simultaneously index and search multiple document types Index and search new document types by configuration Batch and online management (add, update, delete indexes) Repository document load, 5M documents Discovery and Repository integration Bulk and online operations load, update
© by HTC Global Services, Inc. Do not copy or distribute 9 World Headquarters 3270 West Big Beaver Road Troy, MI 48084, U.S.A Phone: Fax: Web: Thank You