Download presentation
Presentation is loading. Please wait.
Published byEmma Alexander Modified over 9 years ago
1
Googalize your Search with DirectInfo Documents DirectInfo Documents - New Features Author: Kiril Rusev Software Architect Semantec Bulgaria OOD Semantec GmbH Benzstr. 32 D-71083 Herrenberg, Germany www.semantec.de
2
Agenda Motivation What is DirectInfo Documents? What's new? Live Demo Future development
3
Motivation - The Need ? ? ?
4
Motivation - The Challenge Database Data Email Local Files Internet Intranet
5
Motivation - The Answer Oracle Text Index DirectInfo Document Files Database Data Web Contents Structured Search Results
6
What is DirectInfo? A framework based on Oracle Text Can index and search into various data sources Can be extended Can be adjusted to the customer’s needs
7
Oracle Text - how does indexing work?
8
DirectInfo and Oracle Text Oracle Text Context indexes with USER_DATASTORE Full control over the indexing Flexible and extensible filtering Custom defined document grouping Regular index management Effective caching mechanism Fast and flexible searching A lot of context information Summarizing capabilities Oracle DirectInfo
9
DirectInfo Architecture
10
What is DirectInfo Documents? Based on DirectInfo platform A powerful document searching tool A web based “google-like” application Easily managed and deployed
11
What's new? Speed improvement Robustness Manageability Functional improvements LF and search results presentation improved
12
Speed improvement – Document Cache User Datastore PL/SQL Procedure NullF ilter PDF HTML Filtering HTML Document Cache Store/Retrieve HTML Filtering is done only once The HTML version of the document is cached
13
Speed improvement – Faster Crawling DirectInfo Interne t Local Files Email Crawler Interface File Crawler Web Crawler Other… Crawlers are adjusted according to the target document sources
14
Robustness – Better Filtering Before: Datastore INSO Filter PDF HTML XFilter After: Datastore PDFHTML NULL Filter HTML Filter 1Filter 2Filter N …
15
Manageability - Indexing in Chunks Before: Dtx_Ddl.Sync_Index Index Unstoppable !!! After: Index Dtx_Ddl.Sync_Index ………
16
Functional improvements - Duplicated Files Detection Before: Found FilesIndexed Files After: Found Files Indexed Files
17
Functional improvements - Summarizer
18
LF and search results presentation improved Deferred fragments loading Skins support, XP look and feel Visual and functional redesign - HTML Frames Searching made more simple
19
Live Demo
20
Future development Defining and searching of meta data Search results clustering Improved flexibility Improved administration Improved caching Better summarizing
21
Thank You!
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.