Presentation is loading. Please wait.

Presentation is loading. Please wait.

Googalize your Search with DirectInfo Documents DirectInfo Documents - New Features Author: Kiril Rusev Software Architect Semantec Bulgaria OOD Semantec.

Similar presentations


Presentation on theme: "Googalize your Search with DirectInfo Documents DirectInfo Documents - New Features Author: Kiril Rusev Software Architect Semantec Bulgaria OOD Semantec."— Presentation transcript:

1 Googalize your Search with DirectInfo Documents DirectInfo Documents - New Features Author: Kiril Rusev Software Architect Semantec Bulgaria OOD Semantec GmbH Benzstr. 32 D-71083 Herrenberg, Germany www.semantec.de

2 Agenda Motivation What is DirectInfo Documents? What's new? Live Demo Future development

3 Motivation - The Need ? ? ?

4 Motivation - The Challenge Database Data Email Local Files Internet Intranet

5 Motivation - The Answer Oracle Text Index DirectInfo Document Files Database Data Web Contents Structured Search Results

6 What is DirectInfo? A framework based on Oracle Text Can index and search into various data sources Can be extended Can be adjusted to the customer’s needs

7 Oracle Text - how does indexing work?

8 DirectInfo and Oracle Text Oracle Text Context indexes with USER_DATASTORE Full control over the indexing Flexible and extensible filtering Custom defined document grouping Regular index management Effective caching mechanism Fast and flexible searching A lot of context information Summarizing capabilities Oracle DirectInfo

9 DirectInfo Architecture

10 What is DirectInfo Documents? Based on DirectInfo platform A powerful document searching tool A web based “google-like” application Easily managed and deployed

11 What's new? Speed improvement Robustness Manageability Functional improvements LF and search results presentation improved

12 Speed improvement – Document Cache User Datastore PL/SQL Procedure NullF ilter PDF HTML Filtering HTML Document Cache Store/Retrieve HTML Filtering is done only once The HTML version of the document is cached

13 Speed improvement – Faster Crawling DirectInfo Interne t Local Files Email Crawler Interface File Crawler Web Crawler Other… Crawlers are adjusted according to the target document sources

14 Robustness – Better Filtering Before: Datastore INSO Filter PDF HTML XFilter After: Datastore PDFHTML NULL Filter HTML Filter 1Filter 2Filter N …

15 Manageability - Indexing in Chunks Before: Dtx_Ddl.Sync_Index Index Unstoppable !!! After: Index Dtx_Ddl.Sync_Index ………

16 Functional improvements - Duplicated Files Detection Before: Found FilesIndexed Files After: Found Files Indexed Files

17 Functional improvements - Summarizer

18 LF and search results presentation improved Deferred fragments loading Skins support, XP look and feel Visual and functional redesign - HTML Frames Searching made more simple

19 Live Demo

20 Future development Defining and searching of meta data Search results clustering Improved flexibility Improved administration Improved caching Better summarizing

21 Thank You!


Download ppt "Googalize your Search with DirectInfo Documents DirectInfo Documents - New Features Author: Kiril Rusev Software Architect Semantec Bulgaria OOD Semantec."

Similar presentations


Ads by Google