Integrate Full-Text Retrieval with Digital Archives System Reporter : Chia-Hao Lee Computer System and Communication Lab, Academia Sinica Institute of Information Science
Outline Current full-text retrieval system What needs ? What problems in current DB retrieval ? How to integrate ?
Full-text Retrieval - System Structure Language Detector Documents DB Language Translator Keyword Indexer Fulltext Indexer Indexer (IE) Searcher (ISAPI & ASP) Language Translator Keyword Searcher Fulltext Searcher In-Class Searcher This is optional Module Data Source Modules Media Parser Crawler
Full-text Retrieval - Platform MS Windows Web Server MS IIS (ASP) The part of core C Language (.dll,.exe) Database MS SQL Server
What Needs ? Web-pages retrieval Some problems Pages generated from PHP, flash… Database retrieval Management Interface Multi-fields retrieval in the same table Miss character retrieval
What Problems in Current DB Retrieval ? I Current management interface Select / Insert Server Select / Insert Database Select / Insert Table & Key-Field (primary key) What problems ? Need an account for the access of all databases in the server No delete function Key-Field must be INTEGER type
What Problems in Current DB Retrieval ? II Other problems As the indexed status of the table Keep the last indexed key value (primary key value) No support miss character retrieval The last presentation after search result Limits The maximum of multi-fields retrieval is 5 in the same table The maximum retrieval length of a field is 1xxx characters
How to Integrate? Current retrieval platform Windows 、 IIS 、 ASP 、 C(MFC…) 、 MS SQL Server Solution Keep the current retrieval platform Translate Window platform to Linux platform
Q & A Thanks for Your Attention !!