Download presentation
Presentation is loading. Please wait.
Published byJordan Anderson Modified over 9 years ago
1
flexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica CERN ETT-DH
2
Hector Sanchez 26 February 2002 @ CERN Contents Introduction Project overview: definition, scenarios, architecture, technology Main features Benefits & results
3
Hector Sanchez 26 February 2002 @ CERN Introduction Link in the scope of FlexElink Stored vs. generated links Link managers Reference to the fulltext version or a Internet resource related to a certain bibliographic record (not necessarily an URL) Generated links reduce considerably maintenance Know when to create a link and build them from bibliographic data Link managers@CDS: SetLink, GoDirect, Dynamic Format
4
Hector Sanchez 26 February 2002 @ CERN Project goals New link management tool Improvement of the formatting tool Integration of already existing LM technologies used at CDS Be able to adapt to new situations and needs Independent of the formatter Work over different types of inputs Cover all possible formatting functions needed Reduce maintenance Avoid ‘harcode’ maintenance Make it easy to use for CDS clients
5
Hector Sanchez 26 February 2002 @ CERN Scenario 1: Brief formats Output: Original XML record with its HTML version Input: Bunch of records in OAI MARC XML cv3t5 flexElink ‘CERN MARC’ OAI MARC XML cxtm OAI MARC XML* SQL und... und... h HTML Bibliographic DB ALEPH Consultation DB MySQL
6
Hector Sanchez 26 February 2002 @ CERN Scenario 2: Detailed formats Output: HTML version to be displayed or PHP to be saved to a file Input: record in OAI MARC XML CDS search flexElink OAI MARC XMLHTML page Links to fulltext & references PHP file setlink output Pre-generated references inclusion Consultation DB MySQL
7
Hector Sanchez 26 February 2002 @ CERN Architecture overview Record Separator Variable Extractor Behavior Processor Link Manager Web configuration interface Extraction rules Link repository Behavior repository individual record internal variables solve links Text output input records admins
8
Hector Sanchez 26 February 2002 @ CERN Technology OO analysis and design Implementation tools 100% open source & freeware Component based delegation & collaboration lead to a more de-coupled and re-usable software Almost any part of the system can be substituted, modified or extended without affecting the rest
9
Hector Sanchez 26 February 2002 @ CERN Main features: Internal variables Maps the values in the input OAI MARC XML records into internal variables This mapping can be configured using the Extraction Rules Tells the extraction module which values to extract from the input and to which variables it has to map them Makes the rest of the configuration independent of the input Developed for OAI MARC XML but it can be adapted to other input types (DB) by specialising the extraction module
10
Hector Sanchez 26 February 2002 @ CERN Main features: Internal Variables OAI MARC XML extraction rules example SCAN-0009119 Racah, Giulio Guignard, G editor 11 author name editor fields Variable: author Value #0field: name Racah, Giulio Value #1field: name Guignard, G field: editor editor SCAN-0009119 Racah, Giulio Guignard, G editor 11 SCAN-0009119 Racah, Giulio Guignard, G editor 11 SCAN-0009119 Racah, Giulio Guignard, G editor 11
11
Hector Sanchez 26 February 2002 @ CERN Main features: Behaviours Behaviour: Describes how the input has to be processed in order to achieve desired output Support for multiple behaviours Behaviour Condition 1 Actions Condition 2 Actions Condition: Expression that makes associated actions to be applied only if it’s TRUE for the current input record data Action: Set of statements that describes how the output has to be built (e.g. formats) if the corresponding condition is accomplished Conditions and actions are expressed using the Evaluation Language
12
Hector Sanchez 26 February 2002 @ CERN Main features: Evaluation Language Specially designed for FlexElink Context-free grammar Extensible via User Defined Functions (UDFs) Operations that are defined in PHP Simple Knowledge Base management Allows interaction with the Link manager Re-usability of expressions through Formats Enables the access to internal variables
13
Hector Sanchez 26 February 2002 @ CERN Main features: Behaviours Simple behaviour example Behaviour: SIMPLE $909C0.b=”27” “”=“” “ ” $245.a ” ” forall($0248.a){ rep_prefix(“ – “) $0248.a separator("; ") } “ ”$245.a” ” forall($100.a){ rep_prefix(“– Authors: “) $100.a separator("; ") } UDFs 100.a author name 245.a title 0248.a standard ref Internal Variables 909C0.b base #
14
Hector Sanchez 26 February 2002 @ CERN Main features: Link Manager Generates links from stored rules These rules are also expressed using the Evaluation Language Supports different types of link solving External linking Just generate the link from the rules Internal linking The link is always a file, it checks the existence, access, formats, etc Can be extended: The LM is just a framework to which new linking logic can be added Independent of the formatter It has no access to Internal Variables, receives data as parameters
15
Hector Sanchez 26 February 2002 @ CERN Main features: Link Manager Example: simple link definition and access from EL Generation of records with already solved fulltext links “ ” $245.a “ ” link(“FULLTEXT”, $base, $categ, $id) { “ Fulltext access: ” forall($link){ “ [“ $link.format_id “] ” } else{ “No link found” } FULLTEXT link definition Link manager call
16
Hector Sanchez 26 February 2002 @ CERN Benefits More modular and specialised CDS Search The OO approach eases the maintenance and allows future extensibility Only one way of configuring formats and links All the configuration is kept in a DB and separated of the logic Possible to generate different configuration views Search Engine doesn’t know anything about linking or formatting flexElink Search Engine formats links format/link config users query results
17
Hector Sanchez 26 February 2002 @ CERN Results It’s already being successfully used for Pre-generated CDS Search BRIEF formats On-the-fly creation of CDS Search DETAILED formats HTML pages of the fulltext extracted references Speed optimisation (test over 15’000 records) BRIEF format creation (average): 0.05 sec/record DETAILED format creation (average): 0.15 sec/record Testing for future replacement of GoDirect and SetLink GoDirect: ‘automatically’ migrated 91% of journals Setlink: Ready for defining new fulltext rules
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.