Presentation is loading. Please wait.

Presentation is loading. Please wait.

Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges.

Similar presentations


Presentation on theme: "Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges."— Presentation transcript:

1 Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges for Open Access Repositories Univ. of Glasgow, 18-20 October 2006 Friedrich Summann Bielefeld University Library

2 Open Scholarship 2006 BASE: concept and content Overview BASE user-interface and further visions BASE dataflow OAI harvesting challenges BASE interfaces Demo Overview:

3 Open Scholarship 2006 BASE uses Fast Data Search BASE uses Linux-based multi-node system BASE contains intellectual selected resources with focus on OAI Servers but also web crawled content BASE displays result lists as bibliographic data and full text hits BASE frontend is written in PHP using the search API from Fast Data Search BASE offers sorting, search refinement and search history BASE: concept and content http://www.base-search.net

4 Open Scholarship 2006 Search API Pipeline QUERY & RESULT PROCESSING DOCUMENT PROCESSING Pipeline FILE TRAVERSER FILTER SEARCH INDEX FILES CONNECTORS TUNING, ADMINISTRATION and DEBUGGING WEB CRAWLER BASE: concept and content

5 Open Scholarship 2006 BASE: concept and content At present 3.8 mio documents in 274 collections, 15 of them web crawled data

6 Open Scholarship 2006 Projekt Gutenberg-DE Internet Library of Early Journals Oxford Various Institutional Repositories Springer Link Metadata Cornell HistMath Fulltext Crawl University Michigan Historical Math CiteSeerZentralblatt Mathematik Bielefeld Univ: Math. Preprints ArXivOPAC UL Bielefeld Ifo Institute Munich PubMedJournals of Enlightment (Digital Collection of Bielefeld UL) BASE: concept and content

7 Open Scholarship 2006 Special view on IR server collections Collections are listed in configuration file [ftubirmingham] url = "http://eprints.bham.ac.uk/" desc_de = "The Univ. of Birmingham: Eprints Archive" desc_en = "The Univ. of Birmingham: Eprints Archive" descdd_de = "Birmingham Univ." descdd_en = "Birmingham Univ." Collections can be clustered for user-interface, e.g. Institutional Repositories Europe consists of [ftubarcelona], [ftubath], [ftubristol], [ftuhelsinki], … Parametric search possible Frontend is ready for multi view (independent views with own configuration and layouts on the same backend)

8 Open Scholarship 2006 BASE: end-user interface (1) Displays search results as bibliographic data and full text hits

9 Open Scholarship 2006 BASE: end-user interface (2) The result list (left hand side) If the document contains meta data (e.g. title, author, abstract) the displayed description is highlighted

10 Open Scholarship 2006 BASE: end-user interface (3) Various options to sort the result set Search refinement by author, keyword, document type, language etc. Search history comprises up to 10 queries The result list (right hand side)

11 Open Scholarship 2006 BASE: end-user interface (4) Search Refinement Select an author...... only documents by this author are displayed

12 Open Scholarship 2006 Check citations (citing articles) in Google Scholar... Google Scholar integration

13 Open Scholarship 2006 Vision: DDC Browsing

14 Open Scholarship 2006 OAI-Data Web Pages Database Records Harvesting Pre-Processing Processing Internal Index (FAST) User interface (PHP) BASE dataflow

15 Open Scholarship 2006 2 16 12 55 17 6 39 4 2 18 1 7 3 3 USA 82 Canada 14 South America 2 Africa 3 India 5 Australia 11 New Zealand 1 OAI-compliant university repositories in BASE 3 1 1

16 Open Scholarship 2006 OAI harvesting challenges Repositories do not response or deliver Error Messages Data contain only References without any Fulltext Links to the Document are not included or do not work Access to fulltext often is restricted XML file is not well-formed Field content varies

17 Open Scholarship 2006 Some Rules from the Harvesting Practice Standard repository software is great - for OAI harvesting as well Small collections – small problems Getting the related fulltext is complicated Libraries produce better metadata Data aggregation may produce problems Writing e-mails helps - sometimes

18 Open Scholarship 2006 BASE interfaces Search form HTTP calls Web Service

19 Open Scholarship 2006 <form action="http://www.base-search.net/index.php" method="post" accept-charset="UTF-8"> Local integration (via search form) E-Repository Integration

20 Open Scholarship 2006 Prototype: Search Based on SOAP interface (EU project DRIVER)

21 Open Scholarship 2006 Thank you!


Download ppt "Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges."

Similar presentations


Ads by Google