Towards an open library of relational metadata: the experience of RePEc (Research Papers in Economics) Thomas Krichel 2003-11-07.

Slides:



Advertisements
Similar presentations
Małgorzata Rychlik, Emilia Karwasińska 2009 Poznań University Library.
Advertisements

Zetoc.mimas.ac.uk Zetoc Electronic Table of Contents from the British Library Zetoc Support.
EPrints - Introducing EPrints 3 Software William J Nixon Digital Library Development Manager, University of Glasgow With many thanks to Les Carr and the.
1 of 16 Information Access The External Information Providers © FAO 2005 IMARK Investing in Information for Development Information Access The External.
1 of 15 Information Access Internal Information © FAO 2005 IMARK Investing in Information for Development Information Access Internal Information.
doi> Digital Object Identifier: overview
© 2008 EBSCO Information Services SUSHI, COUNTER and ERM Systems An Update on Usage Standards Ressources électroniques dans les bibliothèques électroniques.
Institutional Repositories an opportunity for IAMSLIC Pauline Simpson Southampton Oceanography Centre, University of Southampton, UK
IST Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,
Open Archives and Free Online Scholarship Thomas Krichel (RePEc & Long Island University) Simeon M. Warner (ArXiv & Cornell University)
Anwendung von open source Ideen in digitalen Bibliotheken: die Beispiele von RePEc und rclis Thomas Krichel
From RePEc to 3lib. the long march for free bibliographic data Thomas Krichel
Digital scholarly communication in Economics: from NetEc to RePEc Thomas Krichel work partly sponsored by the Joint Information.
Acknowledgements Ellen Fischer for her hospitality. Michael Heinz for organizing the seminar.
The RePEc model for the academic digital library Thomas Krichel work partly sponsored by the Joint Information Systems.
RePEc, a digital commons for economics Thomas Krichel
Что делать? Thomas Krichel
RePEc, a case to illustrate the evolution and future trends of repositories and open access Thomas Krichel
RePEc: a public-access database that promotes scholarly communication in Economics Thomas Krichel
Designing for the Discipline: Open Libraries and Scholarly Communication Thomas Krichel
Rclis in vision and reality Thomas Krichel
RePEc and OLS Thomas Krichel prepared for the first retreat for disciplinary repositories Monterey
RePEc: An Open Library for Economics Thomas Krichel Work partly supported by the Joint Information Systems Committee of.
Transforming scholarly communities with open libraries Thomas Krichel
OA and commercial publishers Thomas Krichel
RePEc as frontier repository, the business model and what it means to survive as network in a more and more web-collaborative academia and a developing.
Bringing scholarly communication in kicking and screaming into the Internet age Thomas Krichel
Bringing scholarly communication in Economics kicking and screaming into the Internet age: NetEc, RePEc and more to come Thomas Krichel
Disintermediation of Academic Publishing through the Internet: An Intermediate Report from the Front Line Thomas Krichel
Information policy issues in RePEc Thomas Krichel
Open Archives and Open Libraries Thomas Krichel
RePEc: a early example of an open library Thomas Krichel
The future of scholarly communication in Economics Thomas Krichel work partly sponsored by the Joint Information Systems.
Academic self-organization on the Internet. The example of RePEc Thomas Krichel
Document data & personal data Thomas Krichel Long Island University & Novosibirsk State University
New Century, New Metadata Thomas Krichel University of Surrey, Hitotsubashi University and Long Island University.
How to become an 800 pound gorilla: the case of RePEc. Thomas Krichel 2008–10–29.
Use your bean. Count it. Thomas Krichel
My life and times Thomas Krichel LIU & НГУ
Four slides for the future Thomas Krichel given at 4 th International Socionet seminar Novosibirsk
Free author registration Thomas Krichel LIU & НГУ
LIS510 lecture 0 Thomas Krichel feeling nervous? So am I. It is my second time. Overall approach –I follow what has been done before. –I am.
Downloading and Document Delivery Accessing and using resources.
Creating Institutional Repositories Stephen Pinfield.
Building Repositories of eprints in UK Research Universities Bill Hubbard SHERPA Project Manager University of Nottingham.
Richard Jones The Edinburgh Research Archive The Edinburgh Research Archive: ERA Institutional Repository Theses & Dissertations Conference Papers/Posters.
Publication costs are research costs Robert Terry Senior Policy Adviser The Wellcome Trust
1 Advances In Web Technologies Brian Kelly UK Web Focus UKOLN University of Bath
CrossRef Linking and Library Users “The vast majority of scholarly journals are now online, and there have been a number of studies of what features scholars.
LIS512 lecture 2 relational databases Thomas Krichel
Highlights from the Open Access Timeline (1) 1971, Project Gutenberg launched on the Internet (originally as an FTP site). There are now 18,000 free books.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
Where I am coming from Thomas Krichel
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
Research evaluation requirements José Manuel Barrueco Universitat de València (SPAIN) Servei de Biblioteques i Documentació May, 2011.
LIS654lecture 1 Introduction Thomas Krichel
Building a discipline-specific aggregate for computing and library and information science Thomas Krichel Long Island University, NY, USA
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
LIS654 lecture 1 Introduction Thomas Krichel
LIS618 lecture 0 Thomas Krichel Organization homepage Contents to be discussed today. Send mail.
Introduction to LIS508 Thomas Krichel
Open Access - an introduction, Aleppo, December Open Access – an introduction Ian Johnson.
Introduction to LIS508 Thomas Krichel
Economists Online researchers and libraries collaborate. A subject-specific service model. Benoit Pauwels Université Libre de Bruxelles.
CitEc as a source for research assessment and evaluation José Manuel Barrueco Universitat de València (SPAIN) May, й Международной научно-практической.
Quality Control in RePEc ... why it is so hard?
CONCERT (CONsortium on Core Electronic Resources in Taiwan).
The RePEc database about Economics
….part of the OSU Libraries' suite of digital library tools…
Presentation transcript:

Towards an open library of relational metadata: the experience of RePEc (Research Papers in Economics) Thomas Krichel

who is me? I was an economist. I was a leisure digital librarian. –NetEcsince 1993 –RePEcsince 1997 I am "just another Perl hacker" I am a visionary –but I'm not like St. John the Baptist

who is he?

he is "St. IGNUicus" A humoristic creation of Richard M. Stallman (RMS) RMS is the father of the free software movement –a geek –a visionary St. IGNUicus shows an emphasis on the moral case for free software, rather than the business case

moral case and business case Other folks in the free software movement avoid the "f" word –free can mean cheap –cheap can mean bad They stress the business case of free software They use the term "open source software", (OSS)

RMS and us Amen, I tell you: we librarians need to learn more from the OSS movement. We need to make the concepts coming of free software more a part of our business. Let us look at a key concept: free software.

free software according to RMS Free software comes with four freedoms –The freedom to run the software, for any purpose –The freedom to study how the program works, and adapt it to your needs –The freedom to redistribute copies so you can help your neighbor –The freedom to improve the program, and release your improvements to the public, so that the whole community benefits

what has this to do with us? Just replace free software with free information. Libraries are about free information. But the analogy is not quite as simple. –When we talk about free information, we usually mean things that we can freely read (download…). free as in: $0 –We do not usually mean free information as information we are free to do things with. Free as in freedom.

moral and business There is a moral case for free information. –We rely on it. There is a business case for free information. –We need to make our own.

we rely on the moral case The citizen should be informed… Individuals in the organization should have free access… This is how we justify resources given to us. Often, members of the community who pay get privileged access.

from moral case to business case To form the business case for free information, think of "free information" as "freedom to do things" rather than $0. Thus libraries can make a crucial business case for them as agents who transform information. Recall that there are whole industries out there that produces free information.

Now for something different RePEc is an example for an Open Library. An Open Library is loosely defined an application of the OSS principles to libraries. –vague –in the making –but has some history Looking at RePEc will fix ideas.

History It started with me as a research assistant an in the Economics Department of Loughborough University of Technology in a predecessor of the Internet allowed me to download free software without effort but academic papers had to be gathered in a painful way

CoREJ published by HMSO –Photocopied lists of contents tables recently published economics journal received at the Department of Trade and Industry –Typed list of the recently received working papers received by the University of Warwick library The latter was the more interesting.

working papers early accounts of research findings published by economics departments –in universities –in research centers –in some government offices –in multinational administrations disseminated through exchange agreements important because of 4 year publishing delay

I planned to circulate the Warwick working paper list over listserv lists I argued it would be good for them –increase incentives to contribute –increase revenue for ILL After many trials, Warwick refused. During the end of that time, I was offered a lectureship, and decided to get working on my own collection.

1993: BibEc and WoPEc Fethy Mili of Université de Montréal had a good collection of papers and gave me his data. I put his bibliographic data on a gopher and called the service "BibEc" I also gathered the first ever online electronic working papers on a gopher and called the service "WoPEc".

NetEc consortium BibEcprinted papers WoPEcelectronic papers CodEcsoftware WebEcweb resource listings JokEcjokes HoPEc a lot of Ec!

WoPEc to RePEc WoPEc was a catalog record collection WoPEc remained largest web access point but getting contributions was tough In 1996 I wrote basic architecture for RePEc. –ReDIF –Guildford Protocol

1996: RePEc principle Many archives –archives offer metadata about digital objects (mainly working papers) One database –The data from all archives forms one single logical database despite the fact that it is held on different servers. Many services –users can access the data through many interfaces. –providers of archives offer their data to all interfaces at the same time. This provides for an optimal distribution.

RePEc is based on 330+ archives WoPEc EconWPA DEGREE S-WoPEc NBER CEPR US Fed in Print IMF OECD MIT University of Surrey CO PAH

to form a 209k item dataset 119,000 working papers 87,000 journal articles 1,000 software components 600 book and chapter listings 3,500 author contact and publication listings 7,300 institutional contact listings

RePEc is used in many services BibEc and WoPEc Decomate Z39.50 service EconPapers NEP: New Economics Papers Inomics RePEc author service IDEAS RuPEc EDIRC LogEc

… describes documents Template-Type: ReDIF-Paper 1.0 Title: Dynamic Aspect of Growth and Fiscal Policy Author-Name: Thomas Krichel Author-Person: RePEc:per: :thomas_krichel Author- Author-Name: Paul Levine Author- Author-WorkPlace-Name: University of Surrey Classification-JEL: C61; E21; E23; E62; O41 File-URL: ftp:// pub/RePEc/sur/surrec/surrec9601.pdf File-Format: application/pdf Creation-Date: Revision-Date: Handle: RePEc:sur:surrec:9601

… describes persons (HoPEc) template-type: ReDIF-Person 1.0 name-full: MANKIW, N. GREGORY name-last: MANKIW name-first: N. GREGORY handle: RePEc:per: :N__GREGORY_MANKIW homepage: mankiw/mankiw.html workplace-institution: RePEc:edi:deharus workplace-institution: RePEc:edi:nberrus Author-Article: RePEc:aea:aecrev:v:76:y:1986:i:4:p: Author-Article: RePEc:aea:aecrev:v:77:y:1987:i:3:p: Author-Article: RePEc:aea:aecrev:v:78:y:1988:i:2:p: ….

… describes institutions Template-Type: ReDIF-Institution 1.0 Primary-Name: University of Surrey Primary-Location: Guildford Secondary-Name: Department of Economics Secondary-Phone: (01483) Secondary- Secondary-Fax: (01483) Secondary-Postal: Guildford, Surrey GU2 5XH Secondary-Homepage: Handle: RePEc:edi:desuruk

what do open libraries do? Identify records Relate identified records These actions require human control. They prepare for assessment of performance.

key to success Have a small group of volunteers Disseminate as widely as possible Demonstrate to authors and institutions that it works for them. –institutional registration –author registration

institutional registration It started by one sad geezer making a list of departments that have a web site. I persuaded him that his data would be more widely used if integrated into the RePEc database. Now he is a happy geezer and one of our three crucial volunteers.

author registration It started when funding allowed us to hire a crazy programmer to write an author registration system. system went online as "HoPEc" in late has been renamed "RePEc author service" (RAS) recent grant from OSI allows for a rewrite and expansion.

RePEc author service RePEc document data has author names as strings. The authors register with RAS to list contact details and identify the papers they wrote. This is classic access control, but done by the authors. In a ranking of 800 most important economists, 400 are registered with RAS.

authors' incentives Authors perceive the registration as a way to achieve common advertising for their papers. Author records are used to aggregate usage logs across RePEc user services for all papers of an author. Stimulates a "I am bigger than you are" mentality. Size matters!

KEY idea 1 RePEc attracts a community of users and contributors The community itself is the focus of attention RePEc describes the living rather than the dead. Forget about documents!

KEY idea 2 Forget about users! Disseminate widely Users will come through Google anyway. And Google loves RePEc services –puts RePEc services top when the query consists of the name of an author

open library idea: serials data Serial level information is a crucial component of academic library data. Idea: build and maintain free serial records. Two ways to build: –Use volunteers and collect in a decentralized way. –Make an expensive central collection, disseminate well, charge $$$ for record changes later.

another open library idea: law Much of the legal texts are de jure free. De facto there are two companies who have comprehensive collections and charge a lot of money for the free information bundled with proprietary information. Our moral case calls for a replacement! (it will also create jobs for us)

free legal open library Have all laws and cases –online as text –identified & related Have citation metadata, so that legal citations can verified be while composing case data. Registration procedure to verify the integrity of data.

open library idea II: drugs Collect data on the composition of all drugs –drugs composition reported by drug companies, using open archives –drug components documented by the governments, using an open archive Open library brings the two together!

Am I crazy? Money does not make the world go round. Ideas do. When RMS proposed a free replacement for UNIX in the early 80s, most people dismissed the idea. Today it is reality! Similarly, when I started to work on RePEc a totally free and improved A&I dataset in 1993, nobody gave it a high probability to succeed. It is a reality!

obstacles to open libraries lack of imagination & entrepreneurship inability to form alliances user-centered thinking document-centered thinking technical competence required –OAI PMH –XML and XML Schema –Unicode the "C" word

what I do for open libraries Create an open library for library science: the rclis (reckless) dataset. Create a supporting organization: the open library society. co-workers welcome!

Thank you for your attention!