Digital Libraries & Technology Trends ARD Prasad DRTC Indian Statistical Institute
Agenda Digital Libraries – Statistics Protocols – OAI-ORE, REST, RDF Persistent IDs – ORCID Discovery Tools – Metadata & Ontologies Related Software Islandora (Integration with Semantic Web) Vivo (Researchers and Research Projects) CKAN & DataVerse (Data repositories)
Statistics from DOAR registered Repositories India stands in 13th place
Worldwide Installations of DSpace Total: 2476 USA: 296 India: 179 Brazil: 116 India has the second largest install base
Protocols
OAI-ORE Open Archives Initiative – Object Reuse and Exchange DSpace supports OAI-ORE Allows not only metadata, but also the digital objects to be harvested DSpace can act both as Data Provider and Service Provider
REST Protocol Representational State Transfer Became more popular than SOAP (Simple Object Access Protocol) Compliant to HTTP DSpace supports REST Meant for developing Web APIs, so that many web based services can launched For example, one can interact offline with DSpace and retrieve and manipulate retrieved data
RDF Resource Description Framework Formats to serialize data following RDF RDF/XML, Turtle, N-Triples and N3-Notation DSpace prefers Turtle, but the code can should be able to handle other formats
Turtle RDF is expressed in Triples – Subject, Predicate, Object Ex: Bernard Shaw(subject) wrote(predicate) Saint Joan(Object) Similar to SPARQL (query language of RDF) Turtle ex: <http://example.org/person/Bernard_Shaw> <http://example.org/relation/author> <http://example.org/books/Saint_Joan>
Persistent Identifier
IDs everywhere Printed documents have ISBN, ISSN etc Digital Documents have DOI, CNRI handles Authors have ORCID Organisational ID: organizationally unique identifier (OUI) Library Science pioneered IDs a few decades back – Authority Files – Name, Corporate Name, Publishers, Geographical Area, subjects etc
ORCID Open Researcher and Contributor ID A unique and Persistent Id for researchers and authors. (Free and Non-proprietary) Similar to Name authority files with an added ID Managed by not-for-profit organization. http://orcid.org Resolves name ambiguity Spelling variation Transliteration Name Changes: Maiden or Married name Names with and without initials
ORCID Open Researcher and Contributor ID Advantages Helps in discoverability Eliminates ambiguity of researchers having same name Connects research with researchers
Where to Use ORCID Submitting research articles Applying for funding to support research projects Membership to learned bodies Digital/Data repositories Any information system for researcher Allows connecting research with research project, organisations Note: Recent versions of DSpace support ORCID
ORCID interoperability CrossRef DataCite EuroCRIS RefFund (FundRef) etc.
Should be Mandated UGC should mandate that all research students, faculty Funding agencies like DST, DBT, ICAR, ICSSR, ICHR etc Journals in various subjects including LIS Digital Repositories. DSpace supports ORCID
Metadata and Ontologies
Metadata Metadata is being used in much more broader sense than confined to documents Metadata of people (FOAF – Friend of a Friend) Metadata of Organizations Events Processes Products Every entity
Metadata A powerful Discovery tool There are more 200 metadata scheme Metadata is becoming highly domain specific Dublin Core is considered as the lowest common denominator
Ontologies Library Classification, Thesauri metamorphosis into Ontologies Major difference is making the relations more explicit to implement Description Logic so that inference engines can be applied Schema: SKOS (Simple knowledge Organisation System) OWL (Web Ontology Language)
Digital Library Related Software
DuraSpace Products A joint venture of DSpace Foundation and Fedora Commons Organization DSpace Fedora (Backend of Islandora) Vivo
Islandora Digital Repository Software Combines Fedora (digital asset management software) Drupal (Content Management System – one can use Drupal modules, functions, themes etc.) Solr (Indexing) Preservation metadata automatically generated
Islandora Solution Packs Supports Research Data Data Solution Pack Chemistry Solution Pack Biosciences Solution Pack Other Generic Packs Basic collection Solution Audio solutions pack Basic image solution pack, etc.
Vivo: A DuraSpace Project A semantic-web-based research and researcher discovery tool for people plus the research they do VIVO is an open source software and an ontology for representing scholarship. VIVO supports recording, editing, searching, browsing and visualizing scholarly activity. VIVO encourages research discovery, expert finding, network analysis and assessment of research impact. VIVO is easily extended to support additional domains of scholarly activity.
Data Repository Software CKAN Mostly used in publishing government data at International, National, even city level DataVerse Mostly popular with Research Data Management (RDM).
Concluding Remarks
Information Are catalogues confined to describing books of documents Trade catalogues deal with all kindly products where each product is described along with pictures of the product. Albeit not a structured way or following some standards We use classification in day to day life, need not be confined to classifying books for shelf arrangement or building thesauri for vocabulary control Persistent Identifiers are important in many contexts
Thank You ard@drtc.isibang.ac.in ardprasad@gmail.com