Presentation is loading. Please wait.

Presentation is loading. Please wait.

Data.bnf.fr Françoise Leresche Bibliothèque nationale de France Bibliographic and Digital Information Department Standards and Models Unit Library Models.

Similar presentations


Presentation on theme: "Data.bnf.fr Françoise Leresche Bibliothèque nationale de France Bibliographic and Digital Information Department Standards and Models Unit Library Models."— Presentation transcript:

1 data.bnf.fr Françoise Leresche Bibliothèque nationale de France Bibliographic and Digital Information Department Standards and Models Unit Library Models and Standards and Their Availability in the Semantic Web Workshop LIDA 2012 Exposing heterogeneous data more efficiently, using Semantic Web tools

2 Bibliothèque nationale de France Millions of resources of any kind (books, scores, maps, still images, audiovisual materials…) published items / manuscripts and archival resources Made Web available through digitization

3 Different logics Published resources Bibliographic description (ISBD) and MARC format Archival resources Archival description (tree structure) and XML-EAD format Digitized resources Simple Dublin Core format

4 Catalogues BnF Catalogue Général 15 millions of bibliographic records Legal deposit of French edition BnF Archives et Manuscrits (BAM) Dozens of thousands of manuscripts and archival resources Mediaeval manuscripts, authors' archives Gallica 1,5 million of digital objects The largest digital library in the French-speaking world

5 Resources Often hidden in the "Deep Web," i.e.: buried in catalogues scattered among various addresses

6 Web practices Less people come in through home pages they access site pages directly Keyword research associated with final content e.g.: a book title Navigating links on blogs, sites, networks

7 Organizing information Gathering contents, links, and services Pages for human reading Data for machine reading Some tools are at hand: Authority records Description standards Perennial identifiers The project

8 Data clustering Published resources Archival resources Digital resources About authors, works, and topics Providing HTML pages RDF structured, open, W3C standards compliant data

9 For what purposes? To propose services Beyond warehouses For any audience To open data and make it free To provide reliable links

10 Theproject Links for authors, works, and topics Links to books, manuscripts, images… disseminated to any audience through user-friendly HTML pages

11 HTML pages In order to: provide content create links to resources lead users to services

12

13 HTML pages Clustering on one page all the information relating to an author a work a topic from heterogeneous formats using Semantic Web technologies

14 Semantic Web Combining a traditional Web interface with a Semantic web approach Gathering information around a concept In addition to resources, an address and meaningful machine-usable data RDF chosen as resource description framework

15 Elaboration on data Relying on authority records, and persistent identifiers Period of reflection on standards Defining a model Selecting vocabularies Data alignment and integration

16 How does it work? BnF Catalogue général Digitized resources Human-friendly Web pages BnF Archives et Manuscrits Machine-friendly data Clustering - Alignment

17 Functional principles Extracting - Transforming Mandragore Virtual exhibitions Gallica BnF Archives & manuscrits Catalogue général Clustering - Aligning Publication - control Autres sources Wikipedia. Other resources Wikipedia, etc. Web InterfaceLinked Data Page concepts leading to Other libraries

18 Authority records ID card for an entity (person, corporate body, work…) the preferred form variant forms (references) notes sources links to other records an identifier (at the BnF, ARK identifier)

19 Authority record for a work associated with an author Preferred access point title of work Link to bibliographic records Link to the authority record for the creator of the work Links to other works (whole/part relationships) Persistent identifier

20 Persistent identifiers Persistent identifier to recognize a resource uniquely on the long term Stable Reference Naming Finding Citing Two objectives Access Preservation

21 Persistent identifiers: ARK http://catalogue.bnf.fr/ark:/12148/cb11992684q/UNIMARC Name Mapping Authority Host (NMAH) Label Name Assigning Authority Number (NAAN) Name (assigned by the Name Assigning Authority) Qualifier (interpreted by the Name Mapping Authority)

22 FRBR model Item Manifestation Expression Work Creation (at the top level of abstraction) e.g.: Tales of the Grotesque and Arabesque, E. A. Poe Version under which a Work is available (type of content, language, contribution) e.g.: French translation by Baudelaire Publication (production of the book: publisher, printer) e.g.: Ed. Paris : M. Lévy frères,1875. xxxi-330 p. Physical copy e.g.: digitized copy ark:/12148/bpt6k2029183 ark:/12148/bpt6k2029183

23 Why FRBR? Organized display of the various versions of a Work of the various productions of a Person Better matches to Web users' various queries

24 Why FRBR? To interrelate persons and works at the right level of contribution Creator of a work author, composer… Contributor to an edition translator, author of a foreword… Producer of a publication publisher, printer, distributor Name associated with a single exemplar owner, annotator…

25 Vocabularies No "records," only data for conceptsSKOS for authorsFOAF for worksRDA implementation of the FRBR model http://www.rdatoolkit.org/backgroundfiles/Relationships Overview_10_9_09.pdf http://www.rdatoolkit.org/backgroundfiles/Relationships Overview_10_9_09.pdf definition of the roles associated with the various types of contribution

26 BnF data sources Alignments with external data BnF Authorities Virtual exhibitions

27 Romain WENZ BnF-IBN 27 Complex ontology

28 "Author" page

29 "Work" page

30 RDF data http://data.bnf.fr/11890582/charles_ baudelaire/rdf.xml http://data.bnf.fr/11890582/charles_ baudelaire/rdf.xml http://data.bnf.fr/11947965/charles_ baudelaire_les_fleurs_du_mal/rdf.xml http://data.bnf.fr/11947965/charles_ baudelaire_les_fleurs_du_mal/rdf.xml

31 RDF data DBF. - GDEL. - Laffont Bompiani, Auteurs Baudelaire / Isabelle Viéville-Degeorges, impr. 2011 Les Fleurs du mal / Charles Baudelaire, 1869 BN Cat. gén.. - BN Cat. gén. 1960-1969. - BN Cat. gén. suppl. Charles Baudelaire (1821-1867) Baudelaire- Dufays (1821-1867) Pierre Dufaÿs (1821-1867) Baudelaire (1821-1867) Charles Pierre Baudelaire (1821-1867) Authority records are regarded as concepts SKOS

32 RDF data Identifying the person FOAF Charles Baudelaire Littératures Baudelaire Charles Paris 04-09 Paris 31-08-1867 09-04-1821 Charles Baudelaire Poète. - Critique d'art. - Traducteur d'Edgar Allan Poe. - Signa ses premières oeuvres de son patronyme et du nom de sa mère Caroline Archenbaut- Dufays male 1821-1867

33 RDF data Baudelaire as translator [contributor] Poe, Edgar Allan. Histoires grotesques et sérieuses Baudelaire as author [creator] Baudelaire, Charles. Le spleen de Paris

34 Benefits Visibility Being on users' path Finding resources without knowing of them Knowledge enhancement Simple pages and large audience Creating links Enabling data reuse

35 Benefits For the BnF Experimentation FRBRizing catalogue records Future developments for the catalogue New mode of disseminating bibliographic information Showing up in the Linked Data world

36 Presence in the Linked Data cloud http://richard.cyganiak.de/2007/10/lod/lod-datasets_2011-09-19_colored.html

37 Thank you!


Download ppt "Data.bnf.fr Françoise Leresche Bibliothèque nationale de France Bibliographic and Digital Information Department Standards and Models Unit Library Models."

Similar presentations


Ads by Google