Download presentation
Presentation is loading. Please wait.
Published byWyatt Adams Modified over 11 years ago
1
data.bnf.fr Françoise Leresche Bibliothèque nationale de France Bibliographic and Digital Information Department Standards and Models Unit Library Models and Standards and Their Availability in the Semantic Web Workshop LIDA 2012 Exposing heterogeneous data more efficiently, using Semantic Web tools
2
Bibliothèque nationale de France Millions of resources of any kind (books, scores, maps, still images, audiovisual materials…) published items / manuscripts and archival resources Made Web available through digitization
3
Different logics Published resources Bibliographic description (ISBD) and MARC format Archival resources Archival description (tree structure) and XML-EAD format Digitized resources Simple Dublin Core format
4
Catalogues BnF Catalogue Général 15 millions of bibliographic records Legal deposit of French edition BnF Archives et Manuscrits (BAM) Dozens of thousands of manuscripts and archival resources Mediaeval manuscripts, authors' archives Gallica 1,5 million of digital objects The largest digital library in the French-speaking world
5
Resources Often hidden in the "Deep Web," i.e.: buried in catalogues scattered among various addresses
6
Web practices Less people come in through home pages they access site pages directly Keyword research associated with final content e.g.: a book title Navigating links on blogs, sites, networks
7
Organizing information Gathering contents, links, and services Pages for human reading Data for machine reading Some tools are at hand: Authority records Description standards Perennial identifiers The project
8
Data clustering Published resources Archival resources Digital resources About authors, works, and topics Providing HTML pages RDF structured, open, W3C standards compliant data
9
For what purposes? To propose services Beyond warehouses For any audience To open data and make it free To provide reliable links
10
Theproject Links for authors, works, and topics Links to books, manuscripts, images… disseminated to any audience through user-friendly HTML pages
11
HTML pages In order to: provide content create links to resources lead users to services
13
HTML pages Clustering on one page all the information relating to an author a work a topic from heterogeneous formats using Semantic Web technologies
14
Semantic Web Combining a traditional Web interface with a Semantic web approach Gathering information around a concept In addition to resources, an address and meaningful machine-usable data RDF chosen as resource description framework
15
Elaboration on data Relying on authority records, and persistent identifiers Period of reflection on standards Defining a model Selecting vocabularies Data alignment and integration
16
How does it work? BnF Catalogue général Digitized resources Human-friendly Web pages BnF Archives et Manuscrits Machine-friendly data Clustering - Alignment
17
Functional principles Extracting - Transforming Mandragore Virtual exhibitions Gallica BnF Archives & manuscrits Catalogue général Clustering - Aligning Publication - control Autres sources Wikipedia. Other resources Wikipedia, etc. Web InterfaceLinked Data Page concepts leading to Other libraries
18
Authority records ID card for an entity (person, corporate body, work…) the preferred form variant forms (references) notes sources links to other records an identifier (at the BnF, ARK identifier)
19
Authority record for a work associated with an author Preferred access point title of work Link to bibliographic records Link to the authority record for the creator of the work Links to other works (whole/part relationships) Persistent identifier
20
Persistent identifiers Persistent identifier to recognize a resource uniquely on the long term Stable Reference Naming Finding Citing Two objectives Access Preservation
21
Persistent identifiers: ARK http://catalogue.bnf.fr/ark:/12148/cb11992684q/UNIMARC Name Mapping Authority Host (NMAH) Label Name Assigning Authority Number (NAAN) Name (assigned by the Name Assigning Authority) Qualifier (interpreted by the Name Mapping Authority)
22
FRBR model Item Manifestation Expression Work Creation (at the top level of abstraction) e.g.: Tales of the Grotesque and Arabesque, E. A. Poe Version under which a Work is available (type of content, language, contribution) e.g.: French translation by Baudelaire Publication (production of the book: publisher, printer) e.g.: Ed. Paris : M. Lévy frères,1875. xxxi-330 p. Physical copy e.g.: digitized copy ark:/12148/bpt6k2029183 ark:/12148/bpt6k2029183
23
Why FRBR? Organized display of the various versions of a Work of the various productions of a Person Better matches to Web users' various queries
24
Why FRBR? To interrelate persons and works at the right level of contribution Creator of a work author, composer… Contributor to an edition translator, author of a foreword… Producer of a publication publisher, printer, distributor Name associated with a single exemplar owner, annotator…
25
Vocabularies No "records," only data for conceptsSKOS for authorsFOAF for worksRDA implementation of the FRBR model http://www.rdatoolkit.org/backgroundfiles/Relationships Overview_10_9_09.pdf http://www.rdatoolkit.org/backgroundfiles/Relationships Overview_10_9_09.pdf definition of the roles associated with the various types of contribution
26
BnF data sources Alignments with external data BnF Authorities Virtual exhibitions
27
Romain WENZ BnF-IBN 27 Complex ontology
28
"Author" page
29
"Work" page
30
RDF data http://data.bnf.fr/11890582/charles_ baudelaire/rdf.xml http://data.bnf.fr/11890582/charles_ baudelaire/rdf.xml http://data.bnf.fr/11947965/charles_ baudelaire_les_fleurs_du_mal/rdf.xml http://data.bnf.fr/11947965/charles_ baudelaire_les_fleurs_du_mal/rdf.xml
31
RDF data DBF. - GDEL. - Laffont Bompiani, Auteurs Baudelaire / Isabelle Viéville-Degeorges, impr. 2011 Les Fleurs du mal / Charles Baudelaire, 1869 BN Cat. gén.. - BN Cat. gén. 1960-1969. - BN Cat. gén. suppl. Charles Baudelaire (1821-1867) Baudelaire- Dufays (1821-1867) Pierre Dufaÿs (1821-1867) Baudelaire (1821-1867) Charles Pierre Baudelaire (1821-1867) Authority records are regarded as concepts SKOS
32
RDF data Identifying the person FOAF Charles Baudelaire Littératures Baudelaire Charles Paris 04-09 Paris 31-08-1867 09-04-1821 Charles Baudelaire Poète. - Critique d'art. - Traducteur d'Edgar Allan Poe. - Signa ses premières oeuvres de son patronyme et du nom de sa mère Caroline Archenbaut- Dufays male 1821-1867
33
RDF data Baudelaire as translator [contributor] Poe, Edgar Allan. Histoires grotesques et sérieuses Baudelaire as author [creator] Baudelaire, Charles. Le spleen de Paris
34
Benefits Visibility Being on users' path Finding resources without knowing of them Knowledge enhancement Simple pages and large audience Creating links Enabling data reuse
35
Benefits For the BnF Experimentation FRBRizing catalogue records Future developments for the catalogue New mode of disseminating bibliographic information Showing up in the Linked Data world
36
Presence in the Linked Data cloud http://richard.cyganiak.de/2007/10/lod/lod-datasets_2011-09-19_colored.html
37
Thank you!
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.