Download presentation
Presentation is loading. Please wait.
Published byAndrea McKenzie Modified over 9 years ago
1
LINKED LITERATURE BHL DEVELOPMENTS CITEBANK Chris Freeland Technical Director, BHL
2
Biodiversity Heritage Library: http://biodiversitylibrary.org BHL Members
3
Biodiversity Heritage Library: http://biodiversitylibrary.org BHL Members: US/UK Academy of Natural Science (Philadelphia, PA) American Museum of Natural History (New York, NY) California Academy of Science (San Francisco, CA) The Field Museum (Chicago, IL) Harvard University Botany Libraries (Cambridge, MA) Harvard University, Ernst Mayr Library of the Museum of Comparative Zoology (Cambridge, MA) Marine Biological Laboratory / Woods Hole Oceanographic Institution (Woods Hole, MA) Missouri Botanical Garden (St. Louis, MO) Natural History Museum (London, UK) The New York Botanical Garden (New York, NY) Royal Botanic Gardens, Kew (Richmond, UK) Smithsonian Institution Libraries (Washington, DC)
4
Biodiversity Heritage Library: http://biodiversitylibrary.org BHL Members: BHL-Europe Museum für Naturkunde - Leibniz- Institut für Evolutions- und Biodiversitätsforschung an der Humboldt-Universität zu Berlin Natural History Museum, UK Narodni muzeum NMP CZ Angewandte Informationstechnik Forschungsgesellschaft mbH Freie Universität Berlin FUBBGBM Georg-August-Universität Göttingen Stiftung Öffentlichen Rechts Naturhistorisches Museum Wien Hungarian Natural History Museum Museum and Institute of Zoology, Polish Academy of Sciences University of Copenhagen Stichting Nationaal Natuurhistorisch Museum, Naturalis National Botanic Garden of Belgium Royal Museum for Central Africa, Royal Belgian Institute of Natural Sciences Bibliothèque nationale de France Museum national d’histoire naturelle Consejo Superior de Investigaciones Cientificas Università degli Studi di Firenze Royal Botanic Garden, Edinburgh Species 2000 John Wiley & Sons limited Helsingin yliopisto UH-Viikki
5
Biodiversity Heritage Library: http://biodiversitylibrary.org Stats: Now Online 15,000 titles 40,000 volumes 16.4mil pages Soon: 34,000 titles 65,000 volumes 24mil pages Oldest book: Schöffer’s Herbarius, 1484.Herbarius
6
Biodiversity Heritage Library: http://biodiversitylibrary.org Stats: Usage Jan – Sep 2009 266,000 visitors 436,000 visits 2.1million pageviews Daily average 970 visitors 1,600 visits / day 7,700 pageviews / day Jan – Sep 2009 Launch to 30 Sep 2009
7
Biodiversity Heritage Library: http://biodiversitylibrary.org Cloud storage & computing DuraCloud Pilot Test of cloud storage & computing with DuraSpace Foundation & New York Public Library Early alpha stage More info: http://www.duraspace.org/duracloud.php http://www.duraspace.org/duracloud.php Inhouse solution demo’d by MOBOT & MBL Redundant high performance storage on commodity hardware Cloud computing pub crawl working group tonight http://www.tdwg.org/conference2009/program/working-sessions/ http://www.tdwg.org/conference2009/program/working-sessions/
8
Biodiversity Heritage Library: http://biodiversitylibrary.org Global, coordinated development New functionality from BHL-Europe Improved deduplication tools Semantic interface OAIS-compliant preservation infrastructure Building a community of developers Funded & volunteer RubyBHL: http://github.com/mjy/rubyBHL http://github.com/mjy/rubyBHL PyBHL: http://linux.softpedia.com/get/Programming/Libraries/pybhl- 51612.shtml http://linux.softpedia.com/get/Programming/Libraries/pybhl- 51612.shtml New partners, new content
9
Biodiversity Heritage Library: http://biodiversitylibrary.org Open Source Pageturning UI http://github.com/openlibrary/bookreader
10
Biodiversity Heritage Library: http://biodiversitylibrary.org Open Software & Development BHL Bits: Portal code, utilities, services http://code.google.com/p/bhl-bits/ http://code.google.com/p/bhl-bits/ Taxonomic Literature Group Google Group for discussion of “taxonomic literature & the services required to make literature interoperable within biodiversity research and biodiversity informatics.” http://groups.google.com/group/taxonlit http://groups.google.com/group/taxonlit
11
Biodiversity Heritage Library: http://biodiversitylibrary.org Open Data Downloads Simple tab-delimited exports of core data http://www.biodiversitylibrary.org/data/BHLExportSchema.pdf http://www.biodiversitylibrary.org/data/BHLExportSchema.pdf Data model DB schema as ERD http://bhl-bits.googlecode.com/files/20090930_BHLDataModel.pdf http://bhl-bits.googlecode.com/files/20090930_BHLDataModel.pdf
12
Biodiversity Heritage Library: http://biodiversitylibrary.org Services Names Service Return all occurrences of a name throughout BHL digitized corpus Documentation: http://bit.ly/2e6sg9http://bit.ly/2e6sg9 Access to 51million name strings using TaxonFinder 1.4million unique names Working out a strategy for obscure species Algorithm improvements to detect nomenclatural & taxonomic acts OpenURL Facilitate links to citations: protologues, articles, references Documentation: http://www.biodiversitylibrary.org/openurlhelp.aspxhttp://www.biodiversitylibrary.org/openurlhelp.aspx Useful to Nomenclators, Reference Systems IPNI Tropicos
13
Biodiversity Heritage Library: http://biodiversitylibrary.org Services: OpenURL http://www.biodiversitylibrary.org/openurl? pid=title:3934&volume=14&issue=&spage=301&date=1879 http://www.biodiversitylibrary.org/openurl? pid=title:3934&volume=14&issue=&spage=301&date=1879 http://www.tropicos.org/Name/1200408
14
Biodiversity Heritage Library: http://biodiversitylibrary.org Services: OpenURL Disambiguation Looking for: BHL returns:
15
Biodiversity Heritage Library: http://biodiversitylibrary.org Services: OpenURL Results
16
Biodiversity Heritage Library: http://biodiversitylibrary.org How? Tropicos maintains internal authority list of publications: Each protologue/reference tied to authority: Matched Tropicos TitleIDs to BHL TitleIDs: Throw citations at resolver at regular intervals & cache data in Tropicos http://www.tropicos.org/Publication/775 http://www.biodiversitylibrary.org/title/3934 http://www.tropicos.org/Publication/775http://www.tropicos.org/Publication/775 = http://www.biodiversitylibrary.org/openurl? pid=title:3934&volume=14&issue=&spage=301&date=1879 http://www.biodiversitylibrary.org/openurl? pid=title:3934&volume=14&issue=&spage=301&date=1879
17
Biodiversity Heritage Library: http://biodiversitylibrary.org Encyclopedia of Life 522,000 species pages linked to BHL #1 referring site
18
Biodiversity Heritage Library: http://biodiversitylibrary.org Other Consumers EarthCape Labs Sort/Search capabilities with harvested names YouTube demo: http://www.youtube.com/watch?v=qw7qw87JTOs http://www.youtube.com/watch?v=qw7qw87JTOs BioGUID / iPhylo BHL Name Timeline & Comparison http://bioguid.info/bhl/ http://bioguid.info/bhl/compare.php New Viewer Tagging So much cool stuff we can’t keep up! http://iphylo.blogspot.com/search/label/BHL @rdmpage
19
http://bioguid.info/bhl/compare.php?name1=Physeter+catodon&name2=Physeter+macrocephalus
20
Biodiversity Heritage Library: http://biodiversitylibrary.org Crowdsourced Articles http://www.biodiversitylibrary.org/pdfgen/17298 http://www.biodiversitylibrary.org/pdfgen/17298 Demo: http://youtube.com/watch?v=oidf3b26jVshttp://youtube.com/watch?v=oidf3b26jVs
21
Biodiversity Heritage Library: http://biodiversitylibrary.org Crowdsourced Articles 12,000 PDFs generated through September 2009 4,900 submitted with article metadata Analysis: http://bit.ly/4Jqu9 http://bit.ly/4Jqu9
22
Biodiversity Heritage Library: http://biodiversitylibrary.org Great, but how to… display / manage? meet community demands for bibliography / citation management? build from more open source tools?
23
Biodiversity Heritage Library: http://biodiversitylibrary.org Development goals re: citations Create a repository for community-vetted taxonomic bibliographies. Ability to ingest, display, download, and index articles so that the BHL can operate as an article repository. Build from existing community of work around Drupal / Biblio. In use by collaborators
24
“something like GenBank or NameBank for citations…” So, CitationBank…or CiteBank (saves chars) Need…
25
http://citebank.biodiversitylibrary.org/
26
Biodiversity Heritage Library: http://biodiversitylibrary.org Crowdsourced Articles PDFs from BHL pushed into Drupal/Biblio:
27
Biodiversity Heritage Library: http://biodiversitylibrary.org http://citebank.biodiversitylibrary.org/search
28
Biodiversity Heritage Library: http://biodiversitylibrary.org http://citebank.biodiversitylibrary.org/node/47423
29
Biodiversity Heritage Library: http://biodiversitylibrary.org PDF http://www.biodiversitylibrary.org/pdf1/000295100017298.pdf
31
Biodiversity Heritage Library: http://biodiversitylibrary.org CiteBank boundaries Book Citation Pageturning UI PDF OCR eBook/Kindle Stored *somewhere* & retrievable via HTTP URI Citation Bibliography CiteBank
32
BHL Data Flow – Sep 2009 CiteBank
33
Biodiversity Heritage Library: http://biodiversitylibrary.org Copyright Bold statements that need some good legal counsel: Citations don’t have copyright Unless you get them from OCLC, other services Bibliographies have copyright They’re a scholarly work Underlying content has copyright Except when it doesn’t
34
Up for discussion…
35
Biodiversity Heritage Library: http://biodiversitylibrary.org Who can upload & edit? Trusted repositories? Approved specialists? BHL Librarians? People in this session? Citizen scientists? 6 th graders? Rod Page? Discussion: Session participants thought it important that BHL get as many citations as possible, then find ways of implementing trust mechanisms for users such as iSpot (Drupal module), ratings systems, ways of tagging inappropriate materials.
36
Biodiversity Heritage Library: http://biodiversitylibrary.org What about duplicates? 3 Bibliographies had Syst. Nat. All 3 in different reference manager formats All 3 had variant forms of title: Syst. Nat. Systema Naturae Systema naturae per regna tria naturae Library catalogues: Caroli Linnaei...Systema naturae per regna tria naturae :secundum classes, ordines, genera, species, cum characteribus, differentiis, synonymis, locis. Discussion: Important to have all the ways in which materials have been referred to over time, then have algorithms & people aggregate titles/articles (translations) into reconciliation groups, resulting in a master index.
37
Biodiversity Heritage Library: http://biodiversitylibrary.org Accuracy How clean is clean? How dirty is dirty? What’s good enough? How to Rank Gold/Platinum? Dirty Bucket/Clean Bucket? Discussion: Let users decide which is the “right” form for use; may differ from project to project. BHL should take it all in, then refine using our libraries’ collected knowledge + involvement from domain specialists.
38
Biodiversity Heritage Library: http://biodiversitylibrary.org Right technologies? “But Drupal’s awful…just ask ___ for their bad experience.” “Drupal’s great!” “MySQL won’t scale” “MySQL’s great!” Discussion: Drupal has limitations, but a large community of developers & implementers. There may be a “Montpellier Declaration” to centralize efforts within biodiversity informatics around the framework. Drupal/Biblio is a good starting point for CiteBank, needs further evaluation after more data are loaded & site is used.
39
Biodiversity Heritage Library: http://biodiversitylibrary.org Next steps Bring hardware online at MBL Have one point of redundancy By Q1 2010 Bring BHL-Europe & other nodes online In conjunction with DuraCloud & other solutions Release CiteBank for beta & sandbox testing Beta at http://citebank.biodiversitylibrary.org http://citebank.biodiversitylibrary.org Sandbox at http://sandcite.biodiversitylibrary.org http://sandcite.biodiversitylibrary.org Production release by Q2 2010 Integration of BHL-Europe tools & content
40
Biodiversity Heritage Library: http://biodiversitylibrary.org Coming soon Darwin’s Library AMNH, NHM, CUL, BHL (MOBOT) Funded by NEH/JISC Digitization of Darwin’s personal library, with annotations New interfaces for recording, indexing, displaying annotations Inhouse scanning from partners/contributors
41
Biodiversity Heritage Library: http://biodiversitylibrary.org Fun: BHL In Your Pocket! Content now available in EPUB format Used by Stanza, transferable to Kindle Blog post by John Mignault (NYBG): http://john.mignault.net/blog/2009/10/28/first-bhl-e-book-experiments/ http://john.mignault.net/blog/2009/10/28/first-bhl-e-book-experiments/
43
Biodiversity Heritage Library: http://biodiversitylibrary.org Links & such Biodiversity Heritage Library http://biodiversitylibrary.org http://biodiversitylibrary.org CiteBank beta http://citebank.biodiversitylibrary.org http://citebank.biodiversitylibrary.org CiteBank sandbox http://sandcite.biodiversitylibrary.org http://sandcite.biodiversitylibrary.org Go play! http://twitter.com/BioDivLibrary Follow BHL on
44
Biodiversity Heritage Library: http://biodiversitylibrary.org Thanks! Chris Freeland Technical Director, BHL Director, Center for Biodiversity Informatics, Missouri Botanical Garden chris.freeland@mobot.org http://twitter.com/chrisfreeland Presentation online through TDWG & at http://www.chrisfreeland.com http://www.chrisfreeland.com
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.