Crowd-sourcing the creation of “articles” within the Biodiversity Heritage Library Bianca Crowley Trish Rose-Sandler

Slides:



Advertisements
Similar presentations
0 DIGITIZING GREY LITERATURE FROM THE ANTARCTIC BIBLIOGRAPHY COLLECTION Tina Gheen and Sue Olmsted National Science Foundation Arlington, Virginia USA.
Advertisements

Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital.
1 PORTO Open Repository Publications TORINO Technical architecture of U-GOV Pubblications Archive and PORTO Open Repository Publications Maddalena Morando.
Knowledge is Empowerment Tutorial Guide no. 18 SEARCH IN JSTOR LANGUAGE & LITERATURE and ART & SCIENCE.
Trish Rose-Sandler, Missouri Botanical Garden TDWG Oct 2013 Florence Italy Art of Life project Finding a goldmine of natural history illustrations within.
Periodicals BooksNewspapers Reference tools Online Databases Printed Version Electronic Version Annual reports and other publications.
Searching for Scientific Information 3rd March 2015 Kirsi Heino.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
Resources to Answer Questions eModule 2 LSI Curriculum, Year 1 Content Authors Stephanie Schulte, MLIS, Assistant Professor, Health Sciences Library Carol.
Information-Seeking Behavior in the High-Energy Physics Community Tamar Sadeh School of Informatics, City University, London Ex Libris HCI conference,
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS The Library behind the scene Opportunities for Scientific.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
IEEE/IEE Electronic Library Journals Proceedings Standards and More MCIT Library Consortia, Delhi Global Information Systems Technology Pvt. Ltd
OpenUp! A New Project on Opening up the European Natural History Heritage for EUROPEANA W. G. Berendsohn, A. K. Michel, A. Güntsch, W.-H. Kusber (2011)
ISP 433/533 Week 8 IR in libraries. Goal Universal Access to Information Vannevar Bush 1945 article Memex A memex is a device in which an individual stores.
Biodiversity Heritage Library by Connie Rinaldo. Overview History EOL/BHL: WHY? Members/Collaborators Process Governance Sustainability: Legal and Financial.
Extending the Lifecycle of Scientific Field Notes: Making Hidden Collections Reusable Riccardo Ferrante Smithsonian Institution Rusty.
Click here for getting your Student User Id & password.
© Tefko Saracevic, Rutgers University1 digital libraries and human information behavior Tefko Saracevic, Ph.D. School of Communication, Information and.
Link yourself or perish? PhytoKeys, the next generation journal in systematic botany Lyubomir Penev 1, W. John Kress 2, Sandra Knapp 3, De-Zhu Li 4, Susanne.
International Services and Tools for Content, Metadata and IPR Management Wen Gao Department of Computer Science 10/24/2013.
1 Newspaper Digitisation Workflows Rose Holley- Manager ANDP Presentation to Cultural Heritage Digitisation professionals 26 November 2008.
1 Australian Newspapers Digitisation Program Development of the Newspapers Content Management System Rose Holley – ANDP Manager ANPlan/ANDP Workshop, 28.
Instructional Technology & Design Office or Zotero & Mendeley Workshop Presented by Aisha Conner-Gaten.
Online Resources From Oxford University Press This presentation gives a brief description of Oxford Scholarly Authorities on International Law. It tells.
MIRA to TDIL Workflows Alicia Morris October 2, 2014.
IDENTIFYING OPEN ACCESS ARTICLES: VALID AND INVALID METHODS David Goodman Palmer School of Library and Information Science, Long Island University Kristin.
Gathering and Analyzing Web Use Statistics: A Practical Tutorial for Archivists Michael Szajewski, Ball State University, Archivist for Digital Development.
The Pensoft Journal System and XML-based workflow Lyubomir Penev Life and Literature Conference, Chicago 2011 ViBRANT Virtual Biodversity.
Value to organisations: the research library view point Susan APA, Frascati, Nov 6, 2012.
Alma 1 year after STP: implementing batch services IGeLU Budapest Sep 2, 2015 Bart Peeters Head Operations LIBIS.
Breakouts. Penguins: Skunks: Cacti: Beetles: Classroom A - Suzanne Classroom C - Chris Lecture Hall 2 - Connie Ward Lecture Hall - Marie (Theme: Content.
Global BHL Meeting Fez - Morocco, May, 2013 BHL Brazil [and LA&C] via SciELO BHL Network Abel L. Packer SciELO, Coordinator Technical support: Fabiana.
University of North Texas Enhancing the Quality of Metadata: Modular Approach to Digital Resource Lifecycle Management Daniel Gelaw Alemneh & Mark E. Phillips.
MAINTAINING QUALITY METADATA: TOWARD EFFECTIVE DIGITAL RESOURCE LIFECYCLE MANAGEMENT Daniel Gelaw Alemneh University of North Texas.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Track 1 – Part 1 What can we do to prepare the library of the future for researchers ? The Europeana Library Conference Madrid, December 2012.
Personal information: Name: Naif bin Abdullah bin Ahmed Alhabas. Current job: Librarian, Faculty of Science. Major: Library and Information. Grade: seventh.
Keele Pathfinder Project CLA Reporting of Scanned Material in a Repository Pathfinder - Tim Denning - Project Leader Catering VLE Powerlink - Boyd Duffee.
This tutorial will help you to search for articles in the JSTOR database. To begin, we’ll visit the library homepage.
Collection as the Cornerstone of Presented by Sara Bishop, Administrative Systems Development West Virginia University Office of Information Technology.
Challenges for Academic Libraries in the Networked World Christine L. Borgman Professor & Presidential Chair in Information Studies UCLA & Visiting Professor.
TDWG 2006 Conference, St Louis Digitizing the legacy literature of biodiversity An introduction to the Biodiversity Heritage Library (BHL) Neil Thomson.
Making search simpler The NHM Library and Archives Virtual Library Project.
Online Resources From Oxford University Press This presentation gives a brief description of Oxford Public International Law It tells you what Oxford.
Automated (meta)data collection – problems and solutions Grete Christina Lingjærde and Andora Sjøgren USIT, University of Oslo.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Welcome to de Gruyter Reference Global. De Gruyter Reference Global provides you with comprehensive access to high quality academic content Run a quick.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Tiziana // Alessandra Lenzi - MG Breaking down the walls Project Museo Galileo and the Linked Open Data A joint project between.
Identifying, Creating, Managing and Preserving Home-Grown Collections Leticia Camacho, BYU, Business Librarian Shellie Dean, BYU Copyright Licensing Office.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Extending Discovery: Help Others Find Your Conference’s Content Adam Philippidis, 26 July 2008 IEEE Indexing & Database Production.
Smart Linking With SFX SFX Training, Intranet Internet range of authorities, technologies A&I e-print FTXT OPAC FTXT A&I Electronic Scholarly Information.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
Taxonomic Name Recognition (TNR) in Biodiversity Heritage Library (生物多样性图书馆分 类学名称识别) Qin Wei (魏琴), Chris Freeland, P. Bryan Heidorn Missouri Botanical.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Biodiversity Heritage Library: A Successful Collaboration, A Fully Open Access Collection Marty Schlabach Mann Library, Cornell University Upstate New.
Freeland, LAPI II, 18 NOV 2008 Digital Libraries for Science: Botanicus & Biodiversity Heritage Library Chris Freeland Director of Bioinformatics, Missouri.
World wide access to biodiversity literature The Biodiversity Heritage Library Henning Scholz 1 & Tom Garnett 2 1 Museum für Naturkunde, Berlin, Germany.
Trove Tufts Digital Image Library
The High Energy Physics information platform: Introduction
Open access, the academic library and collection management : new problems, new responsibilities, new challenges Dorette Snyman Unisa Library
Review Key Teaching Points
Online Resources From Oxford University Press.
Networked Information Resources
New Platform to Support Digital Humanities in the Czech Republic
Library Research for the Annotated Bibliography
Presentation transcript:

Crowd-sourcing the creation of “articles” within the Biodiversity Heritage Library Bianca Crowley Trish Rose-Sandler

The BHL is… A consortium of 13 natural history, botanical libraries and research institutions An open access digital library for legacy biodiversity literature. An open data repository of taxonomic names and bibliographic information An increasingly global effort BHL LITA 2011

Problem: Books vs. Articles Librarians manage booksUsers need articles BHL LITA 2011

Solution: “Article-ization” Creating articles manually, through the help of our users: BHL PDF Generator Creating articles through automated means: BioStor BHL LITA 2011 Page, R. (2011). Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library. BMC Bioinformatics, 12(187). Retrieved from

LITA 2011 BHL

Create-your-own PDF BHL LITA 2011

Citebank today: BHL LITA 2011

What is an “article” anyway? BHL LITA 2011

the Good, the Bad, the Ugly BHL LITA 2011

the Good, the Bad, the Ugly BHL LITA 2011

the Good, the Bad, the Ugly BHL LITA 2011

Questions for Data Analysis What is the quality, or accuracy, of user provided metadata? What kinds of content are users creating? How can we improve the PDF generator interface? BHL LITA 2011

Stats Jan 2010-Apr 2011 –Approx 60,000 pdfs created from PDF Generator –40% of those (approx 24,000) were ingested into CiteBank (PDFs without user-contributed metadata excluded) 5 reviewers analyzed 945 pdfs (approx 3.9% of the 24,000+ articles going into Citebank) **Thanks to reviewers Gilbert Borrego, Grace Costantino, and Sue Graves from the Smithsonian Institution BHL LITA 2011

Methodological approach Quantitative – numerical rating system Rated titles, authors, beg/end pages Its “findability” within CiteBank search often determined how it was rated BHL LITA 2011

Ratings System Title 1=has all characters in title letter for letter 2=does not have all characters in title letter for letter but still findable in CiteBank search 3= does not have all characters in title letter for letter and is NOT findable via the CiteBank search LITA 2011 BHL

Ratings System Author 1=has all characters in author(s) last name letter for letter 2=has at least one author’s last name spelled correctly 3=has no authors or none of the author’s last names are spelled correctly LITA 2011 BHL

Ratings System Article beginning & ending pages 1=has all text pages for an article, from start to end 2=subset of pages from a larger article 3=a set of pages where the intellectual content has been compromised. LITA 2011 BHL

Analysis steps LITA 2011

Results Title average 1.68 Title average1.68 Author(s) average1.33 Beg/End pg average1.41 Title & Author average1.50 Overall average (combines first 3 above) 1.47 LITA 2011 BHL

What did we learn? Ratings were better than we expected Many users took the time to create decent metadata “good enough” is not great but is still “findable” LITA 2011 BHL

BHL-Australia’s new portal there’s always room for improvement Other factors But of course….. BHL LITA 2011

Changes we made for UI so far Asking users if they want to contribute their article to CiteBank Making article title a required field and validating it so its at least 2 or more characters Review button for users to review page selections and metadata (inspired by BHL- AUS) Reduced text and increased more intuitive graphics (inspired by BHL-AUS) BHL LITA 2011

Brief survey of proposed changes Overwhelmingly positive response to proposed change there’s always room for improvement But of course….. BHL LITA 2011

Success Factors Monitor the creation of the metadata to look at user behavior and patterns Engage with your users Incentivize your users LITA 2011

@BioDivLibrary /pages/Biodiversity-Heritage-Library/ /photos/biodivlibrary/sets/ /group/biodiversity-heritage-library Bianca Crowley Trish Rose-Sandler trish.rose- BHL LITA 2011