GSLIS Research Showcase, April 9, 2010

Slides:



Advertisements
Similar presentations
1 Leveraging Your Existing Campus Systems to Access Resource Partners: Federated Identity Management and Tales of Campus Participation EDUCAUSE 2006 October.
Advertisements

Access & Identity Management “An integrated set of policies, processes and systems that allow an enterprise to facilitate and control access to online.
UR Research University of Rochester River Campus Libraries UR Research Development and Use of the University of Rochester’s Institutional Repository Judi.
Connected Histories Sources for Building British History, Funded under the JISC eContent Capital Programme for 18 months Partners:  Prof. Tim.
Idiosyncrasy at Scale Data Curation and the Digital Humanities John Unsworth December 7, 2010 IDCC Man walks around.
Developing a System for Web Based Data Dissemination CSO Experience Strategies for Web based Data Dissemination Ghusoon M. Hameed IRAQ.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
UC Irvine’s Pre-Shib Attribute Setup PH / QI Directory Provides Authoritative Attribute Store –Had both Faculty / Staff and Student Information UCI’s Campus.
InCommon and Federated Identity Management 1
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
A4: Bringing the Library to the User: The Practice David Lindahl University of Rochester Libraries
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Online the Library Michaelmas Term 2011 Trinity College Library Dublin 1 1.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
Archive-It and CINCH tool: Using web harvesting to facilitate born- digital preservation Kathleen Kenney Archive-It Partners Meeting 2012.
New Jersey Digital Video Initiative 1 NJ Digital Video Initiative: NJVid Grace Agnew, Associate University Librarian for Digital Library Systems, Rutgers.
EXtensible Catalog David Lindahl University of Rochester.
Project Builder and MediaMatrix: Redefining Access in the Digital Age Dean Rehberger and Michael Fegan MERLOT August 7-10, 2006 New Orleans, LA.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Illinois Research Connections Researcher Information System Project Rebecca Bryant, PhD
Amy Jackson UNM Technology Days July 22,  An institutional repository (IR) is a web-based database of scholarly material which is institutionally.
GridShib: Grid/Shibboleth Interoperability September 14, 2006 Washington, DC Tom Barton, Tim Freeman, Kate Keahey, Raj Kettimuthu, Tom Scavo, Frank Siebenlist,
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
Capture the Movement: Banner 7.0 and Beyond Susan LaCour, Senior Vice President, Solutions Development California Community Colleges Banner Group.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
INTELLECTUAL RIGHTS AND HISTORIC CORPORA Mark Sandler University of Michigan ICOLC, March, 2003.
Tools Carousel: Sakaibrary Jon Dunn Digital Library Program Indiana University Bill Dueber University Library University of Michigan Jon Dunn Digital Library.
Holly Eggleston, UCSD Beyond the IP Address: Shibboleth and Electronic Resources InCommon Library/Shibboleth Project.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Illinois Research Connections Researcher Information System Project Rebecca Bryant, PhD
The Evolving Scholarly Record in the Campus Context Sarah M. Pritchard March 23, 2015.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Semantically-Rich Tools for Text Exploration Andrew Ashton Center for Digital Scholarship Brown University.
Managing ETDs with Associated Complex Digital Objects Gabrielle V. Michalek Director, Scholarly Publishing, Archives and Data Services Carnegie Mellon.
Digital Library Development: Springboard to State-Wide Access Barbara I. Dewey Dean of Libraries University of Tennessee.
TOP SCHOLAR Digital Research WKU Mike Binder Dean of Libraries Western Kentucky University Presentation at Council of Academic Deans Retreat,
8 November 2012, Penn State Harrisburg Linda Friend University Libraries Publishing & Curation Services.
Tom Barton, Senior Director for Integration, University of Chicago
Arabic Collections Online (ACO)
Our Digital Showcase Scholars’ Mine Annual Report from July 2015 – June 2016 Providing global access to the digital, scholarly and cultural resources.
Federated Identity Management at Virginia Tech
Reusing and repurposing metadata in a Current Research Information System and Institutional Repository 3 June 2010 Robin Armstrong Viner Cataloguing.
Matt Link Associate Vice President (Acting) Director, Systems
IU Digital Library Program
? What is Institutional Repository for Rutgers University
John O’Keefe Director of Academic Technology & Network Services
Avalon's Role in the Digital Collections Ecosystem
Marketplace & service catalog concepts, first design analysis
Sakaibrary Project Update: Subject Research Guides
The Hosted Model Charl Roberts Good morning again,
Your Key to Privacy, Security, and Access to Services
Flexible Extensible Digital Object Repository Architecture
Federated Identity to Support Collaboration in the CIC
Flexible Extensible Digital Object Repository Architecture
Better than it was Finding what works for processing born-digital archives at the Bentley Historical Library Mike Shallcross U-M Bentley Historical Library.
Best Practices for Electronic Theses and Dissertations
Digital Measures Replacement
CNI Spring 2010 Membership Meeting
Data stewardship life cycle
Managing ETDs with Associated Complex Digital Objects
Registrars are a Barrier to Collaboration: Truth or CIO Pretext?
HathiTrust And Its Research Center
Supporting Institutions Towards a Shibbolized Infrastructure
Institutional Repositories
Slides showing what we have working now in Monk Last updated May 6, 2008 (by Catherine) Based on slides used at NEH meeting May 5th for a quick demo.
ArchivesSpace – Archivematica – DSpace Workflow Integration
Presentation transcript:

GSLIS Research Showcase, April 9, 2010 Moving from science experiment to library service GSLIS Research Showcase, April 9, 2010 John Unsworth, Dean Graduate School of Library and Information Science

Funded for two years (2007-2009) by the Andrew W. Mellon Foundation Focus: apply text-mining tools to digital libraries in the humanities; facilitate “reading at library scale.” Funded for two years (2007-2009) by the Andrew W. Mellon Foundation Involved faculty and staff at Illinois (GSLIS and NCSA), Northwestern, Nebraska, Maryland, Alberta, McMaster Content (150M words of literary text) contributed by Virginia, Indiana, UNC, ProQuest, Cengage Coverage: literature of many genres, in English, from 1600- 1920s 2010 GSLIS Research Showcase

In 2009, project was complete, meaning: All texts had been normalized to TEI-A markup and modern spelling (with old spellings preserved), part-of-speech tagged, provided with enhanced item-level metadata, and ingested into a database. A web-based user interface had been produced, allowing users to define collections, select analytic routines and parameters, and examine or export results as tables or visualizations. 2010 GSLIS Research Showcase

However, one of the deliverables promised in the original grant proposal was: Beta installations of MONK alongside several large collections provided by libraries or publishers, hosted on their servers Tim Cole at the UIUC Library had been keeping up with the project as it progressed, and after wrapping up the work on interface and content, I worked with him and Mike Grady (CITES) to produce this deliverable. 2010 GSLIS Research Showcase

However, one of the deliverables promised in the original grant proposal was: Beta installations of MONK alongside several large collections provided by libraries or publishers, hosted on their servers Tim Cole at the UIUC Library had been keeping up with the project as it progressed, and after wrapping up the work on interface and content, I worked with him and Mike Grady (CITES) to produce this deliverable. 2010 GSLIS Research Showcase

Authentication was required because the texts from ProQuest and Cengage (about 100M of our 150M words) were licensed to CIC universities…. All the CIC libraries licensed EEBO and ECCO, but only about half licensed ProQuest’s 19th-century fiction collection ProQuest agreed to allow all CIC institutions access to these texts in MONK, so all we needed was a mechanism for authentication. CIC CIOs had recently agreed to deploy a federated identity management system, called InCommon, and they were willing to provide some funding for a proof-of-concept integration of MONK with InCommon, allowing MONK to be presented as a library service. 2010 GSLIS Research Showcase

2010 GSLIS Research Showcase

“In an email last week, I hinted that we had some collective work to do to complete the Shibboleth access protocol for MONK text analysis functionality.  For Shibboleth to work, authenticated user information has to be pre-loaded.  That’s relatively easy when we have a couple of hundred known users as is the case for CICme, but is a project of different order when we’re trying to authenticate 400,000 users.  Somehow, someway, the University of Illinois Library—which manages the MONK servers— needs to know who on each of the CIC campuses “deserves” to be granted access to MONK. “  2010 GSLIS Research Showcase

-- email from Mark Sandler, CIC “That list will look a lot like your library circulation or e- license authenticated user lists, but in this case you’d be transferring the data to another agency and that will make some of your campus colleagues—registrars and H.R. folks—nervous.  They’ll want to know who is getting the data and why, what the privacy policies are for the University of Illinois, what “attributes” (name, email address, campus status, etc) are being released, plans to refresh the data, etc. “ -- email from Mark Sandler, CIC 2010 GSLIS Research Showcase

At this point, we’re still working on getting access opened up to all CIC users. The problems have much more to do with policy, law, and institutional process than they have to do with technical challenges, although there are certain technicalities in MONK that raise policy problems—for example, the need to have identity information persist, so that work can be conducted in multiple sessions. Moral of the story? 2010 GSLIS Research Showcase