Presentation is loading. Please wait.

Presentation is loading. Please wait.

What’s next with the HathiTrust Research Center?

Similar presentations


Presentation on theme: "What’s next with the HathiTrust Research Center?"— Presentation transcript:

1 What’s next with the HathiTrust Research Center?
How you can mine over 14 Million volumes of one of the largest digital libraries ever built! Robert H. McDonald | IU Libraries | Data to Insight Center

2 About the HathiTrust Digital Library (HTDL)
Repository 14.7+ million volumes 5.1+ billion pages 50% of volumes are in English Material from the 15th C. on | 20th C. concentration 70% in copyright or undetermined | 30% open Interface Search and read books in the public domain | 1:1 Thanks to the efforts of the iucat enterprise library systems team you an link directly one to one to any titles available within HathitTrust straight from IUCat.

3 About the HathiTrust Research Center
Facilitates text analysis of HTDL content Large-scale, computational research Research & Development Conducting user studies Finding technical solutions Building tools and services Located at the University of Illinois and Indiana University Franco Moretti Patten Lecture - Feb

4 HTRC Eco-System

5 HTRC Growth

6 HTRC Analysis Tasks Portal 2014-2016

7 Summer 2016 ACS Projects Fighting Fever in the Caribbean: Medicine and Empire, – University of Iowa Inside the Creativity Boom – Brown University The Chicago School: Wikification as the First Step in Text Mining in Architectural History – Illinois Institute of Technology Signal and Noise and Pride and Prejudice: Toward an Information History of Romantic Fiction – Augsburg College

8 What’s New for HTRC has access to all 14.7 Million HT Volumes (April 2016) Next Round of ACS (late Fall 2016) will feature use of all content Integration of Bookworm with HTRC Portal Currently indexes 4.4 Million HT Volumes Will index 14.4 Million HT by Mid-Fall 2016 Extracted Feature set with all 14.7 Million HT Volumes by Mid-Fall 2016

9 HTRC Useful Links HTRC Portal HTRC Extracted Features Dataset HTRC FAQ
HTRC Extracted Features Dataset HTRC FAQ HTRC+BW

10 Acknowledgements HTRC @ Indiana: HTRC @ Illinois: Beth Plale – Co PI
Inna Kouper Robert McDonald Angela Courtney Marie Ma Nicholae Cline Samitha Liyanage Leanne Mobley Leena Unnikrishnan Guangchen Ruan Zong Peng Milinda Pathirage J. Stephen Downie Co-PI Eleanor Dickson Ryan Dubnicek Beth Sandore Namachichivaya Harriett Green Peter Organisciak Tim Cole Megan Senseney Loretta Auvil Sayan Bhattacharyya Boris Capitanu

11 THANK YOU!


Download ppt "What’s next with the HathiTrust Research Center?"

Similar presentations


Ads by Google