Download presentation
Presentation is loading. Please wait.
Published byVernon Walker Modified over 9 years ago
1
HTRC Workshop 101 THATCamp Gainesville April 24, 2014
2
Outline HathiTrust and HathiTrust Research Center overview How to Use the HTRC Portal – Workset Builder – Algorithm Analysis Opportunities to connect you with the HathiTrust Research Center
3
HathiTrust “Wow” Numbers 11,135,776 total volumes 5,801,121 book titles 290,893 serial titles 3,897,521,600 pages 499 terabytes 132 miles 9,048 tons Public Domain: 3,743,574 volumes(~34% of total) http://www.hathitrust.org
4
Content Distribution
5
Dates
6
Language Distribution The top 10 languages make up ~86% of all content
7
Board of Governors Executive Committee Executive Director HathiTrust Digital Library 90+ partners University of Illinois Indiana University HathiTrust Research Center University of Michigan Data Copy #1 Data Copy #2 Indiana University
8
HathiTrust Collection Builder
9
HTRC Portal
10
www.hathitrust.org/htrc
11
Log in to HTRC Portal
12
Create a Log In
13
How To Start a Workset
14
Log In Again to Workset Builder
15
Workset Builder
16
Why Worksets? The result of a first-level, rough filter Better scale for intensive analytics Provides essential scope for certain analytics – Word frequency scope over Bacon’s essays Some tools (are trained to) work best on a narrow, homogeneous work-set Eliminate noise that would otherwise arise by asking questions across whole of HT
17
Workset Search
18
Select Items
19
Create Worksets
20
Analysis in the HTRC Portal
21
Choose Algorithm
22
Choose Collection(s) for Analysis
23
Run the Analysis…
24
Results!
25
View Results
26
Looking into the future Non-consumptive research on copyrighted texts Bookworm tool development: http://sandbox.htrc.illinois.edu/bookworm/ http://sandbox.htrc.illinois.edu/bookworm/ Improvement of metadata through Workset Creation for Scholarly Analysis (WCSA) study Documentation and user guides forthcoming soon
27
Acknowledgements: HTRC Team HTRC @ Illinois (GSLIS and the University Library): Stephen Downie, Tim Cole, Loretta Auvil, Sayan Bhattacharyya, Boris Capitanu, Colleen Fallaw, Katrina Fenlon, Harriett Green, Peter Organisciak, Megan Senseney, Craig Willis Indiana University: led by Beth Plale
28
Get Involved! HTRC Announcements: htrc-announce-l @ list.indiana.edu HTRC User Group: htrc-usergroup-l @ list.indiana.edu
29
Questions? Harriett Green English and Digital Humanities Librarian University of Illinois at Urbana-Champaign green19@illinois.edu
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.