Presentation is loading. Please wait.

Presentation is loading. Please wait.

HTRC Workshop 101 THATCamp Gainesville April 24, 2014.

Similar presentations


Presentation on theme: "HTRC Workshop 101 THATCamp Gainesville April 24, 2014."— Presentation transcript:

1 HTRC Workshop 101 THATCamp Gainesville April 24, 2014

2 Outline HathiTrust and HathiTrust Research Center overview How to Use the HTRC Portal – Workset Builder – Algorithm Analysis Opportunities to connect you with the HathiTrust Research Center

3 HathiTrust “Wow” Numbers 11,135,776 total volumes 5,801,121 book titles 290,893 serial titles 3,897,521,600 pages 499 terabytes 132 miles 9,048 tons Public Domain: 3,743,574 volumes(~34% of total) http://www.hathitrust.org

4 Content Distribution

5 Dates

6 Language Distribution The top 10 languages make up ~86% of all content

7 Board of Governors Executive Committee Executive Director HathiTrust Digital Library 90+ partners University of Illinois Indiana University HathiTrust Research Center University of Michigan Data Copy #1 Data Copy #2 Indiana University

8 HathiTrust Collection Builder

9 HTRC Portal

10 www.hathitrust.org/htrc

11 Log in to HTRC Portal

12 Create a Log In

13 How To Start a Workset

14 Log In Again to Workset Builder

15 Workset Builder

16 Why Worksets? The result of a first-level, rough filter Better scale for intensive analytics Provides essential scope for certain analytics – Word frequency scope over Bacon’s essays Some tools (are trained to) work best on a narrow, homogeneous work-set Eliminate noise that would otherwise arise by asking questions across whole of HT

17 Workset Search

18 Select Items

19 Create Worksets

20 Analysis in the HTRC Portal

21 Choose Algorithm

22 Choose Collection(s) for Analysis

23 Run the Analysis…

24 Results!

25 View Results

26 Looking into the future Non-consumptive research on copyrighted texts Bookworm tool development: http://sandbox.htrc.illinois.edu/bookworm/ http://sandbox.htrc.illinois.edu/bookworm/ Improvement of metadata through Workset Creation for Scholarly Analysis (WCSA) study Documentation and user guides forthcoming soon

27 Acknowledgements: HTRC Team HTRC @ Illinois (GSLIS and the University Library): Stephen Downie, Tim Cole, Loretta Auvil, Sayan Bhattacharyya, Boris Capitanu, Colleen Fallaw, Katrina Fenlon, Harriett Green, Peter Organisciak, Megan Senseney, Craig Willis Indiana University: led by Beth Plale

28 Get Involved! HTRC Announcements: htrc-announce-l @ list.indiana.edu HTRC User Group: htrc-usergroup-l @ list.indiana.edu

29 Questions? Harriett Green English and Digital Humanities Librarian University of Illinois at Urbana-Champaign green19@illinois.edu


Download ppt "HTRC Workshop 101 THATCamp Gainesville April 24, 2014."

Similar presentations


Ads by Google