SEASR Analytics for Zotero Loretta Auvil Automated Learning Group Data-Intensive Technologies and Applications, National Center for.

Slides:



Advertisements
Similar presentations
EBSCO Discovery Service
Advertisements

HATHI TRUST A Shared Digital Repository Delivering Data For New Generations of Research Strategies and Challenges Jeremy York NISO/BISG Forum ALA 2010.
Managing References : Mendeley
Use Watch folders to automatically add PDFs to Mendeley Desktop.
For Details Visit : or For any Help Contact the Librarian EBSCOhost 2.0.
WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Business Development Suit Presented by Thomas Mathews.
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
Reference Management Software Tools Mendeley. Table of Contents: Part A Background/Location Signup/Login Import References Organize (Manage) References.
Managing Your Literature Search Using Zotero. Outline  Overview of Zotero  Set up Online Sync  Save citations  Use Zotero with Microsoft Word  Review.
University of Illinois Visualizing Text Loretta Auvil UIUC February 25, 2011.
Macromedia Dreamweaver 4 Foundation Level Course.
SEASR Overview Loretta Auvil, Boris Capitanu National Center for Supercomputing Applications University of Illinois at Urbana-Champaign
Management of information. Objectives Discuss the benefits of good management practice Present reference management tools Present bookmark management.
Managing references : Mendeley
Mendeley What is it? How is it different from other “Bibliographic databases” like End Note and Reference.
Working with SharePoint Document Libraries. What are document libraries? Document libraries are collections of files that you can share with team members.
Use Watch folders to automatically add PDFs to Mendeley Desktop. When you place a document in a watched folder, it will be automatically added to Mendeley.
2. Introduction to the Visual Studio.NET IDE 2. Introduction to the Visual Studio.NET IDE Ch2 – Deitel’s Book.
Section 13.1 Add a hit counter to a Web page Identify the limitations of hit counters Describe the information gathered by tracking systems Create a guest.
Ideas for Incorporating the Research Tool Zotero into Your Course: CTLT’s How I did it series Lorena O’English WSU Libraries
SEASR Analytics and Zotero University of Illinois at Urbana-Champaign.
Internet and Social Networking Research Tools for Academic Writing Copyright © 2014 Todd A. Whittaker
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
Crystal Hoyer Program Manager IIS Team Preview of features that will be announced at MIX09 Please do not blog, take pictures or video of session.
Library Workshop for EPA Sep Outline 2 Find Library resources for research  iSearch  ProQuest Education Databases RefWorks – a web-based.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
T U T O R I A L  2009 Pearson Education, Inc. All rights reserved. 1 2 Welcome Application Introducing the Visual Basic 2008 Express Edition IDE.
SEASR Applications and Future Work National Center for Supercomputing Applications University of Illinois at Urbana-Champaign.
Tutorial 1: Browser Basics.
Marcel Casado NCAR/RAP WEATHER WARNING TOOL NCAR.
2. Introduction to the Visual Studio.NET IDE. Chapter Outline Overview of the Visual Studio.NET IDE Overview of the Visual Studio.NET IDE Menu Bar and.
Session 1 SESSION 1 Working with Dreamweaver 8.0.
SEASR Applications and Future Work University of Illinois at Urbana-Champaign.
Installation and Development Tools National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its.
Managing References Using the free reference management tool Zotero.
Meandre Workbench National Center for Supercomputing Applications University of Illinois at Urbana-Champaign.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation Meandre Workbench National Center for Supercomputing.
1 Manage your References: Using RefWorks, Endnote Mendeley & Zotero Winter Term 2012 Helen B. Josephine
L JSTOR Tools for Linguists 22nd June 2009 Michael Krot Clare Llewellyn Matt O’Donnell.
SEASR Analytics Loretta Auvil Automated Learning Group Data-Intensive Technologies and Applications, National Center for Supercomputing.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
Tools and Deployment University of Illinois at Urbana-Champaign.
1 Manage your Research Articles : Using Mendeley & Zotero Winter Term 2012 Helen B. Josephine
 2002 Prentice Hall. All rights reserved. 1 Chapter 2 – Introduction to the Visual Studio.NET IDE Outline 2.1Introduction 2.2Visual Studio.NET Integrated.
SEASR Overview Loretta Auvil, Boris Capitanu University of Illinois at Urbana-Champaign
Laura Ann Zdziarski, LAT, ATC
SEASR Analytics and Zotero University of Illinois at Urbana-Champaign.
Creating Zotero Flows Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign.
WebDat: A Web-based Test Data Management System J.M.Nogiec January 2007 Overview.
Organize. Collaborate. Discover. 1 Introduction to Mendeley.
1 Prof. Rogério Marinho Departamento de Geografia UNIVERSIDADE FEDERAL DO AMAZONAS Manaus, 12 de fevereiro de 2016.
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
IE 411/511: Visual Programming for Industrial Applications Lecture Notes #2 Introduction to the Visual Basic Express 2010 Integrated Development Environment.
 2002 Prentice Hall. All rights reserved. 1 Introduction to the Visual Studio.NET IDE Outline Introduction Visual Studio.NET Integrated Development Environment.
Using JSTOR May What is JSTOR?JSTOR 2.JSTOR demonstration −Searching JSTOR −Format of the journal content −Linking to content on JSTOR 3.Help.
Jacynthe Touchette, MSI JGH Health Sciences Library
Deployment of Flows Loretta Auvil
SEASR & Meandre for Second Generation Digital Libraries
LMEvents SharePoint Portal How-to Guide
SEASR Overview Loretta Auvil, Boris Capitanu
Managing Your Literature Search Using Zotero
Chapter 2 – Introduction to the Visual Studio .NET IDE
Download from Zotero Home Page
Reference Management Software Tools Mendeley (Part A)
INTRODUCTION TO ZOTERO
Chloe Riley | Research Commons Librarian |
Presentation transcript:

SEASR Analytics for Zotero Loretta Auvil Automated Learning Group Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation

Outline Brief Zotero Introduction Brief SEASR Introduction SEASR Analytics for Zotero Plugin Hands-on Learning Exercises A Little More Advanced Information

The Zotero Picture The WEB The WEB Zotero Store

What is Zotero? (from Zotero Quick Start Guide) A citation manager. It is designed to store, manage, and cite bibliographic references, such as books and articles. In Zotero, each of these references constitutes an item. An extension for the Firefox web-browser by the Center for History and New Media at George Mason University. Installed by visiting zotero.org and clicking the download button on the page.

Zotero Features (from zotero.org) Automatically capture citations Remotely back up and sync your library Store PDFs, images, and web pages Cite from within Word and OpenOffice Take rich-text notes in any language Wide variety of import/export options Free, open source, and extensible Collaborate with group libraries Organize with collections and tags Access your library from anywhere Automatically grab metadata for PDFs Use thousands of bibliographic styles Instantly search your PDFs and notes Advanced search and data mining tools Interface available in over 30 languages

What is SEASR? This project will focus on developing, integrating, deploying, and sustaining a set of reusable and expandable software components and a supporting framework. SEASR will provide a broad set of data mining applications for scholars in humanities The key goals: –Support the development of a state-of-the-art software environment for unstructured data management and analysis of digital libraries, repositories and archives –Develop user interfaces, a data flow engine and demonstration flows that provide data management, analysis and visualization capabilities –Support education and training through workshops to promote its usage among scholars The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation

The SEASR Picture The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation

SEASR Enables Scholarly Research Discovery –What are the words used in the corpus? –What named entities (people, locations, dates) can be extracted? –What hypothesis or rules can be generated by the “features” of the corpus? –What “features” or language of the corpus best describes the corpus? –What are the “similarities” between elements, documents, or corpuses to each other? –What patterns can be identified?

Enables Scholar to Ask… Pattern identification using automated learning –Which patterns are characteristic of the English language? –Which patterns are characteristic of a particular author, work, topic, or time? –Which patterns based on words, phrases, sentences, etc. can be extracted from literary bodies? –Which patterns are identified based on grammar or plot constructs? –When are correlated patterns meaningful? –Can they be categorized based on specific criteria? –Can an author’s intent be identified given an extracted pattern?

Locations Components Flows Meandre: Workbench Existing Flow The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation Web-based UI Components and flows are retrieved from server Additional locations of components and flows can be added to server Create flow using a graphical drag and drop interface Change property values Execute the flow

The Zotero + SEASR Picture The WEB Zotero Store The WEB

SEASR Analytics for Zotero An extension for the Firefox web-browser by the SEASR Team Uses your Zotero Collections Performs analysis using SEASR Services

SEASR Analytics for Zotero Interface

Tag Cloud Viewer Given: Zotero item(s) Creates tag cloud for all items submitted (with a url), stop words filtered, common tokens (punctuation), top 100 words displayed in tag cloud viewer

Date Entities to Simile Timeline Extracts date entities (using OpenNLP) from all items submitted (with a url), and plots the dates that it can on the Simile Timeline

HITS Summarizer Finds top sentences and tokens from all items submitted (with a url) and displays them

Flesch-Kincaid Readability Test Given: Zotero item(s) Results show scores for each item selected –Designed to indicate comprehension difficulty when reading a passage of contemporary academic English –Flesch Reading Ease: higher scores indicate material that is easier to read; lower numbers mark passages that are more difficult to read –Flesch–Kincaid Grade Level: result is a number that corresponds with a grade level

Authorship Analysis Given: Zotero Collection (or multiple items selection) with Author/Co-Author Information Determine importance of given authors in this collection? –Each author is a vertex in the graph –Authors are connected with an edge if they are co-authors of an item –List of Authors ranked by the Betweenness Centrality Measure –Betweenness is a centrality measure of a vertex within a graph. Vertices that occur on many shortest paths between other vertices have higher betweenness than those that do not.

The Value Added Analytical Results are saved as Zotero items (View Snapshot) –Includes metadata –Item naming strategy identifies the item or collection processed –Creator indicates the Menu Label of the SEASR Analysis Related Tab links to the items processed in the Analysis No need to install the analysis, it runs as web service

Learning Exercises Add items to your Zotero Collection Run some of the Zotero-enabled flows on your collection –Tag Cloud Viewer –Date Entities to Simile Timeline –HITS Summarizer –Flesch-Kincaid Readability Test –Authorship Analysis

How to Setup Your Machine Install/Open Firefox Install Zotero – – Install the SEASR Zotero plugin – The plugin points to the default services provided by SEASR (running on our server)

Extensible to Analysis that You Create You can deploy the flows we have on your server or request your university to host this analysis You can modify these flows and redeploy You can create new flows –Perhaps you want to see only nouns or verbs –Perhaps you want to see a list of extracted entities You can share these flows back to the community

SEASR Plugin Preferences Configuration files are managed in a list Each configuration file can be enabled or disabled Reload will refresh the plugin with the flows in the configuration files

Configuration File (XML or json) Contains 2 attribute-value pairs –name: label to use in the Zotero drop-down display –url: url for where to send the post XML json {"seasr_flows":[ {"name":"Author Centrality Analysis", "url":" ice-head-post/instance/shp" }, {"name":"Flesch-Kincaid Readability Test", "url":" ice-head-post/instance/shp" } ]}

Zotero Service Flow Components that read Zotero data from the web service Zotero Author Extractor –Extracts the author-coauthor from each item Zotero URL Extractor –Extracts the url from each item

Zotero Flows and Fedora Services Store and share your collections via Fedora –Works the same way you run an analysis –Just select, upload, and share

Repository Search & Browse Web Service Interactive Web Application Zotero Upload to Repository Zotero to SEASR : Fedora

Community Hub Explore existing flows to find others of interest –Keyword Cloud –Connections Find related flows Execute flow Comments

feedback | login | search central Categories Recently Added Top 50 Submit About RSS Featured Component [read more] Word Counter by Jane Doe Description Amazing component that given text stream, counts all the different words that appear on the text Rights: NCSA/UofI open source license Featured Component [read more] Word Counter by Jane Doe Description Amazing component that given text stream, counts all the different words that appear on the text Rights: NCSA/UofI open source license Featured Flow [read more] FPGrowth by Joe Does Browse By Joe Doe Rights: NCSA/UofI Description: Webservices given a Zotero entry tries to retrieve the content and measure its By Joe Doe Rights: NCSA/UofI Description: Webservices given a Zotero entry tries to retrieve the content and measure its Type Component Flows Categories Image JSTOR Zotero Name Author Centrality Readability Upload Fedora SEASR Central Sharing and finding flows and components

Discussion Questions What kinds of data assets would you be creating in Zotero? What other analysis would you like to use against this data?