COMPARING STATISTICAL PACKAGES IN DSPACE BILL ANDERSON, SARA FUCHS, CHRIS HELMS GEORGIA TECH LIBRARY & ANDY CARTER UNIVERSITY OF GEORGIA Reliable Facts.

Slides:



Advertisements
Similar presentations
Setting up repositories: Technical Requirements, Repository Software, Metadata & Workflow. Repository services Iryna Kuchma, eIFL Open Access program manager,
Advertisements

The New Improved OpenDOAR Directory of OA Repositories Peter Millington SHERPA Technical Development Officer University of Nottingham, England.
Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew.
Comparison of EPrints 3.0 and DSpace digital library systems Kuzma Kudim, Galina Proskudina.
1 Institutional Repository Workshop 1 – 3 April 2009 Presented by Leonard Daniels.
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
Integrating Scholarly Repository Services into Consortial Organizations and Statewide University Systems Marlee Givens & Keith Gilbertson Georgia Institute.
Joseph Fisher Database Management UMass Lowell
DSpace 4: TDL upgrades and new features SEPTEMBER 30, 2014.
DSpace CD Tutorial Workbook Stuart Lewis & Chris Yates
DSpace: Technical Basics Iryna Kuchma Open Access Programme Manager Attribution 3.0 Unported.
DSpace User Group Meeting: April 2006 ( Friday 21 April - DSUG Day 2) Bergen, Norway Psepheda - DSpace at University.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
Greenstone Digital Library Usage and Implementation By: Paul Raymond A. Afroilan Network Applications Team Preginet, ASTI-DOST.
Overview Basic functions Features Installation: Windows host and Linux host.
Tripwire Enterprise Server – Getting Started Doreen Meyer and Vincent Fox UC Davis, Information and Education Technology June 6, 2006.
NAL-Institutional Repository: A Case Study CSIR Metadata Harvester I.R.N. Goudar Head, ICAST, NAL National Symposium on Open Access and.
Citrix Academic Network Leaders to power the world’s transformation to virtual computing.
To run the program: To run the program: You need the OS: You need the OS:
Red Hat Installation. Installing Red Hat Linux is the process of copying operating system files from a CD, DVD, or USB flash drive to hard disk(s) on.
Va-scanCopyright 2002, Marchany Unit 3 – Installing Solaris Randy Marchany VA Tech Computing Center.
Usage in June 2003 Summary A Popular Web Server for Astronomy Education in China Chenzhou CUI, Yongheng ZHAO National Astronomical Observatories, Chinese.
Tanenbaum 8.3 See references
The DSpace Course Module – DSpace Installation. Module objectives  By the end of this module you will:  Understand the platforms DSpace can be hosted.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
Making YOUR WEBSITE MORE EFFECTIVE Website Evaluation & Usability September 17 th,
Module - Technical Basics
Developing Interfaces and Interactivity for DSpace with Manakin Part 2: Technical and Conceptual Overview of Dspace and Manakin Eric Luhrs Digital Initiatives.
Introducing New Services with DSpace Open Repositories Conference 2007 Susan Wells Parham Kent Woynowski Julie Griffin.
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
Scholarly Communication in a Digital World: the Role of the Digital Repository at the Raman Research Institute Girija Srinivasan, Y.M. Patil and Jacob.
Choosing Delivery Software for a Digital Library Jody DeRidder Digital Library Center University of Tennessee.
It’s Up and Running, Now What? Strategies for Building Content in an Institutional Repository LITA National Forum ♦ Denver, Colorado October 6, 2007 Catherine.
INVITATION TO COMPUTER SCIENCE, JAVA VERSION, THIRD EDITION Chapter 6: An Introduction to System Software and Virtual Machines.
MIS 424 Professor Sandvig. Overview  Why Analytics?  Two major approaches:  Server logs  Google Analytics.
The DPubS Development Project: Building an Open Source Electronic Publishing System David Ruddy Cornell University Library.
Installing, Configuring And Troubleshooting Coldfusion Mark A Kruger CFG Ryan Stille CF Webtools.
Modernising DiVA: The Integration of the Fedora Repository Software Open Repositories 2009, Atlanta, USA, 20 May Uwe Klosa, Electronic Publishing Centre.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
Liblivecd ARD Prasad Documentation Research and Training Centre Indian Statistical Institute Bangalore
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
The DSpace Course Module – DSpace statistics and Google Analytics.
DSpace - Digital Library Software
DAEDALUS: ePrints Overview Web Meeting, 4th December 2004 William J Nixon Project Manager (DAEDALUS)
Installation of Storage Foundation for Windows High Availability 5.1 SP2 1 Daniel Schnack Principle Technical Support Engineer.
Ο ρόλος των Ακαδημαϊκών Βιβλιοθηκών στη διατήρηση της πολιτιστικής κληρονομιάς: Institutional Repositories 8 – 9 Μαΐου 2006, Θεσσαλονίκη The role of the.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
A Basic Introduction By Scott Phillips 2005/8/7. Agenda What is DSpace and what does it do? The DSpace Information Model Components & Features of DSpace.
DSpace Statistics Graham Triggs Head of Repository Systems, Symplectic.
Evaluating Engagement Through Data Agenda Part I – What reports are available – What do they mean? Part II – How our orgs are measuring engagement Part.
William J Nixon Setting up a Repository. Introduction Key Features to consider (and review) Wide Range of Technology Available –Best fit for purpose –Clear.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Institutional Repository for Milligan College. Introduction.
Citrix Academic Network
Red hat Installation 2 Live CD.
Dspace Statistics: Google Analytics, Solr
Jon Wheeler | Kenning Arlitsch
Current Sakai/CLE Priorities – Three Year Roadmap
VI-SEEM Data Discovery Service
Introduction, Features & Technology
Introduction to DSpace
Implementing an Institutional Repository: Part II
Interoperable Repository Statistics
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

COMPARING STATISTICAL PACKAGES IN DSPACE BILL ANDERSON, SARA FUCHS, CHRIS HELMS GEORGIA TECH LIBRARY & ANDY CARTER UNIVERSITY OF GEORGIA Reliable Facts from Unreliable Figures OPEN REPOSITORIES 2011 JUNE 11, 2011

Outline Why this project Georgia Tech’s perspective UGA’s perspective Problems with SMARTech Statistics Test plan Initial results Next steps

What do we want to learn? 1) What is the best way to capture statistics for a DSpace repository? 2) What statistics do we want to capture? 3) How do we best display these statistics to the end user?

Statistical Packages We choose to focus on the following four:  DSpace with SOLR statistics  DSpace statistics pre SOLR  AWstats 7.0  Google Analytics

SMARTech – Georgia Tech’s Repository

Why did we initiate this project? Lack of trust in the numbers we were generating Create buy-in from submitters Popular content as basis of collection development decisions Rationale for existence of repository/future funding History of problems with DSpace statistics Solr problems meant we couldn’t display stats to the author Lack of understanding of current numbers

Fiscal Year Statistics Items viewed2,693,150 Bitstreams viewed4,046,314 Searches789,327 OAI requests42,799 AWStats for May 2011 Pages399,153 Hits1,135,003 Confessions of a Repository Manager

Univ. of Georgia Knowledge Repository Launched in August of 2010 Contains about 10,000 items

Statistics and the new repository Institutional context at Univ. of Georgia

Stats and the new repository manager Do I know what I need to know? (Do I know what you need to know?) What do I know about what I do know?

Stats and the new repository manager

What Do Statistics Mean?

What’s Wrong With This Picture?

The Hobgoblin of Little Minds

SOLR ATTACKS!

Points to Consider Software can’t fix wetware Where are visitors coming from? Are they really looking? Different packages count different things – changing software changes numbers Are we counting useful events? Are we counting them accurately? Spiders, harvesters, administrators, and other deadly enemies

Test Environment A virtual host running under ESX VM Setup  OS: Red Hat Enterprise Linux 6.0 (64-bit)  2x Intel Xeon Core 2  2048MB of memory  30Gb of disk space DSpace 1.7.1, PostgreSQL 8.4.7, Java 1.6, Tomcat , Maven 2.2.1, Ant XMLUI Mirage theme 91 Items in archive

Configuration Notes Tomcat + mod_jk + Apache JAVA_OPTS for Tomcat JAVA_OPTS="-server -Xmx600M -Xms600M -XX:+UseParallelGC -Dfile.encoding=UTF-8 -XX:PermSize=128M -XX:MaxPermSize=192M -d64” Defined xmlui.google.analytics.key within dspace.cfg SOLR specific settings solr.statistics.logBots = false solr.statistics.query.filter.spiderIp = false solr.statistics.query.filter.isBot = true

Candidate I SOLRAWstatsGoogle Analytics Page Views554 File Visits104105N/A

Candidate II SOLRAWstatsGoogle Analytics Page Views444 File Views33N/A

Candidate III AWstats Page Views: 47 File Views: N/A SOLR Page Views: 46 File Views: 2 Google Analytics Page Views: 46 File Views: N/A

Moving Forward Outstanding issues Refining our reporting capabilities Stabilizing Solr Displaying statistics to users Usability study Gathering feedback

Contact Bill Anderson Andy Carter Sara Fuchs Chris Helms