Example of a bibliomining system: logs.library.cornell.edu Adam Chandler Data Discussion on Library Data Cornell University Library June 1, 2012.

Slides:



Advertisements
Similar presentations
Teaching Data Warehousing and Business Intelligence at a Graduate Business Program Nenad Jukic, Ph.D. Loyola University Chicago.
Advertisements

Monitoring a web sites health. Web Analytics - Definition Measurement of the behavior of visitors to a website Which aspects of the website work towards.
Back to Table of Contents
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse.
CNStats © 2008 State of the art solution for recording and analyzing site attendance statistics Why Choosing CNStats ?
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS MBNA
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Distributed DBMSs A distributed database is a single logical database that is physically distributed to computers on a network. Homogeneous DDBMS has the.
McGraw-Hill/Irwin Copyright © 2008, The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin Copyright © 2008 The McGraw-Hill Companies, Inc.
12/11/01 Matt Bridges Advisor: Ralph Morelli. What is Web Analytics? In traditional commerce, store owners can observe their customers habits: What time.
WebSphere -DB2 Integration Web Browser Web Server (Apache) WebSphere –JSP/Servlet/EJB DB2 JDBC, SQL HTTP.
Institut für Softwarewissenschaft - Universität WienP.Brezany 1 Toward Knowledge Discovery in Databases Attached to Grids Peter Brezany Institute for Software.
Principles of Marketing
Chapter 14 The Second Component: The Database.
Social Network Analysis: Tasks and Tools Steven Loscalzo and Lei Yu Department of Computer Science Watson School of Engineering and Applied Science State.
Knowledge is Power Marketing Information System (MIS) determines what information managers need and then gathers, sorts, analyzes, stores, and distributes.
Data Resource Management Data Concepts Database Management Types of Databases Chapter 5 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies,
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Lecture-8/ T. Nouf Almujally
Business Intelligence
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Operational Data Tools Chapter Eight. Copyright © Houghton Mifflin Company. All rights reserved.8–28–2 Chapter Eight Learning Objectives To learn database.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
 A set of objectives or student learning outcomes for a course or a set of courses.  Specifies the set of concepts and skills that the student must.
Exploring Marketing Research William G. Zikmund Chapter 2: Information Systems and Knowledge Management.
Moving into the 21 st Century: A New Strategy for Disseminating IMF Statistics and Metadata Working Session on Emerging Trends and Best Practices in Data.
Architecture Of ASP.NET. What is ASP?  Server-side scripting technology.  Files containing HTML and scripting code.  Access via HTTP requests.  Scripting.
Database Systems – Data Warehousing
CHAPTER 4 Marketing Information and Research: Analyzing the Business Environment Off-line and Online M A R K E T I N G.
Bibliomining: An Introduction
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
DECISION SUPPORT SYSTEM ARCHITECTURE: The data management component.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Dr. Mike Lowndes, Interactive Media Manager, Natural History Museum, London – Houses 350-permanent scientific staff, plus postgraduate students; one of.
Web Analytics Unit 4-1(2005 Fall) Managing the Digital Enterprise By Professor Michael Rappa.
©2010 John Wiley and Sons Chapter 12 Research Methods in Human-Computer Interaction Chapter 12- Automated Data Collection.
Case 2: Emerson and Sanofi Data stewards seek data conformity
UNESCO ICTLIP Module 1. Lesson 61 Introduction to Information and Communication Technologies Lesson 6. What is the Internet?
Levels of Collection Services ICPSR Four Services Data Services PreservationServices Collections Access.
Aspect Mining Eclipse Plug-in Provide the integrated aspect mining environment in the Eclipse IDE. Consists of the following functional components –Flexible.
Data Warehousing Data Mining Privacy. Reading Bhavani Thuraisingham, Murat Kantarcioglu, and Srinivasan Iyer Extended RBAC-design and implementation.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
There are interesting reports in the CUL bibliomining system (logs.library.cornell.edu) Adam Chandler Electronic Resources User Experience Librarian Library.
Data resource management
The Impact of Evolving IT Security Concerns On Cornell Information Technology Policy.
Organizing Data and Information
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Chapter 6.  Problems of managing Data Resources in a Traditional File Environment  Effective IS provides user with Accurate, timely and relevant information.
Google Analytics Workshop
Pertemuan 16 Materi : Buku Wajib & Sumber Materi :
Why BI….? Most companies collect a large amount of data from their business operations. To keep track of that information, a business and would need to.
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
Web Analytics and Reporting Michal Neuwirth Product Manager – Kentico Software.
Submitted by: Moran Mishan. Instructed by: Osnat (Ossi) Mokryn, Dr.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
Data Resource Management Chapter 5 McGraw-Hill/IrwinCopyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
Social Networking for Library and Information Science Literature John Meier Science Librarian The Pennsylvania State University University Libraries LFO.
WEB BASED DSS Aaron Atuhe. KEY CONCEPTS When software vendors propose implementing a Web-Based Decision Support System, they are referring to a computerized.
PIWIK JUNIOR TIDAL ASSOCIATE PROF., WEB SERVICES & MULTIMEDIA LIBRARIAN NEW YORK CITY COLLEGE OF TECHNOLOGY, CUNY.
Automated ad placement
Latest Updates on BlackHawk Mines Music : Privacy Policy
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Social Media Data Mining
Web Traffic Analysis Script PHP Web Traffic Analysis Script PHP Web Traffic Analysis Software.
Google Analytics & Search Console
Chapter 12: Automated data collection methods
Data Mining.
Big DATA.
Presentation transcript:

Example of a bibliomining system: logs.library.cornell.edu Adam Chandler Data Discussion on Library Data Cornell University Library June 1, 2012

What is it? logs.library.cornell.edu2

3

4

Live demo 1 logs.library.cornell.edu5

That’s it? Why bother? Just use Google Analytics logs.library.cornell.edu6

7

Why not Google Analytics? logs.library.cornell.edu: 1.Uses Cornell single sign for security and convenience 2.gives us the freedom to export and use the data anyway we want for our special reporting needs 3.requires no changes to our websites. Google Analytics requires a section of Javascript code that sends information about each request to Google where it is recorded. Repeated privacy violations from commercial sites such as Facebook are driving some users towards widgets such as ghostery ( that block javascript based web tracking. 4.our flexible design allows us to store logs which cannot easily be tracked with javascript: examples: PURL, checkip, flickr logs.library.cornell.edu8

'bibliomining' logs.library.cornell.edu9 Nicholson, S. (2003) The Bibliomining Process: Data Warehousing and Data Mining for Library Decision-Making. Information Technology and Libraries 22 (4)

logs.library.cornell.edu10 “The term 'bibliomining' was first used by Nicholson and Stanton (2003) in discussing data mining for libraries. In the research literature, most works that contain the terms 'library' and 'data mining' are not talking about traditional library data, but rather using library in the context of software libraries, as data mining is the application of techniques from a large library of tools. In order to make it more conducive for those concerned with data mining in a library setting to locate other works and other researchers, the term 'bibliomining' was created. The term pays homage to bibliometrics, which is the science of pattern discovery in scientific communication.”

logs.library.cornell.edu11

logs.library.cornell.edu12

logs.library.cornell.edu13 “Bibliomining is the application of statistical and pattern- recognition tools to large amounts of data associated with library systems in order to aid decision-making or justify services.”

logs.library.cornell.edu14 “The bibliomining process consists of · determining areas of focus; · identifying internal and external data sources; · collecting, cleaning, and anonymizing the data into a data warehouse; · selecting appropriate analysis tools; · discovery of patterns through data mining and creation of reports with traditional analytical tools; and · analyzing and implementing the results.”

logs.library.cornell.edu15 Nicholson, S. (2003) The Bibliomining Process: Data Warehousing and Data Mining for Library Decision-Making. Information Technology and Libraries 22 (4)

logs.library.cornell.edu16 “The process is cyclical in nature: as patterns are discovered, more questions will be raised which will start the process again. As additional areas of the library are explored, the data warehouse will become more complete, which will make the exploration of other issues much easier.”

Apache Log logs.library.cornell.edu17

Apache Log logs.library.cornell.edu18

CU (Campus) logs.library.cornell.edu19 CU Lib (Staff) CU Lib (Public) CU (Weill) CU (Qatar) Ithaca not CU NY not Ithaca USA not NY Overseas CUL Logs IP Address Groups

logs.library.cornell.edu20

Live demo 2 logs.library.cornell.edu21

WhoWho uses it and for what?what logs.library.cornell.edu22

How do I get help? logs.library.cornell.edu23

How do I get help? logs.library.cornell.edu24

logs.library.cornell.edu25

Credits logs.library.cornell.edu26

Credits logs.library.cornell.edu27 System designAdam Chandler and Pete Hoyt SoftwarePete Hoyt User interfaceAdam Chandler and Nancy Solla DocumentationAdam Chandler, Glen Wiley, Nancy Solla In-library IP address lookup tables Linda Miller and Assessment, Pete Magnus and Desktop Services

logs.library.cornell.edu