Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse.

Slides:



Advertisements
Similar presentations
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS MBNA
Advertisements

ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
Chapter 3 Databases and Data Warehouses Building Business Intelligence
Decision Support Chapter 10. Overview Databases are really information technology Decision Support is a business application that actually uses databases.
MP3 / MD740 Strategy & Information Systems Oct. 13, 2004 Databases & the Data Asset, Types of Information Systems, Artificial Intelligence.
DATA WAREHOUSING.
L The Difference Between Logical and Physical Views of Information l Databases and Database Management Systems l How You Can Develop Database Applications.
Chapter 3 Databases and Data Warehouses: Building Business Intelligence Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Business Driven Technology Unit 2
Database Processing for Business Intelligence Systems
Chapter 13 The Data Warehouse
1 Introduction Introduction to database systems Database Management Systems (DBMS) Type of Databases Database Design Database Design Considerations.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Business Intelligence Instructor: Bajuna Salehe Web:
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Understanding Data Analytics and Data Mining Introduction.
MIS DATABASE SYSTEMS, DATA WAREHOUSES, AND DATA MARTS MBNA ebay
Database Systems COMSATS INSTITUTE OF INFORMATION TECHNOLOGY, VEHARI.
Database Systems – Data Warehousing
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
© Paradigm Publishing Inc. 9-1 Chapter 9 Database and Information Management.
Slide 1 D2.TCS.CL5.04. Subject Elements This unit comprises five Elements: 1.Define the need for tourism product research 2.Develop the research to be.
Bibliomining: An Introduction
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Technology in Action Alan Evans Kendall Martin Mary Anne Poatsy Twelfth Edition.
Databases Topic 4 Text Materials Chapter 3 – Databases and Data Warehouses.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Meet and Confer Rule 26(f) of the Federal Rules of Civil Procedure states that “parties must confer as soon as practicable - and in any event at least.
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
3-1 Management Information Systems for the Information Age Copyright 2004 The McGraw-Hill Companies, Inc. All rights reserved Chapter 3 Databases and Data.
Human Subject Research View from the IRB Anthony J. Filipovitch Minnesota State University Mankato.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Copyright © 2004 Pearson Education, Inc.. Chapter 28 Overview of Data Warehousing and OLAP.
Chapter 4 Data and Databases. Learning Objectives Upon successful completion of this chapter, you will be able to: Describe the differences between data,
There are interesting reports in the CUL bibliomining system (logs.library.cornell.edu) Adam Chandler Electronic Resources User Experience Librarian Library.
Patron Privacy Issues Cindy Cunningham – OHSU. Overview Balancing demands (academic environment) Privacy in Libraries System Privacy  Vendor  Institution/Library.
LEGAL ASPECTS OF DIGITAL LIBRARIES By TALWANT SINGH ADDL DISTT. & SESSIONS JUDGE; DELHI.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Chapter 3: Databases and Data Warehouses Building Business Intelligence Management Information Systems for the Information Age.
Data resource management
Next Back MAP 3-1 Management Information Systems for the Information Age Copyright 2002 The McGraw-Hill Companies, Inc. All rights reserved Chapter 3 Data.
DATABASES AND DATA WAREHOUSES
Chapter 3 Databases and Data Warehouses: Building Business Intelligence Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Organizing Data and Information
Managing Records: Good government, Better business. FOI Presentations to Boards & Committees Cayman Islands National Archive November 2008.
DATA RESOURCE MANAGEMENT
Information Systems in Organizations Managing the business: decision-making Growing the business: knowledge management, R&D, and social business.
Chapter 1 Overview of Databases and Transactions.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Advanced Database Concepts
Speaking the Same Language Serials Standards and e-Resource Data Interactions Diane Hillmann Cornell University.
6-1 Copyright © 2013 Pearson Canada Inc. Databases and Information Management CHAPTER SIX.
Example of a bibliomining system: logs.library.cornell.edu Adam Chandler Data Discussion on Library Data Cornell University Library June 1, 2012.
Chapter 6 Discovering the Scope of the Incident Spring Incident Response & Computer Forensics.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Michael Spiegel, Esq Timothy Shimeall, Ph.D.
Fundamentals of Information Systems
Chapter 9 Database and Information Management.
Business Intelligence
Introduction of Week 9 Return assignment 5-2
Analytics, BI & Data Integration
Presentation transcript:

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Striking a Balance: Bibliomining and Privacy Scott Nicholson Assistant Professor Syracuse University School of Information Studies

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies What is Bibliomining? Bibliomining is the combination of Bibliometrics and Data Mining used on the data produced during the operation of libraries (physical and digital)

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies What is Bibliomining?  Application of advanced analysis tools to data produced by libraries  May include Data mining Bibliometrics (patterns in scholarship) Online analytical processing (OLAP) Other statistical techniques

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Goals of Bibliomining  Improved decision-making through better understanding of Patron Behavior Library Staff Behavior Behavior of outside organizations  Can provide justification for Library management policies and decisions Acquisitions and ILL source selection Collection development decisions Use of library services (funding bodies)

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Steps in Bibliomining  Determine areas of focus Prediction vs. Description  Determine data source needs Internal and External  Gather data  Create data warehouse  Select appropriate analysis tools  Create & test models / Create reports  Analyze results

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Creating the data warehouse  A data warehouse is a collection of cleaned and anonymized data in a relational database and a point for queries  Outside of the operational systems  Connects disparate data sources into easily accessible database  Can be one time or updated on a regular basis

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Steps in the Warehousing Process  Identify fields of interest  Determine fields that contain personally identifiable information (PII)  Determine combinations of fields that create PII (dept. + level + gender)

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Methods for dealing with Personally Identifiable Information  Use codes, Ids for matching and discard

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Codes for PII  One typical suggestion – code the PII fields, and then record the codes in the database Appropriate for other parties  Do not use a reversible encoding procedure to encode variables. This does not protect patron’s information from an investigation.

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Coding and not discarding  Use a code when there is some aspect of the ID that is important Example – IP addresses  Think about the use of the field, and code appropriately  Do not generate code from original; rather, use other methods for code that capture key information

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Methods for dealing with Personally Identifiable Information  Use for matching and discard

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Dealing with categories Make sure that combinations of categories don’t identify an individual.

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Dealing with Textual data  Digital Reference transactions  Easy to deal with the metadata  Hard to deal with the text Manual cleaning of PII Natural Language Processing research  Similar problem with deidentification of medial records

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies People to Involve  Institutional Research Board (IRB)  Legal counsel Ensures you are following state laws for library data  Library administration / Board  Patrons If there are policies, follow them If there are not, create them

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Benefits to creating the Data Warehouse  Cleaned resource, ready for analysis Outside of operational system  Use for regular reports and research  Forces library to examine the life of data Are there backup tapes created? How long are backups kept?

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies Striking a Balance  A well-designed data warehouse strikes the balance between Protecting Privacy and Maintaining a Data-Based History

Please do not reuse these slides without prior permission from Copyright 2004 Scott Nicholson, Syracuse University School of Information Studies For more information  About bibliomining:  About an active data warehouse project: datafarm/  About this presentation: “The Bibliomining process: Data warehousing and data mining for library decision-making”