Co-Cited Author Maps as Interfaces to Digital Libraries: Kohonen and PFNet Displays for the Humanities Howard D. White Jan Buzydlowski Xia Lin College.

Slides:



Advertisements
Similar presentations
Academic Search Engines
Advertisements

Trends in Conceptual Modeling: Citation Analysis of the ER Conference Papers ( ) Chaomei Chen, Il-Yeol Song, Weizhong Zhu
INFO624 - Week 2 Models of Information Retrieval Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
ISI Web of Knowledge – Innovative Solutions ISI Web of Knowledge / Web of Science – coming developments BIOSIS Archive Web Citation Index – New product.
New Features Update ISI Web of Knowledge. Copyright 2006 Thomson Corporation 2 New features added Mozilla Firefox web browser is now supported New access.
Term Co-occurrence Analysis as an Interface to Digital Libraries Jan W. Buzydlowski Howard D. White Xia Lin College of Information Science and Technology.
Research Tool for Excellence
Measuring Scholarly Communication on the Web Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton, UK Bibliometric Analysis.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Interfaces for Selecting and Understanding Collections.
Reference Collections: Task Characteristics. TREC Collection Text REtrieval Conference (TREC) –sponsored by NIST and DARPA (1992-?) Comparing approaches.
1 e-Resources on Social Sciences: Social Sciences Citation Index.
Aims Correlation between ISI citation counts and either Google Scholar or Google Web/URL citation counts for articles in OA journals in eight disciplines.
Article Database Tutorial (and quick guide to library resources)
Mapping intellectual disciplines using author co-citation analysis (ACA) Peter Warning Information & Technology Faculty of Education Cite seminar.
THE ROLE OF CITATION ANALYSIS IN RESEARCH EVALUATION Philip Purnell September 2010.
The Endless Gallery: Visualizations of Author Data Howard D. White Xia Lin Jan Buzydlowski College of Information Science and Technology Drexel University.
A tutorial on how to compute H-index using Web of Science database.
Academic Computing Services 2010 Microsoft ® Office Visio ® 2007 Training Get to know Visio.
Research Methods & Data AD140Brendan Rapple 2 March, 2005.
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
Web of Science. Copyright 2006 Thomson Corporation 2 Example: (bird* or avian) and (flu or influenz*) Enter your terms to be searched. Search fields are.
How to Use Google Scholar An Educator’s Guide
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Welcome to Scopus Training by : Arash Nikyar June 2014
Managing your References Sue Bird Bodleian Bio- & Environmental Sciences October 2010.
IL Step 1: Sources of Information Information Literacy 1.
INFO624 - Week 4 Query Languages and Query Operations Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
Put it to the Test: Usability Testing of Library Web Sites Nicole Campbell, Washington State University.
THOMSON SCIENTIFIC Web of Science Using the specialized search and analyze features Jackie Stapleton, librarian Fall 2006.
AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Drexel University Philadelphia,
Rajesh Singh Deputy Librarian University of Delhi Measuring Research Output.
LIS510 lecture 3 Thomas Krichel information storage & retrieval this area is now more know as information retrieval when I dealt with it I.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
We have displayed the Browse publisher drop down menu. This You have full access to: list for an institution where all the material is included in the.
Kohonen Mapping and Text Semantics Xia Lin College of Information Science and Technology Drexel University.
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
Information Visualization: Ten Years in Review Xia Lin Drexel University.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Announcements Literature search lab on Wednesday (focus on your project) Keep track of your searching to document on the search log…for each search instance:
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
To Find contents by publisher, click on the drop down menu. This is different than the Partner publishers services where users enter the publisher’s portals.
Indexing Mathematical Abstracts by Metadata and Ontology IMA Workshop, April 26-27, 2004 Su-Shing Chen, University of Florida
A Comparison of Graphical Techniques for the Display of Co-Occurrence Data Jan W. Buzydlowski, Xia Lin, Howard D. White College of Information Science.
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
Web of Science: Citation Indexes on the Web Gary Wiggins 9/29/2004.
An Interactive System for CO-Citation Visualization Xia Lin Jan Buzydlowski Howard D. White Drexel University Philadelphia, PA, USA.
Reference Collections: Collection Characteristics.
Citation Searching with Web of Knowledge Roger Mills.
Web 2.0: Making the Web Work for You, Illustrated Unit A: Research 2.0.
A brief tour of Academic Search Premier. Agenda: Agenda: What is a database? What is a database? Searching keywords and using truncation. Searching keywords.
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
Bibliometrics: the black art of citation rankings Roger Mills Head of Science Liaison and Specialist Services, Bodleian Libraries June 2010 These slides.
Citation-Based Retrieval for Scholarly Publications 指導教授:郭建明 學生:蘇文正 M
1 CS 430: Information Discovery Lecture 5 Ranking.
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
MARKO ZOVKO, ACCOUNT MANAGER STEPHEN SMITH, SOLUTIONS SPECIALIST JOURNALS & HIGHLY-CITED DATA IN INCITES V. OLD JOURNAL CITATION REPORTS. WHAT MORE AM.
This multimedia product and its contents are protected under copyright law. The following are prohibited by law: any public performance or display, including.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Publication Pattern of CA-A Cancer Journal for Clinician Hsin Chen 1 *, Yee-Shuan Lee 2 and Yuh-Shan Ho 1# 1 School of Public Health, Taipei Medical University.
1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.
INTRODUCTION TO BIBLIOMETRICS 1. History Terminology Uses 2.
How to Use Google Scholar An Educator’s Guide
Using computers to search electronic databases
Accessing journals by Language 4
Citation databases and social networks for researchers: measuring research impact and disseminating results - exercise Elisavet Koutzamani
Presentation transcript:

Co-Cited Author Maps as Interfaces to Digital Libraries: Kohonen and PFNet Displays for the Humanities Howard D. White Jan Buzydlowski Xia Lin College of Information Science and Technology Drexel University, Philadelphia, PA

Co-citation is the mentioning of any two earlier documents in the bibliographic references of a later third document. The count of mentions may grow over time as new writings appear. Thus, co-citation counts can reflect citers’ changing perceptions of documents as more or less strongly related. Documents shown to be related by their co-citation counts can be mapped as proximate in intellectual space. Co-Citation Analysis Doc 1 Doc 2 Doc 3

Co-Citation Analysis Lin, Xia Map Displays for Information Retrieval. Journal of the American Society for Information Science 48: Chen, Chaomei Bridging the Gap: The Use of Pathfinder Networks in Visual Navigation. Journal of Visual Languages and Computing 9: l Document co-citation counts times two papers are cited together. l Author co-citation counts times two authors, e.g., Lin and Chen, are cited together. l Journal co-citation counts times two journals are cited together.

Co-Citation Analysis l Data on co-citation are readily obtainable from databases of the Institute for Scientific Information (ISI) in Philadelphia, PA: Scisearch (Science Citation Index) Social Scisearch (Social Sciences Citation Index) Arts & Humanities Search (Arts & Humanities Citation Index) l These databases are searchable online through, e.g., the Dialog Corporation.

Author Co-Citation Analysis (ACA) Detects patterns in the frequency with which any works by any two authors are jointly cited in later works. l Only recurrent co-citation is significant: the more times authors are cited together, the more strongly related they are in the eyes of citers.

Author Co-Citation Analysis l If Ben Shneiderman and Shakespeare are cited together in one article, it probably means little. l If Ben Shneiderman and Stuart Card are cited together in 205 articles,* it means a lot: their names have jointly come to symbolize something like “interactive interfaces for digital libraries.” Possibly no subject heading captures this concept. l In a cited-author (CA) search on Dialog, SELECT CA=SHNEIDERMAN B AND CA=CARD SK would retrieve the 205 citing articles. *Actual count, 7/10/00

Underlying Database and Software l ISI gave our college 10 years’ worth of data from the Arts & Humanities Citation Index (AHCI ) as a research grant. Has 1.26 million bibliographic records on articles and other items from humanities journals. l For retrievals from AHCI, we bought BRS Search, an industrial-strength engine, from Dataware, Inc. l Buzydlowski and Lin have written several special programs in Java and C to implement our system on top of the BRS Search software.

Our Project l Produces co-cited author maps in real time (a few seconds) on a Web site. l Low cognitive load: User merely has to enter name of a single author of interest as a “seed.” E.g., Dickinson-E for Emily Dickinson l System responds with the top authors co-cited with that seed—about 25 names ranked by frequency of co-occurrence.

Quick Visualizations of a Database l User can choose to display the top 25 as either a Kohonen feature map (SOM, self-organizing map) or a Pathfinder network map (PFNET). l User can use either map as An aid to retrieving articles from AHCI that cite authors in various combinations. Combinations are made through drag-and-drop. Reproducible artwork in a new study, such as a review of a literature or a commentary on the author used as “seed.”

Maps in the Humanities l We are able to produce maps of authors in the humanities with high face validity. Can build maps around great names in literature, philosophy, history, religion, the fine arts. E.g., Dante, Picasso, D. H. Lawrence, Martin Luther, Edward Gibbon, Emily Dickinson, Plato, Vladimir Nabokov. Can also build maps around noted scholars, critics, or commentators. E.g., Simon Schama, Garry Wills, Elaine Showalter, Camille Paglia, Derek de Solla Price. System will work with authors in other ISI databases in the natural and social sciences. Also with other kinds of co-occurring terms: journal names, descriptors, etc.

Advantages of Maps l Ranked list of top 25 co-cited authors often contains names not previously known to user. l Both Kohonen maps and PFNETs show interconnections of the 25 authors not apparent in the one-dimensional ranking of a simple list.

Interpretation of Maps l Kohonen maps show high co-citation counts of authors by placing them closer in space. l PFNETs show highest co-citation counts of authors directly, as links between nodes bearing authors’ names. The counts themselves can be made to appear above the links.

Kohonen Feature Maps l Are a variety of neural network. l Are produced by an algorithm for unsupervised computer learning in which data points “compete” for the position on the output grid that best represents their numeric weights (co-citation counts) relative to all other points.

PFNETs l Are algorithmically connected graphs based on finding “minimum-cost” path between any two nodes. l In ACA, this is generally the highest single co-citation count between author pairs (all pairs are examined). l Results in useful simplification of graph. l Use spring embedder algorithm to produce layout.

PFNETs l Make sense as pictures of relations in databases! l Independent observers have found them highly intelligible: Xia Lin on Chinese philosophers Kate McCain on historians of science & technology Howard White on various literary figures and artists l Buzydlowski research will test interpretability of PFNETs and Kohonen maps as interfaces for domain experts and naïve users.

Interface Design Considerations l Link interface to valuable digital libraries (ISI citation databases and the journal literatures they lead to). l Focus on intellectual content: meaningful words, meaningfully presented. l Stress quick and flexible presentations over long-term displays.

Evidence We’re on Right Track l US Patent 6,038,574: “Method and Apparatus for Clustering Collection of Linked Documents Using Co-Citation Analysis” l Filed: March 18, 1998 l Awarded: March 14, 2000 l Inventors: James E. Pitkow, Peter L. Pirolli, Jock D. Mackinlay, Stuart K. Card, all of Xerox PARC

PFNET of authors co-cited with F. Schleiermacher in AHCI, (Biblical and literary hermeneutics)

AuthorLink System Structure …….. Procedures Web Interface Java Applet Web Server Application Server Java Servlets Kohonen Mapping Procedures in C BRS Search Engine/ ISI Data PFNET Mapping Procedures in C cgi