Bibliometric Analysis with Sci2: Choose Your Own Adventure Laura Ridenour School of Library and Information Science, Indiana University
Overview Paper citation network using Sci2 Gathering data Examining and understanding file format Adjusting program parameters for speed Using the Network Analysis Toolkit Visualizing a network Exporting data for use in other programs
Gathering Data ISI file format – Text file with special formatting – Downloadable from Web of Science 500 records at a time from WoS – Other sources include the Scholarly Database ( but data from this source works slightly differently than shown in this tutorialhttp://sdb.cns.iu.edu/search/ Can also use any data with nodes and edges, as long as it’s a clean data set
Required tools Sci – Must create an account in order to download A text editor Helpful: – Oxygen, Gephi
Allocating Additional Memory In the Sci2 directory, open the.ini file for Sci2 and change the settings to the following: – -vmargs – -Xms30m – -Xmx1028m Save and close the file, re-open Sci2 Note: you may need to adjust the memory parameters slightly depending on system capabilities and settings
Web of Knowledge
Download Settings
What an ISI File Looks Like
Using Multiple ISI Files Strip out all headers but the beginning and end tag – can do this manually, or with a script
Sci2 File > Load > (select your file and click ‘Open’) > choose ‘ISI flat format’ > click ‘Select’
In the Data Manager Right-click the loaded file, select “view” to check and/or modify the file If you modify the file, be sure to save the file to a permanent location and reload it into the data manager
Extracting the Network
Data Manager
Using the Network Analysis Toolkit (NAT) Choose “Network Analysis Toolkit”
NAT Analysis is Displayed in the Console
Examining the Output Checking the NAT results can save time. If the network is particularly densely connected, using a faster system may save time
Visualization Select the extracted network to visualize from the Data Manager
Visualization (cont’d) Visualization > Networks > GUESS
Visualization (GUESS)
Choosing a layout (GUESS) Layout > GEM (or whichever layout you would like to use)
Initial GEM Layout (GUESS)
Graph Modifier Dashboard in GUESS
Play!! Take some time to play with different properties in the network. Feel free to use the interpreter, it uses jython (examples on next slide, from CNS wiki)
Jython Examples
Exporting Data From the Data Manager, right click the appropriate graph file and select “Save”
Exporting Data (cont’d) Choose GraphML for importing a graph into Gephi – After saving, change the file extension from.XML to.graphml to make the file usable by Gephi Choose.net for Pajek
Additional Resources