Science of Science Research and Tools Tutorial #10 of 12 Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization.

Slides:



Advertisements
Similar presentations
SCOPUS Searching for Scientific Articles By Mohamed Atani UNEP.
Advertisements

Managing References : Mendeley
MY NCBI (module 4.5). MODULE 4.5 PubMed/How to Use MY NCBI Instructions - This part of the: course is a PowerPoint demonstration intended to introduce.
MY NCBI (module 4.5). MODULE 4.5 PubMed/How to Use MY NCBI Instructions - This part of the: course is a PowerPoint demonstration intended to introduce.
EndNote Web Reference Management Software (module 5.1)
In the Format section, we have activated the Bibliographic style drop down menu. From this page, you can choose a specific journal or format (e.g. BMC.
EndNote Web Reference Management Software (module 5)
For Details Visit : or For any Help Contact the Librarian EBSCOhost 2.0.
Accessing and Using the e-Book Collection from EBSCOhost ® When an arrow appears, click to proceed to the next slide at your own pace. To go back, click.
Reference Management Software Tools Mendeley. Table of Contents: Part A Background/Location Signup/Login Import References Organize (Manage) References.
SEARCHING THROUGH ScienceDirect prepared by Literature Searching Team Library, Faculty of Medicine, UGM  2012.
The COUNTER Code of Practice for Books and Reference Works Peter Shepherd Project Director COUNTER UKSG E-Books Seminar, 9 November 2005.
SciVal Experts & SciVal Funding Information Sessions.
11 Presented to the Science of Science Measurement Workshop December 3, 2010 Robin M. Wagner* Katy Börner** National Institutes of Health * Indiana University.
Build VIVO in the Cloud NIH Workshop on Value Added Services for VIVO Brand Niemann Semantic Community March 25-26,
Using Social Care Online: an overview Version 1.0 April 2015.
ARDI Portal/Other Resources (Module 5). Module 5: ARDI Portal/Other Resources Reference Tools Databases.
Managing references : Mendeley
Garland Library Online Orientation. Introduction  This portion of the Online orientation is intended to help library users gain the basic knowledge and.
Using NIH’s Research Portfolio Online Report Tool (RePORT) to Your Advantage June 2013 Megan Columbus Director, Division of Communications and Outreach.
PubMed/How to Search, Display, Download & (module 4.1)
OARE Module 5B: Searching for Scientific Research Using Environmental Issues and Policy Index (EBSCO)
Welcome to the Southeastern Louisiana University’s Online Employment Site Applicant Tutorial!
Web of Science. Copyright 2006 Thomson Corporation 2 Example: (bird* or avian) and (flu or influenz*) Enter your terms to be searched. Search fields are.
Getting started on informaworld™ How do I register my institution with informaworld™? How is my institution’s online access activated? What do I do if.
Classroom User Training June 29, 2005 Presented by:
Bibliometric Analysis with Sci2: Choose Your Own Adventure Laura Ridenour School of Library and Information Science, Indiana University.
Step by Step Instruction: How to Conduct Direct Certification using File Upload: SAIS IDs Released January 2014 “How to Conduct Direct Certification using.
NCBI Webinars Closed captioning: and enter www.captionedtext.com All content, including a video recording, will be available.
System for Administration, Training, and Educational Resources for NASA SATERN Overview for Learners May 2006.
Computational Scientometrics Studying science by scientific means Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Searching Databases. What is in the Library? The Online Library has thousands of journal articles and electronic books available for your use. Also available.
Science of Science Research and Tools Tutorial #04 of 12 Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization.
T. Rowe Price, Invest With Confidence and the Bighorn Sheep logo is a registered trademark of T. Rowe Price Group, Inc. Please dial from.
©2006, CSA Creating and Managing Your COS Expertise Profile Managing Your CV and Promoting Your Work ® Resources for Research, Worldwide.
Data Science for VIVO Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
PubMed/How to Search, Display, Download & (module 4.1)
English 115 GoogleScholar/ OneSearch Hudson Valley Community College Marvin Library Learning Commons 1.
PubMed/How to Search, Display, Download & (module 4.1)
TLS: Towards a Macroscope for Science Policy Decision Making NSF SBE Katy Börner & Kevin Boyack Jan Dec. 09 Dr. Katy Börner Cyberinfrastructure.
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
Temporal Analysis using Sci2 Ted Polley and Dr. Katy Börner Cyberinfrastructure for Network Science Center Information Visualization Laboratory School.
Where are the Academic Jobs ? Interactive Exploration of Job Advertisements in Geospatial and Topical Space Angela M. Zoss 1, Michael Conover 2 and Katy.
EndNote Web Reference Management Software (module 5.1)
Science of Science Research and Tools Tutorial #06 of 12 Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization.
Computational Scientometrics Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director School.
Accessing journals by via PubMed Note the link to find articles through HINARI/PubMed. Using this option will be covered in later in the Short Course.
Using COS Funding Alert Alerting You to Relevant New Opportunities from the World’s Largest Funding Database ™ Via your COS Workbench ™
The Scholarly Database and Its Utility for Scientometrics Research Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
Science Standards Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director School of Library.
Science of Science Research and Tools Tutorial #09 of 12 Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization.
1 OSTI - Accelerating Science Information Dr. Walter L. Warnick Director U.S. Department of Energy Office of Scientific and Technical Information Federal.
1. 2 CIShell Features A framework for easy integration of new and existing algorithms written in any programming language. CIShell Sci2 Tool NWB Tool.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
Disciplinary Maps of Sustainability Science Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory,
Partner Publishers’ Websites From the Partner publisher services dropdown menu, click on the Elsevier Science - Science Direct website. Note that this.
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
1 Manual LIMO Content  What’s LIMO?  Content of LIMO  Getting started in LIMO  Performing Searches  Using the Search Results  Managing.
Global Rangelands Data Entry Guidelines March 23, 2015.
VIVO Social Network Visualizations
Scopus - Elsevier (Advanced Course Module 8)
OARE Module 5A: Scopus (Elsevier)
Matt Link Associate Vice President (Acting) Director, Systems
Adding Assignments and Learning Units to Your TSS Course
Scopus - Elsevier (Advanced Course Module 8)
Literary reference center
USER MANUAL - WORLDSCINET
Scopus - Elsevier (Advanced Course: Module 8)
USER MANUAL - WORLDSCINET
Presentation transcript:

Science of Science Research and Tools Tutorial #10 of 12 Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director School of Library and Information Science Indiana University, Bloomington, IN With special thanks to Kevin W. Boyack, Micah Linnemeier, Russell J. Duhon, Patrick Phillips, Joseph Biberstine, Chintan Tank Nianli Ma, Hanning Guo, Mark A. Price, Angela M. Zoss, and Scott Weingart Invited by Robin M. Wagner, Ph.D., M.S. Chief Reporting Branch, Division of Information Services Office of Research Information Systems, Office of Extramural Research Office of the Director, National Institutes of Health Suite 4090, 6705 Rockledge Drive, Bethesda, MD a-noon, July 27, 2010

What was the most valuable you learned today?  How large data can be processed and visualized (2x)  Intro to large scale network analysis—totally new to me.  Identification of specific tools needed to do this.  TARL and DrL algorithms What was irrelevant for your work/needs?  The listing of other tools—let’s focus on 1-2 things and actually learn them.  Nothing is irrelevant because all helps us think about what we can do with the tools and what they could be used for in our work. What topics or examples would you like to explore in more detail?  What the various capabilities of Sci2 actually DO. We’ve been clicking a lot of buttons without knowing what the tool id doing or how to interpret results.  Requirements to identify most relevant tool.  Cytoscape visualizations 12 Tutorials in 12 Days at NIH—Feedback from Tutorial #8 2

What can the instructor do to improve the tutorials?  Don’t spent so much time on advanced research—we are trying to learn the basics.  Discussion of some types of decision making that would use the forms of visualization being presented.  More structure in the lecture in terms of principles.  Ensure demo computer is fast enough to process large scale networks during class. Do you have any other comments or suggestions on today’s tutorial?  Work more with NIH data.  Focus more on non-publication analysis.  Please end on time. For tutorial 12  Please show epidemiology collaboration analysis, SEE Sci2 Tutorial and NWB Workshop slides. 12 Tutorials in 12 Days at NIH—Feedback from Tutorial #8 3

1.Science of Science Research 2.Information Visualization 3.CIShell Powered Tools: Network Workbench and Science of Science Tool 4.Temporal Analysis—Burst Detection 5.Geospatial Analysis and Mapping 6.Topical Analysis & Mapping 7.Tree Analysis and Visualization 8.Network Analysis 9.Large Network Analysis 10.Using the Scholarly Database at IU 11.VIVO National Researcher Networking 12.Future Developments 12 Tutorials in 12 Days at NIH—Overview 4 1 st Week 2 nd Week 3 rd Week 4 th Week

[#10] Using the Scholarly Database at IU  Motivation  Functionality / Sample Usage  Implementation  Documentation  Outlook  Exercise: Identify Promising SDB Collaborations Recommended Reading  La Rowe, Gavin, Ambre, Sumeet, Burgoon, John, Ke, Weimao and Börner, Katy. (2007) The Scholarly Database and Its Utility for Scientometrics Research. In Proceedings of the 11th International Conference on Scientometrics and Informetrics, Madrid, Spain, June 25-27, 2007, pp  Scholarly Database home page, 12 Tutorials in 12 Days at NIH—Overview 5

[#11] VIVO National Researcher Networking  Motivation  Users, Their Needs, and Usage Scenarios  Development  Implementation  Usage  Outlook  Exercise: Identify Promising VIVO Collaborations Recommended Reading VIVO home page, VIVO Conference in NYC in August 2010, 12 Tutorials in 12 Days at NIH—Overview 6

[#12] Future Developments  Validation Studies  Needed Data/Documentation  Needed and New Tool Functionality  Needed Documentation/Tutorials  Promising Research Questions  Exercise: Identify Promising Collaborations Recommended Reading Börner, Katy (2010) Atlas of Science. MIT Press. Börner, Katy, Bettencourt, Luis M. A., Gerstein, Mark & Uzzo, Stephen Miles (Eds.), Knowledge Management and Visualization Tools in Support of Discovery. (2009). NSF CDI Initiative Workshop Report, National Science Foundation, Indiana University Tutorials in 12 Days at NIH—Overview 7

[#10] Using the Scholarly Database at IU  Motivation  Functionality / Sample Usage  Implementation  Documentation  Outlook  Exercise: Identify Promising SDB Collaborations 8

9 Börner, Katy (2010) Atlas of Science. MIT Press. Number of Awards/Funding and Researchers Over Time

10 Börner, Katy (2010) Atlas of Science. MIT Press. Number of Book & Patents Over Time

11 Börner, Katy (2010) Atlas of Science. MIT Press. Number of Journal Publications (Wikipedia entries) Over Time

Informed science and technology policy (and Science of Science Studies) depend on comprehensive and useful data that has high  Accuracy  Integrity (structured & managed)  Consistency  Validity (rules, standards are followed)  Reliability However, publications, patents, grants are kept in data silos with few interlinkages, incompatible formats, unknown quality and coverage. Obama Administration is committed to evidence-based policymaking and making data used for policymaking accessible, relevant, and timely. … Data and analyses should be factual and policy-neutral. us-science-engineering-and-tec Data Needs 12

[#10] Using the Scholarly Database at IU  Motivation  Functionality / Sample Usage  Implementation  Documentation  Outlook  Exercise: Identify Promising SDB Collaborations 13

Scholarly Database “From Data Silos to Wind Chimes”  Create public databases that any scholar can use. Share the burden of data cleaning and federation.  Interlink creators, data, software/tools, publications, patents, funding, etc. La Rowe, Gavin, Ambre, Sumeet, Burgoon, John, Ke, Weimao and Börner, Katy. (2007) The Scholarly Database and Its Utility for Scientometrics Research. In Proceedings of the 11th International Conference on Scientometrics and Informetrics, Madrid, Spain, June , 2007, pp Nianli Ma 14

15 Online interface: Register for free access or test it via tutorial account: Password: nwb

16 Scholarly Database: About

NIH awards are not aggregated by base project. Some have up to 3,000 subprojects. Dataset# RecordsYears CoveredSDB 2.0 Release, Fall 10Restricted Access Medline17,764, ,072,547 ( ) PhysRev398, Yes PNAS16, Yes JCR59, , 1979, 1984, Yes USPTO 3, 875, ,178,196 ( ) NSF174, ,687 ( ) NIH1,043, ,770,770 ( )* Total23,167, Scholarly Database: # Records, Years Covered 17

18 Scholarly Database: Records Per Year

19 Scholarly Database: Records Per Year

Scholarly Database: Web Interface Search across publications, patents, grants. Download records and/or (evolving) co-author, paper-citation networks. 20 Search for RNAi

Scholarly Database: Browse Search Results 21

Scholarly Database: Download Results 22

Since March 2009: Users can download networks: - Co-author - Co-investigator - Co-inventor - Patent citation and tables for burst analysis in NWB. 23

Mapping the Field of RNAi Research (SDB Data) (Sci2 Tutorial, Section 5.2.7) How many papers, patents, and funding awards exist on a specific topic? Here we selected research on RNA interference (RNAi) is a system within living cells that helps to control which genes are active and how active they are. The data for this analysis comes from a search of the Scholarly Database (SDB) ( for “RNAi” in “All Text” from MEDLINE, NSF, NIH and USPTO. A copy of this data is available in ‘*yoursci2directory*/sampledata/scientometrics/sdb/RNAi’. The default export format is.csv, which can be loaded in the Sci2 Tool directly. 24

Mapping the Field of RNAi Research (SDB Data) (Sci2 Tutorial, Section 5.2.7) The Scholarly Database at Indiana University provides free access to 23,000,000 papers, patents, and grants. Since March 2009, users can also download networks, e.g., co-author, co-investigator, co-inventor, patent citation, and tables for burst analysis. For more information and to register, visit Password: nwb

Mapping the Field of RNAi Research (SDB Data) (Sci2 Tutorial, Section 5.2.7). 26 Co-Author Network Load ‘*yoursci2directory*/sampledata/scientometrics/sdb/RNAi/Medline_co- author_table_(nwb_format).csv’ as a standard csv file. SDB tables are already pre-normalized, so now simply run ‘Data Preparation > Text Files > Extract Co-Occurrence Network’ using the default parameters. Network Analysis Toolkit (NAT): 21,578 nodes with 131 isolates, 77,739 edges. Extract only the largest component by running ‘Analysis > Networks > Unweighted and Undirected > Weak Component Clustering.’ Visualize with GUESS using ‘Layout > GEM’. Use a custom python script to color and size the network. 26

Mapping the Field of RNAi Research (SDB Data) (Sci2 Tutorial, Section 5.2.7). 27 Patent Citation Network To visualize the citation patterns of patents on RNAi, load ‘*yoursci2directory*/sampl edata/scientometrics/sdb/ RNAi/USPTO_citation _table_(nwb_format).csv’ as a standard csv file and follow the instructions in the tutorial.

Mapping the Field of RNAi Research (SDB Data) (Sci2 Tutorial, Section 5.2.7). 28 Topic Bursts Load ‘*yoursci2directory*/sampledat/scientometrics/sdb/RNAi/Medline_master_table.csv’. This table includes full records of MEDLINE papers, and can be used to find bursting terms from MEDLINE abstracts dealing with RNAi. Load the file as a standard csv and run ‘Preprocessing > Topical > Normalize Text’ with the default separator and the “abstract” box checked. Run ‘Analysis > Topical > Burst Detection’ with “date_cr_year” in the Date Column and “abstract” in the Text Column, leaving the rest of the values default. Right click on “Burst detection analysis (date_cr_year, abstract): maximum burst level 1” in the Data Manager and view the file. There are more words than can easily be viewed with the horizontal bar graph, so sort the list by “Strength” and prune all but the strongest 10 words. Save the file as a new.csv and load it into the Sci2 Tool as a standard csv file. Select the new table in the data manager and visualize it using ‘Visualize > Temporal > Horizontal Bar Graph.’

Mapping “Artificial Intelligence Research using SDB Data 29 Börner, Katy,, Duhon, Russell Jackson &. (2009). Science & Technology Assessment Using Open Data and Open Code. IEEE Intelligent Systems. Vol. 24(4), 78-81, IEEE Computer Systems.

Medcline Co- 30

31

32

[#10] Using the Scholarly Database at IU  Motivation  Functionality / Sample Usage  Implementation  Documentation  Outlook  Exercise: Identify Promising SDB Collaborations 33

34

35 Scholarly Database: Architecture Solr full-text search server   Open source  Uses the Lucene search library Interface developed in Django   Open source  Particularly suited for content- focused web applications

NIH Grants 36

Medline Publications 37

NSF Grants 38

US Patents 39

[#10] Using the Scholarly Database at IU  Motivation  Functionality / Sample Usage  Implementation  Documentation  Outlook  Exercise: Identify Promising SDB Collaborations 40

Scholarly Database: Documentation Demo  Wikipedia documentation with table schemas, e.g.,  SDB About page,  Data dictionaries at  Sample data files at  Tutorials, e.g., NWB Tool Tutorial, Sci2 Tool Tutorial at  Peer reviewed publications, see These types of documentation are needed for scientifically valid studies that are used to inform decision making. 41

[#10] Using the Scholarly Database at IU  Motivation  Functionality / Sample Usage  Implementation  Documentation  Outlook  Exercise: Identify Promising SDB Collaborations 42

Planned SDB Extensions Regular update of SDB Data (add NIH ExPORTER data) Adding linkage data, e.g., awards-> publications, grants, news. Adding job market data Exposing SDB data to the Linked Open Data Extend SDB-Sci2 Tool synergies. 43

Adding NIH ExPORTER Data Source: NIH ExPORTER at Year coverage: from 2000 till June 2010 Update schedule: monthly for 2010 awards File formats available: xml/csv Description from Web site ExPORTER makes downloadable versions of the data accessed through the RePORT Expenditures and Results (RePORTER) interface available to the public. This site is a key component of NIH "open government" initiatives to provide more transparency in NIH activities, improve the quality of the data we collect, and increase its utility.RePORTER The NIH ExPORTER now is beta version. Original they only released the data from FY 2005 to FY On Jun 2010, they increased the historical data from FY 2000 to FY 2004 and refined record formats in response to user feedback. They will post release notes describing these changes until both xml and vsv record formats are finalized on Oct 1, Data Fields Please see the NIH ExPORTER data dictionary. 44

[#10] Using the Scholarly Database at IU  Motivation  Functionality / Sample Usage  Implementation  Documentation  Outlook  Exercise: Identify Promising SDB Collaborations 45

Exercise Please identify promising SDB usages and/or collaborations. Document it by listing  Project title  User, i.e., who would be most interested in the result?  Insight need addressed, i.e., what would you/user like to understand?  Data used, be as specific as possible.  Analysis algorithms used.  Visualization generated. Please make a sketch with legend. 46

All papers, maps, cyberinfrastructures, talks, press are linked from 47