Math CS Physics Chemistry Biology Engineering Medical Brain/Neuro Psych Earth Sciences Social Sciences Kevin W. Boyack Mapping Examples Sandia is a multiprogram.

Slides:



Advertisements
Similar presentations
Overlay Maps of Science (2010 update)
Advertisements

Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Paris, May 2007 How good is the research base? New approaches to research indicators Colloque de l’Académie des sciences "Évolution des publications scientifiques"
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
SciTech Strategies, Inc. William Pickering Dick Klavans Marjorie M.K. Hlava IEEE SciTech Strategies Access Innovations / Data Harmony March 23, 2010 Found.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
IDENTIFYING HOT BRAZILIAN SCIENCE AND TECHNOLOGY: TECH MINING METHODS FOR RELATING SOURCES OF KNOWLEDGE AND EMERGING RESEARCH AREAS EU-SPRI CONFERENCE,
Research evaluation at CWTS Meaningful metrics, evaluation in context
Asha Balakrishnan Vanessa Peña Bhavya Lal Task Leader November 5, 2011
Evaluation of the Humanities at the ERC Alain Peyraube CNRS and EHESS (FR) ERC Scientific Council  Relevance and Impact of the Humanities University of.
Mapping the Structure and Evolution of Chemistry Research Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Cyberinfrastructure.
Shou Ray Information Service Co., Ltd.
Disciplinary Diversity of Units These slides expand the article How journal rankings can suppress interdisciplinary research: A comparison between innovation.
Mutual Information Mathematical Biology Seminar
Experimental Evaluation in Computer Science: A Quantitative Study Paul Lukowicz, Ernst A. Heinz, Lutz Prechelt and Walter F. Tichy Journal of Systems and.
Chapter One The Nature of Probability and Statistics.
Who am I and what am I doing here? Allan Tucker A brief introduction to my research
D2K: Data to Knowledge Institute – Natural and Human System Responses to Environmental Change Genomics and Bioinformatics Sustainability, People, and Policy.
Health and CS Philip Chan. DNA, Genes, Proteins What is the relationship among DNA Genes Proteins ?
Jake Blanchard – University of Wisconsin – August 2007.
SciTech Strategies, Inc. BETTER MAPS BETTER DECISIONS Science Mapping and Applications: Choices and Trade-offs Kevin W. Boyack, SciTech Strategies Standards.
Wojciech Fenrich Interdisciplinary Centre for Mathematical and Computational Modelling (ICM) University of Warsaw Prague, KRE 12,
Kevin W. Boyack Sandia National Laboratories Sackler Colloquium on Mapping Knowledge Domains May 11, 2003 An indicator-based characterization of PNAS Sandia.
SAINT2002 Towards Next Generation January 31, 2002 Ly Sauer Sandia National Laboratories Sandia is a multiprogram laboratory operated by Sandia Corporation,
Educational Knowledge Domain Visualizations: Tools to Navigate, Understand, and Internalize the Structure of Scholarly Knowledge and Expertise Peter A.
Standards in science indicators Vincent Larivière EBSI, Université de Montréal OST, Université du Québec à Montréal Standards in science workshop SLIS-Indiana.
RQF outcomes in sciences Gavin Moodie, Principal Policy Adviser Vice Chancellor’s office.
An alternative way of looking at research performance, based on citation patterns (co-citation analysis) rather than traditional journal classifications.
Students Becoming Scientists in the World: Integrating Research and Education for Sustainable Development Dr. James P. Collins Directorate for the Biological.
Bibliometrics: coming ready or not CAUL, September 2005 Cathrine Harboe-Ree.
______________________________________ Öz/Atıf Veri Tabanlarında Yeni Gelişmeler, Veri Grubu Analizleri Yoluyla Akademik Rekabete Yeni Bir Bakış “ SCIVAL.
Information Visualization Tools Ketan Mane Ph.D. Candidate Member of Information Visualization Lab Member of Cyberinfrastructure for Network Science School.
Impact factorcillin®: hype or hope for treatment of academititis? Acknowledgement Seglen O Per (BMJ 1997; 134:497)
Mapping the Disciplinary Diffusion of Information Understanding Complex Systems 2005 Peter A. Hook Doctoral Student, Indiana University Bloomington
Computing and Communications and Biology Molecular Communication; Biological Communications Technology Workshop Arlington, VA 20 February 2008 Jeannette.
Towards a Science of Science (Policy) Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director.
Computational Scientometrics Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director School.
Computational Scientometrics: Mapping the Structure and Evolution of Science Katy Börner & the InfoVis Lab School of Library and Information Science.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
The Scholarly Database and Its Utility for Scientometrics Research Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Mapping New Strategies: National Science Foundation J. HicksNew York Academy of Sciences4 April 2006 Examples from our daily life at NSF Vision Opportunities.
Mapping Medline Papers, Genes, and Proteins Related to Melanoma Research Kevin Boyack †, Ketan Mane ‡, Katy Börner ‡ † VisWave LLC, Albuquerque, NM
Designing Insightful (Network) Visualizations of Scholarly Activity Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Science and Technology (S&T) Studies Can be conducted at different levels:  local (individual),  meso (local, e.g., one institute, one funding agency),
Overlay Maps of Science Ismael Rafols 1,2, Alan Porter 2 and Loet Leydesdorff 3 2 SPRU, University of Sussex, Brighton 2 School of Public Policy, Georgia.
EuroCRIS Platform Meeting - Vienna 2-3 October 1998 CRIS as a source for tracking science publication patterns Fulvio Naldi - Carlo Di Mento Italian National.
Katy Börner, Knowledge Domain Visualizations in Support of Scholarly Knowledge and Expertise Management, SRI International, Oct 21,
SciVal Spotlight Training for KU Huiling Ng, SciVal Product Sales Manager (South East Asia) Cassandra Teo, Account Manager (South East Asia) June 2013.
Deutsche Forschungsgemeinschaft DFG The Use of Research Funding Databases for Research Assessment Information Systems Presented at the 8th international.
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
ESSENTIAL SCIENCE INDICATORS (ESI) James Cook University Celebrating Research 9 OCTOBER 2009 Steven Werkheiser Manager, Customer Education & Training ANZ.
Shaping a Health Statistics Vision for the 21 st Century 2002 NCHS Data Users Conference 16 July 2002 Daniel J. Friedman, PhD Massachusetts Department.
Managing Humanity’s Knowledge and Expertise: The InfoVis Cyberinfrastructure Katy Börner & the InfoVis Lab School of Library and Information Science
1 Making a Grope for an Understanding of Taiwan’s Scientific Performance through the Use of Quantified Indicators Prof. Dr. Hsien-Chun Meng Science and.
Mapping Knowledge Domains Katy Börner School of Library and Information Science Talk at IU’s Technology Transfer Office Indianapolis,
The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology.
Computational Scientometrics That Informs Science Policy Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization.
National Research Council Of the National Academies
Computational Scientometrics: Mapping the Structure and Evolution of Science Katy Börner & the InfoVis Lab School of Library and Information Science Indiana.
1 Academic Disciplines and Level of Academic Challenge Gary R. Pike University of Missouri–Columbia.
Information Visualization, Human-Computer Interaction, and Cognitive Psychology: Domain Visualizations Kevin W. Boyack Sandia National Laboratories.
Finding References for NSF Proposals: What’s been done? Why should you care? Liz Current address: Washington University in St.
Katy Börner Teaching & Research Teaching & Research Katy Börner
Performing Fault-tolerant, Scalable Data Collection and Analysis James Jolly University of Wisconsin-Madison Visualization and Scientific Computing Dept.
The Thomson Reuters Journal Selection Policy – Building Great Journals - Adding Value to Web of Science Maintaining and Growing Web of Science Regional.
Publication Pattern of CA-A Cancer Journal for Clinician Hsin Chen 1 *, Yee-Shuan Lee 2 and Yuh-Shan Ho 1# 1 School of Public Health, Taipei Medical University.
THE BIBLIOMETRIC INDICATORS. BIBLIOMETRIC INDICATORS COMPARING ‘LIKE TO LIKE’ Productivity And Impact Productivity And Impact Normalization Top Performance.
Mapping Knowledge Domains
Educational Knowledge Domain Visualizations: Tools to Navigate, Understand, and Internalize the Structure of Scholarly Knowledge and Expertise Peter A.
Designing Insightful (Network) Visualizations of Scholarly Activity
Presentation transcript:

Math CS Physics Chemistry Biology Engineering Medical Brain/Neuro Psych Earth Sciences Social Sciences Kevin W. Boyack Mapping Examples Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under Contract DE-AC04-94AL85000.

2 Macromodel: “Best” Map  Each dot is one journal  Journals group by discipline  Labeled by hand Boyack, K.W., Klavans, R., & Börner, K. (2005, in press). Mapping the backbone of science. Scientometrics.

3 Modified from Boyack et al. by Ian Aliman, IU

4 Macromodel: Structural Map  Clusters of journals denote disciplines  Lines denote strongest relationships between disciplines  Enables disciplinary diffusion studies  Enables comparison of institutions by discipline Boyack, K.W., Klavans, R., & Börner, K. (2005, in press). Mapping the backbone of science. Scientometrics.

5 Paper-level Map

6 Disciplinary S&T Model - Details  Uses combined SCIE/ SSCI/ISI Proceedings from 2003 –7445 journals, 1198 proceedings –journals, proceedings treated equivalently –Bib coupling of journals  Initial ordination and clustering of journals gave 852 clusters  Coupling counts reaggregated at the journal cluster level; ordination of journal clusters –(x,y) positions for each journal cluster, journal Math CS Physics Chemistry Biology Engineering Medical Brain/Neuro Psych Earth Sciences Social Sciences

7 Research Community Model - Details  Uses combined SCIE/ SSCI/ISI Proceedings from 2003 –997,775 papers from 8643 sources –Bib coupling of papers  Initial ordination and clustering of journals gave 117,433 clusters  Cluster positions calculated using journal positions from the disciplinary map Math CS Physics Chemistry Biology Engineering Medical Brain/Neuro Psych Earth Sciences Social Sciences

8

9

10 Accuracy

11 Journal-level: Local Accuracy  For each similarity measure, journal pairs were assigned a 1/0 binary score if they were IN/OUT of the same ISI category  Accuracy vs. coverage curves were generated for each similarity measure  For each similarity measure, distances (in the VxOrd layouts) between journal pairs were calculated  Accuracy vs. coverage curves were generated for each re-estimated (distance) similarity measure  Results after running through VxOrd were more accurate than the raw measures  Inter-citation measures are best Similarity measures After VxOrd Klavans, R., & Boyack, K.W. (in press). Identifying a better measure of relatedness for mapping science. Journal of the American Society for Information Science and Technology.

12 Journal-level: Regional Accuracy  For each similarity measure, the VxOrd layout was subjected to k- means clustering using different numbers of clusters  Resulting cluster/category memberships were compared to actual category memberships using entropy/mutual information method  Increasing Z-score indicates increasing distance from a random solution  Most similarity measures are within several percent of each other Boyack, K.W., Klavans, R., & Börner, K., (submitted). Mapping the backbone of science. Scientometrics.

13 Paper-level: Local Accuracy  Two maps (current and reference), two measures (raw and modified cosine), two aggregation levels  For each similarity measure, paper pairs, ordered by distance on the map, were assigned a 1/0 binary score if they were IN/OUT of the same ISI category  Accuracy vs. coverage curves were generated for each similarity measure  K50 measures have high accuracy at high coverage Klavans, R., & Boyack, K.W. (under review). Quantitative evaluation of large maps of science. Scientometrics. Local Accuracy (Current Papers) K50 (aggregated) RawFreq (aggregated) K50 (paper) RawFreq (paper) 60% Local Accuracy 70% 60% Local Accuracy (%yes) 70% % Coverage Local Accuracy (Reference Papers) K50 (aggregated) RawFreq (aggregated) K50 (paper) RawFreq (paper) % Coverage 60% Local Accuracy (%yes) 70%

14 Disciplinary Bias (Reference Papers) Disciplinary Bias Small[1999] RawFreq (paper) RawFreq (aggregated) K50 (paper) K50 (aggregated) Disciplinary Bias (Current Papers) Disciplinary Bias RawFreq (paper) RawFreq (aggregated) K50 (paper) K50 (aggregated) Paper-level: Disciplinary Bias  Two maps (current and reference), two measures (raw and modified cosine), two aggregation levels  Disciplinary bias measures effect of thresholding with coverage  K50 measures have lowest bias Klavans, R., & Boyack, K.W. (under review). Quantitative evaluation of large maps of science. Scientometrics.

15 Cluster Size (Reference Papers) RawFreq (paper) K50 (paper) RawFreq (aggregated) K50 (aggregated) Log (Rank Size) Log (Size of Cluster) Cluster Size (Current Papers) Log (Size of Cluster) Log (Rank Size) K50 (paper) RawFreq (paper) RawFreq (aggregated) K50 (aggregated) Paper-level: Cluster Distributions  Two maps (current and reference), two measures (raw and modified cosine), two aggregation levels  Cluster size distributions – smaller clusters are usually better – chaining can create communities that are too large  K50 measures have fewer large clusters Klavans, R., & Boyack, K.W. (under review). Quantitative evaluation of large maps of science. Scientometrics.

16 Paper-level: Text Analysis  Current map, K50 similarity  Multidocument summarization –cluster cohesiveness calculated using abstracts for all clusters –compared to values from random clusters  98.3% of the actual clusters of size 10 are more cohesive than random at p< each

17 Uses

18 Worldwide S&T Circle size – number of topics Percent conference papers 0-25% 25-50% 50-75% % Math CS Physics Chemistry Biology Engineering Medical Brain/Neuro Psych Earth Sciences Social Sciences

19 Math CS Physics Chemistry Biology Engineering Medical Brain/Neuro Psych Earth Sciences Social Sciences Circle size – number of topics Vitality (use of newer ideas) >10% more vital than world 0-10% more vital than world 0-10% less vital than world >10% less vital than world DOE Profile

20 Math CS Physics Chemistry Biology Engineering Medical Brain/Neuro Psych Earth Sciences Social Sciences Circle size – number of topics Vitality (use of newer ideas) >10% more vital than world 0-10% more vital than world 0-10% less vital than world >10% less vital than world Sandia Profile

21 Math CS Physics Chemistry Biology Engineering Medical Brain/Neuro Psych Earth Sciences Social Sciences Circle size – number of topics Vitality (use of newer ideas) >10% more vital than world 0-10% more vital than world 0-10% less vital than world >10% less vital than world A Specific University Profile

22 Math CS Physics Chemistry Biology Engineering Medical Brain/Neuro Psych Earth Sciences Social Sciences Circle size – number of topics Vitality (use of newer ideas) >10% more vital than world 0-10% more vital than world 0-10% less vital than world >10% less vital than world Potential SNL/Univ Collaborations

23 Identifying Opportunities/Threats Policy Economics Statistics Math CompSci Physics Biology GeoScience Microbiology BioChem Brain Psychiatry Environment Vision Virology Infectious Diseases Cancer MRI Bio- Materials Law Plant Animal Phys-Chem Chemistry Psychology Education Computer Tech GI (36 Research Communities that will impact GI Research… that GI Researchers are least likely to be aware of)

24 Identifying Core Competency Policy Economics Statistics Math CompSci Physics Biology GeoScience Microbiology BioChem Brain Psychiatry Environment Vision Virology Infectious Diseases Cancer MRI Bio- Materials Law Plant Animal Phys-Chem Chemistry Psychology Education Computer Tech GI Funding patterns of the National Science Foundation (NSF)

25 Identifying Core Competency Policy Economics Statistics Math CompSci Physics Biology GeoScience Microbiology BioChem Brain Psychiatry Environment Vision Virology Infectious Diseases Cancer MRI Bio- Materials Law Plant Animal Phys-Chem Chemistry Psychology Education Computer Tech GI Funding patterns of the National Institutes of Health (NIH)

26 Identifying Core Competency Policy Economics Statistics Math CompSci Physics Biology GeoScience Microbiology BioChem Brain Psychiatry Environment Vision Virology Infectious Diseases Cancer MRI Bio- Materials Law Plant Animal Phys-Chem Chemistry Psychology Education Computer Tech GI Funding patterns of the US Department of Energy (DOE)

27 Potential Uses of Science Maps  Overlays –Topic distribution –Opportunity/threat assessment –Core competency identification –Funding (amount) patterns –Impact patterns  Relationships –Interdisciplinary –Community level between disciplines

28 Related Publications –Boyack, K. W., Klavans, R. & Börner, K. (2005, in press). Mapping the backbone of science. Scientometrics. –Klavans, R. & Boyack, K. W. (2005, in press). Identifying a better measure of relatedness for mapping science. Journal of the American Society for Information Science and Technology. –Boyack, K. W., & Rahal, N. (2005, in press). Evaluation of LDRD investment areas at Sandia. Technological Forecasting and Social Change. –Boyack, K. W., Mane, K. & Börner, K. (2004). Mapping Medline papers, genes and proteins related to melanoma research. IEEE Information Visualization 2004, –Boyack, K. W. (2004). Mapping knowledge domains: Characterizing PNAS. Proceedings of the National Academy of Sciences 101(S1), –Boyack, K. W., & Börner, K. (2003). Indicator-assisted evaluation and funding of research: Visualizing the influence of grants on the number and quality of research papers. Journal of the American Society for Information Science and Technology 54(5), 447. –Börner, K., Chen, C., & Boyack, K. W. (2003). Visualizing knowledge domains. Annual Review of Information Science and Technology 37, –Werner-Washburne, M., Wylie, B., Boyack, K., Fuge, E., Galbraith, J., Fleharty, M., Weber, J., & Davidson, G.S. (2002). Concurrent analysis of multiple genome-scale datasets. Genome Research 12(10), –Boyack, K. W., Wylie, B. N., & Davidson, G. S. (2002). Information visualization, human-computer interaction, and cognitive psychology: Domain visualizations. Lecture Notes in Computer Science 2539, –Boyack, K. W., Wylie, B. N., & Davidson, G. S. (2002). Domain visualization using VxInsight for science and technology management. Journal of the American Society for Information Science and Technology, 53(9), –Davidson, G. S., Wylie, B. N., & Boyack, K. W. (2001). Cluster stability and the use of noise in interpretation of clustering. Proc. IEEE Information Visualization 2001, –Boyack, K.W., Wylie, B.N., Davidson, G.S. & Johnson, D.K., Analysis of patent databases using VxInsight. Presented at New Paradigms in Information Visualization and Manipulation 2000, McLean, VA, Nov. 10, 2000.