 CiteGraph: A Citation Network System for MEDLINE Articles and Analysis Qing Zhang 1,2, Hong Yu 1,3 1 University of Massachusetts Medical School, Worcester,

Slides:



Advertisements
Similar presentations
PubMed/How to Search, Display, Download & (module 4.1)
Advertisements

Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
PubMed.
1 SUBJECT DATABASES ENGLISH 115 Hudson Valley Community College Marvin Library Learning Commons.
Searching for and Obtaining Scientific Literature.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Search Tools & Tips for PSC 231 Money in Politics Prepared by Ann Marshall February 5, 2013.
What is the Internet? The Internet is a computer network connecting millions of computers all over the world It has no central control - works through.
PubMed/How to Search, Display, Download & (module 4.1)
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE KIEV, 31 JANUARY.
BY OBAJE, ALFRED MICHAEL (DBA., B.SC., M.L.S.) MEDICAL SCIENCES LIBRARIAN 1 Library E-resources Sensitization and Demonstrations for 400 Level Medical.
Orientation to Web of Science Dr.Tariq Ashraf University of Delhi South Campus
How to do a literature search Saharuddin Ahmad Aida Jaffar Department of Family Medicine.
NIH RePORT: report.nih.gov | RePORTER: projectreporter.nih.gov NIH ExPORTER Data NIH OFFICE OF EXTRAMURAL RESEARCH The Health Datapalooza.
Rajesh Singh Deputy Librarian University of Delhi Measuring Research Output.
1 How to find literature - A very short introduction SMED 8004 Medicine and Health Library October 2014.
Introduction to Mendeley. What is Mendeley? Mendeley is a reference manager allowing you to manage, read, share, annotate and cite your research papers...
PubMed Overview From the HINARI Content page, we can access PubMed by clicking on Search inside HINARI full-text using PubMed. Note: If you do not properly.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE LVIV, 11 SEPTEMBER.
University of Antwerp Library TEW & HI UA library offers... books, journals, internet catalogue -UA catalogue, e-info catalogue databases -e.g.
Find Full Text Journal Articles Using Pubmed Nancy B. Clark, M.Ed. Director of Medical Informatics Education FSU College of Medicine 1 All recourses are.
Bibliometrics for your CV Web of Science Google Scholar & PoP Scopus Bibliometric measurements can be used to assess the output and impact of an individual’s.
Research Resources Eugene Tseytlin Department of Biomedical Informatics University of Pittsburgh.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
Anomalies in Open-Access & Traditional Biomedical Literature: A Comparative Analysis Abstract This research compares rates of anomaly and post-publication.
 Major part of psychology for researchers, students, clinicians, etc…  Difference between journal article and popular press articles  Scholarly Journal-
LOGO A comparison of two web-based document management systems ShaoxinYu Columbia University March 31, 2009.
Retrieval of Highly Related Biomedical References by Key Passages of Citations Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan.
Journal Searching Nancy B. Clark, M.Ed. Director of Medical Informatics Education FSU College of Medicine 1 All recourses are available online in Medical.
Database collection evaluation An application of evaluative methods S519.
Trinity College Dublin, The University of Dublin GE3M25: Bioinformatics Karsten Hokamp, PhD Genetics TCD, 05/11/2015.
EBI is an Outstation of the European Molecular Biology Laboratory. Literature Resources at the EBI Information Workshop on European Bioinformatics Resources.
1 SEMEF : A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks Delroy Cameron Masters Thesis Computer Science, University of Georgia.
Copyright OpenHelix. No use or reproduction without express written consent1.
Title Authors Introduction Text, text, text, text, text, text Background Information Text, text, text, text, text, text Observations Text, text, text,
Scopus Fueling Research, Driving Innovation. Scopus Introduction What is Scopus ? Why do you need Scopus? Why do our customers use Scopus?
Assessing Hyperthermia and Cancer Research Productivity Shu-Wan Yeh 1 *, Shih-Ting Hung 1, Yuan-Hsin Chang 1, Yee-Shuan Lee 2 and Yuh-Shan Ho 1# 1 School.
Publication Pattern of CA-A Cancer Journal for Clinician Hsin Chen 1 *, Yee-Shuan Lee 2 and Yuh-Shan Ho 1# 1 School of Public Health, Taipei Medical University.
Sul-Ah Ahn and Youngim Jung * Korea Institute of Science and Technology Information Daejeon, Republic of Korea { snowy; * Corresponding Author: acorn
CitEc as a source for research assessment and evaluation José Manuel Barrueco Universitat de València (SPAIN) May, й Международной научно-практической.
Conclusions  A high percentage share of meeting abstracts (36%) and a low percentage share of articles (40%) was found in the ten journals in the category.
Text and Data Mining for Systematic Reviews Investigating Trends to Update Collaboration Services Virginia Pannabecker Virginia Tech, University Libraries.
Data Mining for Expertise: Using Scopus to Create Lists of Experts for U.S. Department of Education Discretionary Grant Programs Good afternoon, my name.
Google Scholar and ShareLaTeX
Our Digital Showcase Scholars’ Mine Annual Report from July 2015 – June 2016 Providing global access to the digital, scholarly and cultural resources.
Scientometric Analysis of Annual Review of Immunology
Bibliometric Analysis of Herbal Medicine Publications, 1991 to 2004
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
Ming-Yao Chen#, Wen-Ta Chiu and Yuh-Shan Ho*
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Jian Wang Assistant Professor Science Based Business Program LIACS, Leiden University
3School of Public Health, Taipei Medical University, Taipei, Taiwan
Bibliometric Analysis of Water Research
Reference management soft wares Endnote & Mendeley
TITLE Authors Institution RESULTS INTRODUCTION CONCLUSION AIMS METHODS
خشنه اتره اهورهه مزدا شيوۀ ارائه مقاله 17/10/1388.
Funding and Disclosures
Review Key Teaching Points
Introduction of KNS55 Platform
TITLE Authors We appreciate the support of the:
Indication of Publication Pattern of Scientometrics
Bibliometric Analysis of Process Safety and Environmental Protection
Citation-based Extraction of Core Contents from Biomedical Articles
Abstract (Maximum 500 words)
Lívia Vasas, PhD 2018 The Nation Library of Medicine and its databases Mozilla Firefox or Google Chrome Lívia Vasas, PhD.
An Overview of Depression-related Research in the Asia Tigers
PubMed Database Interface (Basic Course: Module 4)
Citation databases and social networks for researchers: measuring research impact and disseminating results - exercise Elisavet Koutzamani
PubMed/How to Search, Display, Download & (module 4.1)
Search for Article Citation
Presentation transcript:

 CiteGraph: A Citation Network System for MEDLINE Articles and Analysis Qing Zhang 1,2, Hong Yu 1,3 1 University of Massachusetts Medical School, Worcester, MA, USA 2 University of Wisconsin Milwaukee, Milwaukee, Milwaukee, WI, USA 3 VA Central Massachusetts, Leeds, MA, USA

Outline  Introduction  Background  Method  Evaluation  Analysis CiteGraph, MedInfo 2013

Introduction  Citation network is important for  Information retrieval  Journal Impact Factor, H-index  Co-authorship network is important  Few citation networks are available for research  We built CiteGraph CiteGraph, MedInfo 2013

Background  Citation network analysis  Power law distribution in citation networks  Article ranking, HITS and PageRank  Community structure of physics fields  Citation network tool for given legal issue using legal document citation network  Co-authorship network analysis  Research collaboration patterns  Author authority : Erdös Number  Literature search  CiteSeer X, Google Scholar CiteGraph, MedInfo 2013

The CiteGraph Data CiteGraph, MedInfo 2013

Citation Network Example CiteGraph, MedInfo 2013

Challenges CiteGraph, MedInfo 2013 (1)Yu, H and Lee M Accessing Bioscience Images from Abstract Sentences. Bioinformatics. Vol 22 No. 14, pages e547–e556. (2) Hong Yu and Minsuk Lee. Accessing Bioscience Images from Abstract Sentences. Bioinformatics. Vol 22 No. 14, pages e547–e (3) Yu H, Lee H Accessing Bioscience Images from Abstract Sentences. Bioinformatics: 22 (14), e547–e556.

Methods  Mapping between articles  Mapping articles to the PubMed ID  Author name disambiguation CiteGraph, MedInfo 2013

Methods  If two of the following matching result are true, we consider the two entities (for example the citation and the article) are matched  Title matching  the set of tokens contained in one title field is a subset of the tokens in the other, or  the number of tokens common to both fields is more than 80% of the size of the larger of the two fields.  Author list matching  two lists of surnames have one-on-one mapping  surnames in one entity (citation) is fully contained in the surname set of the second (article).  Journal name matching  remove stop words such as “of”  if the number of common initials in the journal titles was greater than 80% of the tokens in the longer journal name, they were considered equivalent.

Evaluation Results TaskPrecisionRecallF1Inter-Annotator Agreement (Kappa) Citation Mapping PMID Mapping CiteGraph, MedInfo Annotators are invited to annotate the citation mapping and PMID mapping results Each annotator is presented with 20 matching results of each task

The CiteGraph Statistics CiteGraph, MedInfo M articles 6.35 M citations 1.37 M authors

The CiteGraph Statistics CiteGraph, MedInfo 2013 log y = 1.06 – 2.45* log x (p<0.05 t-test) Livak KJ., Schmittgen TD., Analysis of relative gene expression data using real- time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods Dec;25(4):402-8.

The CiteGraph Statistics CiteGraph, MedInfo 2013 Largest connected component : 1.27 million authors (92.7%) The second largest connected component: 35 authors

The CiteGraph Statistics CiteGraph, MedInfo 2013 Co-authorship spans from 1 to 35 years, while 83.7% of author pairs just appear once.

The CiteGraph Statistics CiteGraph, MedInfo 2013 MeasureMeanMedianStdMaxMin # of Co-authors Co-authorship Year Span * The largest component is excluded when calculating the statistics in the table. Its size is 1.27 million (92.7% authors)

Trends CiteGraph, MedInfo 2013

Conclusion  We created a citation/co-authorship networks with biomedical full text literature  Our networks have high accuracy and large scale, and it can benefit biomedical text mining communities  Article ranking  Research collaboration recommendation  Social network analysis  The network database can be downloaded per request CiteGraph, MedInfo 2013

Acknowledgement  National Institute of Health 1R01GM to Hong Yu  A start-up fund from University of Massachusetts Medical School to Hong Yu  National Center for Advancing Translational Sciences of the National Institute of Health under award number UL1TR CiteGraph, MedInfo 2013