Download presentation
Presentation is loading. Please wait.
Published byValerie Fallas Modified over 9 years ago
1
Vermelding onderdeel organisatie May 3, 2015 1 Literature Search IN 3305 Created by Tomas Klos. Edited by Alexandru Iosup. Parallel and Distributed Systems Groep http://www.pds.ewi.tudelft.nl/
2
May 3, 20152 Introduction From the IN3305 study goals: “kennismaken met wetenschappelijke literatuur” “problemen oplossen door te zoeken in literatuur” What is “scientific literature”? To read or not to read? Literature is output and input Measuring and assessing Quality Useful sites Recommendations and tips
3
How to Talk About Books You Haven’t Read “There is more than one way not to read” Not opening the book You cannot read everything How many books can you read? How many books can a librarian read? Librarians can talk about every book in the library (every book out of millions) There exists a system to (not) read
4
May 3, 20154 Literature = output “Publish or perish”: quality / quantity (“80% of all published papers are not cited”) Peer-review (for conferences, journals): (double) blind review: Accept, with/without (major) revisions Reject Acceptance rate, e.g. 25% (Nature: 10% is reviewed) Measuring scientific output: “scientometrics”
5
May 3, 20155 Scientometrics Scientometrics, “measuring and analyzing science”, Bibliometrics, “study or measurement of texts and information” In particular citation analysis: Which papers cite a paper / does a paper cite? Authority of authors, journals, papers Same principle: Google PageRank Web: network of sites, linking to each other Science: network of papers, citing each other
6
May 3, 20156 World Wide Web
7
May 3, 20157 Citation Networks Time
8
May 3, 20158 Citation Databases Commercial: ScienceCitation Index (Inf.Sci. Inst.) Scopus (Elsevier) Free: Google Scholar: bettercoveragethan ISI CiteSeer (computer science) RePec (economics)
9
May 3, 20159 Indices Journals: Journal Impact Factor Personal: h-index (Hirsch, 2005): “I propose the index h, defined as the number of papers with citation number ≥h, as a useful index to characterize the scientific output of a researcher.” A scientist has index h if h of his/her N papers have at least h citations each, and the other (N − h) papers have no more than h citations each. Extensions: g-index, h-b-index
10
May 3, 201510 Journal Impact Factor (JIF) Many journals have no impact factor JIF is the average number of citations in a given year, to papers in a journal in the 2 previous years. For journal x, 2008 number of citations in 2008 to papers in journal x from the period 2006 – 2007 JIF (x, 2008) = Total number of papers in journal x in the period 2006 – 2007 What does an average value mean?
11
May 3, 201511 Journal Impact factors, 2004 ≥1 citation/publication (last 2 years) JIF Journal Rank Highest JIF ~30 Very high JIF ≥15
12
May 3, 201512 CS impact factors, 2005 Journal Rank JIF Highest JIF ~8 Very high JIF ≥2 Highest JIF ~30 Very high JIF ≥15 CSAll
13
May 3, 201513 Google Scholar “cited by” Relevant authors TU Delft SFX linking Import into bibtex
14
May 3, 201514
15
May 3, 201515
16
May 3, 201516 From home: use vpn!
17
May 3, 201517
18
May 3, 201518 DBLP “lists more than one million articles” (april 2008) Indexes: Authors Now also “Faceted search”, “CompleteSearch” Conferences Journals Series Subjects
19
May 3, 201519
20
May 3, 201520
21
May 3, 201521
22
May 3, 201522
23
Harzing’s Publish or Perish Uses google scholar data Calculates many indices Number of citations (also per year / article / author /…) Hirsch’s h-index Zhang’s e-index (excess in h-index set) Egghe’s g-index … May 3, 201523
24
May 3, 201524 Publish or Perish (http://www.harzing.com/pop.htm)
25
May 3, 201525 Off-topic: How to Game the Citation System? (part of) Collaboration graph
26
May 3, 201526 All authors with Erdős number 1
27
May 3, 201527 Collaboration Graph Degree Distribution Erdős
28
May 3, 201528 Collaboration Graph: Connected Components Distribution Giant Component
29
May 3, 201529 Even further off topic: Kevin Bacon oracle
30
May 3, 201530 Interested? Erdős Number Project http://www.oakland.edu/enp/ Kevin Bacon Oracle http://oracleofbacon.org/ Mark Newman: “who is the best connected scientist?”
31
May 3, 201531 Literature = input Citations Place your work in context Give credit to previous work Support your arguments Show your marginal contribution Prevent plagiarism Read what you cite! (prevent superfluous citing) This does NOT mean: “You should read everything” “You cannot also read what you don’t cite”
32
May 3, 201532 Sources: peer-reviewed Textbook/monograph: for teaching and background Complete treatment of a topic Cite a textbook? Mention chapter or page number Journal article More space, detail, thorough than conference paper Sometimes old news at publication date (lag) Paper in edited volume: Multiple papers, review of state-of-the-art Cite individual papers Paper in conference proceedings Recent results Conference quality; publisher of proceedings?
33
May 3, 201533 Sources: not peer-reviewed Working papers, Preprints Up-to-date, spread ideas “Open access” Computing Research Repository (CoRR) http://arxiv.org/corr/home Websites ‘Personal communication’
34
May 3, 201534 Quality? Reputation: ACM, IEEE, Springer, Elsevier, MIT/Princeton/Oxford/… University Press SCIgen - An Automatic CS Paper Generator http://pdos.csail.mit.edu/scigen/ accepted (non-reviewed) for: 2005 World Multi- Conference on Systemics, Cybernetics and Informatics (another one: an Elsevier journal!)
35
May 3, 201535 Finding Sources Browse: DBLP: http://dblp.uni-trier.de/ CiteSeer: http://citeseerx.ist.psu.edu/ Google Scholar: http://scholar.google.com/ Author homepages Follow links and citations (forward and backward)
36
May 3, 201536 TU Delft Library Search http://www.library.tudelft.nl/ws/search/ e.g. “information by subject” -> computer science TUlib “how to find and use scientific information” http://www.library.tudelft.nl/tulib/
37
Demo Vincent Conitzer, TuomasSandholm, Jérôme Lang, When Are Elections with Few Candidates Hard to Manipulate? Journal of the ACM, 2007. May 3, 201537
38
How to Talk About Books You Haven’t Read There exists a system to (not) read 1.Know where to find the sources Trustworthy: DBLP, ACM DL, Google Scholar Less trustworthy: CoRR, … 2.Know how to find the good sources Number of citations: ACM DL, Google Scholar H-index: Publish or Perish (the program) Try to avoid or weight citation cliques 3.Select from the good sources
39
May 3, 201539 Questions?
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.