AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 1 The Geography of arXiv.org Rui Carvalho and Michael Batty University College London.

Slides:



Advertisements
Similar presentations
PHYSICS AND THE CITY. Carvalho and Batty: Scaling in the Geography of US Computer Science 1 Scaling in the Geography of US Computer Science Rui Carvalho.
Advertisements

What Do We Know about Scientists’ Use of Information? Carol Tenopir Donald W. King
AMJ 1 Academy of Management Journal Academy of Management Meetings August, 2008.
Prepared for: GEOG 4020, Geographic Research Methodology University of Denver, Department of Geography.
New Features Update ISI Web of Knowledge. Copyright 2006 Thomson Corporation 2 New features added Mozilla Firefox web browser is now supported New access.
1 Quality Control in Scholarly Publishing. What are the Alternatives to Peer Review? William Y. Arms Cornell University.
1 2 HEP aims to understand how our Universe works: -Experimental HEP : builds the largest scientific instruments ever to reach.
Information-Seeking Behavior in the High-Energy Physics Community Tamar Sadeh School of Informatics, City University, London Ex Libris HCI conference,
GIS: The Grand Unifying Technology. Introduction to GIS  What is GIS?  Why GIS?  Contributing Disciplines  Applications of GIS  GIS functions  Information.
Dr. Iris Berdrow Bentley College, Harvard Summer School.
Journal of Experimental and Clinical Medicine (JECM) Annual Report Kuang-Sheng Yeh, Ph.D. Executive Editor JECM Taipei Medical University Taipei, Taiwan.
The Future of Physics Publications in the American Physical Society Stewart C. Loken Lawrence Berkeley National Laboratory.
International Business Review The official journal of.
11/18/02Travis Brooks-ASIST The Unpublishing of High Energy Physics Travis Brooks SPIRES Scientific Databases Manager Stanford Linear Accelerator.
AsiaCrypt Program Committee Report Chi Sung Laih Nov.30~Dec.4,2003 Taipei, Taiwan.
« Open Archives (OA) and direct scientific communication (DSC) between scientists » Franck Laloë, laboratoire Kastler Brossel (ENS, Paris) Colloque « Evolution.
Proquest. Digital Commons/Institutional Repository at Pace.
Chais, Feb. 2006Communities1 Mechanisms of Internet-based Collaborations Complex Network Analysis Approach Reuven Aviv, Chais Research Center & Department.
ArXiv: Eprint Repository and OAI Data-Provider Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient.
Data Mining, Information Theory and Image Interpretation Sargur N. Srihari Center of Excellence for Document Analysis and Recognition and Department of.
Changing the Service Paradigm: the HEP- SPIRES Evolution Patricia A. Kreitz and Abraham Wheeler Stanford Linear Accelerator Center Library June 25, 2006.
ArXiv.org 250,000 documents 47,000 registered users 1 million+ downloads per year Cost Per Paper $10000 Commercial Journal $1000 Non-Profit Journal $10.
Introduction to Information Retrieval Got a question concerning literature? Ask! Marion Bierhahn (4630) Where is the library? Bldg:1d.
Electronic or Print: Are Scholarly Journals Still Important? Carol Tenopir, University of Tennessee, USA.
1 What is Scientific Productivity ? = INPUTOUTPUT Scientists Buildings Equipments Communication Tools Salary etc. Publications Patents Books Technology.
Measurement and Evolution of Online Social Networks Review of paper by Ophir Gaathon Analysis of Social Information Networks COMS , Spring 2011,
Information systems for HEP: INSPIRE, arXiv and more Annette Holtkamp CERN ASP 2012 Kumasi, Ghana, Aug 3, 2012.
Jake Blanchard – University of Wisconsin – August 2007.
1 IEEE Intelligent Transportation Systems (ITS) Council, International Collaboration to help People Travel Smarter Daniel J Dailey Ph.D. Past President.
Divide and Conquer: Challenges in Scaling Federated Search Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC SearchEngine Meeting.
Here comes your footer  Page 1 INDIAN CONTRIBUTION TO GREEN COMPUTING RESEARCH: A BIBLIOMETRIC STUDY By D. HEMAVATHY Under the Guidance of M. SURULINATHI.
The Use of Usage Michael J. Kurtz Harvard-Smithsonian Center for Astrophysics.
Solar Physics Board Meeting Rio de Janeiro July, 2009.
1 The Chemistry Preprint Server: An Experiment in Scientific Communication James Weeks, ChemWeb Inc. 84 Theobalds Road, Holborn, London WC1X 8RR
Self-archiving The term usually refers to the self-archiving of peer reviewed research journal and conference articles as well as theses, deposited in.
Open Access Ayesha Abed Library BRAC University October 30, 2011.
Manuel Calderón de la Barca Sánchez Professor of Physics September 17, 2013, Videoconference at ITESM for IFI students Physics Department Opportunities.
A Comparison of On-line Computer Science Citation Databases Vaclav Petricek, Ingemar J. Cox, Hui Han, Isaac G. Councill, C. Lee Giles
THOMSON REUTERS RESEARCH IN VIEW Philip Purnell September 2011 euroCRIS symposium Brussels.
M. Barnett – Sept Summary, Budget and Personnel Issues.
The Division of Computing and Mathematics at UHCL Barrios Tecnologies April 25, 2008 Dr. Kwok-Bun Yue Professor and Chair, CS Chair, Division of Computing.
Science Publishing An Elsevier Perspective Presented by: Carl Schwarz Location: Moscow Date:9 December 2006.
Writing Scientific Papers Additional materials required for manuscript preparation and submission Prof Steve Leharne.
Journal candidates for conversion to OA JournalPublisherImpact Factor ArticlesHEP Articles HEP Fraction Phys.Rev.DAPS % Phys.Lett.BElsevier %
Martin Dodge CASA & Department of Geography, University College London Martin Dodge CASA & Department of Geography, University College London Background.
E.R. Prakasan Anil Sagar Anil Kumar V.L. Kalyane and Vijai Kumar by Scientific Information Resource Division Knowledge Management Group Bhabha Atomic.
Improving Postgraduate Learning – LaTeX Advanced Dr. WONG Tsz Yeung Department of Computer Science and Engineering, CUHK.
How Scientists Use Journals: Electronic and Print Carol Tenopir Donald W. King
1 Making a Grope for an Understanding of Taiwan’s Scientific Performance through the Use of Quantified Indicators Prof. Dr. Hsien-Chun Meng Science and.
Scientists’ Use of Journals: Differences (and Similarities) Between Print and Electronic Carol Tenopir Donald W. King, Randy Hoffman,
Open Archive Workshop, CERN th March 2001 Peer Review - the HEP View Mick Draper, CERN ETT Division
OAI and peer review Workshop (CERN 22/03/2001) Thomas Baron – Tibor Simko CERN Document Server: Validation & OAI WORKSHOP on the Open Archives initiative.
How to publish paper in journal. Step 1.Familiarize yourself with potential publications.
Patricia Renfro Columbia University January 28, 2010.
CNRS Documentation project : CCSD (Center for Direct Scientific Communication ) Htask meeting (Madrid) 06/12/ Lyon Daniel Charnay / Hélène Jamet.
The Structure of Scientific Collaboration Networks by M. E. J. Newman CMSC 601 Paper Summary Marie desJardins January 27, 2009.
Institutional Repositories and Licensing of Research Output advanced information management laboratory university of cape town department of computer science.
Publishing Undergraduate Research Electronically Dennis DeTurck Richard Griscom University of Pennsylvania ©Copyright Dennis DeTurck and Richard Griscom,
Warsaw University of Technology History since 1826 the main building of WUT Students: Academic staff: 2500 other staff: faculties Welcome.
Acad. Tengiz F. URUSHADZE.  International English languages journal “ANNALS OF AGRARIAN SCIENCE” was founded in issues were published during.
Warsaw University of Technology history since 1826 students academic staff2 500 other personnel faculties Warsaw University of Technology.
PCT Statistics PCT Working Group Tenth Session
Planning Research Outputs from your PhD
Assessment of the contribution of IIT’s:
CS 100 Mount Union College Fall, 2002
The Most Visited Countries
APS and INSPIRE Mark Doyle May 20, 2008.
Gwyn P. Williams and Kim Kindrew Pizza Seminar, September 18, 2013
Distribution of confirmed measles cases in the European Region,
XULA Digital Commons Purpose and Uses
Presentation transcript:

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 1 The Geography of arXiv.org Rui Carvalho and Michael Batty University College London

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 2 What is arXiv.org? Founded by Paul Ginsparg in ‘91 at LANL, moved to Cornell in ‘01; Self-archive of physics, maths and computer science preprints since ‘91; Quantitative biology added Sep ‘03; Papers have a time stamp, so authors can claim ownership; Typically, papers appear in refereed journals about 12 months after journal submission; Some data for calendar year ‘04: –total number of submissions (Aug ’91 through Dec ’04): –average submission rate (’04): 3644 papers/month –18 mirror-sites in 16 countries; –submission rates (’04): hep 20.5%, cond-mat 20.5%, astro-ph 18.9%, math 11.8 %, quant-ph 4.8%, gr-qc 4.3%, nucl 3.9%, physics(other) 3.1%, nlin 2.3%, cs 1.5%, q-bio 0.2%; –submissions by country (’00-’04): US edu and gov (27.5%), Germany (9.9%), Italy (6.3%), United Kingdom (5.8%), Japan (5.7%), France (5.6%), Russian Federation (3.2%);

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 3 arXiv monthly submission rate stats (Dec ’04) “hep” = High Energy Physics, “cond-mat” = Condensed Matter Physics, “astro-ph” = Astrophysics, cross-listings in clear

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 4 Why study the Geography of arXiv.org? Papers often submitted in LaTeX. LaTeX is a text-based document preparation system for high-quality typesetting (it’s not a word processor!); In that case, LaTeX source code available for download from arXiv.org; Typically (but not always!), LaTeX source encodes author and address data in specific fields; These fields can be parsed using custom scripts (e.g. written in Perl) to extract the geographical location of the authors; Problem: can we parse author/address fields, extract papers with one or more US authors, and map the zip codes in their addresses?

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 5 Problems with Zip Code extraction Identifying zip code look-alikes: –Easy: Kiev 03028, Ukraine Roma 00185, Italy –Not so easy: Iran Israel Could not process: –Physics Department, Northeastern University, Boston MA USA –address/author fields not found (as in PhD thesis or commentaries) Errors (found 6 in a random sample of 400 papers (1.5%)) –Fargo 58105, ND –Theoretical Division and Center for Nonlinear Studyes, Los Alamos, New Mexico~87545 –Zip not in database (found 1 in 400)

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 6 Mapping cond-mat in 2004 Total: 7957; one or more US authors: 2326 (29.2%); couldn’t process: 517 (6.5%)

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 7 The Geography of cond-mat

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 8 The Geography of cond-mat

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 9 The Geography of cond-mat

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 10 Rank-order plot of paper output by zip (preliminary)

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 11 Next Steps Extend study to larger sample of arXiv.org; Study spatial dynamics of arXiv papers for the period ’91—’05 (knowledge diffusion?); Compare with NSF, ARPA, etc data by state; Extract geography of collaboration networks.

AAG, Denver, 2005: Carvalho and Batty: The Geography of arXiv.org 12 To find out more Spatially Embedded Complex Systems Engineering (SECSE): members: UCL, Leeds, Southampton, Sussex