October 1, 1999 Two Catalysts for Qualitative Change Richard Snodgrass.

Slides:



Advertisements
Similar presentations
COUNTER: improving usage statistics Peter Shepherd Director COUNTER December 2006.
Advertisements

Partnering with Faculty / researchers to Enhance Scholarly Communication Caroline Mutwiri.
NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
© 2012 Association for Computing Machinery Intro to the ACM Digital Library February 24, 2012 Intro to the ACM Digital Library February 24, 2012.
PUBLICATIONS BOARD REPORT Joe Konstan SGB Publications Advisor.
In the Format section, we have activated the Bibliographic style drop down menu. From this page, you can choose a specific journal or format (e.g. BMC.
EndNote Web Reference Management Software (module 5)
" OPEN ACCESS INITIATIVE IN ONE OF THE PALESTINIAN UNIVERSITIES: BIRZEIT UNIVERSITY" Prepared by Mrs. Diana Sayej-Naser Library Director Birzeit University.
How the University Library can help you with your term paper
ISI Web of Knowledge – Innovative Solutions ISI Web of Knowledge / Web of Science – coming developments BIOSIS Archive Web Citation Index – New product.
Web of Science Search and Navigation in the Web of Knowledge
We have displayed the Browse publisher drop down menu. This You have full access to: list for an institution where all the material is included in the.
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
PubMed Central Mahyar Ahmadpour-B. Kowsar Publicatin Corp. Kowsar Editorial Meeting 1 September 19th, 2013 Tehran, Iran.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2004.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2005.
1 Adaptive Management Portal April
If We Build It, Will They Come (Eventually)? : Scholarly Communication and Institutional Repositories A Presentation to the NASIG 2005 Conference May 20.
ISP 433/533 Week 8 IR in libraries. Goal Universal Access to Information Vannevar Bush 1945 article Memex A memex is a device in which an individual stores.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Greenstone Digital Library Usage and Implementation By: Paul Raymond A. Afroilan Network Applications Team Preginet, ASTI-DOST.
Bibliometrics in Computer Science MyRI project team.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
How the University Library can help you with your term paper Computer Science SC Hester Mountifield Science Library x 8050
PubMed/History; Accessing Full-Text Articles (module 4.4)
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Title of the Poster. “Digital library services and their impact with reference to a developing country: The case of the Faculty of Health Sciences library,
Managing your References Sue Bird Bodleian Bio- & Environmental Sciences October 2010.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
IL Step 1: Sources of Information Information Literacy 1.
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
SIG Orientation: Publications Bernard Rous Deputy Director of Publications October 25, 2009.
25-27 June 2003Clearing House Workshop, Paris1 Direct access to UNESCO Documents UNESDOC.
Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.
Meta-Knowledge Computer-age study skill or What kids need to know to be effective students Graham Seibert Copyright 2006.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
Publisher’s Perspective: Digitization of print resources, and archiving of digital resources Judy Best, June 13, 2006.
GeNii New Contents Services of NII
GEO: a special collection for Earth Science community *Stefania Biagioni, *Silvia Giannini, **Cecilia Giussani *CNR-ISTI, **CNR-IGG Pisa, Italy GL13 Conference,
Technology Choices for the JSTOR Online Archive Presented by Chang Feng Department of Computer Engineering and Computer Science, University of Missouri-Columbia,
1 JACoW Joint Accelerator Conferences Website Presented by J. Vigen on behalf of John Poole, JACoW.
We have displayed the Browse publisher drop down menu. This You have full access to: list for an institution where all the material is included in the.
S YCAMORE S CHOLARS ISU Institutional Repository.
EndNote Web Reference Management Software (module 5.1)
Chapter 8 Browsing and Searching the Web. 2Practical PC 5 th Edition Chapter 8 Getting Started In this Chapter, you will learn: − What is a Web page −
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
2/08/2006 2:56 pm Introduction to the Digital LibrarySlide 1 of 40 Introduction to The Digital Library.
CiNii Articles is a service that provides information on scholastic articles, with an emphasis on Japanese papers. It allows users to find the articles.
By Addison, Jessica, and Lauren. Management The Mountain West Digital Library is a program of the Utah Academic Library Consortium (UALC) Three Governing.
Researching the African Diaspora and Creolité on the Internet Karen Hartman Information Resource Officer U.S. Embassy, Nairobi, Kenya February 5, 2008.
1 Overview Finding and importing data sets –Searching for data –Importing data_.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
IN THE NAME OF GOD. Reference Citing Software.
ISI Web of Knowledge update: October What’s New? Conference Proceedings Citation Indexes now in Web of Science –Two editions – Science and Social.
Using Content Presented by Karen Andrews Physical Sciences & Engineering Librarian, U.C. Davis Tuesday, September 13, :30-9:30 ASIDIC Fall 2005 Meeting.
Libraries in the digital age Collection & preservation for generational access part two The LOCKSS Program.
Definition, purposes/functions, elements of IR systems Lesson 1.
CS276B Text Information Retrieval, Mining, and Exploitation Practical 1 Jan 14, 2003.
Searching the Web for academic information Ruth Stubbings.
Gain Global Exposure: Partner with EBSCO to Promote your Scholarship
Reusing and repurposing metadata in a Current Research Information System and Institutional Repository 3 June 2010 Robin Armstrong Viner Cataloguing.
Using computers to search electronic databases
DIGITAL LIBRARY.
Quick guide < Keyword search >
Pricing from an open-access publisher’s perspective
IL Step 3: Using Bibliographic Databases
Preservation Strategy Proposals for Licensed Resources of CSDL
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

October 1, 1999 Two Catalysts for Qualitative Change Richard Snodgrass

October 1, 1999 SGB MeetingRichard T. Snodgrass2 Location City and State, 2000 BCE Longitude, 1773 CE GPS + cell phone, 1999 CE

October 1, 1999 SGB MeetingRichard T. Snodgrass3 Confluences Underlying technologies –Highly accurate atomic clocks –Geosynchronous satellites –Advances in micro-circuitry –Proliferation of cell phones Demonstrated need Catalyst: companies able to produce in quantity at low price Qualitative change

October 1, 1999 SGB MeetingRichard T. Snodgrass4 The Vision The ACM Computing Portal A web-based repository of bibliographic information –contains information on all papers and books in the computing literature –contains a pointer to the digitized version, if available

October 1, 1999 SGB MeetingRichard T. Snodgrass5 Objectives Qualitatively increase the effectiveness of scientific research into computing Continue to place ACM as the premier scientific and educational organization for computing Increase service of ACM and the SIGs to the scientific community Provide a concrete illustration of the scope of computer science

October 1, 1999 SGB MeetingRichard T. Snodgrass6 Presentation Components –Bibliographic Entries –Abstracts and Keywords –Full Text and Bitmapped Images –Citation Linking Demonstration Realizing the Computing Portal –Revisit the components The Next Step

October 1, 1999 SGB MeetingRichard T. Snodgrass7 Step 1: Bibliographic Entries Collect all bibliographic entries from all computer science journals, conferences, workshops, technical bulletins, and books. –Over the period from 1940 to 2000, then continuing –Approximately 1M entries –Provide free searching on the web. –Provide citations in multiple formats: HTML, BiBTeX, refer, Word, XML,...

October 1, 1999 SGB MeetingRichard T. Snodgrass8 Step 2: Abstracts and Keywords Collect keywords, and later, abstracts, for all entries. Copyright restrictions on some abstracts?

October 1, 1999 SGB MeetingRichard T. Snodgrass9 Step 3: Full Text and Images Collect full text of each available paper and book for – use in searching –to develop classification maps and lexicons –other analyses

October 1, 1999 SGB MeetingRichard T. Snodgrass10 Step 3, cont. Encourage acquisition of digitized version of each paper in web-accessible digital libraries (e.g., the ACM DL) –Collect bit-mapped image of each page of each paper to retain formatting, equations, and figures. –Each paper can then be reproduced as an exact copy. –Can provide structure on full text sections, figures, citations in running prose

October 1, 1999 SGB MeetingRichard T. Snodgrass11 Step 4: Citation Linking Start with full text of paper’s bibliography. Out linking: identify bibliographic entry of papers referenced by the paper In linking: identify bibliographic entries of papers referencing the paper Use for citation analysis, knowledge diffusion studies

October 1, 1999 SGB MeetingRichard T. Snodgrass12

October 1, 1999 SGB MeetingRichard T. Snodgrass13 Demonstration

October 1, 1999 SGB MeetingRichard T. Snodgrass14 Papers with “wavelet”

October 1, 1999 SGB MeetingRichard T. Snodgrass15

October 1, 1999 SGB MeetingRichard T. Snodgrass16

October 1, 1999 SGB MeetingRichard T. Snodgrass17

October 1, 1999 SGB MeetingRichard T. Snodgrass18

October 1, 1999 SGB MeetingRichard T. Snodgrass19

October 1, 1999 SGB MeetingRichard T. Snodgrass20

October 1, 1999 SGB MeetingRichard T. Snodgrass21

October 1, 1999 SGB MeetingRichard T. Snodgrass22

October 1, 1999 SGB MeetingRichard T. Snodgrass23 INSPEC

October 1, 1999 SGB MeetingRichard T. Snodgrass24

October 1, 1999 SGB MeetingRichard T. Snodgrass25 Some Numbers Years remaining of lifetime for the average SIG $ per member (over required fund balance) $M total SIG fund balance (over required) $K per SIG fund balance (over required) SIG members lost last year (52.1K  46.8K, > 10%)

October 1, 1999 SGB MeetingRichard T. Snodgrass26 Step 1: Bibliographic Entries Propose that each SIG be responsible for ensuring correctness of relevant entries. relevance based on SIG interests reduce overlap between SIGs Software for provided to SIGs –data entry, validation, conversion –presentation (HTML, BiBTex, …, XML) –searching –precomputed lists (e.g., bibliographic home page for every author)

October 1, 1999 SGB MeetingRichard T. Snodgrass27 Stage 1: Bibliographic Entries 1M entries / 36 SIGs = 30K entries per SIG –e.g., SIGMOD: approximately 50K entries Many resources –DBLP: 2^17 (130K) entries –Propose that ACM donate the ACM Guide to Computing Literature: 300K entries –Collection of Computer Science Bibliographies: 930K entries

October 1, 1999 SGB MeetingRichard T. Snodgrass28 Step 2: Keywords and Abstracts May need copyright permission, negotiated by ACM HQ Collection of CS bibliographies has 100K abstracts

October 1, 1999 SGB MeetingRichard T. Snodgrass29 Step 3: Full Text and Bitmapped Images Full text is used for searching and citation linking in the Computing Portal. Bit-mapped images, stored in a Digital Library, is used to display and print actual paper. Propose SIGs fund populating entire ACM Digital Library. –PDF files containing encapsulated TIFF and OCRed full text –99% accuracy –$1.25 per page –Could go to SGML or XML, 99.9% accuracy: $8-$10 per page.

October 1, 1999 SGB MeetingRichard T. Snodgrass30 Populating ACM DL already in DL Journals: about 110K pages Conferences – : 76K pages –pre-1985: about 200K pages Newsletters –120K pages Total: 500K pages at $600K –$20K per SIG

October 1, 1999 SGB MeetingRichard T. Snodgrass31 Step 3: Full Text, cont. ACM papers: 500K pages, or about 40K papers –This represents perhaps 5% of total of 1M papers. For remaining conference proceedings and journals –Offer URL into their DL in exchange for full text, only for searching ACM Computing Portal provides valuable entry into their DL, enhancing their revenue stream. –Offer full CD Rom package at cost in exchange for inclusion in CD Rom and use of full text for searching. –Pay for digitization out of conference profits –SIGs pay for integration: $ $0.50 per page.

October 1, 1999 SGB MeetingRichard T. Snodgrass32 Step 3: Full Text, cont. Use standard IR indexing and search techniques on full text. Partner with DL and IR research efforts to come up with new search strategies. Search software provided to each SIG

October 1, 1999 SGB MeetingRichard T. Snodgrass33 Step 4: Citation Linking Manual out-linking –about $5-$6 per paper, or $0.30 per page of digitized text Can be done semi-automatically for much less, if the appropriate linking software is developed In-linking is simply a database search. All bibliographic entries must be present.

October 1, 1999 SGB MeetingRichard T. Snodgrass34 SIG-Specific Portals Possibly provide CD Roms, containing the relevant portion of ACM CS Portal, to members of the SIG especially useful for international members, or those working at home or traveling. Web-based Portals –some papers hosted on ACM server (clearly labeled as to source), with copies of papers provided for a fee –URL to other DLs

October 1, 1999 SGB MeetingRichard T. Snodgrass35 Open Architecture Free searching via web interface, including full text search, at ACM site and SIG portals Bibliographic data and full text available for other search engines, and for use in research in information retrieval, knowledge propagation, and other disciplines Portal should be available for mirroring, on both geographical and institutional bases Encourage digitization of corpus

October 1, 1999 SGB MeetingRichard T. Snodgrass36 Previous Efforts SIGDA CD Rom Project –9 CD Roms –$1.5M project –SGML, proprietary display software on CD Rom POPL CDRom –10 years of POPL, given out as a SIGPlan member benefit –PDF files Many conferences distribute CD-ROMs of papers

October 1, 1999 SGB MeetingRichard T. Snodgrass37 Previous Efforts, cont. SIGMOD Anthology –10 CD Roms (later 1-2 DVD Roms), $105K –SIGMOD, PODS, KDD, VLDB, ICDE, SSDBM, COMAD,... –SIGMOD Record, Data Engineering Bulletin –TODS, VLDB Journal –Given out as member benefit SIGMOD DiSC yearly CD-ROM –1999: 2 CD-ROMs, about $30K per year –all relevant conferences and workshops for that year, ancillary material, such as powerpoint presentations, audio, video –Given out as a member benefit (Consumer Reports model)

October 1, 1999 SGB MeetingRichard T. Snodgrass38

October 1, 1999 SGB MeetingRichard T. Snodgrass39 The Next Step SGB Portal Committee –Determine appropriate data format(s). –Negotiate coverage of corpus among SIGs. –Identify appropriate software (paper incorporation, search, citation linking, notification). –Specify new infrastructure to be developed. –Propose specific projects for opportunity fund consideration. –Work with Pubs Board e.g., interaction with Computing Reviews, CoRR. –Work with DL and IR research communities. –Identify new capabilities.

October 1, 1999 SGB MeetingRichard T. Snodgrass40 SGB Portal Committee Rick Snodgrass (University of Arizona, CS), chair Steve Cunningham (Cal State University-Stanislaus, CS) Carol Hutchins (Courant Institute of Math. Sci. Library) Bob Krovetz (NEC Research Institute) Michael Ley (University of Trier, CS) Andreas Paepcke (Stanford University) Kathy Preas (KP Pubs on CDROM) Bernie Rous (ACM Publications) Charles Viles (Univ. of North Carolina, Info and Lib Sci)

October 1, 1999 SGB MeetingRichard T. Snodgrass41 Individual SIG Commitments Collect and capture SIG-relevant bibliographic entries, abstracts, and keywords, in appropriate format. Allocate funds to populate the ACM DL: journals, conference and workshop proceedings, SIG newsletter. –Roughly $20K for each SIG –SIGDA matching funds: $50K Negotiate with steering committees of associated conferences and workshops.

October 1, 1999 SGB MeetingRichard T. Snodgrass42 SGB Opportunities Use opportunity fund to subsidize content development in areas associated with low-fund SIGs. –SIG would request allocation for CSP content development. –Propose that SGB would then control accessibility of created material. Use opportunity fund to subsidize infrastructure. –software acquisition and development/customization –Such proposals would originate at the CSP editorial board.

October 1, 1999 SGB MeetingRichard T. Snodgrass43 ACM HQ Commitments Donate entries from ACM Guide to Computing Literature. Negotiate cross-use agreements with associated societies. Acquire full text of books copyrighted by ACM. Provide hardware and software to host CSP. Provide staff to manage CSP, with content provided by SIGs.

October 1, 1999 SGB MeetingRichard T. Snodgrass44 ACM HQ Opportunities Integrate CSP with CoRR Provide print and CD-ROM versions of the expanded ACM Guide to Computing Literature Fully populated DL Increased visibility of ACM

October 1, 1999 SGB MeetingRichard T. Snodgrass45 The ACM Computing Portal Free searchable access to the entire computer science corpus Links to a fully populated ACM DL and to other DLs Capability to purchase papers and to register queries Possibly ancillary SIG-provided benefits, such as CD-ROMs and SIG-specific portals

October 1, 1999 SGB MeetingRichard T. Snodgrass46 Confluences Underlying technologies –Inexpensive scanning, OCR, disk space, high capacity CD-ROM and DVD-ROM, and widely available www access Demonstrated need Catalysts: SIG Governing Board, ACM Council, ACM Publications Board, HQ staff Qualitative change