ETD 2005 International Accesses to a Digital Library of ETDs.


Similar presentations Nursing Reference Center Tutorial.
Advertisements EBSCOadmin Reports & Statistics Tutorial.
SCOPUS Searching for Scientific Articles By Mohamed Atani UNEP.
In the Format section, we have activated the Bibliographic style drop down menu. From this page, you can choose a specific journal or format (e.g. BMC.
Welcome to informaworld TM. The following demo will show you just a few of the features on informaworld TM. Please select where you would like start. ePublication.
Entre Ríos National University Argentine Federation of Cardiology Prof. Dr. Armando Pacher Faculty of Bioengineering National University of Entre Rios,
SEARCHING THROUGH ScienceDirect prepared by Literature Searching Team Library, Faculty of Medicine, UGM  2012.
We have displayed the Browse publisher drop down menu. This You have full access to: list for an institution where all the material is included in the.
Soichi Tokizane Aichi University
Single Search By Rakphao Theppan, librarian Searching Online Resources.
ETD 2005 BDTD – The Brazilian National ETD Project.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
HINARI website interface, journals, and other full text resources (module 2)
Creation of an online catalog of dissertations using Access & ASP – slide 1 Creation of an online catalog of dissertations using Access & ASP: from Datatel.
1 Kharkiv National University of Radioelectronics, Ukraine Ontology-Based Portal for National Educational and Scientific Resources Management Masha Klymova.
Training on ETD’s in Developing Coutries (some considerations) ETD 2003 Humboldt U Berlin May 2003.
ETD-db: Today ETD-db 2.0: Tomorrow Gail McMillan Director, Digital Library and Archives, Virginia Tech Recorded by Edward A. Fox, Virginia Tech Newcomers’
1 Using Scopus for Literature Research. 2 Why Scopus?  A comprehensive abstract and citation database of peer- reviewed literature and quality web sources.
Online resources in TCD Library:
Accessing journals by via PubMed Note the link to find articles through HINARI/PubMed. Using this option will be covered in later in the Short Course.
1 Urban Education Resources LIBRARY INSTRUCTION Jacqueline A. Gill Associate Professor Reference
1 CS 430: Information Discovery Lecture 15 Library Catalogs 3. EBSCO Discovery Service Statistics Explained Tutorial.
Using AGORA. Workshop Objectives Learn what AGORA offers, main features, and appropriate use Learn how to open AGORA, log in and navigate to find journals.
New Web of Science Rachel Mangan Customer Education
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
OARE Module 3: OARE Portal.
LILACS database: eighteen years indexing Latin American and Caribbean health sciences journals Regina C. Figueiredo Castro BIREME/PAHO/WHO 10th International.
1 The Gateway to Information: Simplifying Access to Library Resources Fred Roecker Head Instruction The Ohio State University Libraries
The impact of the development of institutional repositories on “Kiyo” or institutional research journals in Japan Hiroya Takeuchi and Syun Tutiya Chiba.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Getting started on informaworld™ How do I register my institution with informaworld™? How is my institution’s online access activated? What do I do if.
EBSCOadmin. Select Change Password Select EBSCOadmin Security.
Databases and Library Catalogs Global Index Medicus/Global Health Library PubMed Source Bibliographic Database: International Health and Disability.
Company profile John Wiley & Sons Founded 1807 Wiley-VCH Acquisition 1995 International publisher of scientific and professional.
Library of Vilnius Gediminas Technical University Asta Katinaitė, Aurelija Striogienė
We have displayed the Browse publisher drop down menu. This You have full access to: list for an institution where all the material is included in the.
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
1 SciELO: lessons for an open access movement for less developed countries Anna María Prat CONICYT-Chile SciELO Chile.
Click on the tab to find journals by Subjects. From the drop down menu, we will select Parasitology and Parasitic Diseases.
Maximizing Library Investments in Digital Collections Through Better Data Gathering and Analysis (MaxData) Carol Tenopir and Donald.
Examining Accesses by Country, Language and Area of Knowledge ETD 2011 – Cape Town.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
To Find contents by publisher, click on the drop down menu. This is different than the Partner publishers services where users enter the publisher’s portals.
PubMed Overview From the main HINARI webpage, we can access PubMed by clicking on Search HINARI journal articles through PubMed (Medline). Note: If you.
Journals can be accessed by title from an alphabetical list. For this exercise, click on ‘L’ from the A-Z list. Note: there also is a View complete list.
Information Retrieval
Full-text Article Access Problems Using the ‘Journals by title A-Z’ list, we are attempting to access a full-text article from the Blood. Although HINARI.
Development of the West Virginia University Electronic Theses & Dissertations System Presented By Haritha Garapati at ETD the 7 th International.
To find journals by language of publication, click on the Languages bar in the horizontal frame. The Languages drop down menu appear and we will choose.
Accessing journals by title 1 Journals can be accessed by title from an alphabetical list. For this exercise, click on ‘L’ from the A-Z list. Note: there.
Reference Management Module I: Introduction By Rehema Chande-Mallya(PhD)
Searching for Scientific Research Using Environmental Index (EBSCO)
OARE Module 5A: Scopus (Elsevier)
PubMed Database Interface (Basic Course Module 4 Part A)
Meet the speakers: Sergey Adonin
Library skills Search the catalogue. library skills Search the catalogue.
Introduction of KNS55 Platform
Accessing journals by Language 4
PubMed Database Interface (Basic Course: Module 4 Part A)
PubMed Database Interface Part A (Basic Course Module 4)
PubMed Database Interface (Basic Course: Module 4)
Presentation transcript:

ETD 2005 International Accesses to a Digital Library of ETDs

ETD 2005 Ana Pavani Departamento de Engenharia Elétrica Pontifícia Universidade Católica do Rio de Janeiro

Presentation outline  Profile of the digital library  Generation of data  Combination and anaysis of data – interesting results  Next steps

Profile of the digital library  Beginning of the collection – 2 nd semester of 1995  Items to start the collection – courseware (texts, exercises, technical manuals, tests, etc.)

 The digital library is part of a system that:  Is a LMS (Learning Management System)  Has administrative functions that allow data exchange with the university’s administrative system  Is linked (2 directions) to CNPq’s Lattes Platform (curricula database with more than 595 K CV)  Allows the control of series collections  Is multilingual and has interfaces in 3 languages

 Evolution of the collection:  Administrative documents  Preprints, published papers & online articles  Interactive courseware  ETDs (2000)  Online journals (2003)  Senior projects (2003)  Online bulletins – distributed through mailing lists, archived and published automatically (2004)  Books (Oct. 2005)

 Numbers of titles in the collection:  Courseware (many types) – 2,700+  Administrative documents – 33  Technical documents – 94  ETDs – 1873 (PUC-Rio) + 31 (UNICAP)  Preprints, published papers & online articles – 280  Senior projects – 305  Online journals – 3 (+ 1 in Oct in Dec. 2005)  Online bulletins – 2  Books – 1 (to be published in Oct. 2005)  Total number of digital objects (DOs) : 16,400+

 Technological characteristics:  Machine – IBM RS/6000  Operating system – IBM AIX  Web server – Apache  DBMS – IBM DB2 ALL  Apache log contains info on accesses to ALL digital contents on the system, besides all transaction that users perform (clicking buttons, reading posts, reading help pages, etc.) – data on transactions with contents must be extracted from the server log to generate the numbers to be analyzed

Generation of data  Data have 2 different natures: production and accesses  Production data come from functions of the system that are not related to the Apache server but only to the DB example

(*) PUC-Rio started requiring ETDs in Aug. 2002; (*) UNICAP does not require ETDs.

 Access data are obtained from both the Apache Server log and the DB:  Logs are mined (according to the following definitions) and the results are stored on the DB  Mined data are combined with production data (metadata) already in the database (types of contents, authors, programs, areas of knowledge, dates, countries, etc.) to yield results

Definitions for mining the log  When access statistics came into discussion, it was necessary to define how data should be mined from the log and how it should be combined afterwards  The definitions follow – (M) mining definitions and (C) combining definitions

(M) Visits and complete visits An ETD can have one or many digital objects. The number of visits is the sum of all accesses to all digital objects in a given month. A complete visit is a set of visits to all digital objects from a country in a given month.

(M) Country x IP address The decision to use the country and not the IP address to establish a visit was based on the fact that the visits to an ETD can be made at different times (and reconnecting may assign a new IP address) and from different locations (with fixed IP addresses).

(M) Counting visits from the same IP address Visits from the same IP are counted individually due to the fact that networks with many machines can be identified by the IP address of a firewall.

(M) Counting visits to restricted digital objects Some ETDs are totally or partially restricted – approximately 30% have some type of permanent or temporary restriction. Metadata, abstracts included, are publicly available for all of them. It was decided that attempts followed by denials of access would be counted as accesses. !! This is informed in the help pages of the system; it is suggested that authors should consider allowing their contents to become public if many attempts occur.

(C) Lines to mine Since the interest was on access to digital objects, the decision was to get the lines with extensions.dcr,.doc,.htm,.pdf, etc. All possible extensions on the database are considered, as long as the corresponding item is cataloged on the digital library, so that an eventual static html system page is not counted.

Observations (1)Statistics were planned on a monthly basis. The model treats data as sequences of points with discrete-time intervals of a month. Past months data are unchanged and current month is updated according to the Update definition. (2)IPs are resolved using a plug-in called GeoIP Free that is available with AWStats.

(C) Information to get from a log line The month and the year are extracted along with identification of the digital object and the country of the IP address that accessed the digital object.

(C) Update of the DB The lines are read every hour at the full hours (00:00, 01:00, etc.); incremental lines are mined. Accesses are summed for each month-year-DO-country, so the table is not very big – in the first 6 months of 2005 the average number of lines per month was 10,000.

(C) When to start computing The log of the Apache Server started being saved on Jun 01, So, either this date was used or a later one, for example Jan 01, The decision was to use all available monthly logs. When the process started, some days of offline processing were required. Afterwards update became automatic according to the Update definition.

Observations (1)Maybe these were not the best definitions – we are willing to discuss alternatives!! (2)The (original) logs are stored and saved offline in case some change in the minig strategy is decided (we have not sunk the ships!!).

Definitions for computing statistics  By author  Visited ETDs by year, month and country  Visited ETDs by country, month and year  25 most visited ETDs (on the system = PUC-Rio + UNICAP)  20 most visited ETDs by institution

 10 most visited ETDs by graduate program  Visited ETDs by institution, program, year and month

Initial Results

# ETDs may/sep –  13% # accesses may/sep –  54.6% Access to ETDs is increasing (Sep 28, 2005)

# ETDs may/sep –  13% # accesses may/sep –  54.6% Number of total visits is increasing (Sep 28, 2005)

# ETDs may/sep –  13% # accesses may/sep –  54.6% Accumulated average total visits is increasing (Sep 28, 2005)

But… Brazil + pt speaking + es speaking = 75% Brazil + US + pt speaking + es speaking = 87% Brazil accounts for 55% of the accesses since Jun 01, 2004 (Sep 28, 2005)

On Jun 15, 2007 the numbers of ETDs in Iberian languages on the NDLTD DB were Brazilian ETDs were 83% of all ETDs in Iberian languages (total number 13,369) InstitutionCountryLanguage(s)Number National LibraryPortugalPortuguese185 IBICT (includes PUC-Rio) BrazilPortuguese11,118 UABSpain (Catalunya)Catalan or English or Spanish1,011 UIBSpain (Catalunya)Catalan or English or Spanish22 UJISpain (Catalunya)Catalan or English or Spanish42 UOCSpain (Catalunya)Catalan1 UPCSpain (Catalunya)Catalan or English or Spanish415 UPFSpain (Catalunya)Catalan or English or Spanish67 URLSpain (Catalunya)Spanish1 URVSpain (Catalunya)Catalan or English or Spanish106 UdGSpain (Catalunya)Catalan or English or Spanish131 UdLSpain (Catalunya)Catalan or English or Spanish70 UVSpain (Catalunya)Catalan or English or Spanish200

Percentage of visits from Brazil is decreasing (Sep 28, 2005)

Accumulated percentage averages of visits from Brazil (Sep 28, 2005)

Total accesses top 10 countries (Sep 28, 2005) # identified countries unindentified countries + satellite access host CountryVisits Brazil12,845 USA2,795 Portugal1,489 Spain679 Peru652 Mexico 432 Chile364 France245 Colombia225 Argentina224

Some interesting results  Some ETDs are permanent ‘best sellers’  They are on specific subjects (examples: a specific phylosopher and history of modern architecture in Brazil)  They are linked from sites on the subjects (examples: the first from the US & Brazil and the second from Germany)  They are accessed from different countries  Some topics are permanent ‘best sellers’ (example: energy)

 Some ETDs are temporary ‘best sellers’ – this seems to happen when they are displayed at the ‘last published ETDs’ functions (system and graduate program)  Some graduate programs are permanent ‘best sellers’  They research topics that are very specific of the country (examples: education and history of culture)  They are indexed in other sites and/or digital libraries (examples: Universia in Spain for social sciences and humanities)  They are accessed from different countries

The 25 most visited ETDs have a large number of visits No average is lower than 100 visits per month

Next steps  Find out how readers got to ETDs (BDTD, NDLTD, SCIRUS, etc.) – an online survey is planned  Interview faculty to check if some ETDs are recommended reading in courses  Gather more data and analyze in a ‘more scientific’ manner (must find a student!!)

 Develop additional functions comparing accesses with production  Extend to other digital contents (at the moment only ETDs and online journals have access statistics)

Thank you! Muito obrigada!