Tema 3 INEbase history Statistical books 1858-1997 available on the web Celia Santos

Slides:



Advertisements
Similar presentations
Step 1 Start your web browser (Internet Explorer or Firefox). Step 2 Type: in the Address box Step 3 Press Enter on the keyboard.
Advertisements

Content 15.1 Basic features Types of database Data structures 15.2 Creating a database Screen layout Entering data Editing data 15.3 Displaying data Searching.
High level QA strategy for SQL Server enforcer
Make the CUT April 30, 2014 Required Readings In House, AERO, Publishers, Ares Supplementary Readings ACE, JMC Transcription Services at.
JobTracker™ A Job Tracking System for Architects & Engineers Produced by LA Solutions.
Disseminating Statistics: Internet and Publications INE – Madrid, 3-5 March 2008 Ulrich Wieland, Eurostat How to link publications and Internet in order.
Building The Rare book Collection at Rijeka University Library in the Digital Age Ines Cerovac, Senka Tomljanović, Rijeka University Library Seminar The.
© 2011 Adobe Systems Incorporated. All Rights Reserved. Adobe Confidential. Kiran Kaja | Accessibility Engineer Ensuring Accessibility in Document Conversion.
Selecting Preservation Strategies for Web Archives Stephan Strodl, Andreas Rauber Department of Software.
Previous Lectures: Planning of a Web site: Discussing the strategic issues of Web site engineering process –Models used for Web site planning –Compare.
Project Report1 Dave Inman Project report. Project Report2 Ways to write a report Top down: Write the structure of the report (maybe use the web templates.
Disseminating statistics: Internet and Publications. A strategy for publications Madrid, 3-5 March 2008 A strategy for publications. Part I Maria-Luz SEOANE.
Web Development & Design Foundations with XHTML
Online resources in TCD Library:
Overview of New Behind the Blackboard for Blackboard Customers APRIL 2012 TM.
Disseminating statistics: Internet and Publications Madrid, 3-5 March 2008 Digital Preservation (E-Archiving) Marta Melgar García
Introduction to Genealogy By Al Barron Slidell Branch Library November 17, 2004.
11 The Ultimate Upgrade Nicholas Garcia Bell Helicopter Textron.
Web Developer & Design Foundations with XHTML
OARE Module 3: OARE Portal.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
1 Web Developer Foundations: Using XHTML Chapter 8 Web Site Development.
TPT EPAF Temporary Part-Time Rehire EPAF. What is a TPT EPAF? The EPAF for Temporary Part-Time (TPT) is an electronic process allowing for paperless personnel.
1 Please log-in to your Comprehensive Plan  Visit:  Under “I Would Like To”, select “Access My PDE Applications”
Statistical Databases: A short review Slide: 1 Demonstration of the Prototype model for ECA Statistical Database Statistical Database Application March.
Norges bank’s bicentenary project Producing the commemorative publication of the Swiss National Bank The Swiss National Bank
25-27 June 2003Clearing House Workshop, Paris1 Direct access to UNESCO Documents UNESDOC.
UDoCument: Electronic Scrapbook for the Information Era Soufiane Berouel, Undergraduate Student Supervised by Prof. Lily Liang Department of Computer Science.
PubMed/History, Advanced Search and Review (module 4.3)
POPULATION AND HOUSING CENSUSES IN SLOVAKIA ON THE WEBSITE Miroslav Hudec Pavol Büchler INFOSTAT – Bratislava MSIS Geneva
The Development of the Ceramics and Glass website Mia Ridge Museum Systems Team Museum of London.
LLP-LDV/TOI/07/IT/016. F-MU.S.EU.M. PROJECT: STATE OF THE ART AND INTERMEDIATE REPORT TO SUBMIT AT 15 DECEMBER Marco Merlini F-M U.S.EU.M. (Form Multimedia.
1 UNOG Library Digitization and Microform Unit (DMU) – December 2009.
PatentScope - Electronic Publication World Intellectual Property Organization.
Rev.04/2015© 2015 PLEASE NOTE: The Application Review Module (ARM) is a system that is designed as a shared service and is maintained by the Grants Centers.
© 2012 Adobe Systems Incorporated. All Rights Reserved. Copyright 2012 Adobe Systems Incorporated. All rights reserved. ® DESIGN PROJECT PRODUCTION PHASES.
Consultative process for finalizing the Guidance Document to facilitate the implementation of the clearing-house mechanism regional and national nodes.
United Nations Economic Commission for Europe Statistical Division International migration data sharing: What can be done in the UNECE region? Paolo Valente.
PRESERVING YOUR PAST AND YOUR PRESENT FOR THE FUTURE.
GALILEO Tutorial ProQuest Search Basics Press a key or click the mouse button to advance to the next slide. July 2008.
Ian F. C. Smith Preparing a thesis document. 2 Disclaimer This is mostly opinion. Suggestions are incomplete. There are other ways to prepare a thesis.
We now view some Internet-based sources of E-books besides those available from HINARI. Using the Internet Addresses (url) at the top of the slides, you.
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
Ip4inno 1 Module 3B: Part 1 Using the free worldwide patent database fast & easy patent searching EPO Roland Feinäugle.
Pre-Course Assignment
U.S. Department of Energy Consolidated Audit Program
Egyptian Language School
Reviewing the concept of the UNECE Statistical Yearbook
Dawsonera guide.
Conducting the performance appraisal
by Dr. Nikolas Stylianides
Conducting the performance appraisal
OPERATE A WORD PROCESSING APPLICATION (BASIC)
DESIGN PROJECT PRODUCTION PHASES DEFINE, STRUCTURE, DESIGN, BUILD AND TEST, DELIVERY ® Copyright 2012 Adobe Systems Incorporated. All rights reserved.
Web Site Project Management
PubMed Database Interface (Basic Course Module 4 Part B)
INE´s editorial policy and handbook
IMAODBC, The Hague, 5-9 sept 2005
DESIGN PROJECT PRODUCTION PHASES DEFINE, STRUCTURE, DESIGN, BUILD AND TEST, DELIVERY ® Copyright 2012 Adobe Systems Incorporated. All rights reserved.
Sharing of Eurostat predefined tables
How to Use CheckMyWork Guide
Sharing of Eurostat predefined tables
EUROPEAN STATISTICS ON THE INE WEBSITE
RAMON Re-engineering An Update
DISSEMINATION WORKING GROUP Luxembourg, November 2011 Using Statistics Explained to produce the Eurostat Yearbook Jukka PIIRTO.
Portrait of the Regions State of Play
Research Paper Step-by-step Process.
Dissemination and Communication Introductory course
ECONOMIC CLASSIFICATIONS Advanced course Day 1 – third afternoon session Tools for assisting the use of classifications Zsófia Ercsey - KSH – Hungary.
The new approach to publications in 2011 and beyond
Presentation transcript:

Tema 3 INEbase history Statistical books available on the web Celia Santos

Tema : what shall we do with past information only available in printed format?  Target: opening up to the public historical collection of INE publications only available on paper INEbase history: background INEbase history: Statistical books available on the web 1996: The INE joins the Internet 2000: INEbase birth  all statistical production offered on the Internet

Tema 3 INEbase history: a new section of INEbase Different alternatives: Tables in pc-axis format Complete PDF versions of the books INEbase history INEbase history: Statistical books available on the web

Tema 3 Project phases: Phase 1. 2nd half of –What should be published? Most symbolic and representative volumes of public statistical activity: Statistical Yearbooks (1858 – 1997) Population Censuses (1900 – 1970) –Outsource scanning ( + de 100,000 pages) –Outsource the software development Phase 2. 1st half of 2005 –Cataloguing starts –Software improvements suggested by use –20 publications catalogued before publishing INEbase history: Statistical books available on the web

Tema 3 Project phases: Phase 3. July 2005 –Internet launch takes place with 20 Yearbooks and 1 Census Phase 4. October 2006 –Cataloguing and web publications of 78 Yearbooks and 9 Censuses (34 volumes) INEbase history: Statistical books available on the web

Tema 3 Project phases: INEbase history: Statistical books available on the web Phase 5. Under development, 2007 Incorporation of new publications in st six months:  Scan the Agrarian Census and VS statistics  Programme adaptation 2nd six months: cataloguing & publication

Tema 3 1. Scanning and OCR Scanning using the originals –Unbinding (old and non-unique) –Guillotining (repeated and unimportant) –Microfiche (rare, old copies) TIFF files obtained OCR programme used to generate txt files  used for search engine Once PDF file is obtained  ready to be catalogued The technical process in 3 steps INEbase history: Statistical books available on the web

Tema 3 2. Cataloguing books into the system : “cataloguer” role INEbase history: Statistical books available on the web 1st step: create index with categories until we get to the final node: the statistical tables 2nd step: associate one or more PDF documents to each node

Tema 3 INEbase history: Statistical books available on the web How is cataloguing done? Practical example Creation of a virtual book: Statistical Yearbook 2010 Node blocked

Tema 3 INEbase history: Statistical books available on the web Creation of the index publication Creating as many chapters as needed

Tema 3 INEbase history: Statistical books available on the web Creation of the tables and association to the corresponding PDF-doc.

Tema 3 INEbase history: Statistical books available on the web Recreating the hierarchical tree All the publication´s documents appear associated to their corresponding table Cataloguer’s work ends here Nodes unblocked

Tema 3 3. Revision before publishing Cataloguing should be revised before being published Who revises?  there is a specific role, the “proof-reader”, but…. this role has not really been used and …in reality another cataloguer does the revision Once the proof-reading work is finished, the book is ready for publication INEbase history: Statistical books available on the web Proof-reader’s work ends here

Tema 3 INEbase history: Statistical books available on the web 4. Publisher Main task: to publish books; other tasks: user and trasmission control, nodes translation Blocked node Published node Unblocked node Book ready to be shown on the Internet And the translation process begins

Tema 3 INEbase history: Statistical books available on the web Cataloguing Server Dissemination Server Trasmission process: synchronization of servers This step might not be needed

Tema 3 INEbase history: Statistical books available on the web 5. Visualisation on the Internet:

Tema 3 INEbase history: Statistical books available on the web Yearbooks ordered by decades

Tema 3 INEbase history: Statistical books available on the web On the dissemination server The hierarchical tree ….. On the cataloguing programme

Tema 3 INEbase history: Statistical books available on the web And just a click on the required table And a 9 page PDF document is shown

Tema 3 INEbase history: Statistical books available on the web Anything else to be taken into account? Search engine Change language No. of tables Size of pdf file

Tema 3 INEbase history: Statistical books available on the web The search engine: INEbase history has its own Direct access to the pdf document

Tema 3 INEbase history: Statistical books available on the web The search engine is based on the table titles (sorry, only in Spanish) and the hierarchical tree (in English as well) Of course, you might as well use INE’s general search engine:

Tema 3 INEbase history: Statistical books available on the web Population Censuses: everything is also valid

Tema 3 1- Economic data : Initial scanning stage: 12,000 Euros, 110,000 pages External development: 90,000 Euros INEbase history: Statistical books available on the web Some interesting data… 3- Amount of scanned pages Yearbook: 70,000 pages Census: 30,000 pages Total: 100,000 pages 2- Deadlines Scaning + development programme: 6 months Cataloguing: 20 months

Tema 3 4- Personnel used: Cataloguing: 0 – 3 Recording assistants Indexes translator: 1 trainee Publisher: 1 – 2 Statisticians IT support team INEbase history: Statistical books available on the web Some interesting data… 5- How many people use INEbase History? Page views in october: 77,623 (1.2 % of total)

Tema 3 IT infrastructure: a reasonably simple system: A cataloguing server houses a copy of the work from the database and the collection of PDF pages; multiple cataloguer PCs provided with a "client" application connect to the server One of the components of the family of web servers at houses the dissemination server (the software, plus a copy of the database and a copy of the collection of PDF pages). This is the system that serves Internet files There are copy and safety mechanisms between one environment and the other The environment is similar to a content management programme INEbase history: Statistical books available on the web IT data

Tema 3 IT infrastructure: a reasonably simple system: Client programmes developed with Microsoft.Net. Server programme developed with Java. Catalogue and dissemination database, Oracle 9i. Programmes for working with PDF files obtained from a manufacturer specialised in this kind of software. Conceptual design. Setting requirements, selection of platforms: National Statistics Institute. Scanning of originals: Proco S.A. Tecnological partner development: Sopra Group. INEbase history: Statistical books available on the web IT data

Tema 3 Thank-you for your attention! Any questions?