DIGITIZATION OF GOVERNMENT INFORMATION RESOURCES : A CASE STUDY OF CENTRAL SECRETARIAT LIBRARY BY S. MAJUMDAR DIRECTOR CENTRAL SECRETARIAT LIBRARY BY S.

Slides:



Advertisements
Similar presentations
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Advertisements

ESDS Qualidata Libby Bishop, ESDS Qualidata Economic and Social Data Service UK Data Archive ESDS Awareness Day Friday 5 December 2003Royal Statistical.
Paperless & E-Learning Environments SCHOOL DIGITAL LIBRARY Development, Configuration, maintenance, CD ROM Publishing. E-LEARNING DRIVEN WEBSITES ENVIRONMENTS.
CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
Capacity Building Passing on the Experience Dr. Noha Adly World Digital Library Arab Peninsula Regional Group meeting.
GOVERNMENT LIBRARIES AND ELECTRONIC RESOURCES BY S. MAJUMDAR UNIVERSITY LIBRARIAN UNIVERSITY OF DELHI AND FORMER DIRECTOR CENTRAL SECRETARIAT LIBRARY BY.
DIGITIZATION OF GOVERNMENT INFORMATION RESOURCES : A CASE STUDY OF CENTRAL SECRETARIAT LIBRARY BY S. MAJUMDAR DIRECTOR CENTRAL SECRETARIAT LIBRARY BY S.
Digital Content Solutions Digital content management technology has transformed the way to manage content and knowledge, in this knowledge era. Research.
LEARNING RESOURCE CENTER Carolyn C. Oakley, Director.
Got Paper? Thinking about going paperless or at least as paperless as possible? NAMVBC-2013.
Building The Rare book Collection at Rijeka University Library in the Digital Age Ines Cerovac, Senka Tomljanović, Rijeka University Library Seminar The.
Digitisation projects and preserving digital documents in Hungary Current trends in digitisation DELOS, Turin, 3-4. febr István Moldován Hungary,
1 Uppsala University Library Eva Müller Peter Hansson Stefan Andersson Uwe Klosa Electronic Publishing Centre Krister Östlund Waller project.
EAD in A2A Bill Stockting, Senior Editor A2A and EAD Working Group: Central Archives of Historical Records, Warsaw, 26 April 2003.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
Online training for professionals: how this is being addressed by AccessIT Adam Dudczak Poznań Supercomputing and Networking Center
Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
N ew Stage of the Digital Library of the National Diet Library of Japan: Digitization of Japanese Books and Digital Archive Portal By Kazuharu Honda Assistant.
Organic Digital Library Aboriginal Studies Internet Librarian International March 18, 2002 Darlene Fichter, Data Librarian University of Saskatchewan library.usask.ca/~fichter/
GOVERNANCE ELECTRONIC. ” “ E-Governance is the application of Information and Communication Technology (ICT) for delivering government services, exchange.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
1 Accessible Italy The new provisions to support the access to Information Technologies for the disabled Domenico Gargani Research Unit Ministry for Innovation.
Mark Phillips Digital Projects Department University of North Texas Annexation of Texas Project.
Changing the culture: Ethiopia’s commitment to dissemination and the multi-media approach By Yakob Mudesir Seid
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Ela Majocha Toby Burrows ARC Network for Early European Research University of Western Australia Building e-Research infrastructures for Collaboration.
Introduction to Worldcat (OCLC) Presentation for PGDILIT Course By Dr.D.N.Phadke Coordinator,PGDILIT Contact: Mob
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
Digitization Panel August 12, 2010 Christopher C. Brown, coordinator Mike Culbertson, Colorado State U. James Mauldin, GPO.
Digitising Journals, March 2000, Copenhagen Astrid Wissenburg Information Services and Systems King’s College London
What Agencies Should Know About PDF/A September 20, 2005 Susan J. Sullivan, CRM
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
Mass digitisation? Astrid Verheusen Projectmanager Research & Development Division National library of the Netherlands LIBER-EBLIDA Workshop on Digitisation.
May 16, 2007 Enterprise Content Management- Paperless Government: Association of Governmental Accountants Enterprise Content Management- Paperless Government:
11-15 April 2011 Mauritius Institute of Health S.S.Pillai
TECHNOLOGY SUPPORT FOR ESSSS Progress, Issues, and Challenges Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Technology Choices for the JSTOR Online Archive Presented by Chang Feng Department of Computer Engineering and Computer Science, University of Missouri-Columbia,
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Eurocris Membership Meeting Lisbon 9-11 November 2005 Sérgio Tenreiro de Magalhães Luís Amaral University.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
The Portal to Texas History: Harnessing Technology to Enable Collaboration with Small Museums and Libraries CNI, December 6, 2005 Cathy Nelson Hartman.
P. Schirmbacher Humboldt-Universität zu Berlin The Changing Process of Scholarly Publishing or the Necessity of a New Culture of Electronic.
Robin L. Dale Director of Digital & Preservation Services LYRASIS Getting Started with the Digital Commonwealth.
Digitization An Introduction to Digitization Projects and to Using the Montana Memory Project.
International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library | Lisbon – Portugal DIGITAL ARCHIVE OF PORTUGUESE ART.
1 UNOG Library Digitization and Microform Unit (DMU) – December 2009.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman - Jordan 16 – 19 May 2011 Determination of the scope and form of.
PAN-European Exploitation of the Results of the Libraries Programme - EXPLOIT German Libraries Institute Berlin EXPLOIT 1 Electronic library materials.
1/16/2016I. Revels Digital Imaging Workshop 1 Selection Considerations For Digital Imaging Projects.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Changing Role of Librarians in Digital Era and Need of Professional skills, Efficiency & Competency By Goutam Biswas
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
FACES General Overview ViRR (Virtueller Raum Reichsrecht) Software Solutions Kristina Büchner and Bastien Saquet Contact:Kristina Buechner:
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
The Role of Libraries and Information Centres in the Global Forest Information Service Roger Mills Oxford Forestry Institute August 2000.
MICHAEL and the European Digital Library: promoting teaching, learning and research The MICHAEL Project is funded under the European Commission eTEN Programme.
Dissemination Working Group
DIGITAL LIBRARY.
IMAODBC, The Hague, 5-9 sept 2005
Metadata to fit your needs... How much is too much?
New Platform to Support Digital Humanities in the Czech Republic
Presentation transcript:

DIGITIZATION OF GOVERNMENT INFORMATION RESOURCES : A CASE STUDY OF CENTRAL SECRETARIAT LIBRARY BY S. MAJUMDAR DIRECTOR CENTRAL SECRETARIAT LIBRARY BY S. MAJUMDAR DIRECTOR CENTRAL SECRETARIAT LIBRARY

The course of human development has taken new dimension with the introduction of information and communication technology (ICT)

 For India, the rise of Information and Communication Technology is an opportunity to overcome historical disabilities and to become the master of one's own national destiny

 The GOI has recognised the potential of ICT for rapid and all-round national development.  The National Agenda for Governance, which is the Government's policy blueprint, has taken due note of the ICT Revolution that is sweeping the globe

GOVERNMENTAL PUBLICATIONS IN INDIA  The sphere of governmental activity in India has expanded considerably. Many burning issues like population control, health management, economic and social condition of rural and urban masses, education, basic requirements. In every field the government has intervened

Interventions are Visible Through  Administrative Reports  Governmental Notifications  Statistical Reports  Budget Documents  Committee and Commission Reports  Research Reports

Interventions  Bills, Acts, Laws, Codes, Rules and Regulations, Law Reports, Digests and Parliamentary Debates  Reports of various Parliamentary Committees

CENTRAL SECRETARIAT LIBRARY MISSION  “ take an Indian Initiative to ICT through its libraries to promote, facilitate the development of Indian tangible heritage from printed form to machine readable collections and provide services for optimal utlisation of resources and provide life long accessibility of information through vast library resources.”

Central Secretariat Library : Types of Machine Readable Database  Machine readable catalogue of bibliographical information for its document resources;  Creating digital documents of the Annual Reports, Budget Documents of the parent Ministry(15 thousand pages);  Creating digital documents of the Government of India Gazette (2 million pages) and Commission and Committee Reports (1 million pages);  Developing machine readable annotated bibliography of rare books;  Developing Digital documents of selected government publications

DIGITAL LIBRARY CONCEPTUALISATION  PILOT PROJECT  ANNUAL REPORTS;  PERFORMANCE BUDGET;  DEMANDS FOR GRANTS;  EXPENDITURE BUDGET OF DEPARTMENT OF CULTURE TO

OBJECTIVES  TO DEVELOP CD BASED FULL TEXT ENGLISH DATABASE  HIGHLY INSTINCTIVE DATABASE WHICH IS 100% SEARCHABLE, NAVIGABLE AND CAN BE BROWSED ON ALL GRAPHICS, TABLES, FIGURES AND PHOTOGRAPHS

OBJECTIVES  TO DEVELOP WEB ENABLING INTERFACES TO BE HOSTED ON CSL SERVER CO-LOCATED AT NIC

SECOND PHASE CONCEIVED  ANNUAL REPORTS, ETC. OF DEPARTMENT OF CULTURE, TO

SECOND PHASE CONCEIVED  GOVERNMENT OF INDIA (CENTRAL GOVERNMENT) GAZETTE  COMPLETE SETS ARE AVAILABLE FROM 1950 ONWARDS.  MORE THAN 70% QUERRIES OF THE TOTAL USAGE OF INDIAN OFFICIAL PUBLICATIONS PERTAIN TO GAZETTE OF INDIA

UNIQUENESS OF GAZETTE  ORGANISED INTO DIFFERENT SECTIONS AND PARTS  SEARCH REQUIREMENTS ARE ON ANY ONE OR COMBINATION OF SUBJECTS, PART NUMBERS, SECTIONS, SUBSECTIONS, DATE, GSR NUMBERS, AND S. O. NUMBER

OBJECTIVES  DEVELOP VALUE BASED PRODUCT  USE OF NEW TECHNOLOGY  SCANNING  DATABASE GENERATION  CONTENT ANALYSIS  CONTENT MANAGEMENT

OBJECTIVES  CONSERVATION AND PRESERVATION  WEB ENABLING TECHNOLOGY  PORTAL APPLICATION

ADMINISTRATIVE FORMALITIES  OPEN TENDER SYSTEM  EVALUATION PROCESS  TECHNICAL EVALUATION  FINANCIAL EVALUATION

TECHNICAL EVALUATION  CONSTITUTION OF TECHNICAL COMMITTEE  TWO PART EVALUATION  CREDIBILITY, TURNOVER, WORK FORCE,SIMILAR WORK EXPERIENCE  TECHNICAL EXPERIENCE

TECHNICAL EXPERIENCE  ELEVEN PARAMETERS  Digitizing technique : scanning, OCR, Proofing or any other with technical specifications;  Format for metadata creation;  File format : TIFF and PDF or any other  Organization of Images and the corresponding metadata in DBMS

TECHNICAL DETAILS …  ELEVEN PARAMETERS …..  Standard DBMS used or it is a proprietary software  Retrieval interface with standard search strategy  Provision for Web enabling of data  Resolution : Scanning and Output/Display  Portability from one platform to another  Platform used  Work flow

PROTOTYPES  ANNUAL REPORTS  GAZETTE DOCUMENTS

TECHNICAL RESULTS  COULD SHORTLIST TWO AGENCIES  COULD ACHIEVE BEST PRACTICES TO BE FOLLOWED  COULD UNDERTAKE BENCHMARKING OF THE BEST PRACTICES

BENCHMARK  SCANNING  OCRing  PROOFING  FORMATING  CONTENT ANALYSIS

BENCHMARK…..  METADATA CREATION USING UNIMARC AND DUBLIN CORE (Great Emphasis)  CONTENT MANAGEMENT SYSTEM  PLATFORM PORTABILITY  WEB ENABLING  DELIVERABLES  PRESERVATION AND CONSERVATION

DC vs UNIMARC  MAPPING THE TAGS OF DUBLIN CORE WITH UNIMARC

Costing  COSTING TECHNIQUE USED BY CSL : BASE IS THE WORK FLOW - STAGES OF WORK

DELIVERABLES  TIFF images to PDF with text ;  PDF on CD-ROM after OCRing in RTF ( Rich Text Format) for full text search;  TIFF on CD-ROM in raw form;

DELIVERABLES….  XML based ODBC compliant database archives on CD-ROM;  Installation of Portal Application Infrastructure;  Deliver the tools required for managing and updating the Portal;

STATUS  1.9 million pages of Gazette have already been scanned, cleaned, OCRed;  Segregated the Devnagri version with English before OCRing;  Capturing minimum fields of Meta Data using DC and UNIMARC;  Deliverables at each stage is being received in different storage media

COMMISSION AND COMMITTEE REPORTS  Best Practices used in Gazette Documents were repeated;  More emphasis on the Meta Data creation using the mapping process of UNIMARC with DC;  Emphasis on the Metadata Professionals based on the tools and techniques used by traditional Libraries

FIRST PHASE  BASED ON THE PAPERS SUBMITTED  The capabilities of the Agencies;  The resources available and to be deployed of the Agencies;  The time frame, work flow and performance guarantee; and  CORE COMPETENCY FACTOR WAS IDENTIFIED

SECOND PHASE  The prototype demonstration based on the scope of work and the deliverables envisaged  1. Scanning techniques used and quality, details of equipments used  2. Cleaning, OCRing, Proofing method and compression techniques for output document in PDF and RTF  3. XML coding process followed and files created for the purpose

SECOND PHASE  Understanding of Meta Data in relation to  a. General feature of Committee and Commission Reports;  b. In relation to DC elements, mapping of DC with General Features;  c. Mapping of DC into UNIMARC tags;  d. Standard used for subject heading.

SECOND PHASE  XML coding process followed and files created for the purpose  Search mechanism design, developed using content management system  Work Flow chart and its quality  Overall quality of the Demo

OTHER ISSUES  DELIVERABLES  TIFF (CLEANED AND UNCLEANED)  PDF (IMAGE AND DOCUMENT)  XML CODED DATA  RTF FOR CERTAIN PORTION  DC/ UNIMARC BASED METADATA  WEB ENABLED DATABASE WITH TOOLS AND TRAINING

As we move into the electronic era of digital objects it is important to know that there are new barbarians at the gate and that we are moving into an era where much of what we know today, much of what is coded and written electronically, will be lost forever. We are, to my mind, living in the midst of digital Dark Ages; consequently, much as monks of times past, it falls to librarians and archivists to hold to the tradition which reveres history and the published heritage of our times. Terry Kuny, Consultant, National Library of Canada 1998

THANKS