Export/Import in Dspace & Backup ARD Prasad. Where Dspace stores data /dspace/assetstore directory will have all the  Bitstreams and licenses PostgreSQL.

Slides:



Advertisements
Similar presentations
Site Collection, Sites and Sub-sites
Advertisements

Mukesh A Pund Scientist NISCAIR
The DSpace Course Module – Import and Export. Module objectives  By the end of this module you will:  Know how the batch import and export facility.
TOI - Refresh Upgrades in Cisco Unity Connection 8.6
CCNA2 MODULE 5.
Finding and Reporting Postgres Bug #8257 BY: LLOYD ALBIN 8/6/2013.
Wilma Hodges  Began faculty training and moving content in Nov  Original plan was to be fully migrated to Sakai by.
OpenVMS System Management A different perspective by Andy Park TrueBit b.v.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
PHP (2) – Functions, Arrays, Databases, and sessions.
AGENDA Tools used in SQL Server 2000 Graphical BOL Enterprise Manager Service Manager CLI Query Analyzer OSQL BCP.
© 2009 Kroll Ontrack Inc.| Ontrack PowerControls 6.0 for SharePoint™ A Better Way to Search and Restore.
Simple Web SQLite Manager/Form/Report
Introduction to Linux and Shell Scripting Jacob Chan.
What does SQL Server Management Studio mean? Cortex User Group Meeting Portland – 2012 What does SQL Server Management Studio mean? Cortex User Group Meeting.
Today’s Agenda Chapter 12 Admin Tasks Chapter 13 Automating Admin Tasks.
Drupal Workshop Introduction to Drupal Part 1: Web Content Management, Advantages/Disadvantages of Drupal, Drupal terminology, Drupal technology, directories.
Batch Import/Export/Restore/Archive
Backups in Linux Ning Zhu Class presentation. Introduction The dump and restore commands are the most common way to create and restore from backups in.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
Inti Online Login Page (Lecturer/Student/Administrator View)
1 Introduction to PostgreSQL. 2 Documents PostgreSQL 8 for Windows (*) Beginning Databases with PostgreSQL From Novice to Professional, Second Edition.
Postgres Bug #8545 pg_dump fails to dump database grants BY: LLOYD ALBIN 11/5/2013.
Offline File Storage. Module 12 Offline File Storage ♦ Introduction Backup is usually done by first collecting all the data in a single archive file,
Back to content Final Presentation Mr. Phay Sok Thea, class “2B”, group 3, Networking Topic: Mail Client “Outlook Express” *At the end of the presentation.
Copyright© 2003 Avaya Inc. All rights reserved PUBVT Loading and Implementation instructions for the Pre-upgrade, Backup Verification Tool.
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
HTML Hyper Text Markup Language A simple introduction.
Managing Your Lawson System MSPs and CTPs Ron Wright netASPx Corp.
Dspace Workflow, Administration and Customization Dr. Noorhidawati Abdullah Digital Library Research Group, FCSIT, UM
Page 110/19/2015 Chapter 5 CCNA2 Chapter 5 Managing Cisco IOS Software.
AEgir Maintain your Drupal sites. The name: AEgir “In Norse mythology, AEgir was the god of the oceans and if Drupal is a drop of water, AEgir is the.
AIP Backup & Restore Sunita Barve NCRA, Pune. AIP The latest version of DSpace 1.7.0, supports backup and restore of all its contents as a set of AIP.
USING XML AS A DATA SOURCE. Data binding is a process by which information in a data source is stored as an object in computer memory. In this presentation,
Creating PHPs to Insert, Update, and Delete Data CS 320.
The OpenDock project: putting in place the infrastructure for sharing learning activities. Raymond Elferink, Dai Griffiths, Edwin Veenendaal Paper presentation,
Artezio LLC Address: 3G Gubkina Str., suite 504, Moscow, Russia, Phone: +7 (495) Fax: +7 (495)
What is a port The Ports Collection is essentially a set of Makefiles, patches, and description files placed in /usr/ports. The port includes instructions.
Easy Step – by – step method to create a backup set for your Main drive on a separate backup hard disk. 1.Turn on Retrospect 2.Select – Automate 3.Select.
Install Appaserver ©Tim Riley. Apache Group ©Tim Riley Add yourself to the apache group. Both the apache user and group are called “www-data”. This step.
Linux+ Guide to Linux Certification, Third Edition
Managing Your Lawson System MSPs and CTPs Ron Wright netASPx Corp.
Managing SX.e and TWL with scripts and MARC 02/12/04 Jeremiah Curtis.
Greenstone Building your own collection. Overview Installation Usage Building a collection.
Module - Identifiers The DSpace Course. Module Overview  By the end of this module you will:  Understand what persistent identifiers are, how they work.
DSpace - Digital Library Software
HTML IMAGES. CONTENTS IMG Tag Alt Attribute Setting Width and Height Of An Image Summary Exercise.
Copy to Tape TOI. 2 Copy to Tape TOI Agenda Overview1 Technical Feature Implementation2 Q&A3.
HUBzero® Platform for Scientific Collaboration Copyright © 2012 HUBzero Foundation, LLC Collaboration and Contribution Emily Kayser Hub Liaison, HUBzero®
If condition1 then statements elif condition2 more statements […] else even more statements fi.
Group, group, group One after the other: cmd1 ; cmd2 One or both: cmd1 && cmd2 Only one of them: cmd1 || cmd2 Cuddling (there):( cmd1 ; cmd2 ) Cuddling.
3 Copyright © 2007, Oracle. All rights reserved. Using the RMAN Recovery Catalog.
1 Version 3.1 Module 5 Managing Cisco IOS Software.
@toniblyx at #SummitNow Alfresco Backup and Recovery Tool: a real world backup solution November 2013 Toni de la Fuente – Alfresco Senior Solutions Engineer.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
DSpace Statistics Graham Triggs Head of Repository Systems, Symplectic.
 Prepared by: Eng. Maryam Adel Abdel-Hady
IBM Connections Side by Side/ In Place migration 5.5 / 6.0
Microsoft Office 2010 Basics and the Internet
Microsoft Office 2010 Basics and the Internet
Department of Computer Engineering
Oracle Database for APM 9.1
Creating an Oracle Database
Shell Script Assignment 1.
Cloud based Open Source Backup/Restore Tool
Hey everyone, I’m Sunny …harsh caroline xavier
Introduction to Operating Systems
Administrator’s Manual
Database Backup and Recovery
Lecuter-1.
Presentation transcript:

Export/Import in Dspace & Backup ARD Prasad

Where Dspace stores data /dspace/assetstore directory will have all the  Bitstreams and licenses PostgreSQL databases contains information on  Metadata  Information about Communities  Information about Collections  Information about e-groups & authorizations  Information about E-persons & authorizations  Host of other information

Export/Import in Dspace Export and import deal only with bitstreams, metadata, license and handles. But NOT information about communities, collection, members, reviewers etc., access permissions/restrictions You can export or Import  An itemor  All items in a collection

Export command syntax /dspace/bin/dsrun org.dspace.app.itemexport.ItemExport \ --type=COLLECTION --id=collID \ --dest=dest_dir --number=seq_num Where --type can have either the value COLLECTION or ITEM --id is the handle/collection_or_Item_Id ex: 1849/2 (or /2 in case you do not have handle) --dest is destination directory (directory be created prior before running the script) --number is sequence number, it can be just 1

Shell Script for exporting #!/bin/sh if test $# != 1 then echo "Usage: $0 " exit fi declare collection_id[5]=( ) for((i=0; i<=5; i++)) do mkdir $1/${collection_id[$i]} /dspace/bin/dsrun org.dspace.app.itemexport.ItemExport \ --type=COLLECTION \ --id=1849/${collection_id[$i]} \ --dest=$1/${collection_id[$i]} \ --number=1 done

In the shell script... Look for the line declare collection_id[5]=( ) Change etc with your collection ids Clue: collection ids are the one that appear in the browser URL after handle prefix, ie. If you have not registered with CNRI, the number that appears after / Also create the directory where the data should be exported to

Shell Script for Import #!/bin/sh declare collection_id[5]=( ) for((i=0; i<=5; i++)) do /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport \ -a -e \ -c /${collection_id[$i]} \ -s $1/${collection_id[$i]} \ -m mapfile done

Here also change the collection ids in the import progam -e option, should have the dspace admin id (i.e. e- mail address)

What is exported The following files will be created for every item  dublin_core.xml ( metadata)  Handle ( one line having the handle number)  license.txt  Actual file ( bitstream: could be pdf or doc or an image file)  Contents (with two lines – license file name, and actual bitstream name)

However Import and Export are meant for data exchange It can however, be used for partial back up It takes care of only items It does not back up  Your communities, collection, e-groups, e-persons

How to backup postgresql pg_dump as dspace user Example: $ pg_dump dspace > backupfile Note: where dspace is name of the database backup file will have all the table definitions and contents. pg_dump has lots of options

How to restore database psql -d dspace –f dumpedfile Note: pgsql has lots of options, to know more about options, you can use

Alternative (using tar) To dump a database called mydb that contains large objects to a tar file: $ pg_dump -Ft -b mydb > db.tar To reload this database (with large objects) to an existing database called newdb: $ pg_restore -d newdb db.tar

Upgrading This procedure should be first step when you are upgrading DSpace to newer version Even if upgradation fails, you have back to fall back

Upgrading Tip Have different database and as a different user, so that you do not have to touch the existing DSpace insallation

Extra care It is a good idea to take a tape (hard disk) back up of  Entire /dspace directory  pg_dump out put file  And the export directory

Final Lesson Learning dspace is too easy.  can be learnt in a week  Can be mastered in a month Creating content is continuous, long-term, perhaps no end Be more careful with the Content

Thank You