- iRODS admin
- iRODS admin account
- connected to iRODS, see How to connect to iRODS using iCommands
- SURF iRODS BagIt rule installed (see: https://github.com/ccacciari/irods-bagit-integration , needs to be done by SURF iRODS admins)
- bdbag library (https://github.com/fair-research/bdbag) installed on resource server (needs to be done by SURF iRODS admins)
Packaging data can be useful when a dataset or folder/collection exists of many (small or big) files and needs be archived (either for publishing or cost reduction purposes). However, packaging data before upload can be a tedious operation for most users. Here we show how to enable the packaging workflow using the SURF BagIt iRODS ruleset. After this, iRODS users will be able to package datasets according to How to package and archive datasets using BagIt workflows in iRODS.
Enabling the packaging workflow
iRODS users will be able to mark collections to be packaged. The SURFbagitBatch iRODS rule (installed by SURF iRODS admins) will search for collections that are marked for packaging, and perform the packaging workflow asynchronously in the background.
There are two ways to enable this workflow. One is to manually run the SURFbagitBatch rule, which will find all collection candidates once. An example of how to run such a rule:
Note that this rule needs to be run as an iRODS admin:
However, typically you want the SURFbagitBatch rule to be run regularly and without invoking manually each time. To do this, you can transform the above rule into a delayed rule to be executed with a certain frequency:
This will ensure that the SURFbagitBatch rule is executed by iRODS every 2 hours.
Enable the unpackaging workflow
The unpackaging workflow is similar to the packaging workflow: