What are PIDs?
A Persistent Identifier (PID) is a unique string registered to an object. It provides a single consistent way to point to or reference the object even as the object’s location, underlying infrastructure, or web address changes with time. Typically, a PID does not only provide the location of the data the PID is referring to but also additional metadata (e.g. identifier type, owner, etc.).
The PID string itself consists of a prefix and a suffix separated by a forward slash (/
). The prefix is a single number and groups together one or more suffixes maintained by an institution or individual. The suffix is a unique alphanumeric string of characters. The combination of the pre- and suffix uniquely identifies a specific object at a specific location.
Example of a PID from the SURF repository:
11112/SURF.82df4163-f7c855ca
Prefix: 11112
Suffix: SURF.82df4163-f7c855ca
To retrieve the object or any metadata about it, a resolver is required.
Why PIDs?
PIDs ensure that all elements of your research are uniquely identifiable and accessible: by humans as well as machines. Research data has grown exponentially over the last couple decades and may be housed across various data centres and recorded within various types of media, making it difficult to consolidate. The storage locations and infrastructures of the data themselves are also subject to change, which can make published references to data obsolete. This makes it increasingly difficult and laborious to guarantee the findability and accessibility of your data. At the same time, guaranteeing findability and accessibility is important to facilitate reproducibility and reuse. PIDs offer a simple way to consolidate and trace your data no matter what changes time may bring.
Resolving PIDs
The string of the PID itself has no inherent meaning or information attached to it beyond it’s uniqueness. Rather it is used by the resolver to retrieve the object, it’s location and/or it’s metadata. PIDs registered with SURF, through the ePIC PID consortium, use the Handle service to resolve. You can read more about the Handle service below.
As an example, the PID 11304/2e873bd8-b988-11e3-8cd7-14feb57d12b9
can be resolved using the Handle service proxy URL followed by the PID itself:
http://hdl.handle.net/11304/2e873bd8-b988-11e3-8cd7-14feb57d12b9
Which currently redirects to the webpage: https://b2share.eudat.eu/records/e764dd9162c84a7e929887d3989fe4d1
(And formerly redirected to the webpage: https://b2share.eudat.eu/record/14)
This webpage functions as a landing page and contains corresponding metadata as well as links to the raw dataset files.
ePIC consortium
SURF offers a highly reliable, redundant and performant PID provisioning service in cooperation with the Persistent Identifiers for eResearch (ePIC) consortium, formerly known as the European Persistent Identifier Consortium (EPIC).ePIC provides new prefixes in the namespace “21.” through the GWDG’s membership in DONA. ePIC provides a redundant and load-balanced setup between the international consortium’s datacenters and guarantees all time and global PID resolution availability.
Handle PIDs
The Handle System is run by the DONA foundation (Digital Object Network Architecture) on behalf of the Corporation for National Research Initiatives (CNRI) and provides a comprehensive and flexible system for assigning, managing, and resolving PIDs for digital objects and other resources on the Internet. The Handle System includes an open set of protocols, an identifier space, and an implementation of the protocols.
More info
ePIC consortium website: http://www.pidconsortium.net
Persistent Identifier Guide by the Nationaal Archief: https://www.pidwijzer.nl/en/
A Persistent Identifier (PID) policy for the European Open Science Cloud (EOSC): https://data.europa.eu/doi/10.2777/926037
PID architecture for the EOSC: https://data.europa.eu/doi/10.2777/525581
Knowledge Exchange’s report on risks and trust in pursuit of a well functioning PID infrastructure for research: https://www.knowledge-exchange.info/event/pids-risk-and-trust
Research Data Alliance’s (RDA) PID interest group: https://www.rd-alliance.org/groups/pid-interest-group.html