Persistent Identifiers What is a Persistent Identifier PID









- Slides: 9

Persistent Identifiers

What is a Persistent Identifier (PID)? • PIDs are used when citing and managing research data and information • A PID always identifies something: for instance, a publication, document, person or organization. • A PID is a unique and permanent identifier that is functional on the Internet • In an ideal situation, when clicking on the PID the user is always taken to the original, individualized information, and the machine can also interpret this link and understand what type of information is at hand 2 A good identifier is linked to o o A landing page Access information Metadata Tombstone page

Why are PIDs needed? • Traceability and reliability of information are absolute requirements for good data management • Most weblinks break over time when research sources are digital, we would like the links to them to be persistent • Planning and common standards, as well as PIDs, are needed to secure the availability and localization of the information • Information must be verifiable and traceable; data must be individualized, and its life cycle must be managed in a systematical way – therefore using PIDs is essential! 3

PIDs • URN, DOI and Handle are common PIDs for research data • The DOI for research data is called a Data. Cite DOI and is registered with metadata in the Data. Cite service • The Data. Cite Finland Consortium offers access to minting DOI • DOI in data archives provides for instance access to metadata even if the data is not available anymore • Also URN identifiers from trusted repositories like the Data Archive (Tietoarkisto) or the Language Bank (Kileipankki) or Etsin are good identifiers 4

Resolvers • Resolvers are used to make identifiers last over changes in organizations, web domain names or technical solutions • Resolvers keep track of where the data is currently stored and redirects you to a landing page with metadata about the dataset • An identifier that can be resolved is called functional • A resolver (or resolution service) is an application that is in charge of resolving functional PIDs o e. g. Handle-system is used with DOI-systems o URN doesn’t have one common application package • A resolution service can include many kinds of intelligence and additional services It is important to choose a secure, the most suitable service for the specific need 5

Persistent identifiers are necessary for research and a key element in FAIR data Resolver Data file Read me PID License Data catalog Configuration file

Recommendations for using PIDs 1. If the dataset already has a PID, use that 2. If a format has a standardized identifier system, use that 3. Utilize existing, open and widely used technologies 4. If there is no identifier system, plan it carefully, avoid unnecessary semantics and choose a solution and plan the identifier so that it can be used as a functional PID 5. Take care of your identifiers’ uniqueness and persistence 7 6. Follow the guidelines and practices of the identifier system you have chosen 7. When there is a new version of an identified target, give it a new identifier and take care of linking it to the old versions (or their metadata) 8. Don’t reuse identifiers or destroy them 9. If you use Cool URI –identifiers, make them distinct and manage them carefully 10. Document and publish the practices and principles of your identifiers’ sharing 11. Act responsibly

PID network • CSC’s PID network is an open expert network for people who manage persistent identifiers • In the network there are currently over 50 members from different organizations • In practice, joining the network means subscribing to a mailing list • You can join the network by contacting the network coordinator Jessica Parland-von Essen by email at parland@csc. fi 8

Further Information: • PID Forum: ohttps: //www. pidforum. org/ • EOSC PID Policy: ohttps: //doi. org/10. 5281/zenodo. 3574203 • CSC PID Policy: ohttps: //research. csc. fi/pid-policy 9 CSC provides guidance and services for organizations for allocating and minting PIDs https: //www. csc. fi/en/researchadministration For more information and support contact CSC PID services at pid-support@listat. csc. fi