This documentation is derived from https://www.uu.nl/en/research/yoda and adjusted to Yoda hosted at SURF.

The Yoda metadata form consists of approx. 33 fields. The table below lists all metadata elements with a description of their function. The values in the Man. column state whether or not the field is mandatory when archiving a data package.

NoElementMan.DescriptionExplanation
1TitleYThe title of your datasetWhen you publish your data package, the title will be harvested by other catalogues. Maximum length: 255 characters.
2DescriptionYA description of your datasetDescribe your dataset, e.g., the subject, the sample size, methodology, etc. It is best to keep this description concise. A more elaborate description can be added in a readme.txt and/or a codebook file. Maximum length: 2,700 characters.
3Discipline
The (sub)discipline of the studyThe list contains a combination of research disciplines and sub-disciplines. The standard used is the OECD FOS 2007. This field can have multiple values — use the plus sign to add more values.
4VersionYThe version number of the datasetYoda does not automatically assign version numbers to data packages. If you create multiple versions, you can register the version number yourself, according to your own versioning scheme.
5Language of the data
The (main) language of the data in the datasetThis element is thought of as a possible aid to assess the usability of a dataset for a specific person. The standard used is ISO 639/1.
6aCollection Process - Start Date
Indicate when you’ve started collecting the data for this datasetFormat: YYYY-MM-DD
6bCollection Process - End Date
Indicate when you’ve finished collecting the data for this datasetFormat: YYYY-MM-DD
7Location(s) covered
Indication of the geographical entities, such as countries, regions and cities, covered by this datasetEnglish naming convention preferred. It is recommended to use the preferred spelling from the Getty Thesaurus of Geographic Names whenever possible. One location per line. This field can have multiple values — use the plus sign to add more values. Maximum length: 255 characters.
8aPeriod Covered - Start period
An indication of the start date of the period covered by your datasetFormat: YYYY-MM-DD
8bPeriod Covered - End Period
An indication of the end date of the period covered by your datasetFormat: YYYY-MM-DD
9Tag
Free text field for adding (searchable) keywords to your datasetYou can choose the keywords freely. It is best to add only one keyword per line. This field can have multiple values — use the plus sign to add more values. Maximum length: 255 characters.
10aRelated Data package
The way in which the present data package is related to another data packageIn this section you can enter a 'related' dataset and the nature of that relation. For instance, you can indicate that another dataset contains the raw data by selecting IsSourceOf in this field and entering the information of the other data package in the fields below. This field can have multiple values — use the plus sign to add more values.
10bRelated Data package - TitleIf 10aTitle of the data package related to the present data packageThere is no automatic check for whether or not title and persistent identifier match. Maximum length: 255 characters.
10dRelated Data package – TypeIf 10eThe type of the persistent identifier of the related data packageExample: "DOI".
10eRelated Data package – IdentifierIf 10dThe persistent identifier of the related data package
11Retention PeriodYThe minimal number of years the data will be kept in the archive

In this field you can only enter integers. The default value is 10 years. The policies for handling expiry of the retention period of a data package are still to be defined.

12Retention Information
To be used for remarks about the retention periodPlease provide a reason if you deviate from the default value of 10 years. If you want to ensure that data is retained longer, then data management might request extra care for choosing sustainable file formats.
13Embargo end date
If the dataset has an embargo, on what date does the embargo end?This functionality is not yet fully implemented. Please contact the data manager if you intend to publish a data package with an embargo.
14Data ClassificationYPlease indicate the classification of the dataThe data-classification according to the BIV guidelines.
15Name of Collection
If this data package is part of a larger (conceptual) collection of data packages, you can enter the collection name hereThe research group should ensure that all other data packages in the collection are archived with the same collection name. Maximum length: 255 characters.
16Funder
The name(s) of the organization(s) funding the researchExample: "NWO". This field can have multiple values — use the plus sign to add more values. Maximum length: 255 characters.
17Award Number
The grant number issued by the funding organization
18Creator of Data packageYThe name of the person(s) who created (this version of) the datasetPreferred format: Family Name, First Name . This field can have multiple values — use the plus sign to add more values. Maximum length: 255 characters.
19AffiliationYThe organizational or institutional affiliation of the creatorExample: "Utrecht University". The affiliation of the creator of a data package could be of importance when it is unclear who owns the data. In general, the organization to which the creator was affiliated is regarded as the owner. Each creator can have multiple affiliations — use the plus sign to add more values. Maximum length: 255 characters.
20aCreator of Data package – Persistent Identifier: Type
Please indicate the type of persistent person identifierE.g. AuthorID, ORCID or ResearcherID. Multiple values are possible. If available, enter at least an ORCID.
20bCreator of Data package – Persistent Identifier: Identifier
The persistent identifierIf you are not sure whether someone has a persistent identifier, you can check with the big three providers: AuthorID, ORCID, ResearcherID. Each creator can have multiple persistent identifier — use the plus sign to add more values. Maximum length: 255 characters.
21Contributor(s) to Data Package
The name of the person(s) who contributed to this datasetPreferred format: Family Name, First Name. Multiple values are possible — use the plus sign to add more values. Maximum length: 255 characters.
22Contributor Type
Enter what type of contribution the registered person has made to this data packageExamples: Project lead or Project member. This field can have multiple values — use the plus sign to add more values.
23Affiliation
The organizational or institutional affiliation of the contributorE.g. Utrecht University. Each contributor can have multiple affiliations — use the plus sign to add multiple values. Maximum length: 255 characters.
24aContributor of Data package – Persistent Identifier: Type
Please indicate the type of persistent person identifierEach contributor can have multiple persistent identifiers — use the plus sign to add more values. Maximum length: 255 characters.
24bContributor of Data package – Persistent Identifier: Identifier
The persistent identifierEach contributor can have multiple identifiers — use the plus sign to add more values. Maximum length: 255 characters.
25LicenseYThe license under which you offer the data package for use by third parties. The preferred value for open data is CC By 3.0Every package needs to be archived with a license — even when you’re not planning to publish the data or have it reused in any form. We offer a number of possible licenses in a drop-down list. If you do not know which license to choose, contact the data manager. At the moment of publishing a data package the relevant license text will be copied into the data package. If you opt for a custom license, you will need to store the custom license text in a file titled License.txt in the root folder. Please contact the data manager first if you want to work with custom licenses.
26Data Package AccessYOnce archived, will your dataset be accessible to third parties?Open Access means that the dataset is accessible to everyone. Restricted Access means that the dataset can only be obtained on request. Closed Access means that, in principle, the dataset cannot be shared.
  • No labels