Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Schema.org dataset metadata doesn't include file PIDs on Harvard Dataverse #5458

Closed
jggautier opened this issue Jan 11, 2019 · 1 comment
Closed

Comments

@jggautier
Copy link
Contributor

jggautier commented Jan 11, 2019

The Dataverse 4.10 release includes additions to the dataset metadata that Dataverse publishes in schema.org JSON (#4371). On Harvard Dataverse, it looks like all of those additions show up the way we expect them to except for file persistent IDs. (If the last version of a dataset was published before Harvard Dataverse was upgraded on Jan 8, its schema.org metadata won't have extra fields, yet).

This is what some of the file metadata looks like for this dataset:

"distribution": [
    {
      "@type": "DataDownload",
      "name": "ESCOManalyses_SR.do",
      "fileFormat": "application/x-stata-syntax",
      "contentSize": 7436,
      "description": "Stata do file that replicates analyses reported in the article and appendix, with the exception of the Bayesian analyses that were conducted using R.",
      "contentUrl": "https://dataverse.harvard.edu/api/access/datafile/3317147"

Below "contentUrl" there should be "@id" and "identifier" properties.

Demo Dataverse was also upgraded to 4.10 and does include the file PIDs, e.g. for this dataset (which doesn't include "contentURL" because it's a restricted file):

"distribution": [
    {
      "@type": "DataDownload",
      "name": "CallForSponsors2019.pdf",
      "fileFormat": "application/pdf",
      "contentSize": 146336,
      "description": "File description",
      "@id": "https://doi.org/10.5072/FK2/ALQKV2/F4EVJR",
      "identifier": "https://doi.org/10.5072/FK2/ALQKV2/F4EVJR"

I'm not sure why it shows up in schema.org metadata on Demo Dataverse but not on Harvard Dataverse.

@jggautier jggautier changed the title Schema.org metadata doesn't include file PIDs on Harvard Dataverse Schema.org dataset metadata doesn't include file PIDs on Harvard Dataverse Jan 11, 2019
@jggautier
Copy link
Contributor Author

@pdurbin mentioned that we're not seeing the file PIDs in the dataset metadata on Harvard Dataverse because Harvard Dataverse turned off file PIDs recently. Closing this ticket.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant