I’ve discovered a case where an article cites a dataset that has a CrossRef DOI, but this dataset DOI is not included in the CrossRef metadata. In other words, a data citation has been missed. For context, this came up as part of the Make Data Count Kaggle Challenge, see Make Data Count - Finding Data References | Kaggle
The example is “A conserved allosteric element controls specificity and activity of functionally divergent PP2C phosphatases from Bacillus subtilis” https://doi.org/10.1016/j.jbc.2021.100518 which cites a Protein Data Bank record 3F7A which has a DOI wwPDB: pdb_00003f7a issued by CrossRef. This DOI is cited in the web version of this article, but not in the CrossRef metadata:
{
"year": "2009",
"series-title": "Structure of Orthorhombic Crystal Form of Pseudomonas aeruginosa RssB",
"author": "Levchenko",
"key": "10.1016/j.jbc.2021.100518_bib25"
},
I wonder if this is because the data record has type “component” and is hence overlooked when making the DOI links?
It would be interesting to see what happened here, because data citations are currently a topic of great interest, and here is a case where CrossRef seems to have all the information needed to make the link between paper and data, but doesn’t!