Hi,
It seems some titles encode ampersands as HTML entity &. I wonder if all special characters are converted to HTML entities at registration, or are they registered as submitted? Any suggestions on how to clean the titles are much appreciated!
Hi,
It seems some titles encode ampersands as HTML entity &. I wonder if all special characters are converted to HTML entities at registration, or are they registered as submitted? Any suggestions on how to clean the titles are much appreciated!
Hello,
The titles and metadata would be registered as they are submitted to us and so if the ampersand is encoded then that will be registered as such but any tool using that title might then convert that title itself.
If you have examples of titles that you would like us to take a look at any further questions regarding the cleaning up of titles then please do add them here.
Thank you,
Paul
Hi! I have a similar problem with a journal title that has a comma. Encoding as an HTML entity )`%2C` did not the trick, as the previous comment also suggests.
How can I tell the API to decode the `,` and use the journal name âGeoscientific Instrumentation, Methods and Data Systemsâ in a query as a filter?
Example: /works?filter=prefix:10.5194,container-title:Geoscientific Instrumentation%2C Methods and Data Systems
(sorry, not allowed to insert links yet)
Error:
| type | âpair-list-form-invalidâ |
|---|---|
| value | âprefix:10.5194,container-title:Geoscientific Instrumentation, Methods and Data Systemsâ |
| message | âPair list form specified as âprefix:10.5194,container-title:Geoscientific Instrumentation, Methods and Data Systemsâ but should be of the form: key:val,âŚ,keyN:valNâ |
Hello @nuest ,
Great question! Thanks for asking it here. Before I answer it, I have increased your privileges in the forum so you can paste links and fully participate. Sorry for those extra hoops. Weâve had to deal with a high volume of spam over the years and putting in some protections at the beginning has helped us greatly reduce pesky and time-consuming spam here.
How can I tell the API to decode the `,` and use the journal name âGeoscientific Instrumentation, Methods and Data Systemsâ in a query as a filter?
Example: /works?filter=prefix:10.5194,container-title:Geoscientific Instrumentation%2C Methods and Data Systems
Iâd recommend querying by ISSN instead of title. Youâll get better results. The ISSN of the journal Geoscientific Instrumentation, Methods and Data Systems is: 2193-0864, so your best bet here is a query like this:
https://api.crossref.org/works?filter=issn:21930864&select=DOI,title,published,ISSN
Iâm selecting for only a few of the most relevant elements, but if you wanted full metadata for all 438 DOIs registered with us using that ISSN:
https://api.crossref.org/works?filter=issn:21930864&rows=500
It also looks like a number of DOIs registered for this journal did not include that 2193-0864 ISSN in the metadata registered with us, so querying for records including that ISSN is giving us an incomplete picture of all DOIs registered in our system for this journal. Because of that, the journalâs depositor report is also helpful in getting a full picture of what has been registered:
https://data.crossref.org/depositorreport?pubid=J183305
Warm regards,
Isaac
Thanks - thatâs good advice and solves my problem for now. Iâll revisit the issue of completeness later. I assume I can find the pubid in tje metadata, too.
Thanks also for bumping my account!