Hi!
We’re looking to improve our system’s ability to fetch up-to-date metadata about journal articles and preprints, including updates when changes occur over time due to new forward links and/or cross-links between preprints and journal articles being processed.
We currently use the REST API, but the OAI-PMH service is intriguing, since it seems potentially more fit for our use case of incremental metadata harvesting.
However, it’s not exactly clear what would trigger the inclusion of a record in the OAI-PMH list of updated records for a given date. I’d love to know if there is an easily-described set of rules that determine this.
Our primary motivating examples are:
- Journal article X has a new inbound citation, so its citeby-count goes up by 1. Does article X show up in the OAI-PMH incremental update for today?
- Preprint X has now been published in a journal, so a new related-item link exists. Does preprint X show up in the OAI-PMH incremental update for today?
If there is clear guidance to stick to the REST API for high-fidelity incremental updates like this, that will be helpful to know. (I did see the note in the API Swagger docs about using the from-index-date field to capture incremental metadata updates.)