A new dataset of relationships involving research organisations

We’ve published a dataset of relationships involving research organisations. Research organisations in the dataset are identified by ROR IDs.

The dataset contains relationships involving research organisations deposited by Crossref members and discovered by an automated affiliation matching strategy. It includes data deposited until the end of March 2025. The following relationships are included:

  • contributor’s affiliations
  • institutional contributors
  • work’s institutions
  • grant investigator’s affiliations

The dataset contains:

  • 140,906,929 total assertions
  • 1,014,325 (0.7%) assertions contain a ROR ID deposited by Crossref members
  • 94,988,729 (67%) assertions contain a ROR ID discovered by an automated affiliation matching strategy

The affiliation matching strategy, developed in collaboration with ROR, was used to automatically match a free-text affiliation string to a ROR ID. More information about the matching approach and strategy can be found here.

7 Likes