Questions Regarding Japanese Content and the language Field in REST API

Dear Crossref Support & Community,

I hope this message finds you well.

We are a Japanese organization, and we’re interested in understanding how much Japanese-language content has been registered with Crossref and is being distributed via metadata.

While exploring Crossref’s REST API, we noticed that records include a field like the following:

“language”: “en”

We would like to ask for clarification on this field. Does “language”: “en” indicate:

  • That the article itself is written in English?
  • Or that the journal is English-language?
  • Or simply that the metadata (such as title and abstract) was submitted in English?

Thank you very much for your support.

Hello @ksakai ,

Thanks for asking!

This typically means that the content in question is published in English and thus will be cited in English.

If you have a specific example I’d be happy to give you more details.

-Isaac

Dear @ifarley ,

Thank you for your response and clarification.

As a follow-up, may I ask how the "language" field in the REST API is determined?

For example, in the following record, we did not specify a language attribute like <journal_article language="en" in the deposited metadata:

https://api.crossref.org/works/10.2964/jsik_2025_002

Despite this, the REST API output shows "language": "en". Could you let us know how this value is inferred in such cases?

We’d appreciate any further insight you could provide.

Best regards,
Kohei

Hello Kohei,

Thanks for the example. We’re not inferring the language here, we’re taking it directly from the metadata registered with us.

In the REST API result for DOI 10.2964/jsik_2025_002 - https://api.crossref.org/works/10.2964/jsik_2025_002 - you’ll see the language element here:

That is representing the language of the journal as a whole, and we’re getting that from the XML that is being submitted to us at the journal level. You can see this from the XML file that was registered for DOI 10.2964/jsik_2025_002 in submission https://doi.crossref.org/servlet/submissionAdmin?sf=detail&submissionID=1685138502

There is no language declared for the journal article, albeit two titles were submitted for the journal article:

Thus, we’re matching references based on both journal article titles.

Warm regards,

Isaac

Dear @ifarley ,

Thank you very much for your detailed explanation.
Based on your response, I understand that in this case the “language”: “en” value in the REST API output is not inferred from the article metadata, but instead comes from the language specified at the journal level.

I have a few follow-up questions:

  1. Does Crossref’s deposit schema include a metadata field specifically intended to indicate the language in which the article itself is written?
    For example, does the attribute <journal_article language=“” serve this purpose?
  2. We have some concerns that metadata aggregators such as OpenAlex may not be accurately capturing the publication language of non-English articles, including Japanese content.
    Since these services rely heavily on Crossref metadata, are there any best practices for depositing metadata for non-English articles to ensure that the article language is properly identified by downstream systems?

We would appreciate any guidance you can provide on this topic.

Thank you again for your continued support!