ISSN discrepancy between JSON and XML responses

abeechin · September 14, 2023, 4:27pm

Hello Crossref,

I noticed a very minor data discrepancy relating to the ISSN of a publication between the JSON and XML endpoints. In the later the ISSN is missing the hyphen, so appears as 8 numbers together.

https://0-api-crossref-org.pugwash.lib.warwick.ac.uk/works/10.1016/S1470-2045(22)00738-0
=>
"issn-type": [{
    "value": "1470-2045",
    "type": "print"
}],

vs

https://0-api-crossref-org.pugwash.lib.warwick.ac.uk/works/10.1016/S1470-2045(22)00738-0/transform/application/vnd.crossref.unixsd+xml
=> 
<issn media_type="print">14702045</issn>

And I was wondering if this was a one-off or symptomatic of a wider issue - perhaps with that publisher? Here is another DOI exhibiting the same issue: 10.29297/orbit.v1i2.50

Shayn · September 14, 2023, 4:43pm

Thanks for your question.

The XML reflects the ISSN exactly how the publisher submitted it to us. We accept them with or without the hyphen. They’re treated exactly the same either way.

When the XML is processed for indexing in our REST API (JSON), the ISSNs are normalized so they all include the hyphen, even if the publisher didn’t use the hyphen in the metadata they submitted to us. It’s just for consistency. There’s no difference in meaning.

abeechin · September 14, 2023, 5:37pm

OK, good to know, then we can do the same when pulling from the XML endpoint - many thanks for the rapid answer!

abeechin · September 18, 2023, 1:51pm

As a follow-up, do you have a list of fields where you apply similar normalisations? Mainly it would help us catch where we can also do this, rather than switch over our current implementation to the JSON endpoint.

ppolischuk · September 22, 2023, 8:24pm

Hi, I’m still looking into this but wanted to share some preliminary findings. Unfortunately we don’t have any user-friendly documentation about what normalization we do for the JSON output. As I look through the codebase, it looks to me like ISSNs are the only metadata field that we currently normalize.

However, we are in the process of standing up a new internal data model at Crossref, and many more metadata fields are planned for normalization. Here is the issue for that work, and here are the tests/specs. Be advised that this work is still pending and the specification is subject to change.

gbilder · September 25, 2023, 6:41am

Also- I know you said that you are trying to avoid having to switch the REST JSON API, but you may want to reconsider that approach.

First, as you have already noted, you will continually have to apply normalisations to the XML to match the normalisations applied to the JSON.

The XML is mostly just going to represent what the member registered with us. The REST API and JSON, on the other hand, will increasingly include:

additional metadata from Crossref and other sources
additional metadata types that are not registered via XML
normalisations (as with the ISSN) to help make the metadata more usable (e.g. by citation formatters, etc)
access to non-work metadata and functionality (e.g. member data, submissions status information, billing data, etc)

For example, our recently announced opening of the RetractionWatch data will only ever be made available via the REST API. Our upcoming enhanced relationships support will also only be available via the REST API.

Finally, it is also worth noting that the REST API follows the “be conservative in what you send, be liberal in what you accept” principle. So, even though the REST API represents the ISSN with the hyphen according to the ISSN guidelines, you can search and filter for ISSNs with or without the hyphen. For example:

https://0-api-crossref-org.pugwash.lib.warwick.ac.uk/journals/23636300/works?select=container-title,title,DOI

abeechin · October 5, 2023, 9:47am

Many thanks for the detailed response - I will feed it back to the team and advocate we switch over to capture the full benefits of the data your provide

abeechin · October 5, 2023, 9:48am

I appreciate the detailed answer and links - many thanks (and apologies for the delayed response, was out of office the last weeks).

Topic		Replies	Views
Understanding some fields of the Metadata Retrieval service via API Interfaces for Machines rest-api , participation-report , metadata-retrieval , fees	6	237	January 30, 2024
ISSN journal resource not found Metadata Retrieval rest-api , metadata-retrieval , journal	3	655	July 7, 2023
Where do I open issues with specific ISSNs? Technical Support rest-api , metadata-quality	4	910	June 9, 2022
Public XML metadata API returns 503 Metadata Retrieval bug , metadata-retrieval , xml_api , test_admin_tool , admin-tool	4	311	August 31, 2023
Discrepancy in Subjects between Public and Polite REST API Interfaces for Machines	2	356	April 23, 2024

ISSN discrepancy between JSON and XML responses

Related Topics