Exploiting the DataCite schema and Elastic Search for complex queries

The following are some complex searches formulated as part of a project to establish how deep searches in a subject domain such as chemistry can be formulated. They exploit in particular the Subject element in the DataCite schema. They are not necessarily optimal (in the sense that more efficient variations for the same search might be found), but are presented as examples for the community to experiment with.

EntrydescriptionElastic search query
1Media (MIME) type https://search.datacite.org/works?query=media.media_type:chemical/x-mnpub*
2Combining Media with the DataCite Subject https://search.datacite.org/works?query=media.media_type:chemical/x-mnpub*+AND+subjects.subjectScheme:inchikey+AND+subjects.subject:XZYDALXOGPZGNV-UHFFFAOYSA-M+AND+media.media_type:chemical/x-gaussian*
3Combining ORCID with Mediahttps://search.datacite.org/works?query=contributors.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390+AND+media.media_type:chemical/x-mnpub*
4xploiting Subject https://search.datacite.org/works?query=subjects.subjectScheme:Gibbs_Energy+AND+subjects.subject:"-39.946176"
5Exploiting Subject with range queryhttps://search.datacite.org/works?query=subjects.subjectScheme:Gibbs_energy+AND+subjects.subject:[\-649.1 TO \-649.8]
6Exploiting three different Subjectshttps://search.datacite.org/works?query=(subjects.subjectScheme:inchikey+AND+subjects.subject:KTOSDSJYNBIDCN-UHFFFAOYSA-N)+AND+(subjects.subjectScheme:Gibbs_Energy+AND+subjects.subject:"-1082.980914")
7Two different Media typeshttps://search.datacite.org/works?query=media.media_type:chemical/x-gaussian*+AND+media.media_type:chemical/x-mnpub*
8License type https://search.datacite.org/works?query=rightsList.rights:"Creative Commons Public Domain Dedication (CC0 1.0)"
9Exploiting subjectscheme https://search.datacite.org/works?query=media.media_type:chemical/x-mnpub*+AND+subjects.subjectScheme:NMR_Nucleus+AND+subjects.subject:1H
10Exploiting subjectscheme https://search.datacite.org/works?query=media.media_type:chemical/x-mnpub*+AND+subjects.subjectScheme:NMR_Pulse+AND+subjects.subject:1D
11Simple PID queryhttps://search.datacite.org/works?query=identifier:*10.14469/hpc*
12Combining ORCID with PID queryhttps://search.datacite.org/works?query=(contributors.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390)+AND+(identifier:*10.14469/hpc*)
13Combing researcher name with PID queryhttps://search.datacite.org/works?query=(identifier:*10.14469/hpc*)+AND+(contributors.contributor.contributorName:Henry+Rzepa)
14Entries in specific repository (Imperial) referencing specific Journal https://search.datacite.org/works?query=(relatedIdentifiers.relatedIdentifier:10.1021/acs.orglett*)+AND+(identifier:*10.14469/hpc*)
15Entries in specific repository (Cambridge) referencing specific Journal https://search.datacite.org/works?query=(relatedIdentifiers.relatedIdentifier:10.1021/acs.orglett*)+AND+(identifier:*10.17863/cam*)
18Entries in specific repository (Cambridge) referencing all publisher journalshttps://search.datacite.org/works?query=(relatedIdentifiers.relatedIdentifier:10.1021/acs*)+AND+(identifier:*10.17863/cam*)
16Entries in all repositories except one referencing specific Journalhttps://search.datacite.org/works?query=(relatedIdentifiers.relatedIdentifier:10.1021/acs.orglett*)+NOT+(identifier:*10.5517*)
17Entries in specific repository referencing one publisher https://search.datacite.org/works?query=(relatedIdentifiers.relatedIdentifier:10.1021*)+AND+(identifier:*10.5517*)
19Entires in all publisher journals, excluding one data repositoryhttps://search.datacite.org/works?query=(relatedIdentifiers.relatedIdentifier:10.1021*)+NOT+(identifier:*10.5517*)
20Entries in Institutional repository referencing datasetshttps://search.datacite.org/works?query=(relatedIdentifiers.relatedIdentifier:*10.14469/spiral*)+AND+(identifier:*)+AND+(types.resourceTypeGeneral:Dataset)

Interesting and/or valuable exploitations of the DataCite Schema/Search engine by other communities are hugely welcomed for inclusion here. We can all learn from them.

If you have a query or suggestion, please contact me at rzepa@imperial.ac.uk, ORCID: https://orcid.org/0000-0002-8635-8390