{"id":18344,"date":"2017-06-08T17:32:21","date_gmt":"2017-06-08T16:32:21","guid":{"rendered":"http:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=18344"},"modified":"2017-06-08T20:56:12","modified_gmt":"2017-06-08T19:56:12","slug":"how-to-search-data-repositories-for-fair-chemical-content-and-data-subjectscheme","status":"publish","type":"post","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344","title":{"rendered":"How to search data repositories for FAIR chemical content and data: SubjectScheme"},"content":{"rendered":"<div class=\"kcite-section\" kcite-section-id=\"18344\">\n<p>As data repositories start to flourish, it is reasonable to ask questions such as <em>what sort of chemistry can be found there and how can I find it?<\/em> Here I give an updated<span id=\"cite_ITEM-18344-0\" name=\"citation\"><a href=\"#ITEM-18344-0\">[1]<\/a><\/span> worked example of a digital repository search for chemical content and also pose an important issue for the chemistry domain.<\/p>\n<p>Firstly, I should say this search is restricted just to those data repositories that submit indexing terms (metadata) to DataCite, which is the agency that will be used to conduct the searches. Each type of metadata is defined by a prefix or operator field (much in the same way that an advanced Google search can be <a href=\"http:\/\/www.googleguide.com\/advanced_operators_reference.html\">prefixed<\/a> with an operator, e.g. <strong>author:<\/strong><sup>\u2665<\/sup>). I will use just two such DataCite field prefixes<sup>\u2020<\/sup> here as exemplars (there are many more).<\/p>\n<ol>\n<li><strong>media:<\/strong> This specifies the media type for the data being searched. For restriction to chemistry one takes advantage of the <strong>chemical\/x-<\/strong> media type, as described previously.<span id=\"cite_ITEM-18344-1\" name=\"citation\"><a href=\"#ITEM-18344-1\">[2]<\/a><\/span><\/li>\n<li><strong>SubjectScheme:<\/strong> This is a new declaration, as specified in the DataCite V4 metadata schema.<span id=\"cite_ITEM-18344-2\" name=\"citation\"><a href=\"#ITEM-18344-2\">[3]<\/a><\/span> The subject scheme in effect declares a subject-specific term, and is designed to be used by domains such as chemistry.<\/li>\n<\/ol>\n<p>This latter is best illustrated by one specific example of a search which I will dissect here:<br \/>\n <a href=\"https:\/\/search.datacite.org\/works?query=media:chemical\\\/x\\-gaussian*+subjectScheme:inchikey+subject:XZYDALXOGPZGNV-UHFFFAOYSA-M+media:chemical\\\/x\\-mnpub*\"> <span style=\"background-color: cornflowerblue;\">https:\/\/search.datacite.org\/works?query=<\/span><span style=\"background-color: lightpink;\">media:chemical\\\/x\\-gaussian*<\/span><b style=\"background-color: plum;\">+S<\/b><span style=\"background-color: darkturquoise;\">ubjectScheme:inchikey<\/span><b style=\"background-color: plum;\">+<\/b><span style=\"background-color: deepskyblue;\">subject:XZYDALXOGPZGNV-UHFFFAOYSA-M<\/span><b style=\"background-color: plum;\">+<\/b><span style=\"background-color: sandybrown;\">media:chemical\\\/x\\-mnpub*<\/span><\/a><sup>\u2021<\/sup><\/p>\n<ol>\n<li><span style=\"background-color: cornflowerblue;\">https:\/\/search.datacite.org\/works?query=<\/span> <a href=\"https:\/\/search.datacite.org\/help.html\">queries the DataCite MDS<\/a><sup>\u2020<\/sup> (metadata store).<\/li>\n<li><span style=\"background-color: lightpink;\">media:chemical\\\/x\\-gaussian*<\/span> defines a media type which contains the string <span style=\"background-color: lightpink;\">chemical\/x-gaussian<\/span>, with the <span style=\"background-color: lightpink;\">*<\/span> being a wild-card which allows any characters to follow this string. This now is specifying any data repository where <strong>Gaussian<\/strong> files have been deposited and assigned this media type.<\/li>\n<li><b style=\"background-color: plum;\">+<\/b> represents a Boolean <b style=\"background-color: plum;\">AND<\/b> operator.<\/li>\n<li><span style=\"background-color: darkturquoise;\">SubjectScheme:inchikey<\/span> restricts a subject search to a <span style=\"background-color: darkturquoise;\">subjectScheme<\/span> having the value <span style=\"background-color: darkturquoise;\">inchikey<\/span>, whilst<\/li>\n<li><span style=\"background-color: deepskyblue;\">subject:XZYDALXOGPZGNV-UHFFFAOYSA-M<\/span> defines the value of the subject itself.<\/li>\n<li><span style=\"background-color: sandybrown;\">media:chemical\/x-mnpub<\/span> completes the search definition, this relating to the mandatory additional presence of an <strong>Mpublish<\/strong><span id=\"cite_ITEM-18344-3\" name=\"citation\"><a href=\"#ITEM-18344-3\">[4]<\/a><\/span> file indicating (spectroscopic, probably NMR) data readable by the MestreNova program.<\/li>\n<\/ol>\n<p>One hit with these restrictions has doi: <a href=\"http:\/\/doi.org\/10.14469\/HPC\/2635\">10.14469\/HPC\/2635<\/a> and clicking the button on the landing page for this object labelled <strong>metadata<\/strong> resolves to <em>e.g.<\/em><br \/>\n <a href=\"https:\/\/data.datacite.org\/application\/vnd.datacite.datacite+xml\/10.14469\/hpc\/1976\"><small>https:\/\/data.datacite.org\/application\/vnd.datacite.datacite+xml\/10.14469\/hpc\/2635<\/small><\/a>,<br \/>\n and downloads the metadata record for this object. Part of this record looks a bit like:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-18503\" src=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171-1024x196.jpg\" alt=\"\" width=\"450\" height=\"86\" srcset=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171-1024x196.jpg 1024w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171-300x58.jpg 300w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171-768x147.jpg 768w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171.jpg 1252w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/p>\n<p>This brings me to the important issue for the chemistry domain, which is to agree upon a core set of <strong>SubjectSchemes<\/strong> for implementation in\u00a0data repositories with domain-specific chemical content. The two subjects above, the InChI and the InChIKey seem obvious candidates for inclusion. But how the list is extended and how the SubjectScheme\u00a0is specified are now matters for the community to discuss. Perhaps the <a href=\"http:\/\/goldbook.iupac.org\/\">IUPAC GoldBook<\/a> is one starting point for the SubjectScheme URIs. Watch this space.<\/p>\n<hr \/>\n<p><sup>\u2021<\/sup>The \\ syntax indicates an <a href=\"http:\/\/lucene.apache.org\/core\/4_0_0\/queryparser\/org\/apache\/lucene\/queryparser\/classic\/package-summary.html#Escaping_Special_Characters\">&#8220;escaped&#8221; character<\/a>. Thus in chemicalx\\-gaussian a \\ ensured that the following \/ is treated as part of the search string, and not as part of the search syntax. Likewise <b>\\-<\/b> ensures the minus character is part of the string and not a syntactic negation. The current list of characters requiring escaping is <tt style=\"background-color: lightyellow;\">+ - &amp; | ! ( ) { } [ ] ^ \" ~ * ? : \\ \/<\/tt><\/p>\n<p><sup>\u2020<\/sup> The documentation lists common fields, but there are far more specified in V4 of their schema. The ones you see used here are not (yet?) documented at <a href=\"https:\/\/search.datacite.org\/help.html\">https:\/\/search.datacite.org\/help.html<\/a><\/p>\n<p><sup>\u2665<\/sup> This <a href=\"http:\/\/www.googleguide.com\/advanced_operators_reference.html\">Google page<\/a> has a rich plethora of powerful searches, which I suggest almost no-one knows about!<\/p>\n<hr \/>\n<h2>References<\/h2>\n    <ol class=\"kcite-bibliography csl-bib-body\"><li id=\"ITEM-18344-0\">H.S. Rzepa, A. Mclean, and M.J. Harvey, \"InChI As a Research Data Management Tool\", <i>Chemistry International<\/i>, vol. 38, pp. 24-26, 2016. <a href=\"https:\/\/doi.org\/10.1515\/ci-2016-3-408\">https:\/\/doi.org\/10.1515\/ci-2016-3-408<\/a>\n\n<\/li>\n<li id=\"ITEM-18344-1\">H.S. Rzepa, P. Murray-Rust, and B.J. Whitaker, \"The Application of Chemical Multipurpose Internet Mail Extensions (Chemical MIME) Internet Standards to Electronic Mail and World Wide Web Information Exchange\", <i>Journal of Chemical Information and Computer Sciences<\/i>, vol. 38, pp. 976-982, 1998. <a href=\"https:\/\/doi.org\/10.1021\/ci9803233\">https:\/\/doi.org\/10.1021\/ci9803233<\/a>\n\n<\/li>\n<li id=\"ITEM-18344-2\">DataCite Metadata Working Group., \"DataCite Metadata Schema Documentation for the Publication and Citation of Research Data v4.0\", <i>DataCite e.V.<\/i>, 2016. <a href=\"https:\/\/doi.org\/10.5438\/0012\">https:\/\/doi.org\/10.5438\/0012<\/a>\n\n<\/li>\n<li id=\"ITEM-18344-3\">M.J. Harvey, A. McLean, and H.S. Rzepa, \"A metadata-driven approach to data repository design\", <i>Journal of Cheminformatics<\/i>, vol. 9, 2017. <a href=\"https:\/\/doi.org\/10.1186\/s13321-017-0190-6\">https:\/\/doi.org\/10.1186\/s13321-017-0190-6<\/a>\n\n<\/li>\n<\/ol>\n\n<\/div> <!-- kcite-section 18344 -->","protected":false},"excerpt":{"rendered":"<p>As data repositories start to flourish, it is reasonable to ask questions such as what sort of chemistry can be found there and how can I find it? Here I give an updated worked example of a digital repository search for chemical content and also pose an important issue for the chemistry domain. Firstly, I [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"activitypub_content_warning":"","activitypub_content_visibility":"","activitypub_max_image_attachments":5,"activitypub_interaction_policy_quote":"anyone","activitypub_status":"","footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[2],"tags":[499,2198,2202,2205,2204,2200,2201,1882,2203,2197,2195,1001,342,2199,2206,2196],"ppma_author":[2661],"class_list":["post-18344","post","type-post","status-publish","format-standard","hentry","category-chemical-it","tag-chemical-content","tag-chemicalx-media-type","tag-chemicalx-gaussian","tag-company-datacite","tag-company-google","tag-digital-repository-search","tag-domain-specific-chemical-content","tag-media-type","tag-mediachemicalx-mnpub","tag-mediachemicalx-gaussian","tag-question","tag-search-definition","tag-search-engines","tag-search-string","tag-search-syntax","tag-subject-search"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>How to search data repositories for FAIR chemical content and data: SubjectScheme - Henry Rzepa&#039;s Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to search data repositories for FAIR chemical content and data: SubjectScheme - Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"og:description\" content=\"As data repositories start to flourish, it is reasonable to ask questions such as what sort of chemistry can be found there and how can I find it? Here I give an updated worked example of a digital repository search for chemical content and also pose an important issue for the chemistry domain. Firstly, I [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344\" \/>\n<meta property=\"og:site_name\" content=\"Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"article:published_time\" content=\"2017-06-08T16:32:21+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2017-06-08T19:56:12+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171-1024x196.jpg\" \/>\n<meta name=\"author\" content=\"Henry Rzepa\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Henry Rzepa\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"How to search data repositories for FAIR chemical content and data: SubjectScheme - Henry Rzepa&#039;s Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344","og_locale":"en_GB","og_type":"article","og_title":"How to search data repositories for FAIR chemical content and data: SubjectScheme - Henry Rzepa&#039;s Blog","og_description":"As data repositories start to flourish, it is reasonable to ask questions such as what sort of chemistry can be found there and how can I find it? Here I give an updated worked example of a digital repository search for chemical content and also pose an important issue for the chemistry domain. Firstly, I [&hellip;]","og_url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344","og_site_name":"Henry Rzepa&#039;s Blog","article_published_time":"2017-06-08T16:32:21+00:00","article_modified_time":"2017-06-08T19:56:12+00:00","og_image":[{"url":"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171-1024x196.jpg","type":"","width":"","height":""}],"author":"Henry Rzepa","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Henry Rzepa","Estimated reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344#article","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344"},"author":{"name":"Henry Rzepa","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"headline":"How to search data repositories for FAIR chemical content and data: SubjectScheme","datePublished":"2017-06-08T16:32:21+00:00","dateModified":"2017-06-08T19:56:12+00:00","mainEntityOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344"},"wordCount":607,"commentCount":1,"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344#primaryimage"},"thumbnailUrl":"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171-1024x196.jpg","keywords":["chemical content","chemical\/x- media type","chemical\/x-gaussian","Company: DataCite","Company: Google","digital repository search","domain-specific chemical content","media type","media:chemical\/x-mnpub","media:chemical\\\/x\\-gaussian*","Question","search definition","search engines","search string","search syntax","subject search"],"articleSection":["Chemical IT"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344","name":"How to search data repositories for FAIR chemical content and data: SubjectScheme - Henry Rzepa&#039;s Blog","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344#primaryimage"},"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344#primaryimage"},"thumbnailUrl":"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171-1024x196.jpg","datePublished":"2017-06-08T16:32:21+00:00","dateModified":"2017-06-08T19:56:12+00:00","author":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"breadcrumb":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344#primaryimage","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171.jpg","contentUrl":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171.jpg","width":1252,"height":240},{"@type":"BreadcrumbList","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog"},{"@type":"ListItem","position":2,"name":"How to search data repositories for FAIR chemical content and data: SubjectScheme"}]},{"@type":"WebSite","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/","name":"Henry Rzepa&#039;s Blog","description":"Chemistry with a twist","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281","name":"Henry Rzepa","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g370be3a7397865e4fd161aefeb0a5a85","url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","caption":"Henry Rzepa"},"description":"Henry Rzepa is Emeritus Professor of Computational Chemistry at Imperial College London.","sameAs":["https:\/\/orcid.org\/0000-0002-8635-8390"],"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?author=1"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pDef7-4LS","jetpack-related-posts":[{"id":15907,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15907","url_meta":{"origin":18344,"position":0},"title":"Global initiatives in research data management and discovery: searching metadata.","author":"Henry Rzepa","date":"March 7, 2016","format":false,"excerpt":"The upcoming ACS national meeting in San Diego has a CINF\u00a0(chemical information division) session entitled \"Global initiatives in research data management and discovery\". I have highlighted here just one slide from my contribution to this session, which addresses the discovery aspect of the session. Data, if you think about it,\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":28045,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=28045","url_meta":{"origin":18344,"position":1},"title":"Data Discovery: A pick-n-mix library of useful FAIR Data searches &#8211; and a call for new search suggestions.","author":"Henry Rzepa","date":"November 25, 2024","format":false,"excerpt":"With AI and Machine learning needing data in abundance, interest in data discovery is intense. However, this type of discovery is somewhat different from more traditional data base searches, in that it is particularly suited for machine discovery as well as by humans. The discovery searches are conducted using an\u2026","rel":"","context":"In &quot;Interesting chemistry&quot;","block_context":{"text":"Interesting chemistry","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=4"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":16251,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=16251","url_meta":{"origin":18344,"position":2},"title":"Metametadata: data about data about (chemical) data.","author":"Henry Rzepa","date":"April 16, 2016","format":false,"excerpt":"Scientists are familiar with the term data, at least in a scientific or chemical context, but appreciating metadata (meaning \"after\", or \"beyond\") is slightly more subtle, in the sense of using it to mean data about data. The challenge lies in clarifying where the boundary between data and its metadata\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":24314,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24314","url_meta":{"origin":18344,"position":3},"title":"A comparison of searches based on metadata records from three (update: five) research repositories.","author":"Henry Rzepa","date":"September 28, 2021","format":false,"excerpt":"In the previous blog post, I looked at the metadata records registered with DataCite for some chemical computational modelling files as published in three different repositories. Here I take it one stage further, by looking at how searches of the DataCite metadata store for three particular values of the metadata\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":22043,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=22043","url_meta":{"origin":18344,"position":4},"title":"New generations of globally aggregating search engines &#8211; for (chemical) data.","author":"Henry Rzepa","date":"April 7, 2020","format":false,"excerpt":"Chemists have long been familiar with search engines that aspire to index a large proportion of the chemical literature. Think for example the old-generation (and commercial)\u00a0SciFinder (Scholar)\u00a0and Reaxys\u00a0or those that arrived in the 1990s in the online era\u2021 such as the non-commercial Pubchem or ChemSpider (there are more). But you\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2020\/04\/google-1024x1004.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":24723,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24723","url_meta":{"origin":18344,"position":5},"title":"Raw data: the evolution of FAIR data and crystallography.","author":"Henry Rzepa","date":"March 1, 2022","format":false,"excerpt":"Scientific data in chemistry has come a long way in the last few decades. Originally entangled into scientific articles in the form of tables of numbers or diagrams, it was (partially) disentangled into supporting information when journals became electronic in the late 1990s. The next phase was the introduction of\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"jetpack_likes_enabled":false,"authors":[{"term_id":2661,"user_id":1,"is_guest":0,"slug":"admin","display_name":"Henry Rzepa","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/18344","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=18344"}],"version-history":[{"count":36,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/18344\/revisions"}],"predecessor-version":[{"id":18511,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/18344\/revisions\/18511"}],"wp:attachment":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=18344"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=18344"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=18344"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fppma_author&post=18344"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}