{"id":18385,"date":"2017-05-29T17:36:22","date_gmt":"2017-05-29T16:36:22","guid":{"rendered":"http:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=18385"},"modified":"2017-05-30T07:36:03","modified_gmt":"2017-05-30T06:36:03","slug":"challenges-in-reliably-representing-the-chemistry-of-crystal-structures","status":"publish","type":"post","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385","title":{"rendered":"Challenges in reliably representing the chemistry of crystal structures."},"content":{"rendered":"<div class=\"kcite-section\" kcite-section-id=\"18385\">\n<p>The title here is taken from a presentation made by Ian Bruno from CCDC at the <a href=\"http:\/\/www.beilstein-institut.de\/en\/symposia\/open-science\">recent conference on Open Science<\/a>. It also addresses the <a href=\"http:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=18257\">theme here<\/a>\u00a0of the issues that might arise in assigning identifiers for any given molecule.<\/p>\n<p>The structure was represented as shown<span id=\"cite_ITEM-18385-0\" name=\"citation\"><a href=\"#ITEM-18385-0\">[1]<\/a><\/span> by the original authors, in which the bonding from S to Sn is indicated with both solid lines (a bond) and dotted lines\u00a0(an &#8220;interaction&#8221;).<img decoding=\"async\" class=\"aligncenter size-full wp-image-18387\" src=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/149.jpg\" alt=\"\" width=\"450\" srcset=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/149.jpg 654w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/149-279x300.jpg 279w\" sizes=\"(max-width: 654px) 100vw, 654px\" \/><\/p>\n<p>Why would this matter? Well, to\u00a0enable any entry in the Cambridge structure database as\u00a0findable (the F of FAIR) it has to be given a unique identifier. There are in general three such identifiers assigned by the CCDC:<\/p>\n<ol>\n<li>The <strong>Refcode<\/strong>, in this case XONHIS. These six or seven letter codes are historically the oldest, and started off at least with an attempt if possible to assign some semantic inference from the name, even if only occasionally.\u00a0<\/li>\n<li>The <strong>CCDC deposition number<\/strong>, in this case 650011. This is the number that an author will receive immediately upon deposition, and you often find these identifiers quoted in supporting information files<\/li>\n<li>The <strong>DOI<\/strong> (digital object identifier), in this case\u00a0<a href=\"https:\/\/dx.doi.org\/10.5517\/ccptd3z\">10.5517\/ccptd3z<\/a>, which can be used to view the structure even if access to the full CSD is not available to the user. In that sense, the DOI is the FAIRest of the first three of these identifiers.<\/li>\n<li>However, CCDC reported that they are considering adding a 4th very common identifier, based on the <strong>InChI<\/strong> (International chemical identifier), which comes as a full string and with the structure of the molecule at least in part inferrable from it, together with \u00a0a shortened (almost) unique string which has the advantage of being &#8220;Googlable&#8221;. Both are\u00a0helpfully FAIR.<\/li>\n<\/ol>\n<p>It is this 4th identifier that is at issue here. InChIs are derived from atom connection tables; you need to define all bonds present in the molecule. And it is here that the dotted &#8220;bond&#8221;\/&#8221;interaction&#8221; above becomes a problem. This is the representation shown in the CSD database, which reveals that all the Sn&#8230;S interactions are classified as &#8220;bonds&#8221;, along with some creative(!) representations of the C&#8230;S bonds.<img decoding=\"async\" class=\"aligncenter size-full wp-image-18388\" src=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/148.jpg\" alt=\"\" width=\"450\" srcset=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/148.jpg 624w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/148-300x200.jpg 300w\" sizes=\"(max-width: 624px) 100vw, 624px\" \/><\/p>\n<p>So the InChI will very much depend on whether all the Sn&#8230;S contacts are termed as bonds or as interactions. To help clarify that, it is useful to show the typical range of lengths of such contacts. Below is a simple search for all Sn and S systems where the pair are either close in space (&lt; 3.5\u00c5) or have a bond specified between the two atoms.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-18390\" src=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/Sn-S-non-1024x756.jpg\" alt=\"\" width=\"450\" height=\"332\" srcset=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/Sn-S-non-1024x756.jpg 1024w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/Sn-S-non-300x221.jpg 300w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/Sn-S-non-768x567.jpg 768w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/Sn-S-non.jpg 1926w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/p>\n<p>The main cluster occurs at ~2.5\u00c5, but there is some evidence of a second peak at about 3.0\u00c5. The third distribution up to 3.5\u00c5 is probably a continuum of very weak dispersion interaction, which most molecules exhibit. The values for XONHIS are 2.521 and 2.996\u00c5, which match the two clusters above.<\/p>\n<p>So perhaps a quantum calculation can shed some light (DOI:\u00a0<a href=\"https:\/\/doi.org\/10.14469\/hpc\/2593\">10.14469\/hpc\/2593<\/a>)? The values on the right are the optimised bond lengths which are pretty similar to the crystal structure. On the left are the calculated Wiberg bond orders (B3LYP+D3BJ\/Def2-TZVPP\/chloroform calculation). These reveal both &#8220;bonds&#8221; have an order less than 1. The value of ~0.6 is probably not contentious, but it does graphically show that when a compound is indexed as having a &#8220;single bond&#8221; between two atoms, the quantitative bond order may be substantially less. What however would one make of a bond order of 0.214? Should it be classified as a bond, albeit a much weaker one than normal? Or should it instead simply be a rather strong &#8220;interaction&#8221; which is not classified as a bond? And perhaps one should have in mind the question &#8220;how sensitive is this result to the quantum mechanical procedure used?&#8221;<img decoding=\"async\" class=\"aligncenter size-full wp-image-18393\" src=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/146.jpg\" alt=\"\" width=\"450\" srcset=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/146.jpg 902w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/146-273x300.jpg 273w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/146-768x844.jpg 768w\" sizes=\"(max-width: 902px) 100vw, 902px\" \/><\/p>\n<p>Why does this distinction matter? Well, the InChI algorithm is based on simple connectivity; are two atoms connected by a bond or not? There are no nuances here. At the moment, this decision can be made by an algorithm based on the distance between any atom pair (whether computed or measured), but more often I suspect it derives from a &#8220;molfile&#8221; which is often derived from a human-drawn representation using a structure drawing program. It does rather boil down to the individual preferences of the human drawing the molecule. Due in part to such uncertainties, it was\u00a0estimated that only 22% of structures in the CSD can be used to generate a reliable InChI. Hydrogen bonds are almost always classified as non-bonds, which means their presence is rarely systematically flagged during the indexing of the structures. Organometallics often pose some of the greatest representational\u00a0problems\u00a0(there are many others).<\/p>\n<p>I will end by observing another class of structure that I deal with, &#8220;reaction transition states&#8221;. As you might imagine these forms are full of pairs of atoms with ambiguous bond lengths and hence connectivity. We currently have no truly reliable\u00a0method for assigning useful identifiers to them. So lots of challenges for the future then!<\/p>\n<p>&nbsp;<\/p>\n<h2>References<\/h2>\n    <ol class=\"kcite-bibliography csl-bib-body\"><li id=\"ITEM-18385-0\">R. Reyes-Mart\u00ednez, R. Mejia-Huicochea, J.A. Guerrero-Alvarez, H. H\u00f6pfl, and H. Tlahuext, \"Synthesis, heteronuclear NMR and X-ray crystallographic studies of two dinuclear diorganotin(IV) dithiocarbamate macrocycles\", <i>Arkivoc<\/i>, vol. 2008, pp. 19-30, 2007. <a href=\"https:\/\/doi.org\/10.3998\/ark.5550190.0009.503\">https:\/\/doi.org\/10.3998\/ark.5550190.0009.503<\/a>\n\n<\/li>\n<\/ol>\n\n<\/div> <!-- kcite-section 18385 -->","protected":false},"excerpt":{"rendered":"<p>The title here is taken from a presentation made by Ian Bruno from CCDC at the recent conference on Open Science. It also addresses the theme here\u00a0of the issues that might arise in assigning identifiers for any given molecule. The structure was represented as shown by the original authors, in which the bonding from S [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"activitypub_content_warning":"","activitypub_content_visibility":"","activitypub_max_image_attachments":5,"activitypub_interaction_policy_quote":"anyone","activitypub_status":"","footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[2],"tags":[281,2181,2183,806,2167,2180,2182],"ppma_author":[2661],"class_list":["post-18385","post","type-post","status-publish","format-standard","hentry","category-chemical-it","tag-author","tag-bruno","tag-chemical-identifier","tag-digital-object-identifier","tag-ian-bruno","tag-identifier","tag-inchi-algorithm"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Challenges in reliably representing the chemistry of crystal structures. - Henry Rzepa&#039;s Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Challenges in reliably representing the chemistry of crystal structures. - Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"og:description\" content=\"The title here is taken from a presentation made by Ian Bruno from CCDC at the recent conference on Open Science. It also addresses the theme here\u00a0of the issues that might arise in assigning identifiers for any given molecule. The structure was represented as shown by the original authors, in which the bonding from S [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385\" \/>\n<meta property=\"og:site_name\" content=\"Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"article:published_time\" content=\"2017-05-29T16:36:22+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2017-05-30T06:36:03+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/149.jpg\" \/>\n<meta name=\"author\" content=\"Henry Rzepa\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Henry Rzepa\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Challenges in reliably representing the chemistry of crystal structures. - Henry Rzepa&#039;s Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385","og_locale":"en_GB","og_type":"article","og_title":"Challenges in reliably representing the chemistry of crystal structures. - Henry Rzepa&#039;s Blog","og_description":"The title here is taken from a presentation made by Ian Bruno from CCDC at the recent conference on Open Science. It also addresses the theme here\u00a0of the issues that might arise in assigning identifiers for any given molecule. The structure was represented as shown by the original authors, in which the bonding from S [&hellip;]","og_url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385","og_site_name":"Henry Rzepa&#039;s Blog","article_published_time":"2017-05-29T16:36:22+00:00","article_modified_time":"2017-05-30T06:36:03+00:00","og_image":[{"url":"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/149.jpg","type":"","width":"","height":""}],"author":"Henry Rzepa","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Henry Rzepa","Estimated reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385#article","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385"},"author":{"name":"Henry Rzepa","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"headline":"Challenges in reliably representing the chemistry of crystal structures.","datePublished":"2017-05-29T16:36:22+00:00","dateModified":"2017-05-30T06:36:03+00:00","mainEntityOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385"},"wordCount":833,"commentCount":2,"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385#primaryimage"},"thumbnailUrl":"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/149.jpg","keywords":["author","Bruno","chemical identifier","Digital Object Identifier","Ian Bruno","Identifier","InChI algorithm"],"articleSection":["Chemical IT"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385","name":"Challenges in reliably representing the chemistry of crystal structures. - Henry Rzepa&#039;s Blog","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385#primaryimage"},"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385#primaryimage"},"thumbnailUrl":"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/149.jpg","datePublished":"2017-05-29T16:36:22+00:00","dateModified":"2017-05-30T06:36:03+00:00","author":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"breadcrumb":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385#primaryimage","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/149.jpg","contentUrl":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/05\/149.jpg","width":654,"height":703},{"@type":"BreadcrumbList","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18385#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog"},{"@type":"ListItem","position":2,"name":"Challenges in reliably representing the chemistry of crystal structures."}]},{"@type":"WebSite","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/","name":"Henry Rzepa&#039;s Blog","description":"Chemistry with a twist","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281","name":"Henry Rzepa","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g370be3a7397865e4fd161aefeb0a5a85","url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","caption":"Henry Rzepa"},"description":"Henry Rzepa is Emeritus Professor of Computational Chemistry at Imperial College London.","sameAs":["https:\/\/orcid.org\/0000-0002-8635-8390"],"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?author=1"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pDef7-4Mx","jetpack-related-posts":[{"id":12182,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=12182","url_meta":{"origin":18385,"position":0},"title":"The Amsterdam Manifesto and crystal structures.","author":"Henry Rzepa","date":"March 18, 2014","format":false,"excerpt":"I have mentioned the Amsterdam manifesto before on these pages. It is worth repeating the eight simple principles: Data should be considered citable products of research. Such data should be held in persistent public repositories. If a publication is based on data not included with the article, those data should\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"doi-x2","src":"https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2014\/03\/doi-x2.jpg?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2014\/03\/doi-x2.jpg?resize=350%2C200 1x, https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2014\/03\/doi-x2.jpg?resize=525%2C300 1.5x"},"classes":[]},{"id":20669,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=20669","url_meta":{"origin":18385,"position":1},"title":"A search of some major chemistry publishers for FAIR data records.","author":"Henry Rzepa","date":"April 12, 2019","format":false,"excerpt":"In recent years, findable data has become ever more important (the F in FAIR). Here I test that F using the DataCite search service. Firstly an introduction to this service. This is a metadata database about datasets and other research objects. One of the properties is\u00a0relatedIdentifier which records other identifiers\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":12932,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=12932","url_meta":{"origin":18385,"position":2},"title":"One molecule, one identifier: Viewing molecular files from a digital repository using metadata standards.","author":"Henry Rzepa","date":"September 8, 2014","format":false,"excerpt":"In the beginning (taken here as\u00a0prior to ~1980) libraries held\u00a0five-year printed consolidated indices of molecules, organised by formula or name (Chemical abstracts). This could occupy about 2m of shelf space for each five years. And an equivalent set of printed volumes from the Beilstein collection. Those of us who needed\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":24723,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24723","url_meta":{"origin":18385,"position":3},"title":"Raw data: the evolution of FAIR data and crystallography.","author":"Henry Rzepa","date":"March 1, 2022","format":false,"excerpt":"Scientific data in chemistry has come a long way in the last few decades. Originally entangled into scientific articles in the form of tables of numbers or diagrams, it was (partially) disentangled into supporting information when journals became electronic in the late 1990s. The next phase was the introduction of\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":17237,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17237","url_meta":{"origin":18385,"position":4},"title":"Molecules of the year? Pnictogen chains and 16 coordinate Cs.","author":"Henry Rzepa","date":"December 19, 2016","format":false,"excerpt":"I am completing my survey of the vote for molecule of the year candidates, which this year seems focused on chemical records of one type or another. The first article reports striving towards creating a molecule covering a complete column of the period table. In this case, group 7, containing\u2026","rel":"","context":"In &quot;crystal_structure_mining&quot;","block_context":{"text":"crystal_structure_mining","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=1745"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":16251,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=16251","url_meta":{"origin":18385,"position":5},"title":"Metametadata: data about data about (chemical) data.","author":"Henry Rzepa","date":"April 16, 2016","format":false,"excerpt":"Scientists are familiar with the term data, at least in a scientific or chemical context, but appreciating metadata (meaning \"after\", or \"beyond\") is slightly more subtle, in the sense of using it to mean data about data. The challenge lies in clarifying where the boundary between data and its metadata\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"jetpack_likes_enabled":false,"authors":[{"term_id":2661,"user_id":1,"is_guest":0,"slug":"admin","display_name":"Henry Rzepa","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/18385","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=18385"}],"version-history":[{"count":8,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/18385\/revisions"}],"predecessor-version":[{"id":18398,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/18385\/revisions\/18398"}],"wp:attachment":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=18385"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=18385"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=18385"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fppma_author&post=18385"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}