{"id":24561,"date":"2022-01-26T10:41:34","date_gmt":"2022-01-26T10:41:34","guid":{"rendered":"https:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=24561"},"modified":"2022-02-03T07:25:23","modified_gmt":"2022-02-03T07:25:23","slug":"data-base-or-data-repository-a-brief-and-very-selective-history-of-data-management-in-chemistry","status":"publish","type":"post","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561","title":{"rendered":"Data base or Data repository? &#8211; A brief and very selective history of data management in chemistry."},"content":{"rendered":"<div class=\"kcite-section\" kcite-section-id=\"24561\">\n<p>Way back in the late 1980s or so, research groups in chemistry started to replace the filing of their paper-based research data by storing it in an easily retrievable digital form. This required a computer database and initially these were accessible only on specific dedicated computers in the laboratory. These gradually changed from the 1990s onwards into being accessible online, so that more than one person could use them in different locations. At least where I worked, the infrastructures<sup>\u2021<\/sup> to set up such databases were mostly not then available as part of the standard research provisions and so had to be installed and maintained by the group itself. The database software took many different forms and it was not uncommon for each group in a department to come up with a different solution that suited its needs best. The result was a proliferation of largely non-interoperable solutions which did not communicate with each other. Each database had to be searched locally and there could be ten or more such resources in a department. The knowledge of how the system operated also often resided in just one person, which tended to evaporate when this guru left the group.<\/p>\n<p>After the millennium, two newcomers started to appear, one being called an ELN (electronic laboratory notebook) and the second a data repository. The first was a heavily customised database containing research data as obtained from instruments, computers, images\/video, chemical structure drawings etc. ELNs, even to this day, have limitations of interoperability with other ELNs and the contents of an ELN are often closed, requiring authentication credentials to access. The data repository also started to appear in chemistry around this period. Even in its early incarnations, it could be associated with an ELN &#8220;front end&#8221; as part of the data pipeline; an early example of this coupling is described here.<span id=\"cite_ITEM-24561-0\" name=\"citation\"><a href=\"#ITEM-24561-0\">[1]<\/a><\/span> Another key phrase that became associated with repositories starting around 2014 was the concept of FAIR, including ideas such as the <strong>Findability<\/strong> (discoverability) and <strong>Interoperablity<\/strong> of data,<sup>\u2020<\/sup> a theme <a href=\"https:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?s=FAIR\">often explored and illustrated<\/a> on this blog.<\/p>\n<p><a href=\"https:\/\/doi.org\/10.1021\/ci500302p\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-24586\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015-1024x521.jpg\" alt=\"\" width=\"450\" height=\"229\" srcset=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015-1024x521.jpg 1024w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015-300x153.jpg 300w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015-768x390.jpg 768w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015.jpg 1200w\" sizes=\"auto, (max-width: 450px) 100vw, 450px\" \/><\/a><\/p>\n<p>These last seventeen years has seen organisations such as funding agencies and publishers increasingly mandating the use of such data management methods, using either a repository on its own or a combination of an ELN and repository as routine operations in research activity and publication processes. The close coupling of an ELN and repository is still however uncommon.\u00a0<\/p>\n<p>A colleague recently alerted me to a computational chemistry repository first launched in 2014;\u00a0<a href=\"https:\/\/www.iochem-bd.org\/home.jsp?action=about\" target=\"_blank\" rel=\"noopener\">www.iochem-bd.org<\/a>\u00a0 Reading the <strong>about<\/strong> text, I found these statements;<\/p>\n<ul>\n<li><em>Chem-BD is a digital repository aimed to manage and store Computational Chemistry files.<\/em><\/li>\n<li><em>Goals: Build a distributed database of computational chemistry results: reduce size and increase value. <\/em><\/li>\n<li><em>Set a common data standard among all quantum chemistry legacy formats (XML &#8211; CML<span id=\"cite_ITEM-24561-1\" name=\"citation\"><a href=\"#ITEM-24561-1\">[2]<\/a><\/span>) <\/em><\/li>\n<\/ul>\n<p>So this is <strong>both<\/strong> a database <strong>and<\/strong> a data repository, as well as espousing a commendable common data standard!<em><span id=\"cite_ITEM-24561-1\" name=\"citation\"><a href=\"#ITEM-24561-1\">[2]<\/a><\/span><\/em> I decided to explore the first two aspects here using this resource as an example.<\/p>\n<ul>\n<li>Whilst the absolute distinction between the two types can be blurry, the crucial difference between the two is that a database functions on curation via a structured index of the <strong>data<\/strong>, whilst a repository aspires to having FAIR attributes primarily through its\u00a0<strong>metadata <\/strong>as exposed by registration\u00a0(metadata is data describing the data).<\/li>\n<li>A database holds this data index locally and the Findability of the data is associated purely with the functionality of \u00a0the database. The data structures are defined by a database schema, describing in detail all the terms indexed (a key and its value) and searched using the values of these key pairs. This schema is unlikely to be exactly the same as <em>e.g.<\/em> databases on related topics, largely because the database is self-contained and self-consistent.<\/li>\n<li>A data repository also uses a schema (DOI: <a href=\"https:\/\/doi.org\/10.14454\/3w3z-sa82\" target=\"_blank\" rel=\"noopener\">10.14454\/3w3z-sa82<\/a> and<span id=\"cite_ITEM-24561-2\" name=\"citation\"><a href=\"#ITEM-24561-2\">[3]<\/a><\/span>)<sup>\u2660<\/sup> to express the key pairs, but this time it is expressed as metadata. Now, this metadata is registered externally to the repository using a registration agency.<span id=\"cite_ITEM-24561-2\" name=\"citation\"><a href=\"#ITEM-24561-2\">[3]<\/a><\/span> The metadata for each deposited object is assigned a persistent identifier known as a DOI. Although it might be indexed and searchable locally, it must be capable of also being searched in aggregated\/federated form using services provided by registration or other agencies. This independence of metadata is part of those FAIR criteria.<\/li>\n<li>Whereas a database can be very finely grained in order to describe individual properties of an object, repository metadata tends to be more coarsely grained to describe the object as a whole, to place it in context and to impart provenance.<\/li>\n<li>Both databases and repositories can have what is called an API (application programmer interface) to allow machine access (the <strong>A<\/strong> of FAIR) to the contents. Accessing the former would normally require bespoke code to be written and possibly authentication credentials, whereas information to access to repository held data is provided <em>via<\/em> the registered metadata (which does not normally require credentials). Access to the repository may also require code, but if the metadata is carefully standardised by adherence to the schema, the code can be made more general than that required for a database.<sup>\u2665<\/sup><\/li>\n<li>A typical entry in the <a href=\"https:\/\/www.iochem-bd.org\/home.jsp?action=about\" target=\"_blank\" rel=\"noopener\">www.iochem-bd.org<\/a> repository has a DOI of\u00a0<span style=\"font-size: 10pt;\"><tt><a href=\"https:\/\/doi.org\/10.19061\/iochem-bd-4-36\">10.19061\/iochem-bd-4-36<\/a><\/tt><\/span><\/li>\n<li>This DOI is registered with the CrossRef agency, one normally used for registering journal articles, rather than DataCite which is used for registering data and other research objects. The metadata for this DOI can be viewed using the resolution service <span style=\"font-size: 10pt;\"><a href=\"https:\/\/api.crossref.org\/works\/10.19061\/iochem-bd-4-36\/transform\/application\/vnd.crossref.unixsd+xml\" target=\"_blank\" rel=\"noopener\">https:\/\/api.crossref.org\/works\/10.19061\/iochem-bd-4-36\/transform\/application\/vnd.crossref.unixsd+xml<\/a><\/span>\u00a0and shows that it largely contains the bibliographic information typical of a journal article. So in this sense it is certainly a repository, but using a metadata schema that is more frequently used for journal articles than for data sets.<\/li>\n<li>The CrossRef metadata record also has an item\u00a0<span style=\"font-size: 10pt;\"><tt><span class=\"tag\">&lt;resource&gt;<\/span><a href=\"https:\/\/www.iochem-bd.org\/handle\/10\/235025\"><span class=\"text\">https:\/\/www.iochem-bd.org\/handle\/10\/235025<\/span><\/a><span class=\"tag\">&lt;\/resource&gt;<\/span><\/tt><\/span> which points to the so-called landing page for that item, but information about the properties of the actual data itself must be instead obtained directly from the repository.\u00a0<\/li>\n<li>Because the metadata describing the data is only held at this repository and not elsewhere (a local metadata record), it can only be queried locally and the query cannot be upon aggregated metadata \u00a0provided by the registration agency. A machine query would have to be constructed by coding a suitable request using the API provided for the database aspect of this repository.\u00a0<\/li>\n<\/ul>\n<p>This example has served to highlight just a few of the often quite subtle distinctions between <em>eg<\/em> a database and a data repository and that some examples can indeed be both. \u00a0It also highlights that repositories can have the attributes of \u00a0FAIR, which in themselves are driven by asking &#8220;what could a machine do to obtain data?&#8221;<sup>\u2665<\/sup> rather than what could a human achieve by browsing. So another question that arises when evaluating the characteristics of a repository is whether each item held there has a FAIR-enabling metadata record describing the data, a record which is registered in a manner that can be aggregated and hence used to find and access content across\u00a0multiple independent repositories.<\/p>\n<hr \/>\n<p>This post has DOI <a href=\"https:\/\/doi.org\/10.14469\/hpc\/10043\">10.14469\/hpc\/10043<\/a><\/p>\n<hr \/>\n<p><sup>\u2021<\/sup>Indeed in that era, few online\/Internet infrastructures were available as part of departmental resources. <a href=\"https:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=10688\">See also here<\/a>. \u00a0<sup>\u2020<\/sup>In this last regard, I note a workshop devoted largely to such interoperability and machine access in chemistry coming up soon; <a href=\"https:\/\/www.cecam.org\/workshop-details\/1165\">https:\/\/www.cecam.org\/workshop-details\/1165<\/a> <sup>\u2660<\/sup>The CrossRef schema is not referenced using an assigned DOI: <a href=\"https:\/\/data.crossref.org\/reports\/help\/schema_doc\/5.3.1\/\">data.crossref.org\/reports\/help\/schema_doc\/5.3.1\/<\/a>.<sup>\u2665<\/sup>An example can be seen at doi: <a href=\"https:\/\/doi.org\/10.14469\/hpc\/10059\">10.14469\/hpc\/10059<\/a> Here, invoking a hyperlink based purely on the data DOI and the data media type required in turn calls code (Javascript) which retrieves the metadata held for that DOI and parses it to identify whether it indicates the presence of a file manifest. If it does, it identifies the type of manifest (ORE in this case) and the media types the manifest points to and finally uses that manifest to then retrieve data filtered by media type and pipes it into a visualiser (JSmol). In this case the endpoint is visualisation, but it could also be <em>eg<\/em> piped into an AI\/ML program for analysis. In this case only one instance of data is machine retrieved, but in principle it could be a multitude of data files obtained from a multitude of different locations and based on a multitude of criteria as filtered by <a href=\"https:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=22059\">suitable searches of registered metadata<\/a>.<span id=\"cite_ITEM-24561-3\" name=\"citation\"><a href=\"#ITEM-24561-3\">[4]<\/a><\/span><\/p>\n<hr \/>\n<h2>References<\/h2>\n    <ol class=\"kcite-bibliography csl-bib-body\"><li id=\"ITEM-24561-0\">M.J. Harvey, N.J. Mason, and H.S. Rzepa, \"Digital Data Repositories in Chemistry and Their Integration with Journals and Electronic Notebooks\", <i>Journal of Chemical Information and Modeling<\/i>, vol. 54, pp. 2627-2635, 2014. <a href=\"https:\/\/doi.org\/10.1021\/ci500302p\">https:\/\/doi.org\/10.1021\/ci500302p<\/a>\n\n<\/li>\n<li id=\"ITEM-24561-1\">P. Murray-Rust, and H.S. Rzepa, \"Chemical Markup, XML, and the Worldwide Web. 1. Basic Principles\", <i>Journal of Chemical Information and Computer Sciences<\/i>, vol. 39, pp. 928-942, 1999. <a href=\"https:\/\/doi.org\/10.1021\/ci990052b\">https:\/\/doi.org\/10.1021\/ci990052b<\/a>\n\n<\/li>\n<li id=\"ITEM-24561-2\">H. Cousijn, T. Habermann, E. Krznarich, and A. Meadows, \"Beyond data: Sharing related research outputs to make data reusable\", <i>Learned Publishing<\/i>, vol. 35, pp. 75-80, 2022. <a href=\"https:\/\/doi.org\/10.1002\/leap.1429\">https:\/\/doi.org\/10.1002\/leap.1429<\/a>\n\n<\/li>\n<li id=\"ITEM-24561-3\">H.S. Rzepa, and S. Kuhn, \"A data\u2010oriented approach to making new molecules as a student experiment: artificial intelligence\u2010enabling FAIR publication of NMR data for organic esters\", <i>Magnetic Resonance in Chemistry<\/i>, vol. 60, pp. 93-103, 2021. <a href=\"https:\/\/doi.org\/10.1002\/mrc.5186\">https:\/\/doi.org\/10.1002\/mrc.5186<\/a>\n\n<\/li>\n<\/ol>\n\n<\/div> <!-- kcite-section 24561 -->","protected":false},"excerpt":{"rendered":"<p>Way back in the late 1980s or so, research groups in chemistry started to replace the filing of their paper-based research data by storing it in an easily retrievable digital form. This required a computer database and initially these were accessible only on specific dedicated computers in the laboratory. These gradually changed from the 1990s [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"activitypub_content_warning":"","activitypub_content_visibility":"","activitypub_max_image_attachments":5,"activitypub_interaction_policy_quote":"anyone","activitypub_status":"","footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"jetpack_post_was_ever_published":false},"categories":[2],"tags":[],"ppma_author":[2661],"class_list":["post-24561","post","type-post","status-publish","format-standard","hentry","category-chemical-it"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Data base or Data repository? - A brief and very selective history of data management in chemistry. - Henry Rzepa&#039;s Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Data base or Data repository? - A brief and very selective history of data management in chemistry. - Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"og:description\" content=\"Way back in the late 1980s or so, research groups in chemistry started to replace the filing of their paper-based research data by storing it in an easily retrievable digital form. This required a computer database and initially these were accessible only on specific dedicated computers in the laboratory. These gradually changed from the 1990s [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561\" \/>\n<meta property=\"og:site_name\" content=\"Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"article:published_time\" content=\"2022-01-26T10:41:34+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2022-02-03T07:25:23+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015-1024x521.jpg\" \/>\n<meta name=\"author\" content=\"Henry Rzepa\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Henry Rzepa\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Data base or Data repository? - A brief and very selective history of data management in chemistry. - Henry Rzepa&#039;s Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561","og_locale":"en_GB","og_type":"article","og_title":"Data base or Data repository? - A brief and very selective history of data management in chemistry. - Henry Rzepa&#039;s Blog","og_description":"Way back in the late 1980s or so, research groups in chemistry started to replace the filing of their paper-based research data by storing it in an easily retrievable digital form. This required a computer database and initially these were accessible only on specific dedicated computers in the laboratory. These gradually changed from the 1990s [&hellip;]","og_url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561","og_site_name":"Henry Rzepa&#039;s Blog","article_published_time":"2022-01-26T10:41:34+00:00","article_modified_time":"2022-02-03T07:25:23+00:00","og_image":[{"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015-1024x521.jpg","type":"","width":"","height":""}],"author":"Henry Rzepa","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Henry Rzepa","Estimated reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561#article","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561"},"author":{"name":"Henry Rzepa","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"headline":"Data base or Data repository? &#8211; A brief and very selective history of data management in chemistry.","datePublished":"2022-01-26T10:41:34+00:00","dateModified":"2022-02-03T07:25:23+00:00","mainEntityOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561"},"wordCount":1446,"commentCount":0,"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561#primaryimage"},"thumbnailUrl":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015-1024x521.jpg","articleSection":["Chemical IT"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561","name":"Data base or Data repository? - A brief and very selective history of data management in chemistry. - Henry Rzepa&#039;s Blog","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561#primaryimage"},"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561#primaryimage"},"thumbnailUrl":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015-1024x521.jpg","datePublished":"2022-01-26T10:41:34+00:00","dateModified":"2022-02-03T07:25:23+00:00","author":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"breadcrumb":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561#primaryimage","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015.jpg","contentUrl":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2022\/01\/Screenshot-1015.jpg","width":1200,"height":610},{"@type":"BreadcrumbList","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=24561#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog"},{"@type":"ListItem","position":2,"name":"Data base or Data repository? &#8211; A brief and very selective history of data management in chemistry."}]},{"@type":"WebSite","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/","name":"Henry Rzepa&#039;s Blog","description":"Chemistry with a twist","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281","name":"Henry Rzepa","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g370be3a7397865e4fd161aefeb0a5a85","url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","caption":"Henry Rzepa"},"description":"Henry Rzepa is Emeritus Professor of Computational Chemistry at Imperial College London.","sameAs":["https:\/\/orcid.org\/0000-0002-8635-8390"],"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?author=1"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pDef7-6o9","jetpack-related-posts":[{"id":20342,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=20342","url_meta":{"origin":24561,"position":0},"title":"Open Access journal publishing debates &#8211; the elephant in the room?","author":"Henry Rzepa","date":"November 4, 2018","format":false,"excerpt":"For perhaps ten years now, the future of scientific publishing has been hotly debated. The traditional models are often thought to be badly broken, although convergence to a consensus of what a better model should be is not apparently close. But to my mind, much of this debate seems to\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":16952,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=16952","url_meta":{"origin":24561,"position":1},"title":"The 2016 Bradley-Mason prize for open chemistry.","author":"Henry Rzepa","date":"October 4, 2016","format":false,"excerpt":"Peter Murray-Rust and I are delighted to announce that the 2016 award of the Bradley-Mason\u00a0prize for open chemistry\u00a0goes to\u00a0Jan Szopinski (UG) and\u00a0Clyde Fare (PG). Jan's open chemistry derives from a final year project looking at why atom charges derived from quantum chemical calculation of the electronic density represent chemical information\u2026","rel":"","context":"In &quot;Bradley-Mason Prize for Open Chemistry&quot;","block_context":{"text":"Bradley-Mason Prize for Open Chemistry","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2131"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":19603,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=19603","url_meta":{"origin":24561,"position":2},"title":"Examples please of  FAIR (data); good and bad.","author":"Henry Rzepa","date":"May 6, 2018","format":false,"excerpt":"The site fairsharing.org is a repository of information about FAIR (Findable, Accessible, Interoperable and Reusable) objects such as research data. A project to inject chemical components, rather sparse at the moment at the above site, is being promoted by workshops under the auspices of e.g. IUPAC and CODATA\u00a0and the GO-FAIR\u2026","rel":"","context":"In &quot;Interesting chemistry&quot;","block_context":{"text":"Interesting chemistry","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=4"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2018\/04\/240-1024x478.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":18257,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18257","url_meta":{"origin":24561,"position":3},"title":"The challenges in curating research data: one case study.","author":"Henry Rzepa","date":"April 28, 2017","format":false,"excerpt":"Research data (and its management) is rapidly emerging as a focal point for the development of research dissemination practices. An important aspect of ensuring that such data remains fit for purpose is identifying what curation activities need to be associated with it. Here I revisit one particular case study associated\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/04\/077-1.jpg?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":10998,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=10998","url_meta":{"origin":24561,"position":4},"title":"A two-publisher model for the scientific article: narrative+shared data.","author":"Henry Rzepa","date":"September 15, 2013","format":false,"excerpt":"I do go on rather a lot about enabling or\u00a0hyper-activating data. So do others. Why is sharing data important? Reproducibility is a cornerstone in science, To achieve this, it is important that scientific research be\u00a0open and transparent. Openly available research data is central to achieving this. It is estimated that\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":16251,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=16251","url_meta":{"origin":24561,"position":5},"title":"Metametadata: data about data about (chemical) data.","author":"Henry Rzepa","date":"April 16, 2016","format":false,"excerpt":"Scientists are familiar with the term data, at least in a scientific or chemical context, but appreciating metadata (meaning \"after\", or \"beyond\") is slightly more subtle, in the sense of using it to mean data about data. The challenge lies in clarifying where the boundary between data and its metadata\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"jetpack_likes_enabled":false,"authors":[{"term_id":2661,"user_id":1,"is_guest":0,"slug":"admin","display_name":"Henry Rzepa","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","author_category":"1","first_name":"Henry","last_name":"Rzepa","user_url":"https:\/\/orcid.org\/0000-0002-8635-8390","job_title":"","description":"Henry Rzepa is Emeritus Professor of Computational Chemistry at Imperial College London."}],"_links":{"self":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/24561","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=24561"}],"version-history":[{"count":49,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/24561\/revisions"}],"predecessor-version":[{"id":24611,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/24561\/revisions\/24611"}],"wp:attachment":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=24561"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=24561"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=24561"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fppma_author&post=24561"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}