{"id":17885,"date":"2017-03-30T10:13:18","date_gmt":"2017-03-30T09:13:18","guid":{"rendered":"http:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=17885"},"modified":"2017-03-30T11:27:46","modified_gmt":"2017-03-30T10:27:46","slug":"the-provenance-of-scientific-data-establishing-an-audit-trail","status":"publish","type":"post","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885","title":{"rendered":"The provenance of scientific data &#8211; establishing an audit trail."},"content":{"rendered":"<div class=\"kcite-section\" kcite-section-id=\"17885\">\n<p>In an era when <a href=\"https:\/\/en.wikipedia.org\/wiki\/Alternative_facts\">alternative facts<\/a> and<a href=\"https:\/\/en.wikipedia.org\/wiki\/Fake_news\"> fake news<\/a> afflict us, the <a href=\"https:\/\/eos.org\/opinions\/the-importance-of-data-set-provenance-for-science\">provenance of scientific data<\/a> becomes ever more important. Especially if that data is available as open access and exploitable by others for both valid scientific reasons but\u00a0potentially also by those with other\u00a0motives. Here I consider the audit trail that might serve to establish data provenance in one typical situation in chemistry, the acquisition of NMR instrumental data.\u00a0<\/p>\n<p>Here I describe how such data is generated in my department; details may vary elsewhere.<\/p>\n<ol>\n<li>The prospective user of the NMR service is allocated a service ID. In our case, that ID relates to the research group rather than to individual researchers. This ID is parochial, it does not reference any other information about the user in the institute. Only the service manager has the information to associate this ID with real users and this information is normally not distributed.<\/li>\n<li>When a sample is submitted, this ID is used to create a new folder containing the data as a sub-folder of the group ID and located on the NMR data servers.<\/li>\n<li>The dataset itself<sup>\u2021<\/sup>\u00a0contains a number of files that contain an audit trail (names such as audita.txt, auditp.txt) with the fields:\u00a0##AUDIT TRAIL= <tt>$$ (NUMBER, WHEN, WHO, WHERE, PROCESS, VERSION, WHAT).\u00a0<\/tt>Typically, none of these files have propagated the original user ID under which the data was collected; to do so would require a programmatic connection between the local authentication systems and the spectrometer software used, a connection that is normally missing. Thus the<span style=\"color: #ff0000;\"> first break<\/span> in the provenance trail.<\/li>\n<li>In principle other audit trails can be inferred from these files, such as the <a href=\"http:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=17084\">unique identity of the instrument<\/a> provided by its manufacturer. Further information such as <em>e.g.<\/em> the probe used to collect the data (probes can be readily changed over) or any calibration data used in setting up the instrument for the data collection are by and large not recorded. To my knowledge, although an instrument can have a unique serial number, such serial numbers of swappable components such as probes are not recorded by the collection software.\u00a0Thus the <span style=\"color: #ff0000;\">second break<\/span> in the provenance trail.<\/li>\n<li>This data then needs to be processed by further software. In this case we use the <a href=\"http:\/\/mestrelab.com\/resources\/whats-new-in-mnova-11-0-0\/\">MestreNova system<\/a> for this task. Each dataset has editable assigned properties; below I show those\u00a0that can be associated with the spectrum (accessed with MestreNova using <strong><em>Edit\/Properties<\/em><\/strong>). All this comes from the information collected by the instrument. The user&#8217;s identity can be inserted into the &#8220;title&#8221; field, the display of which is <strong>off<\/strong> by default.\u00a0<img decoding=\"async\" class=\"aligncenter size-full wp-image-17888\" src=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073.jpg\" alt=\"\" width=\"450\" srcset=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073.jpg 1008w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073-300x99.jpg 300w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073-768x253.jpg 768w\" sizes=\"(max-width: 1008px) 100vw, 1008px\" \/><\/li>\n<li>There is also a section for parameters, a synonym for which might be\u00a0<strong>metadata<\/strong> and\u00a0accessed using this program from <strong><em>View\/Tables\/Parameter<\/em>s<\/strong>.\u00a0If <strong>Author<\/strong> was entered as a parameter in the dataset by the spectrometer software,\u00a0the\u00a0Mnova document would retrieve\u00a0that information. Equally, an <a href=\"http:\/\/orcid.org\/\">ORCID<\/a> identifier for the author entered at the time of data collection\u00a0and thus stored in the dataset could be read by Mnova, stored and displayed if configured to do so. It would be fair to say however that this option is rarely if indeed ever systematically\u00a0implemented by NMR instrument data collection software and so is never propagated to the data processing software (as highlighted in red below). Thus a third<span style=\"color: #ff0000;\">\u00a0break<\/span> in the provenance trail.<br \/>\n <img decoding=\"async\" class=\"aligncenter size-large wp-image-17910\" src=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/077-389x1024.jpg\" alt=\"\" width=\"300\" srcset=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/077-389x1024.jpg 389w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/077-114x300.jpg 114w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/077.jpg 660w\" sizes=\"(max-width: 389px) 100vw, 389px\" \/>This is also an alternative and this time formal <strong>metadata<\/strong> field that can be populated, by default as shown below with the type of spectrum and nucleus. These properties are not controlled in the sense of only allowing those terms that are present in a specified dictionary. The jargon for such control is a <strong>metadata schema. <\/strong>This\u00a0is not used here, since dissemination of this information is not intended; the software accepts whatever information it is given.\u00a0<br \/>\n <img decoding=\"async\" class=\"aligncenter size-full wp-image-17889\" src=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/074.jpg\" alt=\"\" width=\"450\" srcset=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/074.jpg 952w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/074-300x130.jpg 300w, https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/074-768x334.jpg 768w\" sizes=\"(max-width: 952px) 100vw, 952px\" \/> There are thus several opportunities to collect the identity of the experimenter\u00a0and thus attribute provenance to the collected data, but this does very much depend on the\u00a0will of\u00a0researchers,\u00a0institutions or publishers to enforce specific policies around this.\u00a0<span style=\"color: #ff0000;\">The fourth break in the provenance trail<\/span>.<\/li>\n<li>The dataset can then be uploaded (DOI:\u00a0<a href=\"https:\/\/doi.org\/10.14469\/hpc\/1291\">10.14469\/hpc\/1291<\/a>), at which stage provenance can finally be added using the <a href=\"http:\/\/orcid.org\/\">ORCID<\/a> credentials of the person publishing the dataset, who\u00a0of course\u00a0may or may not be the person who actually recorded the data! The full metadata for this specific collection can be seen at\u00a0<a href=\"https:\/\/data.datacite.org\/10.14469\/hpc\/1291\">data.datacite.org\/10.14469\/hpc\/1291<\/a>.\u00a0Or to put it another way, this is the first point in the provenance chain where the metadata is controlled by a schema and is also discoverable in a standard programmatic manner,<em> i.e.<\/em> the preceding link. The provenance is now formally\u00a0associated with the ORCID identifier using the <a href=\"http:\/\/datacite metadata schema\">DataCite metadata schema<\/a>. You should be aware that a local policy<sup>\u2020<\/sup> is that access to the\u00a0repository at\u00a0<a href=\"https:\/\/data.hpc.imperial.ac.uk\">https:\/\/data.hpc.imperial.ac.uk<\/a> is only allowed by\u00a0cross-authentication with <a href=\"http:\/\/orcid.org\/\">http:\/\/orcid.org\/<\/a> using the user&#8217;s ORCID. This identifier is then automatically propagated to the metadata held at <em>e.g.<\/em>\u00a0<a href=\"https:\/\/data.datacite.org\/10.14469\/hpc\/1095\">data.datacite.org\/10.14469\/hpc\/1095<\/a>. Currently however, none of any metadata originally recorded in either the instrumental file set or the processed MestreNova file is forwarded on to the metadata record held at DataCite; again <span style=\"color: #ff0000;\">loss of information and potentially of provenance<\/span>.\u00a0<\/li>\n<li>The peer-reviewed article resulting from the interpretation of this data however <strong>can<\/strong> be associated with the provenance introduced in the previous stage; see\u00a0<a href=\"https:\/\/data.datacite.org\/10.14469\/hpc\/1267\">data.datacite.org\/10.14469\/hpc\/1267<\/a>\u00a0 and the<em>\u00a0IsReferencedBy<\/em> property.\u00a0<\/li>\n<\/ol>\n<p>Now imagine if there was a common thread in all the stages of acquiring, processing and publishing this scientific data based on the ORCID.\u00a0<\/p>\n<ol>\n<li>Providing an ORCID could be made an essential requirement of access to the instrument.<\/li>\n<li>This information would be propagated to the dataset &#8230;<\/li>\n<li>by inclusion in one or more of the audit trail files.<\/li>\n<li>At this stage, further <a href=\"http:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=17084\">persistent identifiers <\/a>associated with the instrument manufacturer could be added, which help identify not only the instrument used, but sub-components such as the changeable probe. This would allow access to any calibration curves or probe sensitivity and other aspects.<\/li>\n<li>The ORCID and other relevant information could be picked up by the software used to convert the data into spectra and propagated into the metadata containers for this software &#8230;<\/li>\n<li>where its use is controlled by a specified schema.<\/li>\n<li>At this stage, the ORCID and information such as the nucleus recorded, the sample temperature <em>etc<\/em> can be propagated on to the final metadata records.<\/li>\n<li>And the reader of the article describing this work would have a formally defined provenance audit trail they could follow back to the start of the experiment or forward to a published article.\u00a0In this case, the data claims provenance (acquired from peer review) from the article, but it should also work in reverse with the article claiming provenance from the data on which it is based. The indexing of this bidirectional exchange is one of the exciting features that we should see emerging from CrossRef (holders of metadata about articles) and DataCite (holders of metadata about research data) in the near future.<\/li>\n<\/ol>\n<p>We are clearly a little\u00a0way from having the infrastructures described above for establishing such data audit trails. To do so will require cooperation from instrument manufacturers, at least in the example as charted above, as well as researchers, institutions, publishers, peer-reviewers and funding bodies. The first step would be to ensure that all scientists who intend collecting, processing and publishing data should claim\u00a0an ORCID. That remark is directed specifically at undergraduate, postgraduate and post-doctoral\u00a0researchers, not just at their supervisor or their PI (principal investigator). At a point when the discussion about alternate facts and perhaps even alternate data risks a general loss of confidence in science, we should be pro-active in\u00a0establishing trust in the scientific processes.<\/p>\n<hr \/>\n<p><sup>\u2021<\/sup> You can see an example obtained by this process at DOI: <a href=\"http:\/\/doi.org\/10.14469\/hpc\/1095\">10.14469\/hpc\/1095<\/a><\/p>\n<p><sup>\u2020<\/sup> This requirement is a strong driver for the uptake of ORCID amongst our student population.<\/p>\n<!-- kcite active, but no citations found -->\n<\/div> <!-- kcite-section 17885 -->","protected":false},"excerpt":{"rendered":"<p>In an era when alternative facts and fake news afflict us, the provenance of scientific data becomes ever more important. Especially if that data is available as open access and exploitable by others for both valid scientific reasons but\u00a0potentially also by those with other\u00a0motives. Here I consider the audit trail that might serve to establish [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":true,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"activitypub_content_warning":"","activitypub_content_visibility":"","activitypub_max_image_attachments":5,"activitypub_interaction_policy_quote":"anyone","activitypub_status":"","footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[2],"tags":[2099,1471,281,2093,2098,1211,1474,2094,2088,2097,2090,1879,2096,1705,2091,2089,1645,2092,2095,1180,1405,2087],"ppma_author":[2661],"class_list":["post-17885","post","type-post","status-publish","format-standard","hentry","category-chemical-it","tag-acquisition","tag-archival-science","tag-author","tag-collection-software","tag-company-nmr","tag-data","tag-data-management","tag-data-processing-software","tag-evidence-law","tag-instrument-data-collection-software","tag-local-authentication-systems","tag-mestrenova","tag-mestrenova-system","tag-nuclear-magnetic-resonance","tag-principal-investigator","tag-provenance","tag-scientific-method","tag-service-manager","tag-spectrometer-software","tag-supervisor","tag-technologyinternet","tag-terminology"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>The provenance of scientific data - establishing an audit trail. - Henry Rzepa&#039;s Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The provenance of scientific data - establishing an audit trail. - Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"og:description\" content=\"In an era when alternative facts and fake news afflict us, the provenance of scientific data becomes ever more important. Especially if that data is available as open access and exploitable by others for both valid scientific reasons but\u00a0potentially also by those with other\u00a0motives. Here I consider the audit trail that might serve to establish [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885\" \/>\n<meta property=\"og:site_name\" content=\"Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"article:published_time\" content=\"2017-03-30T09:13:18+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2017-03-30T10:27:46+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073.jpg\" \/>\n<meta name=\"author\" content=\"Henry Rzepa\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Henry Rzepa\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"The provenance of scientific data - establishing an audit trail. - Henry Rzepa&#039;s Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885","og_locale":"en_GB","og_type":"article","og_title":"The provenance of scientific data - establishing an audit trail. - Henry Rzepa&#039;s Blog","og_description":"In an era when alternative facts and fake news afflict us, the provenance of scientific data becomes ever more important. Especially if that data is available as open access and exploitable by others for both valid scientific reasons but\u00a0potentially also by those with other\u00a0motives. Here I consider the audit trail that might serve to establish [&hellip;]","og_url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885","og_site_name":"Henry Rzepa&#039;s Blog","article_published_time":"2017-03-30T09:13:18+00:00","article_modified_time":"2017-03-30T10:27:46+00:00","og_image":[{"url":"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073.jpg","type":"","width":"","height":""}],"author":"Henry Rzepa","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Henry Rzepa","Estimated reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885#article","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885"},"author":{"name":"Henry Rzepa","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"headline":"The provenance of scientific data &#8211; establishing an audit trail.","datePublished":"2017-03-30T09:13:18+00:00","dateModified":"2017-03-30T10:27:46+00:00","mainEntityOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885"},"wordCount":1332,"commentCount":2,"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885#primaryimage"},"thumbnailUrl":"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073.jpg","keywords":["Acquisition","Archival science","author","collection software","Company: NMR","data","Data management","data processing software","Evidence law","instrument data collection software","local authentication systems","Mestrenova","MestreNova system","Nuclear magnetic resonance","principal investigator","Provenance","Scientific method","service manager","spectrometer software","supervisor","Technology\/Internet","Terminology"],"articleSection":["Chemical IT"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885","name":"The provenance of scientific data - establishing an audit trail. - Henry Rzepa&#039;s Blog","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885#primaryimage"},"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885#primaryimage"},"thumbnailUrl":"http:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073.jpg","datePublished":"2017-03-30T09:13:18+00:00","dateModified":"2017-03-30T10:27:46+00:00","author":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"breadcrumb":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885#primaryimage","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073.jpg","contentUrl":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/03\/073.jpg","width":1008,"height":332},{"@type":"BreadcrumbList","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17885#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog"},{"@type":"ListItem","position":2,"name":"The provenance of scientific data &#8211; establishing an audit trail."}]},{"@type":"WebSite","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/","name":"Henry Rzepa&#039;s Blog","description":"Chemistry with a twist","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281","name":"Henry Rzepa","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g370be3a7397865e4fd161aefeb0a5a85","url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","caption":"Henry Rzepa"},"description":"Henry Rzepa is Emeritus Professor of Computational Chemistry at Imperial College London.","sameAs":["https:\/\/orcid.org\/0000-0002-8635-8390"],"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?author=1"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pDef7-4Et","jetpack-related-posts":[{"id":16628,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=16628","url_meta":{"origin":17885,"position":0},"title":"Managing (open) NMR data: a working example using Mpublish.","author":"Henry Rzepa","date":"August 1, 2016","format":false,"excerpt":"In March, I posted from\u00a0the ACS meeting in San Diego on the topic of Research data: Managing spectroscopy-NMR, and noted a talk by MestreLab Research on\u00a0how a tool called Mpublish in the forthcoming release of their NMR analysis software Mestrenova could help. With that release now out, the opportunity arose\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":15916,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916","url_meta":{"origin":17885,"position":1},"title":"Research data: Managing spectroscopy-NMR.","author":"Henry Rzepa","date":"March 16, 2016","format":false,"excerpt":"At the ACS conference, I have attended many talks these last four days, but one made some \"connections\" which intrigued me. I tell its story (or a part of it) here. But to start, try the following experiment. Find a Word document of .docx type on your hard drive Remove\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":21928,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=21928","url_meta":{"origin":17885,"position":2},"title":"Encouraging Submission of FAIR Data at the Journal of Organic Chemistry and Organic Letters","author":"Henry Rzepa","date":"February 14, 2020","format":false,"excerpt":"In a welcome move, one of the American chemical society journals has published an encouragement to submit what is called FAIR data to the journal.. A reminder that FAIR data is data that can be Found (F), Accessed (A), Interoperated(I) and Re-used( R). I thought I might try to explore\u2026","rel":"","context":"In &quot;Interesting chemistry&quot;","block_context":{"text":"Interesting chemistry","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=4"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":12513,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=12513","url_meta":{"origin":17885,"position":3},"title":"Disambiguation\/provenance of claimed scientific opinion and research.","author":"Henry Rzepa","date":"May 5, 2014","format":false,"excerpt":"My name is displayed pretty prominently on this blog, but it is not always easy to find out who the real person is behind many a blog. In science, I am troubled by such anonymity. Well, a new era is about to hit us. When you come across an Internet\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":22059,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=22059","url_meta":{"origin":17885,"position":4},"title":"A cascading tutorial in finding rich NMR data using the Datacite datasearch engine.","author":"Henry Rzepa","date":"April 11, 2020","format":false,"excerpt":"In the previous post, I introduced three of a new generation of search engines specialising in the discovery of data. Data has some special features which make its properties slightly different from the conceptual (or natural language) searches we are used to performing for general information and so a search\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":17951,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17951","url_meta":{"origin":17885,"position":5},"title":"Supporting information: chemical graveyard or invaluable resource for chemical structures.","author":"Henry Rzepa","date":"March 31, 2017","format":false,"excerpt":"Nowadays, data supporting\u00a0most publications relating to the synthesis of organic compounds is more likely than not to be found in associated \"supporting information\" rather than the (often page limited) article itself. For example, this article has an SI which is paginated at 907; almost a mini-database in its own right!\u2020\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"jetpack_likes_enabled":false,"authors":[{"term_id":2661,"user_id":1,"is_guest":0,"slug":"admin","display_name":"Henry Rzepa","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/17885","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=17885"}],"version-history":[{"count":29,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/17885\/revisions"}],"predecessor-version":[{"id":17918,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/17885\/revisions\/17918"}],"wp:attachment":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=17885"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=17885"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=17885"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fppma_author&post=17885"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}