{"id":15916,"date":"2016-03-16T20:41:06","date_gmt":"2016-03-16T20:41:06","guid":{"rendered":"http:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=15916"},"modified":"2016-04-05T08:43:24","modified_gmt":"2016-04-05T07:43:24","slug":"research-data-managing-spectroscopy-nmr","status":"publish","type":"post","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916","title":{"rendered":"Research data: Managing spectroscopy-NMR."},"content":{"rendered":"<div class=\"kcite-section\" kcite-section-id=\"15916\">\n<p>At the ACS conference, I have attended many talks these last four days, but one made some &#8220;connections&#8221; which intrigued me. I tell its story (or a part of it) here.<\/p>\n<p>But to start, try the following experiment.<\/p>\n<ol>\n<li>Find a Word document of .docx type on your hard drive<\/li>\n<li>Remove the .docx suffix and replace it with a .zip suffix.<\/li>\n<li>Expand as if it is an archive (it is!).<\/li>\n<li>A folder is created and this itself contains four further folders. These all contain XML files, and in the sub-folder actually called word you will find something called <strong>document.xml<\/strong> That file contains the visible content of the document; all the others are support documents, including styles etc.<\/li>\n<\/ol>\n<p>The reason this is important was made clear in Santi Dominguez&#8217; talk. Most of it was concerned with introducing Mbook, an ELN (electronic laboratory notebook) but the relevance to the above comes from his introduction of <strong>Mpublish<\/strong>, a forthcoming product targeting the area of research data management. What is the connection? Well, NMR spectrometers produce raw outputs as collections of files, much in the manner of the exploded word document above. Some files contain the raw FID, others contain the acquisition parameters, etc. These files are then turned into the traditional spectra by suitable processing software such as Mestrenova (part of the same ecosystem as Mpublish). Most users of such programs then squirt the spectra into a PDF file and it is this last document that is preserved as &#8220;research data&#8221; &#8211;&nbsp;almost invariably this is the version sent off to journals as the supporting information or SI for the article. SI is called information for a good reason; in such a container it&nbsp;is very often not easily usable data, and functions just visually.<\/p>\n<p>So what <strong>is<\/strong> the problem? Well, the conversion of the NMR fileset (and quite possibly many other forms of spectroscopy) into a PDF file is a lossy process. It cannot be reversed; information has been lost. And only really a human who can easily retrieve and interpret such a visual presentation.<\/p>\n<p>Santi described how Mpublish can assemble all the files associated with the instrumental outputs, optionally add chemical structure and other information, collect suitable metadata describing the contents and create a .zip archive. As we saw with Word however, the suffix does not even need to be .zip. It was suggested that it be this information-complete archive&nbsp;that should really be used as SI to accompany an article in which NMR data is invoked&nbsp;to support the narrative. In the reverse process, anyone downloading this zip archive could themselves potentially acquire full access, without information loss, to the original NMR data. There is a little further magic that needs to be included to make the&nbsp;process work which I do not include here. When Mpublish becomes available to play with, I will complete that story here.<\/p>\n<p>It is good to report that software is starting to appear which enhances the management and reporting of research data as part of the publication process. The &#8220;rules&#8221; and &#8220;best practice&#8221; of this game are still being written however.&nbsp;In this regard, I&nbsp;feel that it is the researchers themselves that must play&nbsp;a vital role&nbsp;in defining the rules. Let us not cede that role just&nbsp;to publishers.<\/p>\n<!-- kcite active, but no citations found -->\n<\/div> <!-- kcite-section 15916 -->","protected":false},"excerpt":{"rendered":"<p>At the ACS conference, I have attended many talks these last four days, but one made some &#8220;connections&#8221; which intrigued me. I tell its story (or a part of it) here. But to start, try the following experiment. Find a Word document of .docx type on your hard drive Remove the .docx suffix and replace [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"activitypub_content_warning":"","activitypub_content_visibility":"","activitypub_max_image_attachments":5,"activitypub_interaction_policy_quote":"anyone","activitypub_status":"","footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[2],"tags":[1704,1709,1707,1705,327,1699,33,1708,124,1706],"ppma_author":[2661],"class_list":["post-15916","post","type-post","status-publish","format-standard","hentry","category-chemical-it","tag-archive-formats","tag-chemical-structure","tag-eln","tag-nuclear-magnetic-resonance","tag-pdf","tag-research-data-management","tag-spectroscopy","tag-suitable-processing-software","tag-xml","tag-zip"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Research data: Managing spectroscopy-NMR. - Henry Rzepa&#039;s Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Research data: Managing spectroscopy-NMR. - Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"og:description\" content=\"At the ACS conference, I have attended many talks these last four days, but one made some &#8220;connections&#8221; which intrigued me. I tell its story (or a part of it) here. But to start, try the following experiment. Find a Word document of .docx type on your hard drive Remove the .docx suffix and replace [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916\" \/>\n<meta property=\"og:site_name\" content=\"Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"article:published_time\" content=\"2016-03-16T20:41:06+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2016-04-05T07:43:24+00:00\" \/>\n<meta name=\"author\" content=\"Henry Rzepa\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Henry Rzepa\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Research data: Managing spectroscopy-NMR. - Henry Rzepa&#039;s Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916","og_locale":"en_GB","og_type":"article","og_title":"Research data: Managing spectroscopy-NMR. - Henry Rzepa&#039;s Blog","og_description":"At the ACS conference, I have attended many talks these last four days, but one made some &#8220;connections&#8221; which intrigued me. I tell its story (or a part of it) here. But to start, try the following experiment. Find a Word document of .docx type on your hard drive Remove the .docx suffix and replace [&hellip;]","og_url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916","og_site_name":"Henry Rzepa&#039;s Blog","article_published_time":"2016-03-16T20:41:06+00:00","article_modified_time":"2016-04-05T07:43:24+00:00","author":"Henry Rzepa","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Henry Rzepa","Estimated reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916#article","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916"},"author":{"name":"Henry Rzepa","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"headline":"Research data: Managing spectroscopy-NMR.","datePublished":"2016-03-16T20:41:06+00:00","dateModified":"2016-04-05T07:43:24+00:00","mainEntityOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916"},"wordCount":561,"commentCount":1,"keywords":["Archive formats","chemical structure","ELN","Nuclear magnetic resonance","PDF","research data management","spectroscopy","suitable processing software","XML","Zip"],"articleSection":["Chemical IT"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916","name":"Research data: Managing spectroscopy-NMR. - Henry Rzepa&#039;s Blog","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website"},"datePublished":"2016-03-16T20:41:06+00:00","dateModified":"2016-04-05T07:43:24+00:00","author":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"breadcrumb":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=15916#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog"},{"@type":"ListItem","position":2,"name":"Research data: Managing spectroscopy-NMR."}]},{"@type":"WebSite","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/","name":"Henry Rzepa&#039;s Blog","description":"Chemistry with a twist","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281","name":"Henry Rzepa","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g370be3a7397865e4fd161aefeb0a5a85","url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","caption":"Henry Rzepa"},"description":"Henry Rzepa is Emeritus Professor of Computational Chemistry at Imperial College London.","sameAs":["https:\/\/orcid.org\/0000-0002-8635-8390"],"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?author=1"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pDef7-48I","jetpack-related-posts":[{"id":2962,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=2962","url_meta":{"origin":15916,"position":0},"title":"Data-round-tripping: wherein the future?","author":"Henry Rzepa","date":"December 7, 2010","format":false,"excerpt":"Moving (chemical) data around in a manner which allows its (automated) use in whichever context it finds itself must be a holy grail for all scientists and chemists. I posted earlier on the fragile nature of molecular diagrams making the journey between the editing program used to create them (say\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":16628,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=16628","url_meta":{"origin":15916,"position":1},"title":"Managing (open) NMR data: a working example using Mpublish.","author":"Henry Rzepa","date":"August 1, 2016","format":false,"excerpt":"In March, I posted from\u00a0the ACS meeting in San Diego on the topic of Research data: Managing spectroscopy-NMR, and noted a talk by MestreLab Research on\u00a0how a tool called Mpublish in the forthcoming release of their NMR analysis software Mestrenova could help. With that release now out, the opportunity arose\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":22059,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=22059","url_meta":{"origin":15916,"position":2},"title":"A cascading tutorial in finding rich NMR data using the Datacite datasearch engine.","author":"Henry Rzepa","date":"April 11, 2020","format":false,"excerpt":"In the previous post, I introduced three of a new generation of search engines specialising in the discovery of data. Data has some special features which make its properties slightly different from the conceptual (or natural language) searches we are used to performing for general information and so a search\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":106,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=106","url_meta":{"origin":15916,"position":3},"title":"A lab in a backpack","author":"Henry Rzepa","date":"April 3, 2009","format":false,"excerpt":"We recently developed a new computational chemistry practical laboratory here at Imperial College. I gave a talk about it at the recent ACS meeting in Salt Lake City. If you want to see the details of the lab, do go here. The talk itself contains further links and examples. Perhaps\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":28121,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=28121","url_meta":{"origin":15916,"position":4},"title":"The secrets of FAIR Metadata: optimisation for Chemical Compounds.","author":"Henry Rzepa","date":"December 11, 2024","format":false,"excerpt":"The idea of so-called FAIR (Findable, Accessible, Interoperable and Reusable) data is that each object has an associated metadata record which serves to enable the four aspects of FAIR. Each such record is itself identified by a persistent identifier known as a DOI. The trick in producing useful FAIR data\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":17939,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=17939","url_meta":{"origin":15916,"position":5},"title":"MOLinsight: A web portal for the processing of molecular structures by blind students.","author":"Henry Rzepa","date":"March 31, 2017","format":false,"excerpt":"Occasionally one comes across a web site that manages to combine being\u00a0unusual, interesting and also useful. Thus\u00a0www.molinsight.net\u00a0is I think a unique chemistry resource for blind and visually impaired students. If you think perhaps that it might be a little too specialised to be useful for you, go visit it first.\u2026","rel":"","context":"In &quot;Interesting chemistry&quot;","block_context":{"text":"Interesting chemistry","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=4"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"jetpack_likes_enabled":false,"authors":[{"term_id":2661,"user_id":1,"is_guest":0,"slug":"admin","display_name":"Henry Rzepa","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/15916","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=15916"}],"version-history":[{"count":7,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/15916\/revisions"}],"predecessor-version":[{"id":15923,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/15916\/revisions\/15923"}],"wp:attachment":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=15916"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=15916"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=15916"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fppma_author&post=15916"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}