{"id":13248,"date":"2015-01-15T14:05:19","date_gmt":"2015-01-15T14:05:19","guid":{"rendered":"http:\/\/www.ch.imperial.ac.uk\/rzepa\/blog\/?p=13248"},"modified":"2023-09-16T19:48:19","modified_gmt":"2023-09-16T18:48:19","slug":"a-convincing-example-of-the-need-for-data-repositories-fair-data","status":"publish","type":"post","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248","title":{"rendered":"A convincing example of the need for data repositories. FAIR Data."},"content":{"rendered":"<div class=\"kcite-section\" kcite-section-id=\"13248\">\n<p>Derek Lowe in his <a href=\"http:\/\/pipeline.corante.com\" target=\"_blank\" rel=\"noopener\">In the Pipeline blog<\/a> is famed for spotting unusual claims in the literature and subjecting them to analysis. This one is entitled\u00a0<a href=\"http:\/\/pipeline.corante.com\/archives\/2015\/01\/13\/odd_structures_subjected_to_powerful_computations.php\" target=\"_blank\" rel=\"noopener\">Odd Structures, Subjected to Powerful Computations<\/a>. He looks at this image below, and finds the structures represented there might be a mistake, based on his considerable experience of these kinds of molecules. I expect he had a gut feeling within seconds of seeing the diagram.<\/p>\n<p><img decoding=\"async\" src=\"http:\/\/pipeline.corante.com\/Exacycle%20plot.jpg\" alt=\"\" \/><\/p>\n<p>Indeed, so, you will now find that the <a href=\"http:\/\/pipeline.corante.com\/archives\/2015\/01\/13\/odd_structures_subjected_to_powerful_computations.php#1645888\" target=\"_blank\" rel=\"noopener\">authors have apparently acknowledged<\/a> a mistake<span id=\"cite_ITEM-13248-0\" name=\"citation\"><a href=\"#ITEM-13248-0\">[1]<\/a><\/span>. My interest piqued, I went to the article, and immediately tracked down the <a href=\"http:\/\/www.nature.com\/nchem\/journal\/v6\/n1\/extref\/nchem.1821-s1.pdf\" target=\"_blank\" rel=\"noopener\"> supplementary information<\/a>. Surely, if these molecules had been subjected to<span style=\"color: #ff0000;\"><strong> <em>powerful computation<\/em><\/strong><\/span>, this supporting information should contain coordinates of some kind that would allow a correlation\u00a0with the 2D structural representation shown above. I have just returned from <a href=\"https:\/\/www.force11.org\/meetings\/force2015\/detailed-agenda\" target=\"_blank\" rel=\"noopener\">FORCE2015<\/a>, a three-day event in Oxford. From the detailed agenda, you can see that a lot of the conference centered around what is called <a href=\"https:\/\/www.force11.org\/group\/fairgroup\" target=\"_blank\" rel=\"noopener\">FAIR Data<\/a>. FAIR stands for:<\/p>\n<ol>\n<li><span style=\"color: #0000ff;\">Findable<\/span><\/li>\n<li><span style=\"color: #0000ff;\">Accessible<\/span><\/li>\n<li><span style=\"color: #0000ff;\">Interoperable<\/span><\/li>\n<li><span style=\"color: #0000ff;\">Re-usable<\/span><\/li>\n<\/ol>\n<p>So I then set out to find if the <a href=\"http:\/\/www.nature.com\/nchem\/journal\/v6\/n1\/extref\/nchem.1821-s1.pdf\" target=\"_blank\" rel=\"noopener\"> supplementary information<\/a> WAS FAIR. Well, check for yourself (unlike the narrative article, the data should be accessible outside of the paywall, <em>i.e.<\/em> you should not need a subscription to access it). It is certainly big, running out to 45 pages, in the form of a paginated PDF file (the norm). The table of contents does not refer to data as such, but it does quote\u00a025\u00a0figures, from which you might just be able to extract some data. But no molecules as such! So:<\/p>\n<ol>\n<li>No data is findable, although the \u00a0PDF which might contain it is reasonably so.<\/li>\n<li>The data is not easily accessible,<\/li>\n<li>let alone interoperable (thus many of the charts were probably created using spreadsheet software, but the source files for these are not available),<\/li>\n<li>and not-reusable (certainly not without loss and possible error in any attempt at capture).<\/li>\n<\/ol>\n<p>I think it fair to say that the data for these powerful computations are not <strong>FAIR<\/strong>. Had we had at least some coordinates (the computations involved molecular mechanics based dynamics simulations, which certainly involve manipulating atom coordinates in some form) then the structures shown in the figure above could be checked, and perhaps even the apparent error would have been quickly spotted.<\/p>\n<p>Derek does not make the point about FAIR data (to be fair, he was not at FORCE2015) and so I will make the case. If you are reporting a computational model or simulation, there is no excuse for not supplying FAIR data to accompany it. If the data is FAIR it <strong>will<\/strong> be inter-operable and re-usable. And this will instantly allow anyone to check <em>e.g.<\/em> the structures above. You would not need to have Derek&#8217;s vast experience and instinct (although having it is also helps). And of course we might presume that there were 2-3 referees that also looked at the article, and presumably none of them requested FAIR data.<\/p>\n<p>Oh, if you are interested in my take on FAIR data, I gave a <a href=\"http:\/\/www.ch.ic.ac.uk\/rzepa\/talks\/force2015\/\" target=\"_blank\" rel=\"noopener\">talk about\u00a0<\/a>that at FORCE2015, which you are welcome to view; I hope it constitutes a FAIR talk!<\/p>\n<hr \/>\n<h4>Acknowledgments<\/h4>\n<p>This post has been cross-posted in PDF format at <a href=\"https:\/\/doi.org\/10.15200\/winn.142313.30279\" rel=\"noopener\" target=\"_blank\">Authorea<\/a>.<\/p>\n<h2>References<\/h2>\n    <ol class=\"kcite-bibliography csl-bib-body\"><li id=\"ITEM-13248-0\">K.J. Kohlhoff, D. Shukla, M. Lawrenz, G.R. Bowman, D.E. Konerding, D. Belov, R.B. Altman, and V.S. Pande, \"Cloud-based simulations on Google Exacycle reveal ligand modulation of GPCR activation pathways\", <i>Nature Chemistry<\/i>, vol. 6, pp. 15-21, 2013. <a href=\"https:\/\/doi.org\/10.1038\/nchem.1821\">https:\/\/doi.org\/10.1038\/nchem.1821<\/a>\n\n<\/li>\n<\/ol>\n\n<\/div> <!-- kcite-section 13248 -->","protected":false},"excerpt":{"rendered":"<p>Derek Lowe in his In the Pipeline blog is famed for spotting unusual claims in the literature and subjecting them to analysis. This one is entitled\u00a0Odd Structures, Subjected to Powerful Computations. He looks at this image below, and finds the structures represented there might be a mistake, based on his considerable experience of these kinds [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"activitypub_content_warning":"","activitypub_content_visibility":"","activitypub_max_image_attachments":5,"activitypub_interaction_policy_quote":"anyone","activitypub_status":"","footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[2,1],"tags":[1314,1313,633,327,721],"ppma_author":[2661],"class_list":["post-13248","post","type-post","status-publish","format-standard","hentry","category-chemical-it","category-general","tag-created-using-spreadsheet-software","tag-derek-lowe","tag-oxford","tag-pdf","tag-simulation"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>A convincing example of the need for data repositories. FAIR Data. - Henry Rzepa&#039;s Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"A convincing example of the need for data repositories. FAIR Data. - Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"og:description\" content=\"Derek Lowe in his In the Pipeline blog is famed for spotting unusual claims in the literature and subjecting them to analysis. This one is entitled\u00a0Odd Structures, Subjected to Powerful Computations. He looks at this image below, and finds the structures represented there might be a mistake, based on his considerable experience of these kinds [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248\" \/>\n<meta property=\"og:site_name\" content=\"Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"article:published_time\" content=\"2015-01-15T14:05:19+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-09-16T18:48:19+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/pipeline.corante.com\/Exacycle%20plot.jpg\" \/>\n<meta name=\"author\" content=\"Henry Rzepa\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Henry Rzepa\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"A convincing example of the need for data repositories. FAIR Data. - Henry Rzepa&#039;s Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248","og_locale":"en_GB","og_type":"article","og_title":"A convincing example of the need for data repositories. FAIR Data. - Henry Rzepa&#039;s Blog","og_description":"Derek Lowe in his In the Pipeline blog is famed for spotting unusual claims in the literature and subjecting them to analysis. This one is entitled\u00a0Odd Structures, Subjected to Powerful Computations. He looks at this image below, and finds the structures represented there might be a mistake, based on his considerable experience of these kinds [&hellip;]","og_url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248","og_site_name":"Henry Rzepa&#039;s Blog","article_published_time":"2015-01-15T14:05:19+00:00","article_modified_time":"2023-09-16T18:48:19+00:00","og_image":[{"url":"http:\/\/pipeline.corante.com\/Exacycle%20plot.jpg","type":"","width":"","height":""}],"author":"Henry Rzepa","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Henry Rzepa","Estimated reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248#article","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248"},"author":{"name":"Henry Rzepa","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"headline":"A convincing example of the need for data repositories. FAIR Data.","datePublished":"2015-01-15T14:05:19+00:00","dateModified":"2023-09-16T18:48:19+00:00","mainEntityOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248"},"wordCount":549,"commentCount":0,"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248#primaryimage"},"thumbnailUrl":"http:\/\/pipeline.corante.com\/Exacycle%20plot.jpg","keywords":["created using spreadsheet software","Derek Lowe","Oxford","PDF","simulation"],"articleSection":["Chemical IT","General"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248","name":"A convincing example of the need for data repositories. FAIR Data. - Henry Rzepa&#039;s Blog","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248#primaryimage"},"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248#primaryimage"},"thumbnailUrl":"http:\/\/pipeline.corante.com\/Exacycle%20plot.jpg","datePublished":"2015-01-15T14:05:19+00:00","dateModified":"2023-09-16T18:48:19+00:00","author":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"breadcrumb":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248#primaryimage","url":"http:\/\/pipeline.corante.com\/Exacycle%20plot.jpg","contentUrl":"http:\/\/pipeline.corante.com\/Exacycle%20plot.jpg"},{"@type":"BreadcrumbList","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=13248#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog"},{"@type":"ListItem","position":2,"name":"A convincing example of the need for data repositories. FAIR Data."}]},{"@type":"WebSite","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/","name":"Henry Rzepa&#039;s Blog","description":"Chemistry with a twist","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281","name":"Henry Rzepa","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g370be3a7397865e4fd161aefeb0a5a85","url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","caption":"Henry Rzepa"},"description":"Henry Rzepa is Emeritus Professor of Computational Chemistry at Imperial College London.","sameAs":["https:\/\/orcid.org\/0000-0002-8635-8390"],"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?author=1"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pDef7-3rG","jetpack-related-posts":[],"jetpack_likes_enabled":false,"authors":[{"term_id":2661,"user_id":1,"is_guest":0,"slug":"admin","display_name":"Henry Rzepa","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/13248","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=13248"}],"version-history":[{"count":6,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/13248\/revisions"}],"predecessor-version":[{"id":26484,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/13248\/revisions\/26484"}],"wp:attachment":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=13248"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=13248"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=13248"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fppma_author&post=13248"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}