{"id":31548,"date":"2026-06-17T11:45:19","date_gmt":"2026-06-17T10:45:19","guid":{"rendered":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548"},"modified":"2026-06-18T10:14:16","modified_gmt":"2026-06-18T09:14:16","slug":"evaluating-metadata-quality-and-completeness-for-research-data-using-the-new-datacite-tool","status":"publish","type":"post","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548","title":{"rendered":"Evaluating metadata quality and completeness for research data using the new DataCite Tool."},"content":{"rendered":"<div class=\"kcite-section\" kcite-section-id=\"31548\">\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-31634\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-678.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p>Metadata are an essential way of enabling the discoverability and impact of scholarly resources such as (research) data and associated objects. It must adhere to a precisely described schema describing its properties,<span id=\"cite_ITEM-31548-0\" name=\"citation\"><a href=\"#ITEM-31548-0\">[1]<\/a><\/span> and such a conformant metadata record is an mandatory component of a (research) data repository. Access to the record is by resolving the DOI (Digital object identifier) for any repository item, as for example:<\/p>\n<p><a href=\"https:\/\/data.datacite.org\/application\/vnd.datacite.datacite+xml\/10.14469\/hpc\/15994\"><tt><small>https:\/\/data.datacite.org\/application\/vnd.datacite.datacite+xml\/10.14469\/hpc\/15994<\/small><\/tt><\/a><br \/>\n<a href=\"https:\/\/data.datacite.org\/application\/vnd.datacite.datacite+xml\/10.5281\/zenodo.20657236\"><tt><small>https:\/\/data.datacite.org\/application\/vnd.datacite.datacite+xml\/10.5281\/zenodo.20657236<\/small><\/tt><\/a><\/p>\n<p>where <strong><tt><small>10.14469\/hpc\/15994<\/small><\/tt><\/strong> and <strong><tt><small>10.5281\/zenodo.20657236<\/small><\/tt><\/strong> are the (in this example two) DOIs registered for the same specific repository dataset. Two different repositories are shown here, because the metadata is still mostly captured using the user-interface of the relevant repository, and the richness or completeness of the metadata can differ greatly between repositories. Whilst the registered metadata record has some mandatory components, many more are optional and it is often the case that these optional components are either not supported <em>via<\/em> a visual interface by the repository, or the user choses to omit them (a complete or &#8220;rich&#8221; metadata entry could be quite tedious for a human). This aspect of human time and their attention span can often result in sparse metadata records.<\/p>\n<p>In some cases, the metadata is captured using a programmed workflow and then registered using the equivalent of a command line interface (API) which requires no user involvement or interactive user responses<span id=\"cite_ITEM-31548-1\" name=\"citation\"><a href=\"#ITEM-31548-1\">[2]<\/a><\/span> and which tends to produce more systematically complete metadata records. Unfortunately, I think this mode of metadata provision must be relatively rare &#8211; although to be fair the metadata record itself does not carry details of the mechanism by which the metadata was populated. The two examples above were prepared using exactly the same API, and they largely differ in what elements of the total metadata schema each of the two repositories above actually support, rather than what a human had the patience for.<\/p>\n<p>So it is a welcome development that DataCite have recently made a Dashboard available that allows at a glance an inspection of either a specific metadata record or a collection of such records to be made. The start point is <a href=\"https:\/\/metadata.datacite.org\/\">https:\/\/metadata.datacite.org\/<\/a>\u00a0and here you can filter the record by\u00a0a) the repository, further filtered by\u00a0b) registration year and c) resource type (Figure 1).\u00a0Thus:<br \/>\n<img decoding=\"async\" class=\"aligncenter size-full wp-image-31562\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-663.jpg\" alt=\"\" width=\"540\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 1<\/strong>. The DataCite Metadata Dashboard,\u00a0showing a specified repository using the query<br \/>\n<tt><small><a href=\"https:\/\/metadata.datacite.org\/urks.helix?registrationYear=2026&amp;resourceType=dataset\">https:\/\/metadata.datacite.org\/urks.helix?registrationYear=2026&amp;resourceType=dataset<\/a><\/small><\/tt><\/p>\n<p>This dashboard now allows you to easily compare the two metadata records noted above, with the help of an additional DOI query filter which can be used to further narrow it down to a single dataset (queries 1 and 2).<\/p>\n<ol>\n<li><a href=\"https:\/\/metadata.datacite.org\/urks?registrationYear=2026&amp;query=id:10.14469\/HPC\/15994\"><tt><small>https:\/\/metadata.datacite.org\/urks?registrationYear=2026&amp;query=id:10.14469\/HPC\/15994<\/small><\/tt><\/a><\/li>\n<li><a href=\"https:\/\/metadata.datacite.org\/cern.zenodo?registrationYear=2026&amp;query=id:10.5281\/zenodo.20657236\"><tt><small>https:\/\/metadata.datacite.org\/cern.zenodo?registrationYear=2026&amp;query=id:10.5281\/zenodo.20657236<\/small><\/tt><\/a><\/li>\n<\/ol>\n<p>A prominent difference between these queries is the <strong>Subjects<\/strong> metadata, with for example the <strong>subjectScheme<\/strong>\u00a0100% complete for example <strong>1<\/strong> (Figure 2) and 0% complete for example <strong>2<\/strong> (Figure 3).<br \/>\n<img decoding=\"async\" class=\"aligncenter size-full wp-image-31568\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-664.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 2<\/strong>. The Subjects panel of the DataCite Metadata Dashboard,\u00a0for\u00a0DOI: <a href=\"https:\/\/metadata.datacite.org\/urks?registrationYear=2026&amp;query=id:10.14469\/HPC\/15994\"><tt><small>10.14469\/HPC\/15994<\/small><\/tt><\/a><\/p>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-31567\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-665.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 3.<\/strong> The Subjects panel of the DataCite Metadata Dashboard,\u00a0for\u00a0DOI: <a href=\"https:\/\/metadata.datacite.org\/urks?registrationYear=2026&amp;query=id:10.14469\/HPC\/15994\"><tt><small>10.5281\/zenodo.20657236<\/small><\/tt><\/a><\/p>\n<h2>Using the query filter to explore a range of other searches.<\/h2>\n<p>Searches <strong>3<\/strong> and <strong>4<\/strong> specify an individual depositor by their ORCID identifier and 2026 as a publication year, for two different repositories.<\/p>\n<ol start=\"3\">\n<li><a href=\"https:\/\/metadata.datacite.org\/bl.imperial?registrationYear=2026&amp;query=contributors.nameIdentifiers.nameIdentifier:0000-0002-8635-8390+OR+creators.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390\"><tt><small>https:\/\/metadata.datacite.org\/bl.imperial?registrationYear=2026&amp;query=contributors.nameIdentifiers.nameIdentifier:0000-0002-8635-8390+OR+creators.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390<\/small><\/tt><\/a><\/li>\n<li><a href=\"https:\/\/metadata.datacite.org\/cern.zenodo?registrationYear=2026&amp;query=contributors.nameIdentifiers.nameIdentifier:0000-0002-8635-8390+OR+creators.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390\"><tt><small>https:\/\/metadata.datacite.org\/cern.zenodo?registrationYear=2026&amp;query=contributors.nameIdentifiers.nameIdentifier:0000-0002-8635-8390+OR+creators.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390<\/small><\/tt><\/a><\/li>\n<\/ol>\n<p>The <strong>Subjects<\/strong> panels\u00a0are shown in Figures 4 and 5. In these examples, both sets of depositions are made using the same automatic command line API<span id=\"cite_ITEM-31548-1\" name=\"citation\"><a href=\"#ITEM-31548-1\">[2]<\/a><\/span> so human error or their lack of attention is not the cause of the differences.<br \/>\n<img decoding=\"async\" class=\"aligncenter size-full wp-image-31568\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-664.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 4.<\/strong> The Subjects panel of the DataCite Metadata Dashboard for the bl.imperial repository for query 3.<\/p>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-31569\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-667.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 5.<\/strong> The Subjects panel of the DataCite Metadata Dashboard for the cern.zenodo repository for query 4.<\/p>\n<p>Search 5 shows more direct use of a <strong>Subject<\/strong> filter (Figure 6) and use of this filter ensures that again the subjects metadata panel is well populated.<\/p>\n<ol start=\"5\">\n<li><a href=\"https:\/\/metadata.datacite.org\/urks?query=(media.media_type:application\/zip+OR+media.media_type:chemical\/x-mnova)+AND+(subjects.subjectScheme:*NMR_Nucleus)+AND+(subjects.subject:13C)+AND+(titles.title:*pyrazol*+OR+descriptions.description:*pyrazol*)\"><tt><small>https:\/\/metadata.datacite.org\/urks?query=(media.media_type:application\/zip+OR+media.media_type:chemical\/x-mnova)+AND+<br \/>\n(subjects.subjectScheme:*NMR_Nucleus)+AND+(subjects.subject:13C)+AND+<br \/>\n(titles.title:*pyrazol*+OR+descriptions.description:*pyrazol*)<\/small><\/tt><\/a><\/li>\n<\/ol>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-31571\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-668.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 6.<\/strong> The Subjects panel of the DataCite Metadata Dashboard for the urks repository for query 5.<\/p>\n<p>Query 6 (Figure 7) identifies datasets that have a directly associated journal article, showing the population of the &#8220;high impact&#8221; <strong>relatedIdentifier<\/strong> property.<\/p>\n<ol start=\"6\">\n<li><a href=\"https:\/\/metadata.datacite.org\/urks?query=(types.resourceTypeGeneral:Dataset+OR+types.resourceTypeGeneral:Collection)+AND+(contributors.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390+OR+creators.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390)+AND+(relatedIdentifiers.relatedIdentifierType:DOI+AND+relatedIdentifiers.resourceTypeGeneral:JournalArticle+AND+relatedIdentifiers.relatedIdentifier:*)\"><tt><small>https:\/\/metadata.datacite.org\/urks?query=(types.resourceTypeGeneral:Dataset+OR+types.resourceTypeGeneral:Collection)+AND+<br \/>\n(contributors.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390+OR+<br \/>\ncreators.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390)+AND+(relatedIdentifiers.relatedIdentifierType:DOI+AND+<br \/>\nrelatedIdentifiers.resourceTypeGeneral:JournalArticle+AND+relatedIdentifiers.relatedIdentifier:*)<\/small><\/tt><\/a><\/li>\n<\/ol>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-31572\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-670.jpg\" alt=\"\" width=\"540\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 7.<\/strong> The RelatedIdentifiers\u00a0panel of the DataCite Metadata Dashboard for the urks repository for query 6.<\/p>\n<p>Query 7 showing again a very well populated Subjects panel (due of course to the filter applied below) with 100% occupancy of the subjectScheme.<\/p>\n<ol start=\"7\">\n<li><a href=\"https:\/\/metadata.datacite.org\/urks?query=(media.media_type:chemical\/x-gaussian-log+OR+media.media_type:chemical\/x-gaussian-checkpoint)+AND+(titles.title:*Endo*+OR+descriptions.description:*Endo*+OR+titles.title:*Exo*+OR+descriptions.description:*Exo*)+AND+&lt;br&gt;&lt;\/a&gt;(subjects.subjectScheme:*KIE*)+AND+subjects.subject:1H\/2H\"><tt><small>https:\/\/metadata.datacite.org\/urks?query=(media.media_type:chemical\/x-gaussian-log+OR+<br \/>\nmedia.media_type:chemical\/x-gaussian-checkpoint)+AND+<br \/>\n(titles.title:*Endo*+OR+<br \/>\ndescriptions.description:*Endo*+OR+titles.title:*Exo*+OR+descriptions.description:*Exo*)+AND+<br \/>\n(subjects.subjectScheme:*KIE*)+AND+<br \/>\nsubjects.subject:1H\/2H<\/small><\/tt><\/a><\/li>\n<\/ol>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-31575\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-672.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 8.<\/strong> The Subjects\u00a0panel of the DataCite Metadata Dashboard for the urks repository for query 7.<\/p>\n<p>Query 8 shows how well populated the Subjects panel is for a whole range of users (excluding one subject-loving suspect!). It would be interesting to see if this population (albeit only 4.7%) was achieved by manual entry or by automatic API calls.<\/p>\n<ol start=\"8\">\n<li><a href=\"https:\/\/metadata.datacite.org\/cern.zenodo?registrationYear=2026&amp;query=+NOT+(contributors.nameIdentifiers.nameIdentifier:0000-0002-8635-8390+OR+creators.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390)\"><tt><small>https:\/\/metadata.datacite.org\/cern.zenodo?registrationYear=2026&amp;query=+NOT+<br \/>\n(contributors.nameIdentifiers.nameIdentifier:0000-0002-8635-8390+OR+<br \/>\ncreators.nameIdentifiers.nameIdentifier:*0000-0002-8635-8390)<\/small><\/tt><\/a><\/li>\n<\/ol>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-31579\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-673.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 9.<\/strong> The Subjects panel of the DataCite Metadata Dashboard for the cern.zenodo repository for query 8.<\/p>\n<p>Example 9 uses the <a href=\"https:\/\/inveniosoftware.org\/products\/rdm\/\" target=\"_blank\" rel=\"noopener\">InvenioRDM<\/a> repository system whilst <strong>10<\/strong> uses a bespoke repository created in 2016 with metadata richness in mind.<span id=\"cite_ITEM-31548-1\" name=\"citation\"><a href=\"#ITEM-31548-1\">[2]<\/a><\/span> Both these examples were crafted &#8220;by hand&#8221; rather than using an API tool and are limited only by the user interfaces of either repository.<\/p>\n<ol start=\"9\">\n<li><a href=\"https:\/\/metadata.datacite.org\/urks.helix?registrationYear=2026&amp;query=id:10.82186\/xjxch-zzb72\"><tt><small>https:\/\/metadata.datacite.org\/urks.helix?registrationYear=2026&amp;query=id:10.82186\/xjxch-zzb72<\/small><\/tt><\/a><\/li>\n<li><a href=\"https:\/\/metadata.datacite.org\/bl.imperial?registrationYear=2024&amp;query=id:10.14469\/hpc\/14835\"><tt><small>https:\/\/metadata.datacite.org\/bl.imperial?registrationYear=2024&amp;query=id:10.14469\/hpc\/14835<\/small><\/tt><\/a><\/li>\n<\/ol>\n<p><img decoding=\"async\" class=\"aligncenter size-full wp-image-31625\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-676.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 10.<\/strong> The Subjects panel of the DataCite Metadata Dashboard for query 9.<br \/>\n<img decoding=\"async\" class=\"aligncenter size-full wp-image-31624\" src=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-677.jpg\" alt=\"\" width=\"400\" \/><\/p>\n<p style=\"text-align: center;\"><strong>Figure 11.<\/strong> The Subjects panel of the DataCite Metadata Dashboard for query 10.<\/p>\n<p><strong>Conclusions.<\/strong><\/p>\n<p>It is to be hoped that analysis of research data metadata records using the DataCite tool will rapidly lead to a greater and richer population of these records. Wherever possible, these records should be populated using automated methods which do not rely on the patience of a human. My own candidate for increased population is the Subjects field, which can be readily automated and the presence of which allows finely tuned searches of the DataCite metadata store to be made.<\/p>\n<hr \/>\n<p>DOI: <a href=\"https:\/\/doi.org\/10.59350\/ams3m-m3t92\">10.59350\/ams3m-m3t92<\/a><\/p>\n<h2>References<\/h2>\n    <ol class=\"kcite-bibliography csl-bib-body\"><li id=\"ITEM-31548-0\">DataCite Metadata Working Group., \"DataCite Metadata Schema Documentation for the Publication and Citation of Research Data and Other Research Outputs v4.7\", <i>DataCite<\/i>, 2026. <a href=\"https:\/\/doi.org\/10.14454\/qdd3-ps68\">https:\/\/doi.org\/10.14454\/qdd3-ps68<\/a>\n\n<\/li>\n<li id=\"ITEM-31548-1\">C. Cave-Ayland, M. Bearpark, C. Romain, and H. Rzepa, \"CHAMP is a HPC Access and Metadata Portal\", <i>Journal of Open Source Software<\/i>, vol. 7, pp. 3824, 2022. <a href=\"https:\/\/doi.org\/10.21105\/joss.03824\">https:\/\/doi.org\/10.21105\/joss.03824<\/a>\n\n<\/li>\n<\/ol>\n\n<\/div> <!-- kcite-section 31548 -->","protected":false},"excerpt":{"rendered":"<p>Metadata are an essential way of enabling the discoverability and impact of scholarly resources such as (research) data and associated objects. It must adhere to a precisely described schema describing its properties, and such a conformant metadata record is an mandatory component of a (research) data repository. Access to the record is by resolving the [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"activitypub_content_warning":"","activitypub_content_visibility":"","activitypub_max_image_attachments":5,"activitypub_interaction_policy_quote":"anyone","activitypub_status":"federated","footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"jetpack_post_was_ever_published":false},"categories":[4],"tags":[],"ppma_author":[2661],"class_list":["post-31548","post","type-post","status-publish","format-standard","hentry","category-interesting-chemistry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.8 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Evaluating metadata quality and completeness for research data using the new DataCite Tool. - Henry Rzepa&#039;s Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Evaluating metadata quality and completeness for research data using the new DataCite Tool. - Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"og:description\" content=\"Metadata are an essential way of enabling the discoverability and impact of scholarly resources such as (research) data and associated objects. It must adhere to a precisely described schema describing its properties, and such a conformant metadata record is an mandatory component of a (research) data repository. Access to the record is by resolving the [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548\" \/>\n<meta property=\"og:site_name\" content=\"Henry Rzepa&#039;s Blog\" \/>\n<meta property=\"article:published_time\" content=\"2026-06-17T10:45:19+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-06-18T09:14:16+00:00\" \/>\n<meta name=\"author\" content=\"Henry Rzepa\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Henry Rzepa\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Evaluating metadata quality and completeness for research data using the new DataCite Tool. - Henry Rzepa&#039;s Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548","og_locale":"en_GB","og_type":"article","og_title":"Evaluating metadata quality and completeness for research data using the new DataCite Tool. - Henry Rzepa&#039;s Blog","og_description":"Metadata are an essential way of enabling the discoverability and impact of scholarly resources such as (research) data and associated objects. It must adhere to a precisely described schema describing its properties, and such a conformant metadata record is an mandatory component of a (research) data repository. Access to the record is by resolving the [&hellip;]","og_url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548","og_site_name":"Henry Rzepa&#039;s Blog","article_published_time":"2026-06-17T10:45:19+00:00","article_modified_time":"2026-06-18T09:14:16+00:00","author":"Henry Rzepa","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Henry Rzepa","Estimated reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548#article","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548"},"author":{"name":"Henry Rzepa","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"headline":"Evaluating metadata quality and completeness for research data using the new DataCite Tool.","datePublished":"2026-06-17T10:45:19+00:00","dateModified":"2026-06-18T09:14:16+00:00","mainEntityOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548"},"wordCount":1166,"commentCount":0,"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548#primaryimage"},"thumbnailUrl":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-678.jpg","articleSection":["Interesting chemistry"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548","name":"Evaluating metadata quality and completeness for research data using the new DataCite Tool. - Henry Rzepa&#039;s Blog","isPartOf":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548#primaryimage"},"image":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548#primaryimage"},"thumbnailUrl":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-678.jpg","datePublished":"2026-06-17T10:45:19+00:00","dateModified":"2026-06-18T09:14:16+00:00","author":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281"},"breadcrumb":{"@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548#primaryimage","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-678.jpg","contentUrl":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2026\/06\/Screenshot-678.jpg","width":2140,"height":1242},{"@type":"BreadcrumbList","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=31548#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog"},{"@type":"ListItem","position":2,"name":"Evaluating metadata quality and completeness for research data using the new DataCite Tool."}]},{"@type":"WebSite","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#website","url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/","name":"Henry Rzepa&#039;s Blog","description":"Chemistry with a twist","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/#\/schema\/person\/2b40f7b9c872a4dc1547e040a11b6281","name":"Henry Rzepa","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g370be3a7397865e4fd161aefeb0a5a85","url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","caption":"Henry Rzepa"},"description":"Henry Rzepa is Emeritus Professor of Computational Chemistry at Imperial College London.","sameAs":["https:\/\/orcid.org\/0000-0002-8635-8390"],"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?author=1"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pDef7-8cQ","jetpack-related-posts":[{"id":20634,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=20634","url_meta":{"origin":31548,"position":0},"title":"Questions about the (metadata) components of a scientific article.","author":"Henry Rzepa","date":"April 8, 2019","format":false,"excerpt":"The conventional procedures for reporting analysis or new results in science is to compose an \"article\", augment that perhaps with \"supporting information\" or \"SI\", submit to a journal which undertakes peer review, with revision as necessary for acceptance and finally publication. If errors in the original are later identified, a\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":18465,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18465","url_meta":{"origin":31548,"position":1},"title":"FAIR Research data: Gravitational waves as an example from the astrophysics community.","author":"Henry Rzepa","date":"June 2, 2017","format":false,"excerpt":"In 2016, the world heard that gravitational waves had been detected and\u00a0now a third instance is reported.\u2021 Given that the data associated with these detections are perhaps amongst the most important instances in recent times, I thought I might take a peek at how it was managed. The original report\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/117-1024x584.jpg?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":21960,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=21960","url_meta":{"origin":31548,"position":2},"title":"The Persistent Identifier ecosystem expands &#8211; to instruments!","author":"Henry Rzepa","date":"March 21, 2020","format":false,"excerpt":"A PID or persistent identifier has been in common use in scientific publishing for around 20 years now. It was introduced as a DOI (Digital Object Identifier), and the digital object in this case was the journal article. From 2000 onwards, DOIs started appearing for most journal articles, journals having\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2020\/03\/ecosystem-1024x937.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":18344,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=18344","url_meta":{"origin":31548,"position":3},"title":"How to search data repositories for FAIR chemical content and data: SubjectScheme","author":"Henry Rzepa","date":"June 8, 2017","format":false,"excerpt":"As data repositories start to flourish, it is reasonable to ask questions such as what sort of chemistry can be found there and how can I find it? Here I give an updated worked example of a digital repository search for chemical content and also pose an important issue for\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2017\/06\/171-1024x196.jpg?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":20675,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=20675","url_meta":{"origin":31548,"position":4},"title":"The &#8220;Accessible&#8221; in FAIR (data).","author":"Henry Rzepa","date":"April 18, 2019","format":false,"excerpt":"In a previous post, I looked at the Findability of FAIR data in common chemistry journals. Here I move on to the next letter, the A = Accessible. The attributes of A include: (meta)data are retrievable by their identifier using a standardized communication protocol. the protocol is open, free and\u2026","rel":"","context":"In &quot;Chemical IT&quot;","block_context":{"text":"Chemical IT","link":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?cat=2"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":26941,"url":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/?p=26941","url_meta":{"origin":31548,"position":5},"title":"Internet Archeology:  reviving a 2001 article published in the Internet Journal of Chemistry.","author":"Henry Rzepa","date":"March 21, 2024","format":false,"excerpt":"In the mid to late 1990s as the Web developed, it was becoming more obvious that one area it would revolutionise was of scholarly journal publishing. Since the days of the very first scientific journals in the 1650s, the medium had been firmly rooted in paper. Even printed colour only\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/www.ch.ic.ac.uk\/rzepa\/blog\/wp-content\/uploads\/2024\/03\/Screenshot-297-1024x698.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]}],"jetpack_likes_enabled":false,"authors":[{"term_id":2661,"user_id":1,"is_guest":0,"slug":"admin","display_name":"Henry Rzepa","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/897b6740f7f599bca7942cdf7d7914af5988937ae0e3869ab09aebb87f26a731?s=96&d=blank&r=g","author_category":"1","first_name":"Henry","last_name":"Rzepa","user_url":"https:\/\/orcid.org\/0000-0002-8635-8390","job_title":"","description":"Henry Rzepa is Emeritus Professor of Computational Chemistry at Imperial College London."}],"_links":{"self":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/31548","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=31548"}],"version-history":[{"count":76,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/31548\/revisions"}],"predecessor-version":[{"id":31713,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=\/wp\/v2\/posts\/31548\/revisions\/31713"}],"wp:attachment":[{"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=31548"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=31548"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=31548"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.ch.ic.ac.uk\/rzepa\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fppma_author&post=31548"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}