I reminisced about the wonderfully naive but exciting Web-period of 1993-1994. This introduced the server-log analysis to us for the first time, and hits-on-a-web-page. One of our first attempts at crowd-sourcing and analysis was to run an electronic conference in heterocyclic chemistry and to look at how the attendees visited the individual posters and presentations by analysing the server logs.
Archive for the ‘Chemical IT’ Category
Internet Archaeology: Blasts from the past.
Friday, October 11th, 2013In 1993-1994, when the Web (synonymous in most minds now with the Internet) was still young, the pace of progress was so rapid that some wag worked out that one “web-year” was like a dog-year, worth about 7 years of normal human time. So in this respect, 1994 is now some 133 web-years ago. Long enough for an archaeological excavation.
Publishing a procedure with a doi.
Wednesday, October 2nd, 2013In the two-publisher model I proposed a post or so back, I showed an example of how data can be incorporated (transcluded) into the story narrative of a scientific article, with both that story and the data each having their own independently citable reference (using a doi for the citation). Here I take it a step further, by publishing a functional procedure in a digital repository[1] and assigned its own doi:10.6084/m9.figshare.811862.
References
- H.S. Rzepa, "Script for creating an NCI surface as a JVXL compressed file from a (Gaussian) cube of total electron density", 2013. https://doi.org/10.6084/m9.figshare.811862
A two-publisher model for the scientific article: narrative+shared data.
Sunday, September 15th, 2013I do go on rather a lot about enabling or hyper-activating[1] data. So do others[2]. Why is sharing data important?
References
- O. Casher, G.K. Chandramohan, M.J. Hargreaves, C. Leach, P. Murray-Rust, H.S. Rzepa, R. Sayle, and B.J. Whitaker, "Hyperactive molecules and the World-Wide-Web information system", Journal of the Chemical Society, Perkin Transactions 2, pp. 7, 1995. https://doi.org/10.1039/p29950000007
- R. Van Noorden, "Data-sharing: Everything on display", Nature, vol. 500, pp. 243-245, 2013. https://doi.org/10.1038/nj7461-243a
The Amsterdam Manifesto on Data Citation Principles
Wednesday, July 31st, 2013The Amsterdam manifesto espouses the principles of citable open data. It is a short document, and it is worth re-stating its eight points here:
150,000,000 DFT calculations on 2,300,000 compounds!
Friday, July 5th, 2013The title of this post summarises the contents of a new molecular database: www.molecularspace.org[1] and I picked up on it by following the post by Jan Jensen at www.compchemhighlights.org (a wonderful overlay journal that tracks recent interesting articles). The molecularspace project more formally is called “The Harvard Clean Energy Project: Large-scale computational screening and design of organic photovoltaics on the world community grid“. It reminds of a 2005 project by Peter Murray-Rust et al at the same sort of concept[2] (the World-Wide-Molecular-Matrix, or WWMM[3]), although the new scale is certainly impressive. Here I report my initial experiences looking through molecularspace.org
References
- J. Hachmann, R. Olivares-Amaya, S. Atahan-Evrenk, C. Amador-Bedolla, R.S. Sánchez-Carrera, A. Gold-Parker, L. Vogt, A.M. Brockway, and A. Aspuru-Guzik, "The Harvard Clean Energy Project: Large-Scale Computational Screening and Design of Organic Photovoltaics on the World Community Grid", The Journal of Physical Chemistry Letters, vol. 2, pp. 2241-2251, 2011. https://doi.org/10.1021/jz200866s
- P. Murray-Rust, H.S. Rzepa, J.J.P. Stewart, and Y. Zhang, "A global resource for computational chemistry", Journal of Molecular Modeling, vol. 11, pp. 532-541, 2005. https://doi.org/10.1007/s00894-005-0278-1
- P. Murray-Rust, S.E. Adams, J. Downing, J.A. Townsend, and Y. Zhang, "The semantic architecture of the World-Wide Molecular Matrix (WWMM)", Journal of Cheminformatics, vol. 3, 2011. https://doi.org/10.1186/1758-2946-3-42
Research data and the “h-index”.
Monday, June 24th, 2013The blog post by Rich Apodaca entitled “The Horrifying Future of Scientific Communication” is very thought provoking and well worth reading. He takes us through disruptive innovation, and how it might impact upon how scientists communicate their knowledge. One solution floated for us to ponder is that “supporting Information, combined with data mining tools, could eliminate most of the need for manuscripts in the first place“. I am going to juxtapose that suggestion on something else I recently discovered.
Digital repositories. An update to the update.
Monday, August 13th, 2012QR codes and InChI strings.
Sunday, July 22nd, 2012A month or so ago at a workshop I was attending, a speaker included in his introductory slide a QR (Quick Response) Code. It is a feature of most digital eco-systems that there is probably already “an app for it”. So I thought I would jump on the band wagon by coding an InChI string. Here it is below: