Tag Archives: Chemistry Data

Navigating an Internet of Chemistry via ChemSpider

The internet is a rich source of chemistry related data and, nowadays, if a chemist knows how to initiate a search, data can be sourced for millions of chemicals online. The nature of online data varies from simple molecule diagrams, to experimental and predicted properties, encyclopedic articles, synthetic routes, analytical data, patents and publications. The array of information now accessible is distributed across thousands of sites giving rise to the information overload commonly associated with the Google-type searches on the internet. In addition the purest language of chemistry, that of chemical structures, is not fully supported on the web as yet. This presentation will provide an overview of how the internet is being meshed together using data aggregation and standardization approaches to enable a structure-searchable internet for chemistry. The speaker will present an overview of the ChemSpider platform (, the challenges of linking together over 400 internet resources and 26 million unique chemicals, and discuss how members of the chemistry community can directly contribute to enhancing the availability of quality data online.

This is a movie of the talk I gave using the BigBlueButton platform to students and faculty at the University of Arkansas, Little Rock.

Leave a comment

Posted by on October 19, 2011 in Publications and Presentations


Tags: , , , , ,

ChemSpider – Does Community Engagement work to Build a Quality Online Resource for Chemists?

This is my presentation at the Skolnik Symposium at ACS Denver to honor the contributions of Alexander “Sandy” Lawson to our domain of Cheminformatics.

ChemSpider – Does Community Engagement work to Build a Quality Online Resource for Chemists?

With an intention to provide a high quality free internet resource of chemistry related data for the community, ChemSpider has aggregated almost 25 million compounds linked out to over 400 data sources and provided a platform for the community to both deposit and curate data. This experiment in crowdsourcing for chemistry has now been running for over three years. This presentation will review a number of aspects of the project including (a) the level of community participation in depositing and curating data; (b) the nature of data and content supplied by the community; (c) how ChemSpider is used by the community; (d) using game-based systems to assist in data curation; (e) algorithmic-based approaches to data validation and filtering; and (f) sharing data curation efforts with other online databases.


Leave a comment

Posted by on August 30, 2011 in Publications and Presentations


Tags: , , , , , , , ,