Archive for category ChemSpider Chemistry
This is the second presentation I gave at the ACS Meeting in Indianapolis
Accessing chemical health and safety data online using Royal Society of Chemistry resources
The internet has opened up access to large amounts of chemistry related data that can be harvested and assembled into rich resources of value to chemists. The Royal Society of Chemistry’s ChemSpider database has assembled an electronic collection of over 28 million chemicals from over 400 data sources and some of the assembled data is certainly of value to those searching for chemical health and safety information. Since ChemSpider is a text and structure searchable database chemists are able to find relevant information using both of their general search approaches. This presentation will provide an overview of the types of chemical health and safety data and information made available via ChemSpider and discuss how the data are sourced, aggregated and validated. We will examine how the data can be made available via mobile devices and examine the issue of data quality and its potential impacts on such a database.
This is the first presentation I gave at the ACS Meeting in Indianapolis
Apps and approaches to mobilizing chemistry from the Royal Society of Chemistry
Mobilizing chemistry by delivering data and content from Royal Society of Chemistry resources has become an important component of our activities to increase accessibility. Content includes access to our publications, our magazine content and our chemistry databases. Mobile devices also allow us to deliver access to tools to support teaching, game-based learning, annotation and curation of data. This presentation will provide an overview of our varied activities in enhancing access to chemistry related data and materials. This will include providing data feeds associated with RSC graphical databases, our experiences in optical structure recognition using smartphone apps and our future vision for supporting chemistry on mobile devices.
The future of scientific information & communication presented at the SUNY Potsdam Academic Festival
This is a LONG presentation….I talk about the “It’s All About Me” attitude that can positively feed science….we want to share OUR science, we want people to know about our opinions, our activities, our collaborators, we want to get funding, recognition and attribution. And why not…it can all be to the benefit of science.
This presentation was given at the SUNY Potsdam Academic Festival
The future of scientific information & communication
Our access to scientific information has changed in ways that were hardly imagined even by the early pioneers of the internet. The immense quantities of data and the array of tools available to search and analyze online content continues to expand while the pace of change does not appear to be slowing. While scientists now have access to the enormous capacities and capability of the internet the vast majority of scientific communication continues to be through peer-reviewed scientific journals. The measure of a scientist’s contribution is primarily represented by their publication profile and the citations to their published works and offers an incomplete view of their activities. However, we are at the beginning of a new revolution where the ability to communicate offers the opportunity to embrace new forms of publishing and where scientific participation and influence will be measured in new ways. This presentation will provide an overview of our new generation of “openness” in which open source, open standards, open access and open data are proliferating. The future of scientific information and communication will be underpinned by these efforts, influenced by increasing participation from the scientific community and facilitated collaboration and ultimately accelerate scientific progress.
Presentation given at ACS New Orleans Spring Meeting
As a result of the advent of internet technologies supporting participation on the internet via blogs, wikis and other social networking approaches, chemists now have an opportunity to contribute to the growing chemistry content on the web. As scientists an important skill to develop is the ability to succinctly report in a published format the details of scientific experimentation. The Royal Society of Chemistry provides a number of online systems to share chemistry data, the most well known of these being the ChemSpider database. In parallel the ChemSpider SyntheticPages (CSSP) platform is an online publishing platform for scientists, and especially students, to publish the details of chemical syntheses that they have performed. Using the rich capabilities of internet platforms, including the ability to display interactive spectral data and movies, CSSP is an ideal environment for students to publish their work, especially syntheses that might not support mainstream publication.
Presentation given at ACS New Orleans Spring Meeting
ChemSpider is one of the chemistry community’s primary public compound databases. Containing tens of millions of chemical compounds and its associated data ChemSpider serves data to many tens of websites and software applications at this point. This presentation will provide an overview of the expanding reach of the ChemSpider platform and the nature of solutions that it helps to enable. We will also discuss some of the future directions for the project that are envisaged and how we intend to continue expanding the impact for the platform.
Presentation given at ACS New Orleans Spring Meeting
Chemistry online is represented in various ways including publications, presentations, blog posts, wiki-contributions, data depositions, curations and annotations. Encouraging participation from the community to participate in and comment on the information delivered via these various formats would likely provide for a rich dialog exchange in some cases and improved data quality in others. At the Royal Society of Chemistry we have a number of platforms that are amenable to contribution. This presentation will provide an overview of our experiences in engaging the community to interact with our various forms of content and discuss new approaches we are utilizing to encourage crowdsourced participation.
As a scientist myself, and after years of pulling my hair out over various projects (as well as some of the conversations I have with my twin boys) there are times I wish that I wasn’t “thinning on top”. This video uses the holiday season to help encourage those of us who are “follicly challenged” to realize that science will give us the solution. This movie is a collaboration between my good friend Professor Alex Tropsha from the University of North Carolina in Chapel Hill (http://pharmacy.unc.edu/Directory/tropsha) and myself (Antony Williams, http://www.chemconnector.com/) where we did a few takes …literally 8 different shots only, in less than an hour. With a little bit of waxing lyrical to an old classical Christmas Song, and a quick self-training course in Microsoft Movie Maker (BRILLIANT SOFTWARE) I threw this together in about 2 hours. Enjoy! And there is hope for all us who, because of the excess heat from our skulls from “all that thinking”, are evaporating the follicles.
Please do note that Alex is modeling a ChemSpider SyntheticPages labcoat. Want to win one? Then you need to write a few articles here: http://cssp.chemspider.com/
I would like to acknowledge my mother for giving me a singing voice that sounds like I am gargling broken glass.
Second talk delivered today at ACS Philadelphia…
Mining public domain data as a basis for drug repurposing
Online databases containing high throughput screening and other property data continue to proliferate in number. Many pharmaceutical chemists will have used databases such as PubChem, ChemSpider, DrugBank, BindingDB and many others. This work will report on the potential value of these databases for providing data to be used to repurpose drugs using cheminformatics-based approaches (e.g. docking, ligand-based machine learning methods). This work will also discuss the potentially related applications of the Open PHACTS project, a European Union Innovative Medicines Initiative project, that is utilizing semantic web based approaches to integrate large scale chemical and biological data in new ways. We will report on how compound and data quality should be taken into account when utilizing data from online databases and how their careful curation can provide high quality data that can be used to underpin the delivery of molecular models that can in turn identify new uses for old drugs.
I had the privilege of co-chairing a session on Mobile Chemistry today with Harry Pence. The session was “Mobile devices, augmented reality, and the mobile classroom”
Putting chemistry into the hands of students – chemistry made mobile using resources from the Royal Society of Chemistry
The increasing prevalence of mobile devices offers the opportunity to provide chemistry students with easy access to a multitude of resources. As a publisher the RSC provides a myriad of content to chemists including an online database of over 26 million chemical compounds, tools for learning spectroscopy and access to scientific literature and other educational materials. This presentation will provide an overview of our efforts to make RSC content more mobile, and therefore increasingly available to chemists. In particular it will discuss our efforts to provide access to chemistry related data of high value to students in the laboratory. It will include an overview of spectroscopy tools for the review and analysis of various forms of spectroscopy data.
Chemistry is complex. Anybody who has been involved with the creation of electronic datafiles containing thousands of chemical compounds and associated data (chemical names, properties etc) will tell you that errors creep in. ChemSpider has >28 million unique chemical entities and these have been sourced from many different places/groups/individuals. Some of these have been deprecated as we have determined, both manually and algorithmically, that the data are in error. Over the years we have learned a lot about data quality and ways in which algorithms can be applied to data prior to deposition on ChemSpider.
Some obvious structure-based errors that can be checked for would include: hypervalency (e.g. pentavalent carbons), charge imbalance (a compound has no neutralizing counterion for example), absence of stereochemistry (e.g. a compound with 12 possible stereocenters only has one assigned). There are many other such errors that can be detected algorithmically. It’s the old adage of why apply a human to what a computer can fix. With this in mind we have been working on a system called the ChemSpider Validation and Standardization Platform (CVSP for short). This system will serve multiple purposes. It will be one of the foundation blocks for checking structure-based data for our publications (i.e. catch bad chemistry before it is published!), it will be used for validating chemistry for our databases (Natural Product Updates, Methods in Organic Synthesis and Catalysts and Catalyzed Reactions), it will be used to check and validate depositions going into ChemSpider, it will serve data related to the Open PHACTS project and it will serve the community by providing an online website where you can upload your own SDF files (and other file formats in future) to validate the structures.
I won’t go into detail here about all of the functionality and capability of the system as we will discuss this in further detail on this blog. However, we will be unveiling the system in its present form at the ACS meeting in Philadelphia. Come along and meet some of the team involved in building CVSP and give us your feedback!