Archive for category ChemSpider Chemistry

Presentation at the 2014 Allen Press Emerging Trends in Scholarly Publishin Seminar

Today I gave a presentation at the 2014 Allen Press Emerging Trends in Scholarly Publishing™ Seminar here in Washington DC. Over coffee I had very positive feedback about what we are doing at RSC and various comments about “real science exposed by a publisher”. The abstract and Slideshare presentation are below.

The Application of Text and Data Mining to Enhance the Royal Society of Chemistry Publication Archive

The Royal Society of Chemistry (RSC) is one of the world’s most prominent scientific societies and STM publishers. Our contributions to the scientific community include the delivery of a myriad of resources to support the chemistry community to access chemistry-related data, information and knowledge. This includes ChemSpider, a compound centric platform linking together over 30 million chemical compounds with internet-based resources. Using this compound database and its associated chemical identifiers as a basis the RSC is utilizing text and data mining approaches to data enable our published archive of scientific publications. This presentation will provide an overview of our technical approaches to text and data enable our archive of scientific articles, how we are developing an integrated database of chemical compounds, reactions, physical and analytical data and how it will be used to facilitate scientific discovery.

No Comments

Accessing Royal Society of Chemistry resources and making chemistry mobile

This is a presentation I gave at the ACS Dallas meeting on March 19th 2014

Accessing Royal Society of Chemistry resources and making chemistry mobile

The ongoing drive towards mobile devices is now simply one of generic ubiquity. It is less an issue of whether a scientist has a mobile device but rather what brand, what generation and what apps do they have installed. Chemistry has fast been moving to mobile devices for a number of years now and today is it possible to draw chemical compounds, perform searches of databases both on device and in the cloud. Modeling of data using server based platforms is increasing in scope and capabilities. The Royal Society of Chemistry was early in recognizing the potential power of mobile platforms in terms to allowing scientists to access data and the benefits of such devices to allowing students access to data and content. This presentation will provide an overview of our efforts to date in supporting chemistry technologies on mobile devices and our recent developments in this domain.

 

No Comments

Data enhancing the Royal Society of Chemistry publication archive

This is a presentation I gave at the ACS Dallas meeting on March 19th 2014

Data enhancing the Royal Society of Chemistry publication archive

The Royal Society of Chemistry has an archive of hundreds of thousands of published articles containing various types of chemistry related data – compounds, reactions, property data, spectral data etc. RSC has a vision of extracting as much of these data as possible and providing access via ChemSpider and its related projects. To this end we have applied a combination of text-mining extraction, image conversion and chemical validation and standardization approaches. The outcome of this project will result in new chemistry related data being added to our chemical and reaction databases and in the ability to more tightly couple web-based versions of the articles with these extracted data. The ability to search across the archive will be enhanced as a result. This presentation will report on our progress in this data extraction project and discuss how we will ultimately use similar approaches in our publishing pipeline to enhance article markup for new publications.

 

No Comments

The UK National Chemical Database Service as an integration of commercial and public chemistry services to support chemists in the United Kingdom

This is a presentation I gave at the ACS National Meeting in Dallas on Wednesday 19th March 2014

The UK National Chemical Database Service – an integration of commercial and public chemistry services to support chemists in the United Kingdom

At a time when the data explosion has simply been redefined as “Big”, the hurdles associated with building a subject-specific data repository for chemistry are daunting. Combining a multitude of non-standard data formats for chemicals, related properties, reactions, spectra etc., together with the confusion of licensing and embargoing, and providing for data exchange and integration with services and platforms external to the repository, the challenge is significant. This all at a time when semantic technologies are touted as the fundamental technology to enhance integration and discoverability. Funding agencies are demanding change, especially a change towards access to open data to parallel their expectations around Open Access publishing. The Royal Society of Chemistry has been funded by the Engineering and Physical Science Research of the UK to deliver a “chemical database service” for UK scientists. This presentation will provide an overview of the challenges associated with this project and our progress in delivering a chemistry repository capable of handling the complex data types associated with chemistry. The benefits of such a repository in terms of providing data to develop prediction models to further enable scientific discovery will be discussed and the potential impact on the future of scientific publishing will also be examined.

 

No Comments

Ontology work at the Royal Society of Chemistry #ACSDallas

This is a presentation at I gave at the ACS Spring meeting in Dallas, Texas on March 17th 2014

Ontology work at the Royal Society of Chemistry

We provide an overview of the use we make of ontologies at the Royal Society of Chemistry.  Our engagement with the ontology community began in 2006 with preparations for Project Prospect, which used ChEBI and other Open Biomedical Ontologies to mark up journal articles. Subsequently Project Prospect has evolved into DERA (Digitally Enhancing the RSC Archive) and we have developed further ontologies for text markup, covering analytical methods and name reactions. Most recently we have been contributing to CHEMINF, an open-source cheminformatics ontology, as part of our work on disseminating calculated physicochemical properties of molecules via the Open PHACTS. We show how we represent these properties and how it can serve as a template for disseminating different sorts of chemical information.

 

No Comments

Big data challenges associated with building a national data repository for chemistry

I gave a presentation at the ICIC 2013 meeting in Vienna focused on the “Big data challenges associated with building a national data repository for chemistry“. The Slideshare presentation is shown below.

At a time when the data explosion has simply been redefined as “Big”, the hurdles associated with building a subject-specific data repository for chemistry are daunting. Combining a multitude of non-standard data formats for chemicals, related properties, reactions, spectra etc., together with the confusion of licensing and embargoing, and providing for data exchange and integration with services and platforms external to the repository, the challenge is significant. This all at a time when semantic technologies are touted as the fundamental technology to enhance integration and discoverability. Funding agencies are demanding change, especially a change towards access to open data to parallel their expectations around Open Access publishing. The Royal Society of Chemistry has been funded by the Engineering and Physical Science Research of the UK to deliver a “chemical database service” for UK scientists. This presentation will provide an overview of the challenges associated with this project and our progress in delivering a chemistry repository capable of handling the complex data types ssociated with chemistry. The benefits of such a repository in terms of providing data to develop prediction models to further enable scientific discovery will be discussed and the potential impact on the future of scientific publishing will also be examined.

No Comments

The importance of standards for data exchange and interchange on the Royal Society of Chemistry eScience platforms

This is my seventh and LAST talk at the ACS Meeting in Indianapolis:

The importance of standards for data exchange and interchange on the Royal Society of Chemistry eScience platforms

The Royal Society of Chemistry provides access to a number of databases hosting chemicals data, reactions, spectroscopy data and prediction services. These databases and services can be accessed via web services utilizing queries using standard data formats such as InChI and molfiles. Data can then be downloaded in standard structure and spectral formats allowing for reuse and repurposing. The ChemSpider database integrates to a number of projects external to RSC including Open PHACTS that integrates chemical and biological data. This project utilizes semantic web data standards including RDF. This presentation will provide an overview of how structure and spectral data standards have been critical in allowing us to integrate many open source tools, ease of integration to a myriad of services and underpin many of our future developments.

No Comments

Digitizing documents to provide a public spectroscopy database

This is my sixth presentation at the ACS Fall Meeting in Indianapolis:

Digitizing documents to provide a public spectroscopy database

RSC hosts a number of platforms providing free access to chemistry related data. The content includes chemical compounds and associated experimental and predicted data, chemical reactions and, increasingly, spectral data. The ChemSpider database primarily contains electronic spectral data generated at the instrument, converted into standard formats such as JCAMP, then uploaded for the community to access. As a publisher RSC holds a rich source of spectral data within our scientific publications and associated electronic supplementary information. We have undertaken a project to Digitally Enable the RSC Archive (DERA) and as part of this project are converting figures of spectral data into standard spectral data formats for storage in our ChemSpider database. This presentation will report on our progress in the project and some of the challenges we have faced to date.

 

 

No Comments

Accessing chemical health and safety data online using Royal Society of Chemistry resources

This is the second presentation I gave at the ACS Meeting in Indianapolis

Accessing chemical health and safety data online using Royal Society of Chemistry resources

The internet has opened up access to large amounts of chemistry related data that can be harvested and assembled into rich resources of value to chemists. The Royal Society of Chemistry’s ChemSpider database has assembled an electronic collection of over 28 million chemicals from over 400 data sources and some of the assembled data is certainly of value to those searching for chemical health and safety information. Since ChemSpider is a text and structure searchable database chemists are able to find relevant information using both of their general search approaches. This presentation will provide an overview of the types of chemical health and safety data and information made available via ChemSpider and discuss how the data are sourced, aggregated and validated. We will examine how the data can be made available via mobile devices and examine the issue of data quality and its potential impacts on such a database.

 

No Comments

Apps and approaches to mobilizing chemistry from the Royal Society of Chemistry

This is the first presentation I gave at the ACS Meeting in Indianapolis

Apps and approaches to mobilizing chemistry from the Royal Society of Chemistry

Mobilizing chemistry by delivering data and content from Royal Society of Chemistry resources has become an important component of our activities to increase accessibility. Content includes access to our publications, our magazine content and our chemistry databases. Mobile devices also allow us to deliver access to tools to support teaching, game-based learning, annotation and curation of data. This presentation will provide an overview of our varied activities in enhancing access to chemistry related data and materials. This will include providing data feeds associated with RSC graphical databases, our experiences in optical structure recognition using smartphone apps and our future vision for supporting chemistry on mobile devices.

No Comments