Created: 29 July 2021
IBM Research Europe and Science of Synthesis/Thieme Chemistry
Collaboration accelerates discovery in organic chemistry
Stuttgart, Germany – The RXN for Chemistry cloud platform helps synthetic organic chemists in predicting the outcome of chemical reactions using artificial intelligence (AI) which is trained on data. Thus, a prerequisite for optimal prediction results is high-quality datasets. The cooperation between IBM and Thieme Chemistry aims at improving the prediction outcomes using synthesis data from Thieme's expert curated digital publication source for organic chemistry - Science of Synthesis.
IBM launched RXN for Chemistry in 2018. The cloud platform uses an artificial intelligence model called Molecular Transformer which applies neural machine translation models to predict the outcome of a chemical reaction and thus, improve synthesis planning in organic chemistry.
"The challenge for organic chemists is that there are hundreds of thousands of possible reactions of organic compounds. To address this, we used natural language processing models for all RXN prediction tasks. The RXN models have no built-in chemistry and are not based on codified rules. Every chemical prediction is based on the knowledge learned from the data during training. With AI, cloud and automation, today we can accelerate discovery in organic chemistry by a factor of ten," says Dr. Teodoro Laino, Distinguished Scientist at IBM Research Europe.
Driving technical innovation with high-quality, diverse, and well-structured data
"Tools for translating from one language to another are only as good as the data on which the algorithms are trained," says Dr. Alain Vaucher, Research Scientist at IBM. "Our assumption is that this is also true for predicting chemical synthesis results: the results depend very much on the underlying data."
Earlier this year IBM Research and Thieme Chemistry incorporated expert synthesis data from Thieme's expert curated digital publication source on organic chemistry – Science of Synthesis – into RXN for Chemistry and initial results show that Thieme-trained models predict correct reactions twice as often as baseline models when tested on Science of Synthesis chemistry.
Initial results of forward reaction prediction on reactions from Science of Synthesis.
"We are pleased to be directly involved in this innovative project, which is of high importance for the chemistry community," says Dr. M. Fiona Shortt de Hernandez, Senior Director Product Management, Strategic Partnerships and Science of Synthesis at Thieme Chemistry. "Six highly-renowned organic synthesis experts and their groups have agreed to test the retrained models. Together this collaboration will help drive the development of state-of-the-art custom-fit tools for organic chemists," Shortt affirms.
“The collaboration with Thieme is an important landmark between AI solution providers and domain specific data publishers, with important business opportunities for both," says Laino. “I am very excited to share these preliminary results and curious to see how they will lead in the next months to an improved AI experience for synthetic organic chemists.”
Would you be interested in using IBM RXN for Chemistry, trained on Science of Synthesis, as a cloud service if it should become available later? Please contact ibmrxn@thieme-chemistry.com.
IBM RXN for Chemistry is available for free at: https://rxn.res.ibm.com