Prediction of chemical reaction yields using deep learning

نویسندگان

چکیده

Abstract Artificial intelligence is driving one of the most important revolutions in organic chemistry. Multiple platforms, including tools for reaction prediction and synthesis planning based on machine learning, have successfully become part chemists’ daily laboratory, assisting domain-specific synthetic problems. Unlike retrosynthetic models, yields has received less attention spite enormous potential accurately predicting conversion rates. Reaction describing percentage reactants converted to desired products, could guide chemists help them select high-yielding reactions score routes, reducing number attempts. So far, yield predictions been predominantly performed high-throughput experiments using a categorical (one-hot) encoding reactants, concatenated molecular fingerprints, or computed chemical descriptors. Here, we extend application natural language processing architectures predict properties given text-based representation reaction, an encoder transformer model combined with regression layer. We demonstrate outstanding performance two experiment sets. An analysis reported open-source USPTO data set shows that their distribution differs depending mass scale, limiting applicability predictions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Learning for Chemical Compound Stability Prediction

This paper explores the idea of using deep neural networks with various architectures and a novel initialization method, to solve a critical topic in the field of materials science. Understanding the relationship between the composition and the property of materials is essential for accelerating the course of materials discovery. Data driven approaches using advanced machine learning to derive ...

متن کامل

Toxicity Prediction using Deep Learning

Everyday we are exposed to various chemicals via food additives, cleaning and cosmetic products and medicines — and some of them might be toxic. However testing the toxicity of all existing compounds by biological experiments is neither financially nor logistically feasible. Therefore the government agencies NIH, EPA and FDA launched the Tox21 Data Challenge within the “Toxicology in the 21st C...

متن کامل

Prediction and Prevention of Chemical Reaction Hazards – Learning by Simulation

Assignments based on dynamic simulation of a batch reactor and a semi-batch reactor in which exothermic reactions are conducted, are used to teach students the various aspects of process safety. The students can observe temperature runaway taking place because of incidents, such as overcharging, cooling water failure, pipe blockage and excessive initial heating. They can derive various strategi...

متن کامل

DeepTox: Toxicity Prediction using Deep Learning

The Tox21 Data Challenge has been the largest effort of the scientific community to compare computational methods for toxicity prediction. This challenge comprised 12,000 environmental chemicals and drugs which were measured for 12 different toxic effects by specifically designed assays. We participated in this challenge to assess the performance of Deep Learning in computational toxicity predi...

متن کامل

Traffic Prediction using a Deep Learning Paradigm

For many years intelligent transportation systems (ITS) have been collecting and processing huge amounts of data from numerous sensors to generate a ground truth of urban traffic. Such data has set the foundation of traffic theory, planning and simulation to create rule-based systems. It has also been used in many different studies in data-driven short-term traffic flow forecasting with promisi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Machine learning: science and technology

سال: 2021

ISSN: ['2632-2153']

DOI: https://doi.org/10.1088/2632-2153/abc81d