Transformers and Large Language Models for Chemistry and Drug Discovery

Philippe Schwaller, Andres Camilo Marulanda Bran
2024

Résumé

Language modeling has seen impressive progress over the last few years. The invention of the Transformer architecture sparked a revolution in many fields of machine learning, including breakthroughs in chemistry and biology. In this chapter, we explore how analogies between chemical and natural language have inspired the use of Transformers to tackle important bottlenecks in the drug discovery process, such as retrosynthetic planning and chemical space exploration. The revolution started with models able to address specific tasks with a single type of data, like linearised molecular graphs, which then evolved to include other types of data, like spectra from analytical instruments, synthesis actions, and human language. A new trend leverages recent developments in large language models, giving rise to a wave of models capable of solving generic tasks in chemistry, all facilitated by the flexibility of natural language. As we continue to explore and harness these capabilities, we can look forward to a future where machine learning plays an even more integral role in accelerating drug discovery.

Source officielle

https://infoscience.epfl.ch/entities/publication/6a9c5628-85c4-44ea-a84a-b8bb6e3d17b1

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.