Can Large Language Models Empower Molecular Property Prediction?

07/14/2023
by   Chen Qian, et al.
0

Molecular property prediction has gained significant attention due to its transformative potential in multiple scientific disciplines. Conventionally, a molecule graph can be represented either as a graph-structured data or a SMILES text. Recently, the rapid development of Large Language Models (LLMs) has revolutionized the field of NLP. Although it is natural to utilize LLMs to assist in understanding molecules represented by SMILES, the exploration of how LLMs will impact molecular property prediction is still in its early stage. In this work, we advance towards this objective through two perspectives: zero/few-shot molecular classification, and using the new explanations generated by LLMs as representations of molecules. To be specific, we first prompt LLMs to do in-context molecular classification and evaluate their performance. After that, we employ LLMs to generate semantically enriched explanations for the original SMILES and then leverage that to fine-tune a small-scale LM model for multiple downstream tasks. The experimental results highlight the superiority of text explanations as molecular representations across multiple benchmark datasets, and confirm the immense potential of LLMs in molecular property prediction tasks. Codes are available at <https://github.com/ChnQ/LLM4Mol>.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2023

Extracting Molecular Properties from Natural Language with Multimodal Contrastive Learning

Deep learning in computational biochemistry has traditionally focused on...
research
05/27/2023

What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks

Large Language Models (LLMs) with strong abilities in natural language p...
research
07/16/2021

Property-aware Adaptive Relation Networks for Molecular Property Prediction

Molecular property prediction plays a fundamental role in drug discovery...
research
11/09/2020

Explaining Deep Graph Networks with Molecular Counterfactuals

We present a novel approach to tackle explainability of deep graph netwo...
research
12/07/2020

ATOM3D: Tasks On Molecules in Three Dimensions

Computational methods that operate directly on three-dimensional molecul...
research
11/20/2022

Heterogenous Ensemble of Models for Molecular Property Prediction

Previous works have demonstrated the importance of considering different...
research
11/29/2022

BARTSmiles: Generative Masked Language Models for Molecular Representations

We discover a robust self-supervised strategy tailored towards molecular...

Please sign up or login with your details

Forgot password? Click here to reset