GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

08/14/2023
by   Pengfei Liu, et al.
0

Large language models have made significant strides in natural language processing, paving the way for innovative applications including molecular representation and generation. However, most existing single-modality approaches cannot capture the abundant and complex information in molecular data. Here, we introduce GIT-Mol, a multi-modal large language model that integrates the structure Graph, Image, and Text information, including the Simplified Molecular Input Line Entry System (SMILES) and molecular captions. To facilitate the integration of multi-modal molecular data, we propose GIT-Former, a novel architecture capable of mapping all modalities into a unified latent space. Our study develops an innovative any-to-language molecular translation strategy and achieves a 10 captioning, a 5 in molecule generation validity compared to baseline or single-modality models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2023

Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration

Although instruction-tuned large language models (LLMs) have exhibited r...
research
07/01/2021

Multimodal Graph-based Transformer Framework for Biomedical Relation Extraction

The recent advancement of pre-trained Transformer models has propelled t...
research
11/25/2022

Molecular Joint Representation Learning via Multi-modal Information

In recent years, artificial intelligence has played an important role on...
research
08/09/2023

Prompting In-Context Operator Learning with Sensor Data, Equations, and Natural Language

In the growing domain of scientific machine learning, in-context operato...
research
07/22/2023

Extracting Molecular Properties from Natural Language with Multimodal Contrastive Learning

Deep learning in computational biochemistry has traditionally focused on...
research
04/25/2022

Translation between Molecules and Natural Language

Joint representations between images and text have been deeply investiga...
research
07/09/2019

M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention

Generative adversarial networks have led to significant advances in cros...

Please sign up or login with your details

Forgot password? Click here to reset