Skip to content

KU-MedAI/ArcDFI

Repository files navigation

ArcDFI: Attention Regularization guided by CYP450 Interactions for Predicting Drug-Food Interactions

img

Abstract

Drug-food interactions are an integral part of health and safety especially when consuming certain foods during drug treatment. They are characterized by situations where foods consisting of various small food compounds alter the pharmacokinetics or pharmacodynamics of a drug compound. Especially, it is known that CYP450 enzyme families play a key role in explaining drug-food interactions (DFIs). Previous works have introduced computational approaches for predicting DFIs but lack incorporation of drug-CYP interaction (DCI) information and also have limited generalizability in drug or food compounds unseen during model training. In this paper, we introduce ArcDFI, a model that utilizes Attention Regularization guided by CYP450 Interactions for predicting Drug-Food Interactions. Our experiments conducted on stricter evaluation settings (cold drug and cold food) show ArcDFI's strong generalizability in both unseen drug and food compounds, compared with other baseline models. Analysis on ArcDFI's cross attention mechanism between the CYP450 isoenzymes and compound substructures provides insights of its current understanding of DCIs that lead to explaining its rationale behind DFI predictions. Although the attention regularization method helps ArcDFI develop its understanding of DCIs, we conclude that expanding the DCI interactions and gathering additional data can further improve ArcDFI's predictability and interpretability.

Required Libraries

Install the following external libraries to run this project:

deepchem
matplotlib
numpy
omegaconf
pandas
rdkit
scikit-learn
scipy
seaborn
torch
torch_geometric

Download Instructions

To run the model and reproduce results, please download the following files from Google Drive:

  1. Model Checkpoint
    File: arcdfi.ckpt
    Download Link

  2. DFI Dataset
    File: dfi_final.csv
    Download Link

After downloading, place the files in the appropriate directories:

├── data/
│   └── dfi_240901.csv
├── checkpoints/
│   └── arcdfi.ckpt

As stated in our paper, we release both Drug-Food Interaction and Drug-CYP450 Interaction dataset. The dataset zip file from the Google Drive contains both Drug-Food Interaction Dataset dfi_final.csv and Drug-CYP450 Interaction Dataset dci_final.csv. Only the dfi_final.csv file is needed for the model training process.

Model Training Instructions

Simply run the following command,

python train.py -cn arcdfi -pn <wandb_project_name> -gn <wandb_group_name> -sn <wandb_session_name> -ds <newdrug_or_newfood>

Attention Weights Analysis

img img

You can also reproduce the visualization results of the attention weights by running the Jupyter Notebook ArcDFI Analysis.ipynb. Make sure that the downloaded files are in the correct directories.

Contributors

Name Affiliation Email
Mogan Gim Department of Biomedical Engineering,
Hankuk University of Foreign Studies, Yongin, South Korea
gimmogan@hufs.ac.kr
Jaewoo Kang Department of Computer Science,
Korea University, Seoul, South Korea
kangj@korea.ac.kr
Donghyeon Park* Department of AI and Data Science,
Sejong University, Seoul, South Korea
parkdh@sejong.ac.kr
Minji Jeon* Department of Biomedical Informatics, Department of Medicine,
Korea University College of Medicine, Seoul, South Korea
mjjeon@korea.ac.kr
  • *: Co-corresponding Authors

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published