ArcDFI: Attention Regularization guided by CYP450 Interactions for Predicting Drug-Food Interactions
Drug-food interactions are an integral part of health and safety especially when consuming certain foods during drug treatment. They are characterized by situations where foods consisting of various small food compounds alter the pharmacokinetics or pharmacodynamics of a drug compound. Especially, it is known that CYP450 enzyme families play a key role in explaining drug-food interactions (DFIs). Previous works have introduced computational approaches for predicting DFIs but lack incorporation of drug-CYP interaction (DCI) information and also have limited generalizability in drug or food compounds unseen during model training. In this paper, we introduce ArcDFI, a model that utilizes Attention Regularization guided by CYP450 Interactions for predicting Drug-Food Interactions. Our experiments conducted on stricter evaluation settings (cold drug and cold food) show ArcDFI's strong generalizability in both unseen drug and food compounds, compared with other baseline models. Analysis on ArcDFI's cross attention mechanism between the CYP450 isoenzymes and compound substructures provides insights of its current understanding of DCIs that lead to explaining its rationale behind DFI predictions. Although the attention regularization method helps ArcDFI develop its understanding of DCIs, we conclude that expanding the DCI interactions and gathering additional data can further improve ArcDFI's predictability and interpretability.
Install the following external libraries to run this project:
deepchem
matplotlib
numpy
omegaconf
pandas
rdkit
scikit-learn
scipy
seaborn
torch
torch_geometric
To run the model and reproduce results, please download the following files from Google Drive:
-
Model Checkpoint
File:arcdfi.ckpt
Download Link -
DFI Dataset
File:dfi_final.csv
Download Link
After downloading, place the files in the appropriate directories:
├── data/
│ └── dfi_240901.csv
├── checkpoints/
│ └── arcdfi.ckptAs stated in our paper, we release both Drug-Food Interaction and Drug-CYP450 Interaction dataset. The dataset zip file from the Google Drive contains both Drug-Food Interaction Dataset dfi_final.csv and Drug-CYP450 Interaction Dataset dci_final.csv. Only the dfi_final.csv file is needed for the model training process.
Simply run the following command,
python train.py -cn arcdfi -pn <wandb_project_name> -gn <wandb_group_name> -sn <wandb_session_name> -ds <newdrug_or_newfood>You can also reproduce the visualization results of the attention weights by running the Jupyter Notebook ArcDFI Analysis.ipynb. Make sure that the downloaded files are in the correct directories.
| Name | Affiliation | |
|---|---|---|
| Mogan Gim | Department of Biomedical Engineering, Hankuk University of Foreign Studies, Yongin, South Korea |
gimmogan@hufs.ac.kr |
| Jaewoo Kang | Department of Computer Science, Korea University, Seoul, South Korea |
kangj@korea.ac.kr |
| Donghyeon Park* | Department of AI and Data Science, Sejong University, Seoul, South Korea |
parkdh@sejong.ac.kr |
| Minji Jeon* | Department of Biomedical Informatics, Department of Medicine, Korea University College of Medicine, Seoul, South Korea |
mjjeon@korea.ac.kr |
- *: Co-corresponding Authors


