Volume 45, Issue 32 pp. 2929-2940

RESEARCH ARTICLE

Enhancing protein-ligand binding affinity prediction through sequential fusion of graph and convolutional neural networks

Yimin Yang,

Yimin Yang

Department of Physics, University of Science and Technology of China, Hefei, China

Search for more papers by this author

Ruiqin Zhang,

Corresponding Author

Ruiqin Zhang

[email protected]

orcid.org/0000-0001-6897-4010

Department of Physics, City University of Hong Kong, Hong Kong, China

Correspondence

Ruiqin Zhang, Department of Physics, City University of Hong Kong, Hong Kong 999077, China.

Email: [email protected]

Zijing Lin, Department of Physics, University of Science and Technology of China, Hefei 230026, China.

Email: [email protected]

Search for more papers by this author

Zijing Lin,

Corresponding Author

Zijing Lin

[email protected]

orcid.org/0000-0001-9270-1717

Department of Physics, University of Science and Technology of China, Hefei, China

Hefei National Laboratory, University of Science and Technology of China, Hefei, China

Correspondence

Ruiqin Zhang, Department of Physics, City University of Hong Kong, Hong Kong 999077, China.

Email: [email protected]

Zijing Lin, Department of Physics, University of Science and Technology of China, Hefei 230026, China.

Email: [email protected]

Search for more papers by this author

Yimin Yang,

Yimin Yang

Department of Physics, University of Science and Technology of China, Hefei, China

Search for more papers by this author

Ruiqin Zhang,

Corresponding Author

Ruiqin Zhang

[email protected]

orcid.org/0000-0001-6897-4010

Department of Physics, City University of Hong Kong, Hong Kong, China

Correspondence

Ruiqin Zhang, Department of Physics, City University of Hong Kong, Hong Kong 999077, China.

Email: [email protected]

Zijing Lin, Department of Physics, University of Science and Technology of China, Hefei 230026, China.

Email: [email protected]

Search for more papers by this author

Zijing Lin,

Corresponding Author

Zijing Lin

[email protected]

orcid.org/0000-0001-9270-1717

Department of Physics, University of Science and Technology of China, Hefei, China

Hefei National Laboratory, University of Science and Technology of China, Hefei, China

Correspondence

Ruiqin Zhang, Department of Physics, City University of Hong Kong, Hong Kong 999077, China.

Email: [email protected]

Zijing Lin, Department of Physics, University of Science and Technology of China, Hefei 230026, China.

Email: [email protected]

Search for more papers by this author

First published: 02 September 2024

https://doi.org/10.1002/jcc.27499

Citations: 1

Share a link

Email
Wechat
Bluesky

Abstract

Predicting protein-ligand binding affinity is a crucial and challenging task in structure-based drug discovery. With the accumulation of complex structures and binding affinity data, various machine-learning scoring functions, particularly those based on deep learning, have been developed for this task, exhibiting superiority over their traditional counterparts. A fusion model sequentially connecting a graph neural network (GNN) and a convolutional neural network (CNN) to predict protein-ligand binding affinity is proposed in this work. In this model, the intermediate outputs of the GNN layers, as supplementary descriptors of atomic chemical environments at different levels, are concatenated with the input features of CNN. The model demonstrates a noticeable improvement in performance on CASF-2016 benchmark compared to its constituent CNN models. The generalization ability of the model is evaluated by setting a series of thresholds for ligand extended-connectivity fingerprint similarity or protein sequence similarity between the training and test sets. Masking experiment reveals that model can capture key interaction regions. Furthermore, the fusion model is applied to a virtual screening task for a novel target, PI5P4Kα. The fusion strategy significantly improves the ability of the constituent CNN model to identify active compounds. This work offers a novel approach to enhancing the accuracy of deep learning models in predicting binding affinity through fusion strategies.

CONFLICT OF INTEREST STATEMENT

The authors declare no competing financial interest.

Open Research

DATA AVAILABILITY STATEMENT

The source code developed in this work can be found at https://github.com/IanYMY/GCNN.

Supporting Information

REFERENCES

1J. Li, A. Fu, L. Zhang, Interdiscip. Sci. 2019, 11, 320.
10.1007/s12539-019-00327-w
PubMed Web of Science® Google Scholar
2Q. U. Ain, A. Aleksandrova, F. D. Roessler, P. J. Ballester, Rev. Comput. 2015, 5, 405.
10.1002/wcms.1225
CAS Google Scholar
3P. J. Ballester, J. B. O. Mitchell, Bioinformatics 2010, 26, 1169.
10.1093/bioinformatics/btq112
CAS PubMed Web of Science® Google Scholar
4Z. X. Cang, G. W. Wei, PLoS Comput. Biol. 2017, 13, e1105690.
10.1371/journal.pcbi.1005690
Google Scholar
5C. Wang, Y. K. Zhang, J. Comput. Chem. 2017, 38, 169.
10.1002/jcc.24667
PubMed Web of Science® Google Scholar
6M. Wojcikowski, M. Kukielka, M. M. Stepniewska-Dziubinska, P. Siedlecki, Bioinformatics 2019, 35, 1334.
10.1093/bioinformatics/bty757
CAS PubMed Web of Science® Google Scholar
7Z. Cang, L. Mu, G.-W. Wei, PLoS Comput. Biol. 2018, 14, e1005929.
10.1371/journal.pcbi.1005929
PubMed Web of Science® Google Scholar
8N. Sanchez-Cruz, J. L. Medina-Franco, J. Mestres, X. Barril, Bioinformatics 2021, 37, 1376.
10.1093/bioinformatics/btaa982
CAS PubMed Web of Science® Google Scholar
9D. D. Nguyen, G. W. Wei, Int. J. Numer. Meth. Biomed. 2019, 35, e3179.
10.1002/cnm.3179
PubMed Web of Science® Google Scholar
10J. Jimenez, M. Skalic, G. Martinez-Rosell, G. De Fabritiis, J. Chem. Inf. Model. 2018, 58, 287.
10.1021/acs.jcim.7b00650
CAS PubMed Web of Science® Google Scholar
11M. M. Stepniewska-Dziubinska, P. Zielenkiewicz, P. Siedlecki, Bioinformatics 2018, 34, 3666.
10.1093/bioinformatics/bty374
CAS PubMed Web of Science® Google Scholar
12L. Zheng, J. Fan, Y. Mu, ACS Omega 2019, 4, 15956.
10.1021/acsomega.9b01997
CAS PubMed Web of Science® Google Scholar
13E. N. Feinberg, D. Sur, Z. Q. Wu, B. E. Husic, H. H. Mai, Y. Li, S. S. Sun, J. Y. Yang, B. Ramsundar, V. S. Pande, ACS Central Sci. 2018, 4, 1520.
10.1021/acscentsci.8b00507
CAS PubMed Web of Science® Google Scholar
14D. J. Jiang, C. Y. Hsieh, Z. X. Wu, Y. Kang, J. K. Wang, E. C. Wang, B. Liao, C. Shen, L. Xu, J. Wu, D. S. Cao, T. J. Hou, J. Med. Chem. 2021, 64, 18209.
10.1021/acs.jmedchem.1c01830
CAS PubMed Web of Science® Google Scholar
15H. M. Shen, Y. Z. Zhang, C. H. Zheng, B. Wang, P. Chen, Int. J. Mol. Sci. 2021, 22, 4023.
10.3390/ijms22084023
CAS PubMed Google Scholar
16D. Jones, H. J. Kim, X. H. Zhang, A. Zemla, G. Stevenson, W. F. D. Bennett, D. Kirshner, S. E. Wong, F. C. Lightstone, J. E. Allen, J. Chem. Inf. Model. 2021, 61, 1583.
10.1021/acs.jcim.0c01306
CAS PubMed Web of Science® Google Scholar
17G. W. Kyro, R. I. Brent, V. S. Batista, J. Chem. Inf. Model. 2023, 63, 1947.
10.1021/acs.jcim.3c00251
CAS PubMed Web of Science® Google Scholar
18H. Öztürk, A. Özgür, E. Ozkirimli, Bioinformatics 2018, 34, i821.
10.1093/bioinformatics/bty593
CAS PubMed Web of Science® Google Scholar
19T. Nguyen, H. Le, T. P. Quinn, T. Nguyen, T. D. Le, S. Venkatesh, Bioinformatics 2020, 37, 1140.
10.1093/bioinformatics/btaa921
Web of Science® Google Scholar
20A. Dhillon, G. K. Verma, Prog. Artif. Intell. 2020, 9, 85.
10.1007/s13748-019-00203-0
Web of Science® Google Scholar
21A. Roitberg, T. Pollert, M. Haurilet, M. Martin, R. Stiefelhagen, 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vol. 15-20, IEEE, Long Beach, CA 2019, p. 198.
Google Scholar
22J. Zhou, G. Cui, S. Hu, Z. Zhang, C. Yang, Z. Liu, L. Wang, C. Li, M. Sun, AI Open 2020, 1, 57.
10.1016/j.aiopen.2021.01.001
Web of Science® Google Scholar
23S. Brody, U. Alon, E. Yahav, ePrint arXiv 2021 https://arxiv.org/abs/2105.14491
Google Scholar
24K. Cho, B. V. Merrienboer, Ç. Gülçehre, D. Bahdanau, F. Bougares, H. Schwenk, Y. Bengio, Conference on Empirical Methods in Natural Language Processing, ACL Anthology, Doha, Qatar 2014, p. 1724.
Google Scholar
25Y. Li, D. Tarlow, M. Brockschmidt, R. S. Zemel, ePrint arXiv 2016 https://arxiv.org/abs/1511.05493
Google Scholar
26J. Lee, I. Lee, J. Kang, ePrint arXiv 2019 https://arxiv.org/abs/1904.08082
Google Scholar
27Z. H. Liu, M. Y. Su, L. Han, J. Liu, Q. F. Yang, Y. Li, R. X. Wang, Acc. Chem. Res. 2017, 50, 302.
10.1021/acs.accounts.6b00491
CAS PubMed Web of Science® Google Scholar
28P. W. Rose, A. Prlic, A. Altunkaya, C. X. Bi, A. R. Bradley, C. H. Christie, L. Di Costanzo, J. M. Duarte, S. Dutta, Z. K. Feng, R. K. Green, D. S. Goodsell, B. Hudson, T. Kalro, R. Lowe, E. Peisach, C. Randle, A. S. Rose, C. H. Shao, Y. P. Tao, Y. Valasatava, M. Voigt, J. D. Westbrook, J. Woo, H. W. Yang, J. Y. Young, C. Zardecki, H. M. Berman, S. K. Burley, Nucleic Acids Res. 2017, 45, D271.
10.1093/nar/gkw1042
CAS PubMed Web of Science® Google Scholar
29M. Y. Su, Q. F. Yang, Y. Du, G. Q. Feng, Z. H. Liu, Y. Li, R. X. Wang, J. Chem. Inf. Model. 2019, 59, 895.
10.1021/acs.jcim.8b00545
CAS PubMed Web of Science® Google Scholar
30N. M. O'Boyle, M. Banck, C. A. James, C. Morley, T. Vandermeersch, G. R. Hutchison, J. Cheminformatics 2011, 3, 33.
10.1186/1758-2946-3-33
CAS PubMed Web of Science® Google Scholar
31E. F. Pettersen, T. D. Goddard, C. C. Huang, G. S. Couch, D. M. Greenblatt, E. C. Meng, T. E. Ferrin, J. Comput. Chem. 2004, 25, 1605.
10.1002/jcc.20084
CAS PubMed Web of Science® Google Scholar
32 RDKit. RDKit: Open-source cheminformatics https://www.rdkit.org. 2023.
Google Scholar
33R. X. Wang, L. H. Lai, S. M. Wang, J. Comput. Aided Mol. Des. 2002, 16, 11.
10.1023/A:1016357811882
CAS PubMed Web of Science® Google Scholar
34Y. Li, J. Y. Yang, J. Chem. Inf. Model. 2017, 57, 1007.
10.1021/acs.jcim.7b00049
CAS PubMed Web of Science® Google Scholar
35J. C. Yang, C. Shen, N. Huang, Front. Pharmacol. 2020, 11, 69.
10.3389/fphar.2020.00069
CAS PubMed Web of Science® Google Scholar
36D. Rogers, M. Hahn, J. Chem. Inf. Model. 2010, 50, 742.
10.1021/ci100050t
CAS PubMed Web of Science® Google Scholar
37Y. Zhang, NW-Align. http://zhanglab.dcmb.med.umich.edu/NW-align
Google Scholar
38K. Stierand, M. Rarey, ACS Med. Chem. Lett. 2010, 1, 540.
10.1021/ml100164p
CAS PubMed Web of Science® Google Scholar
39L. Wortmann, N. Bräuer, S. J. Holton, H. Irlbacher, J. Weiske, C. Lechner, R. Meier, J. Karén, C. B. Siöberg, V. Pütter, C. D. Christ, A. ter Laak, P. Lienau, R. Lesche, B. Nicke, S. H. Cheung, M. Bauser, A. Haegebarth, F. von Nussbaum, D. Mumberg, C. Lemos, J. Med. Chem. 2021, 64, 15883.
10.1021/acs.jmedchem.1c01245
CAS PubMed Web of Science® Google Scholar
40B. M. Emerling, J. B. Hurov, G. Poulogiannis, K. S. Tsukazawa, R. Choo-Wing, G. M. Wulf, E. L. Bell, H. S. Shim, K. A. Lamia, L. E. Rameh, G. Bellinger, A. T. Sasaki, J. M. Asara, X. Yuan, A. Bullock, G. M. DeNicola, J. X. Song, V. Brown, S. Signoretti, L. C. Cantley, Cell 2013, 155, 844.
10.1016/j.cell.2013.09.057
CAS PubMed Web of Science® Google Scholar
41S. Chen, C. C. Tjin, X. Gao, Y. Xue, H. Y. Jiao, R. L. Zhang, M. N. Wu, Z. Y. He, J. Ellman, Y. Ha, Proc. Natl. Acad. Sci. U. S. A. 2021, 118, e21180291180.
Google Scholar
42H. M. G. Willems, S. Edwards, H. K. Boffey, S. J. Chawner, C. Green, T. Romero, D. Winpenny, J. Skidmore, J. H. Clarke, S. P. Andrews, RSC Med. Chem. 2023, 14, 934.
10.1039/D3MD00039G
CAS PubMed Google Scholar
43D. Mendez, A. Gaulton, A. P. Bento, J. Chambers, M. De Veij, E. Félix, M. P. Magariños, J. F. Mosquera, P. Mutowo, M. Nowotka, M. Gordillo-Marañón, F. Hunter, L. Junco, G. Mugumbate, M. Rodriguez-Lopez, F. Atkinson, N. Bosc, C. Radoux, A. Segura-Cabrera, A. Hersey, A. R. Leach, Nucleic Acids Res. 2019, 47, D930.
10.1093/nar/gky1075
CAS PubMed Web of Science® Google Scholar
44D. R. Koes, M. P. Baumgartner, C. Camacho, J. Chem. Inf. Model. 2013, 53, 1893.
10.1021/ci300604z
CAS PubMed Web of Science® Google Scholar
45O. Trott, A. J. Olson, J. Comput. Chem. 2010, 31, 455.
10.1002/jcc.21334
CAS PubMed Web of Science® Google Scholar
46L. Li, B. Wang, S. O. Meroueh, J. Chem. Inf. Model. 2011, 51, 2132.
10.1021/ci200078f
CAS PubMed Web of Science® Google Scholar

Citing Literature

Volume45, Issue32

December 15, 2024

Pages 2929-2940

Enhancing protein-ligand binding affinity prediction through sequential fusion of graph and convolutional neural networks

Abstract

CONFLICT OF INTEREST STATEMENT

Open Research

DATA AVAILABILITY STATEMENT

Supporting Information

REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Enhancing protein-ligand binding affinity prediction through sequential fusion of graph and convolutional neural networks

Abstract

CONFLICT OF INTEREST STATEMENT

Open Research

DATA AVAILABILITY STATEMENT

Supporting Information

REFERENCES

Citing Literature

References

Related

Information