A deep learning approach to increase the accuracy of predicting protein- compound interactions

Loading...
Thumbnail Image
Date
2019-09-12
Authors
Gunathilake, P. M. P. C.
Roshan, W. M. P. G. D. M.
Jayarathna, P. D. C. R.
Journal Title
Journal ISSN
Volume Title
Publisher
University of Peradeniya
Abstract
In drug repositioning and discovery, a system-level identification of drug-target direct interactions is important. However, even today a wide range of drug compounds and protein targets makes it challenging and expensive to identify the interactions between them. For protein-compound interaction prediction, traditional similarity-based computational models were used and they rarely utilize the hidden features from currently available large-scale unlabelled compound and protein data. They often limit their usage on relatively small-scale datasets. This work proposes a new schema for predicting the protein compound interaction and binding score using deep learning approach. Most accurate features from a large set of proteins and compounds were selected and converted into a form of numerical values as the input data. The known interactions between selected compound and protein pairs were taken in binary form as the target dataset. The numerical input dataset and the binary target dataset were used to train a deep neural network model under a supervised learning process. The neural network was optimized and trained with different activation functions and optimization algorithms. The trained neural network model was then used to predict the protein-compound interactions. Also, the second approach was made to predict the protein-compound binding score. The same set of protein-compound pairs were selected as the input of the neural network model. A multiple levels of numerical values representing the binding score between each protein- compound pair in the input dataset was taken as the target instead of the binary target in the interaction prediction method. Then the neural network model was optimized and trained to predict the binding score for a protein-compound pair. The cross validated accuracy for protein-compound binding score prediction neural network was 99.6% and for the protein-compound interaction prediction neural network it was 77.32%.
Description
Keywords
Deep learning , Artificial neural network , Machine learning , Protein compound interaction , Drug discovery
Citation
Collections