loading page

Novel Recognition Method for the Locale of Membrane Proteins by Employing Deep Learning Algorithm
  • +2
  • Mehwish Faiz,
  • Saad Jawaid Khan,
  • Fahad Azim,
  • Nazia Ejaz,
  • Fahad Shamim
Mehwish Faiz
Ziauddin University
Author Profile
Saad Jawaid Khan
Ziauddin University

Corresponding Author:[email protected]

Author Profile
Fahad Azim
Ziauddin University
Author Profile
Nazia Ejaz
Balochistan University of Engineering & Technology Khuzdar
Author Profile
Fahad Shamim
Liaquat University of Medical and Health Sciences
Author Profile

Abstract

Membrane proteins are those biomolecules that are attached to or incorporated into the membranes of cells and their organelles. Depending on their functions, they are located in various regions of a cell and are essential to several cellular processes. The locale revelation of these biomolecules is critical as it portrays their activities. Most protein subcellular localization predictors have been trained particularly on globular type and perform poorly on those residing on membranes, specifically through Deep Learning. To overcome this issue, membrane proteins are forecasted in three distinct locations, (a) plasma membrane, (b) internal membrane, and (c) organelle membrane. Features are extracted through Pseudo Amino Acid Composition and some other features from a redundancy curtailed MemLoci dataset. Pseudo Amino Acid Composition is an illustrious approach that excerpts factual protein information through amino acid sequences. Another key feature is that the Pseudo Amino Acid Composition’s impact is unrelated to the Deep Learning Execution of these membrane proteins. This novel study employs four deep learning models, including (a) Artificial Neural Networks (ANN), (b) Recurrent neural networks (RNN), (c) Convolutional neural networks (CNN), and (c) Long Short Term Memory (LSTM). After extensive experimentation, the accuracy of yields is 83.2%, 83.4%, 82.4%., and 80.5% respectively. The outcomes indicate that the simple RNN and ANN models, which are less used in the research, are more suitable compared to the other two models CNN and LSTM which are frequently implemented models in proteomics. The results of the first two models is approximately similar, with a difference of 0.2% among each other, however, they surpass the other two models with better of outcomes in the range of 2 - 3%.