Leveraging Graph Machine Learning for Moonlighting Protein Prediction: A Local PPI Network and Physiochemical Feature Approach

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

Leveraging Graph Machine Learning for Moonlighting Protein Prediction: A Local PPI Network and Physiochemical Feature Approach

Authors

Zhou, H.; Sarkar, R.

Abstract

Moonlighting proteins (MPs) play critical roles in cellular functions and disease, yet their complex nature challenges traditional identification methods. This research delves into the potential of Graph Machine Learning (GML) to effectively utilize Protein-Protein Interaction (PPI) networks and physicochemical properties for MP prediction. Our study focuses on two GML architectures: Graph Attention Networks (GAT) and Graph Convolutional Networks(GCN), which are augmented by integrating quasi-sequence-order (QSOrder) and amino acid composition (AAC) to enrich the feature representation. We evaluated these models on a dataset comprising 310 proteins, finding that GML substantially outperforms conventional methodologies such as Support Vector Machines (SVM), K-nearest neighbors (KNN), and Random Forest (RF), especially in scenarios with limited data where Deep Neural Networks(DNNs) typically struggle. Notably, the GAT model, when enhanced with QSOrder and PPI data, exhibited exceptional accuracy, and robustness in precision, F1-score, and ROC-AUC metrics. This indicates a significant edge in capturing the complex predictive signals embedded in PPI networks. Our findings strongly support the integration of GML in MP research and suggest a novel approach for exploring MPs through their interactive partners.

Follow Us on

0 comments

Add comment