Review Article
Role of Artificial Intelligence in Increasing Agricultural Productivity and Ensuring Global Food Security through Study of Plant Protein-Protein Interaction Networks and Interactomes
Debasree Sarkar
Department of Biotechnology, SRM Institute of Science and Technology, Tiruchirappalli, India
*Corresponding author:Debasree Sarkar, Department of Biotechnology, SRM Institute of Science and Technology, Tiruchirappalli, India. Email Id: debasreesarkar@ist.srmtrichy.edu.in
Copyright: © Sarkar D. 2025. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Article Information:Submission: 02/05/2025; Accepted: 03/06/2025; Published: 06/06/2025
Abstract
To better understand cells, diseases, and how to treat them, researchers focus on protein-protein interactions (PPIs). Since there are many difficulties in food production due to the environment, studying interactomes can ease these problems and help reveal new facts about how crops interact with pathogens.
To study PPIs in plants, researchers rely on yeast two-hybrid (Y2H) systems, affinity purification with mass spectrometry (AP-MS), bimolecular fluorescence complementation (BiFC), and computational prediction programmes. Yet, they have some problems, including a high error rate and the difficulty in obtaining
similar results on repeating the same experiments. The areas of machine learning and deep learning within AI have helped in PPIs by boosting the accuracy, speed, and ability to work on large datasets. AI is being applied more often to plant protein-protein interaction networks and interactomes, helping people to
better understand the details of biological systems that will determine the future prospects of agriculture and global food security.
Keywords: Plant Proteomics; Artificial Intelligence; Protein-Protein Interaction Networks; Host-Pathogen Interactions; Agricultural Biotechnology.
Abbreviations
PPI-Protein-protein Interaction; Y2H-Yeast two-hybrid; AP-MSAffinity
Purification with Mass Spectrometry; BiFC-Bimolecular
Fluorescence Complementation; 2D-LC Two-dimensional Liquid
Chromatography; 2D-DIGE Two-dimensional Difference Gel
Electrophoresis; AI-Artificial Intelligence; ML-Machine Learning;
DL-Deep Learning; PAMP Pathogen-associated Molecular Pattern;
PTI-PAMP-triggered Immunity; KGF-GNN-Knowledge Graph Fused
Graph Neural Network; PAN-Protein-associated Network; CNNConvolutional
Neural Network; RNN-Recurrent Neural Network;
GSN-Generative Stochastic Network; LSTM-Long Short-term
Memory; GO-Gene Ontology; LC-MS Liquid Chromatography
Mass Spectrometry; DIA-Data-independent Acquisition; DDA
Data-dependent Acquisition; GAN- Generative Adversarial Network;
IoT-Internet of Things.
Materials and Methods
PubMed (https://pubmed.ncbi.nlm.nih.gov/) was used as the
primary resource for searching the relevant articles using keywords/
search terms like “artificial intelligence in plant proteomics”/
“artificial intelligence for prediction of protein-protein interactions”/
“artificial intelligence for prediction of host-pathogen interactions in
plants”/ “protein-protein interactions in plant immunity” etc.
Introduction
Understanding how living things work, how diseases start, and
ways to treat them all come down to how proteins interact with
each other [1]. Hence, addressing issues in food production brought
on by environmental stressors can be aided by comprehensive
interactome research to understand protein interactions in plants,
which are essential for many biological processes like development,
stress responses, and signalling pathways [2,3]. Furthermore,
analysis of host-pathogen interactions can provide novel insights
into how pathogens manipulate host defences for their own benefit
[4]. These findings might help us better understand how plants
defend themselves by showing us how plant proteins and pathogen
molecules interact and respond to each other.
To understand PPIs in plants, scientists are depending mainly on
proteomics techniques since it provides a better insight into various
biological processes. Researchers depend on high-throughput tools
to look into and understand how cells interact [2,5]. Using the yeast
two-hybrid (Y2H) system, scientists are able to study PPIs that take
place naturally in cells, which made it easier to study the binary
interactions between individual proteins, and thereby helped with
plant biology studies [6].Another way to figure this out is to use a
purification method called affinity purification together with mass
spectrometry (AP-MS) in plant tissues. It can be made more effective
by testing proteins with fluorescent tags and without labels to get
rid of any nonspecifically precipitated proteins [7]. Other methods
include bimolecular fluorescence complementation (BiFC) for
visualizing protein interactions in living cells, and in silico prediction
tools for computational analysis of potential interactions [3]. Twodimensional
liquid chromatography (2D-LC) and fluorescence two dimensional
difference gel electrophoresis (2D-DIGE) have also
made it easier for researchers to identify proteins regulated by specific
stimuli in plants [8].
Recent progress in proteomic technology has led to the
development of new methods that can detect very small amounts of
proteins more accurately, and allow us to look at a high volume of
samples at the same time. These include multidimensional protein
identification technology, OFFGEL electrophoresis, and filter-aided
sample preparation methods, which allow for the identification of
thousands of proteins and the detection of transient or weak affinity
interactions [9]. Nevertheless, the integration of multiple techniques,
such as Y2H, AP-MS, and computational predictions, is crucial for
constructing comprehensive and reliable plant protein interactome
networks [2].
In conclusion, the combination of various proteomic methods,
including Y2H, AP-MS, BiFC, and advanced mass spectrometry
techniques, allows scientists to understand the inter-relationships
between proteins in plant systems. With the help of these methods,
researchers can study plant development, functions, and diseases,
guiding new discoveries and using them in agricultural practice
forcrop improvement[2,3]. Yet, each of them is subject to flaws such
as having too many false positives and making results hard to repeat
[10]. On the other hand, recent advancements in artificial intelligence
(AI), particularly machine learning (ML) and deep learning (DL), have
revolutionized the study of PPIs. These computational approaches
have improved the reliability, efficiency, and scale of predicting
PPIs, helping researchers to explore and discover new connections
between proteinsand their biological significance. While there is less
comprehensive experimental data available on plant interactomes
than for some other organisms, this still creates an opportunity for
exploring how plant protein interactions may improve crop yields
and help plants resist various factors causing biotic and abiotic stress.
For example, PPIs play a crucial role in regulating plant defense
responses against pathogens and pests by detecting pathogen associated
molecular patterns (PAMPs) and initiating PAMPtriggered
immunity (PTI), a first line of defense against pathogens
[11]. Hence, AI is increasingly being applied to study plant protein protein
interaction (PPI) networks and interactomes, offering new
mechanistic insights into plant physiology and host-pathogen
interactions.
Methodologies in AI-assisted PPI studies
1. Structure-Based Approaches:
Structure-based methods look at the three-dimensional shape of
a protein to predict how it might interact, giving better results than
methods that rely solely on protein sequences. Taking into account the
location and the parts of a protein involved in binding and catalysis
boosts the accuracy of these approaches. Even so, issues like lacking
structural data and efficient ways of using negative samples continue
to exist, so there is a need for bringing together experimental and
computational tools [12].2. Graph Neural Networks (GNNs):
Graph neural networks have emerged as powerful tools for
modeling PPIs. The Knowledge Graph Fused Graph Neural Network
(KGF-GNN) constructs protein-associated networks (PANs) and
extracts topological and semantic features. This end-to-end learning
framework fuses features from PANs and PPI networks, significantly
outperforming state-of-the-art models [13].3. Deep Learning Techniques:
Deep learning methods, such as convolutional neural networks
(CNNs), recurrent neural networks (RNNs), and generative stochastic
networks (GSNs), have been widely adopted for PPI prediction. CNNs
excel at extracting hierarchical features from biological sequences,
while GSNs handle uncertainty effectively. Long short-term memory
(LSTM) networks capture temporal dependencies, though scalability
remains a challenge [14].4. Multi-Modal Approaches:
The integration of multi-modal data, such as protein sequences,
3D structures, and gene ontology (GO), has enhanced PPI prediction.
Vision transformers and pre-trained language models are used to
extract features from structural and sequence data, respectively. These
multi-modal frameworks have demonstrated superior performance
compared to uni-modal approaches [15,16].5. Language Models:
Large language models, such as ProtBERT, have been fine-tuned
for PPI prediction. These models achieve state-of-the-art performance
by learning from synthetic and real datasets, demonstrating their
utility in high-throughput protein interaction prediction.Role of AI in in MS-based proteomics:
Artificial intelligence (AI) plays a transformative role in mass
spectrometry (MS)-based proteomics data analysis by enhancing
data processing, quality control, and interpretation. AI technologies
are integrated across the proteomics workflow to extract meaningful
insights from complex datasets as summarized in (Table 1), thereby
addressing challenges such as data complexity and the need for
standardized analytical frameworks, ultimately refining the quality
and practicality of proteomics data, in the following aspects:Quality Control and Data Acquisition: AI models, such as
the one developed in the iDIA-QC software, improve quality
control in MS-based proteomics by detecting subtle changes
in liquid chromatography-mass spectrometry (LC-MS)
status. This model, trained on data-independent acquisition
(DIA) files, outperforms traditional data-dependent
acquisition (DDA) methods, achieving high accuracy in
validation datasets [17].
Data Analysis and Interpretation: AI improves the
understanding of MS data by filtering out false signals and
increasing its accuracy [18]. This is particularly crucial
in the context of infectious diseases, where AI aids in the
identification of biomarkers and elucidation of disease
mechanisms. Machine learning (ML) strategies are also
employed to tackle data challenges in proteomics, with a
focus on developing robust models through high-quality
datasets and standardization efforts [19]. Techniques like self supervised
pre-training and multitask learning are explored
to address data scarcity and improve model performance.
AI applications in studying plant PPI networks:
Deep Learning Models: AI models like DWPPI use deep
neural networks to predict plant PPIs by integrating multi-source
information, achieving high accuracy in datasets from plants like
Arabidopsis thaliana, maize, and rice [28].Integration of Data Types: AI methods are advancing by
integrating various data types, such as molecular structure and
interactome data, to enhance prediction accuracy. This approach
has shown significant improvements over traditional methods in
predicting protein-compound interactions [29].
Network Biology: AI-driven network biology helps identify
critical nodes in plant interactomes, which are often targets for
pathogens. This understanding can aid in developing disease-resistant
plant varieties [30, 31].
Benefits of using AI in plant proteomics research
Improved Prediction Accuracy: AI models have demonstrated
superior performance in predicting PPIs compared to traditional
methods, with high accuracy values indicating robust predictive
capabilities [32].
Data Integration: Combining different data sources, such as
sequence and structural information, enhances the predictive power
of AI models, providing a more comprehensive understanding of
protein interactions [29].
Pathogen Interaction Studies: AI tools are instrumental in
studying host-pathogen interactions, revealing how pathogens target
specific nodes in plant interactomes, which can inform strategies for
enhancing plant immunity [30,31].
Potential applications of artificial intelligence in improving agricultural output
Artificial intelligence (AI) could really help find new ways to
treat plant diseases and make crops grow better. The integration
of AI technologies like generative AI, machine learning, and deep
learning can help farmers find and control diseases more effectively
by making it easier for them to recognise problems and predict what
might happen if something goes wrong. These advancements make
it possible to help farmers find easy, accurate, and helpful ways to
control plant diseases, which really helps make growing crops better
and more reliable so there is enough food for everyone. AI applications
in agriculture cover a lot of ground, from checking for diseases in
real time to helping predict when outbreaks might occur, and more
farmers are using them to make their farming more sustainable.
AI in Disease Detection and Management
Generative AI can enhance disease identification and prediction,
offering tools for better management strategies, and optimization of
food production processes[32].
Machine learning and deep learning techniques, such as
convolutional neural networks (CNNs), are used for accurate disease
detection, achieving high accuracy and efficiency in identifying plant
diseases [33,34].
Multi-model deep learning approaches, including VGG16 and
MobileNetV2, provide scalable, non-invasive solutions for early
disease detection, achieving up to 99% efficiency [35].
Improving Crop Yields:
AI-driven systems can optimize crop protection and yield by
providing timely and informed decision-making tools for farmers.
The judicious use of generative models like Generative Adversarial
Networks (GANs) and deep reinforcement learning to simulate
scenarios, generate realistic data, and evaluate agricultural strategies,
can help in creating predictive analytics for crop yields, and the
development of personalized nutrition plans, thereby enabling
sustainable and efficient agricultural practices [32, 33].The integration of AI with IoT-enabled devices and satellite
imagery allows for real-time monitoring and prediction of disease
outbreaks, which can help in implementing preventive measures to
protect crops [36].
AI models trained on comprehensive datasets, such as Leafsnap,
can identify patterns and symptoms of leaf diseases, aiding in efficient
disease management and minimizing crop losses[34].
Challenges and Future Directions:
Despite significant progress, several challenges remain in AIdriven
PPI studies. These include:
Data Limitations: The scarcity of high-resolution structural data
and the need for reliable negative samples hinder model training and
validation [12].
Scalability: Deep learning models often struggle with scalability,
particularly when handling large proteomic datasets [14].
Interpretability: The complexity of deep learning models can
make it difficult to interpret predictions, limiting their practical utility
[14].
While AI offers transformative potential in agriculture, challenges
such as data scarcity, computational constraints, and socioeconomic
barriers to AI adoption remain. Addressing these issues is crucial
for maximizing the benefits of AI technologies in agriculture.
Additionally, fostering user trust and acceptance of AI-driven systems
among farmers is essential for widespread adoption and success in
improving crop yields and disease prevention [35,36]. Future work
could focus on making sure the data used is more reliable, and that it
is easier to understand how the machine learning models work, and
also on bringing in different kinds of data to help these methods work
better with diverse datasets.
Conclusion
There has been a paradigm shift in the domain of plant biology
and agricultural biotechnology by the use of AI in the discovery of
plant protein-protein interaction networks. Both AI and machine
learning use unique data to predict information about plant
interactomes accurately, with remarkable benefits for agriculture
and biotechnology. Relying on AI while studying plant PPIs and
interactomes provides better understanding of the processes involved
in plant growth and development. Leveraging machine learning and
deep learning with AI increases the accuracy, speed, and capacity
of predicting PPIs. Such progress is necessary to increase food
production and protect crops from various challenges brought by
climate change.